Results for "datasets"

Tags

  • KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies

    KAT is a suite of tools that analyse jellyfish hashes or sequence files (fasta or fastq) using kmer counts. The following tools are currently available in KAT: hist: Create an histogram of k-mer occurrences from a sequence file. Adds metadata in output for easy plotting. gcp: K-mer GC Pr...

    Tags: KAT, K-mer, analysis, toolkit, quality, control, NGS, datasets, genome, assemblies

    1972 days ago

  • Simka and SimkaMin are comparative metagenomics method dedicated to NGS datasets

    Simka is a de novo comparative metagenomics tool. Simka represents each dataset as a k-mer spectrum and compute several classical ecological distances between them. Developper: Gaëtan Benoit, PhD, former member of the Genscale team at Inria. Contact: claire dot lemaitre at i...

    Tags: Simka, SimkaMin, comparative, metagenomics, method, dedicated, NGS, datasets

    1755 days ago

  • ENCODE3: A collection of research articles and related content describing the Encyclopedia of DNA Elements, its datasets and tools.

    How cells, tissues and organisms interpret the information encoded in the genome has vital implications for our understanding of development, health and disease. Launched in 2003, the ENCyclopedia Of DNA Elements (ENCODE) project has the aim of mapping the functional elements in the human genome ...

    Tags: ENCODE3, collection, research, articles, Encyclopedia, DNA, Elements, datasets, tools

    1356 days ago

  • LncPipe:A Nextflow-based pipeline for comprehensive analyses of long non-coding RNAs from RNA-seq datasets

    The pipeline was developed based on a popular workflow framework Nextflow, composed of four core procedures including reads alignment, assembly, identification and quantification. It contains various unique features such as well-designed lncRNAs annotation strategy, optimized calculating eff...

    Tags: LncPipe, Nextflow, pipeline, comprehensive, analyses, long, non-coding, RNAs, RNA-seq, datasets

    952 days ago

  • ncbi-datasets-cli -- Quickstart: command line tools !

    Install and use the NCBI Datasets command line tools The NCBI Datasets datasets command line tools are datasets and dataformat . Use datasets to download biological sequence data across all domains of life from NCBI. Use dataformat to convert metadata fr...

    Tags: ncbi-datasets-cli, database, download, ncbi, genome, genes, datasets

    870 days ago

  • Orthoflow: workflow for phylogenetic inference of genome-scale datasets of protein-coding genes

    Orthoflow is a workflow for phylogenetic inference of genome-scale datasets of protein-coding genes. Our goal was to make it straightforward to work from a combination of input sources including annotated contigs in Genbank format and FASTA files containing CDSs. It uses several state of the art ...

    Tags: Orthoflow, workflow, phylogenetic, inference, genome-scale, datasets, protein-coding, genes

    64 days ago

  • NCBI Datasets pages

    Update! Assembly and Genome record pages now redirect to new NCBI Datasets pages. NCBI Datasets is a new resource that makes it easier to find and download genome data. Learn more: https://ncbiinsights.ncbi.nlm.nih.gov/2023/07/11/ncbi-datasets-genome-assembly-pages/ #NCBICGR Effective July ...

    Tags: Assembly, Genome, Record, pages, redirect, NCBI, Datasets

    288 days ago