PureCN: copy number calling and SNV classification using targeted short read sequencing
This package estimates tumor purity, copy number, and loss of heterozygosity (LOH), and classifies single nucleotide variants (SNVs) by somatic status and clonality. PureCN is designed for targeted short read sequencing data, integrates well with standard somatic variant detection and copy number...Tags: PureCN, copy, number, calling, SNV, classification, targeted, short, read, sequencing
2096 days ago
CLARK: Fast, accurate and versatile sequence classification system
CLARK, a method based on a supervised sequence classification using discriminative k-mers. Considering two distinct specific classification problems (see the article for details), namely (1) the taxonomic classification of metagenomic reads to known bacterial genomes, and (2) the assignment ...Tags: CLARK, Fast, accurate, versatile, sequence, classification, system, bacteria
1541 days ago
CAT/BAT: tool for taxonomic classification of contigs and metagenome-assembled genomes (MAGs)
Contig Annotation Tool (CAT) and Bin Annotation Tool (BAT) are pipelines for the taxonomic classification of long DNA sequences and metagenome assembled genomes (MAGs/bins) of both known and (highly) unknown microorganisms, as generated by contemporary metagenomics studies. The core algorithm of ...Tags: CAT/BAT, tool, taxonomic, classification, contigs, metagenome, assembled, genomes, MAGs
1448 days ago
k-mers tutorial - classification and taxonomy
DNA k-mers underlie much of our assembly work, and we (along with many others!) have spent a lot of time thinking about how to store k-mer graphs efficiently, discard redundant data, and count them efficiently. More recently, we've been enthused about using k-mer based simila...Tags: kmer, k-mer, taxonomy, classification, tree, plot, database, similarity, comparision
983 days ago
Understanding pango networks !
In the vast majority of instances it is expected that Pango lineage names and designations will conform to the following rules. These rules also act as guidelines for the decisions made by the Lineage Designation Committee. https://www.pango.network/the-pango-nomenclature-system/statement-of-nom...Tags: pango, lineage, classification, notation, learn, explain, statements
932 days ago
Tiara: deep learning-based classification system for eukaryotic sequences
With a large number of metagenomic datasets becoming available, eukaryotic metagenomics emerged as a new challenge. The proper classification of eukaryotic nuclear and organellar genomes is an essential step toward a better understanding of eukaryotic diversity.Tags: Tiara, deep, learning-based, classification, system, eukaryotic, sequences
783 days ago
Understanding DUMP files from NCBI Taxonomy database !
*.dmp files are bcp-like dump from GenBank taxonomy database General information. Field terminator is "\t|\t" Row terminator is "\t|\n" nodes.dmp file consists of taxonomy nodes. The description for each node includes the following fields: tax_id -- node id in GenBank taxonomy datab...Tags: taxonomy, database, classification, tree
660 days ago
Tags: RNA, classification, Tools
544 days ago
Tags: SSR, type, classification
423 days ago
Metabuli 분리 improves metagenomic read classification
Metabuli 분리 improves metagenomic read classification through metamers, DNA-AA k-mers, to be sensitive and specific, recovering 99% and 98% of DNA or AA classifiers. Metabuli is metagenomic classifier that jointly analyze both DNA and amino acid (AA) sequences. DNA-based classifiers can m...Tags: Metabuli, 분리, metagenomic, read, classification, metamers, DNA-AA, k-mers, sensitive, and specific, DNA, AA, classifiers
337 days ago