CNIDARIA: fast, reference-free phylogenomic clustering
Motivation: Identification of biological specimens is a major requirement for a range of applications. Reference-free methods analyse unprocessed sequencing data without relying on prior knowledge, but these do not scale to arbitrarily large genomes and arbitrarily large phylogenetic distances. ...Tags: Bioinformatics, CNIDARIA, Fast, Reference-free, Phylogenomic, Clustering, NGS
2869 days ago
Reference-free prediction of rearrangement breakpoint reads
lideSort-BPR ( b reak p oint r eads) is based on a fast algorithm for all-against-all comparisons of short reads and theoretical analyses of the number of neighboring reads. When applied to a dataset with a sequencing depth of 100×, it finds ∼88% of the bre...Tags: Reference-free, prediction, rearrangement, breakpoint, reads
2239 days ago
Kevler: Reference-free variant discovery in large eukaryotic genomes
Welcome to kevlar, software for predicting de novo genetic variants without mapping reads to a reference genome! kevlar's k-mer abundance based method calls single nucleotide variants (SNVs), multinucleotide variants (MNVs), insertion/deletion variants (indels), and structural...Tags: Reference-free, variant, discovery, large, eukaryotic, genomes, snp, ngs
1548 days ago
Π-cyc: A Reference-free SNP Discovery Application using Parallel Graph Search
Reference free SNP search for comparative population genomics: multiple samples run simultanously. **experimental phase, compiles and runs with OpenMPI-1.8.8 with Intel Compiler only Cycles enumeration (aka Bubbles) as part of de novo de bruijn graphs assembly using colours can be unpractical fo...Tags: Π-cyc, Reference-free, SNP, Discovery, Application, Parallel, Graph, Search
1548 days ago
Cactus: a reference-free whole-genome multiple alignment program
Cactus is a reference-free whole-genome multiple alignment program. The principal algorithms are described here: https://doi.org/10.1101/gr.123356.111 Cactus uses substantial resources. For primate-sized genomes (3 gigabases each), you should expect Cactus to use approximately 120 CPU-days ...Tags: Cactus, reference-free, whole-genome, multiple, alignment, program
1717 days ago
Merqury: reference-free quality and phasing assessment for genome assemblies
Often, genome assembly projects have illumina whole genome sequencing reads available for the assembled individual. The k-mer spectrum of this read set can be used for independently evaluating assembly quality without the need of a high quality reference. Merqury provides a set of tools for this ...Tags: Merqury, reference-free, quality, phasing, assessment, genome, assemblies
1418 days ago