www.biomedcentral.com - A. Hatem, D. Bozdag, A. E. Toland, U. V. Catalyurek "Benchmarking short sequence mapping tools" BMC Bioinformatics, 14(1):184, 2013.
http://bmi.osu.edu/hpc/software/benchmark/
http://bmi.osu.edu/hpc/software/pmap/pmap.html
Other similiar...
www.news.ucdavis.edu - The enormous size of the loblolly pine genome having 22 billion base pairs compared to only 3 billion in the human genome. In other words, it is seven times larger than a human’s and also the largest and the most...
www.genengnews.com - "By removing the time-consuming step of read mapping, the authors reported, Sailfish able to provide quantification estimates 20–30 times faster than current methods without loss of accuracy."
Tool...
Liver cancer is third leading cause of deaths and fourth most frequent occuring cancer worldwide. There are multiple signaling pathways responsible for causing cancer amongst which TGFb is most important cytokine whose signaling pathway promote...
www.cs.helsinki.fi - LoRMA is a tool for correcting sequencing errors in long reads such those produced by Pacific Biosciences sequencing machines.
Publication:
L. Salmela, R. Walve, E. Rivals, and E. Ukkonen: Accurate selfcorrection of errors in long reads using de...
github.com - GRASS (GeneRic ASsembly Scaffolder)-a novel algorithm for scaffolding second-generation sequencing assemblies capable of using diverse information sources. GRASS offers a mixed-integer programming formulation of the contig scaffolding problem, which...
github.com - Jabba is a hybrid error correction tool to correct third generation (PacBio / ONT) sequencing data, using second generation (Illumina) data.
Input
Jabba takes as input a concatenated de Bruijn graph and a set of sequences:
the de Bruijn graph...
github.com - Apollo is an assembly polishing algorithm that attempts to correct the errors in an assembly. It can take multiple set of reads in a single run and polish the assemblies of genomes of any size. Described by Firtina et al. (preliminary version...
The key to finding a solution is to notice that most genomicsequences differ by very little. It may well be that the number of complete genome sequences being stored is increasing rapidly, but the actual amount of new data is very small. In...