Meraculous: De Novo Genome Assembly with Short Paired-End Reads
We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia stipitis. More than 95% of the genome is recovered, wi...Tags: Meraculous, De Novo, Genome, Assembly, Short, Paired-End, Reads
2356 days ago
GAPPadder: A Sensitive Approach for Closing Gaps on Draft Genomes with Short Sequence Reads
This software is provided ``as is” without warranty of any kind. In no event shall the author be held responsible for any damage resulting from the use of this software. The program package, including source codes, executables, and this documentation, is distributed free of charge. If you u...Tags: GAPPadder, Sensitive, Approach, Closing, Gaps, Draft, Genomes, Short, Sequence, Reads, Genome, Assembly
2168 days ago
SOAP2 : Short Oligonucleotide Analysis Package 2
SOAPaligner/soap2 is a member of the SOAP (Short Oligonucleotide Analysis Package). It is an updated version of SOAP software for short oligonucleotide alignment. The new program features in super fast and accurate alignment for huge amounts of short reads generated by Illumina/Solexa Genome Anal...Tags: SOAPaligner, soap2, SOAP, Short, Oligonucleotide, Analysis, Package
2158 days ago
LSC: Improving PacBio Long Read Accuracy by Short Read Alignment
Added Command line argument support. Multi-stage execution modes. Support for parallelization. Now execution proceeds in batches of long reads the size of which can be set by --long_read_batch_size N. Better compressed intermediate files. Added utilities folder. Added support for multiple ...Tags: LSC, Improve, correct, PacBio, Long, Read, Accuracy, Short, Read, Alignment
2052 days ago
SISRS: Site Identification from Short Read Sequences
Next-gen sequence data such as Illumina HiSeq reads. Data must be sorted into folders by taxon (e.g. species or genus). Paired reads in fastq format must be specified by _R1 and _R2 in the (otherwise identical) filenames. Paired and unpaired reads must have a fastq file extension.Tags: SISRS, Site, Identification, Short, Read, Sequences
1969 days ago
HipSTR: Haplotype inference and phasing for Short Tandem Repeats
HipSTR was specifically developed to deal with these errors in the hopes of obtaining more robust STR genotypes. In particular, it accomplishes this by: Learning locus-specific PCR stutter models using an EM algorithm Mining candidate STR alleles from population-scale sequencing dat...Tags: HipSTR, Haplotype, inference, phasing, Short, Tandem, Repeats
1870 days ago
Tags: TAREAN, Computational, tool, identification, characterization, satellite, DNA, unassembled, short, reads
2167 days ago
GenomeMapper: Simultaneous alignment of short reads against multiple genomes
GenomeMapper is a short read mapping tool designed for accurate read alignments. It quickly aligns millions of reads either with ungapped or gapped alignments. It can be used to align against multiple genomes simulanteously or against a single reference. If you are unsure which one is the appropr...Tags: GenomeMapper, Simultaneous, alignment, short, reads, multiple, genomes
2156 days ago
Cerulean: A hybrid assembly using high throughput short and long reads
Cerulean extends contigs assembled using short read datasets like Illumina paired-end reads using long reads like PacBio RS long reads. Cerulean v0.1 has been implemented with bacterial genomes in mind. The method is fully described in Deshpande, V., Fung, E. D., Pham, S., & Bafna, V. (...Tags: Cerulean, hybrid, assembly, short, long, reads, pacbio, nanopore
2145 days ago