github.com - MECAT is an ultra-fast Mapping, Error Correction and de novo Assembly Tools for single molecula sequencing (SMRT) reads. MECAT employs novel alignment and error correction algorithms that are much more efficient than the state of art of aligners and...
Computer simulation is the discipline of designing a model of an actual or theoretical physical/biological system, executing the model on a digital computer, and analyzing the execution output.
cutadapt.readthedocs.io - Cutadapt finds and removes adapter sequences, primers, poly-A tails and other types of unwanted sequence from your high-throughput sequencing reads.
Cleaning your data in this way is often required: Reads from small-RNA sequencing contain the...
The genome assemblers generally take a file of short sequence reads and a file of quality-value as the input. Since the quality-value file for the high throughput short reads is usually highly memory-intensive, only a few assemblers, best suited for...
github.com - Mix is a tool that combines two or more draft assemblies, without relying on a reference genome and has the goal to reduce contig fragmentation and thus speed-up genome finishing. The proposed algorithm builds an extension graph where vertices...
github.com - Filtering on quality and/or read length, and optional trimming after passing filters.Reads from stdin, writes to stdout.
Intended to be used:
directly after fastq extraction
prior to mapping
in a stream between extraction and...
github.com - gapFinisher is based on the controlled use of a previously published gap filling tool FGAP and works on all standard Linux/UNIX command lines. They compare the performance of gapFinisher against two other published gap filling tools PBJelly and...
github.com - Key features
Filters SNVs from any variant caller to remove false positives
Calculates metrics based on BAM files and provides filtering not possible with other tools
Fully user-configurable filtering (including which filters to use and their...
github.com - jackalope simply and efficiently simulates (i) variants from reference genomes and (ii) reads from both Illumina and Pacific Biosciences (PacBio) platforms. It can either read reference genomes from FASTA files or simulate new ones. Genomic variants...