github.com - Despite marked recent improvements in long-read sequencing technology, the assembly of diploid genomes remains a difficult task. A major obstacle is distinguishing between alternative contigs that represent highly heterozygous regions. If primary...
github.com - HapSolo, that identifies secondary contigs and defines a primary assembly based on multiple pairwise contig alignment metrics. HapSolo evaluates candidate primary assemblies using BUSCO scores and then distinguishes among candidate assemblies using...
github.com - fqtools is a software suite for fast processing of FASTQ files. Various file manipulations are supported. See below for a full list of the subcommands available and a brief description of their purpose. Most of the individual...
github.com - v0.0 by Glenn Hickey (hickey@soe.ucsc.edu)
Progressive Cactus is a whole-genome alignment package.
Requirements
git
gcc 4.2 or newer
python 2.7
wget
64bit processor and build environment
150GB+ of memory on at least one machine when...
www.iam.u-tokyo.ac.jp - Cgaln (Coarse grained alignment) is a program designed to align a pair of whole genomic sequences of not only bacteria but also entire chromosomes of vertebrates on a nominal desktop computer. Cgaln performs an alignment job in two steps, at the...
http://www.htslib.org/ - Samtools is a suite of programs for interacting with high-throughput sequencing data. It consists of three separate repositories:
SamtoolsReading/writing/editing/indexing/viewing SAM/BAM/CRAM formatBCFtoolsReading/writing BCF2/VCF/gVCF files and...
journal.frontiersin.org - We have developed gbtools, a software package that allows users to visualize metagenomic assemblies by plotting coverage (sequencing depth) and GC values of contigs, and also to annotate the plots with taxonomic information. Different sets of...
wiki.bits.vib.be - NGS data are just a bunch of sequences, you have no idea which region in the genome each sequences comes from, which gene it represents...To know that you have to align the sequences to the reference sequence. The reference sequence is in most cases...