github.com - Snippy finds SNPs between a haploid reference genome and your NGS sequence reads. It will find both substitutions (snps) and insertions/deletions (indels). It will use as many CPUs as you can give it on a single computer (tested to 64 cores). It is...
github.com - With advances in Cancer Genomics, Mutation Annotation Format (MAF) is being widely accepted and used to store somatic variants detected. The Cancer Genome Atlas Project has sequenced over 30 different cancers with sample size of each cancer type...
github.com - Cactus is a reference-free whole-genome multiple alignment program. The principal algorithms are described here: https://doi.org/10.1101/gr.123356.111
Cactus uses substantial resources. For primate-sized genomes (3 gigabases each), you should...
bioconductor.org - doubletrouble aims to identify duplicated genes from whole-genome protein sequences and classify them based on their modes of duplication. The duplication modes are i. segmental duplication (SD); ii. tandem duplication (TD); iii. proximal...
github.com - Automatic Filtering, Trimming, Error Removing and Quality Control for fastq dataAfterQC can simply go through all fastq files in a folder and then output three folders: good, bad and QC folders, which contains good...
github.com - Other tools focus on getting data out of the fastq or fast5 files, which is slow and computationally intensive. The benefit of this approach is that it works on a single, small, .txt summary file. So it's a lot quicker than most other things out...
bamview.sourceforge.net - To run the application on UNIX from the downloaded jar file run the UNIX:
java -mx512m -jar BamView.jar
and extra command line options are given when '-h' is used:
java -jar BamView.jar -h
BAM files can be specified on the command line with the...
fantom.gsc.riken.jp - ZENBU is a data integration, data analysis, and visualization system enhanced for RNAseq, ChipSeq, CAGE and other types of next-generation-sequence-tag (NGS) based data. ZENBU allows for novel data exploration through "on-demand" data...
github.com - OMTools, an efficient and intuitive data processing and visualization suite to handle and explore large-scale optical mapping profiles. OMTools includes modules for visualization (OMView), data processing and simulation. These modules together form...
github.com - pbmm2 is a SMRT C++ wrapper for minimap2's C API. Its purpose is to support native PacBio in- and output, provide sets of recommended parameters, generate sorted output on-the-fly, and postprocess alignments. Sorted output can be used directly for...