github.com - PyParanoid is a pipeline for rapid identification of homologous gene families in a set of genomes - a central task of any comparative genomics analysis. The "gold standard" for identifying homologs is to use reciprocal best hits (RBHs) which depends...
github.com - Orthoflow is a workflow for phylogenetic inference of genome-scale datasets of protein-coding genes. Our goal was to make it straightforward to work from a combination of input sources including annotated contigs in Genbank format and FASTA files...
jmorp.megabank.tohoku.ac.jp - Japanese Multi Omics Reference Panel, jMorp, is a database of metabolome and proteome data in plasma obtained from volunteers in ToMMo. Metabolome data were measured by proton NMR and LC-MS, and proteome data were obtained by nanoLC-MS. We could...
github.com - Automatic Filtering, Trimming, Error Removing and Quality Control for fastq dataAfterQC can simply go through all fastq files in a folder and then output three folders: good, bad and QC folders, which contains good...
ecogenomics.github.io - CheckM provides a set of tools for assessing the quality of genomes recovered from isolates, single cells, or metagenomes. It provides robust estimates of genome completeness and contamination by using collocated sets of genes that are ubiquitous...
github.com - iSeqQC, an expression-based QC tool that detects outliers either produced due to variable laboratory conditions or due to dissimilarity within a phenotypic group. iSeqQC implements various statistical approaches including unsupervised clustering,...
samtools.sourceforge.net - In current genome era, our day to day work is to handle the huge geneome sequences, expression data, several other datasets. This link provide a comprehensive list of commonly used sofware/tools.
List of generic simulation software/tools/resource with brief description and homepage
ALF A Simulation Framework for Genome Evolution http://www.cbrg.ethz.ch/alfBayesian Serial SimCoal Bayesian Serial SimCoal, (BayeSSC) is a modification of...
pevsnerlab.kennedykrieger.org - DRAGON: Database Referencing of Array Genes Online
SNOMAD: Standardization and Normalization of Microarray Data
SNPduo: SNP Analysis Between Two Individuals
SNPtrio: Analyzing and Visualizing and Inheritance Patterns in Trios
SNPscan: Data...
broadinstitute.github.io - Picard is a set of command line tools for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF. These file formats are defined in the Hts-specs repository. See especially the SAM specification and the VCF...