github.com - Nucleus is a library of Python and C++ code designed to make it easy to read, write and analyze data in common genomics file formats like SAM and VCF. In addition, Nucleus enables painless integration with the TensorFlow machine learning framework,...
github.com - snakePipes are flexible and powerful workflows built using snakemake that simplify the analysis of NGS data.
DNA-mapping*
ChIP-seq*
RNA-seq*
ATAC-seq*
scRNA-seq
Hi-C
Whole Genome Bisulfite Seq/WGBS
(*Also available in...
github.com - There is a directory for each chapter of the book. Each directory contains a test.py program you can use with pytest to check that you have written the program correctly. I have included a short README to describe each exercise....
wgs-assembler.sourceforge.net - These are release notes for Celera Assembler version 8.3rc2, which was released on May 24, 2015.This distribution package provides a stable, tested, documented version of the software. The distribution is usable on most Unix-like platforms,...
telatin.github.io - A general-purpose program to manipulate and parse information from FASTA/FASTQ files, supporting gzipped input files. Includes functions to interleave and de-interleave FASTQ files, to rename sequences and...
www.bioinformatics.nl - Caretta – a multiple protein structure alignment and feature extraction suite
Caretta, a multiple structure alignment suite meant for homologous but sequentially divergent protein families which consistently returns accurate alignments...
broadinstitute.github.io - Picard is a set of command line tools for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF. These file formats are defined in the Hts-specs repository. See especially the SAM specification and the VCF...