cosmos.hms.harvard.edu - COSMOS, our Python-based management system for implementing large-scale parallel workflows focusing on, but not restricted to, large-scale short-read "NGS" sequencing data is open-access published via Advance Access in Bioinformatics (Gafni et al....
milkweedgenome.org - Some of the useful bioinformatics scripts.
For example ... contig-stats.pl is a Perl script that will automatically describe features of a sequence assembly.
http://milkweedgenome.org/?q=scripts
readthedocs.org - Bipype is a very useful program, which prepare a lot of types of bioinformatics analyses. There are three input options: amplicons, WGS (whole genome sequences) and metatranscriptomic data. If amplicons are input data, then bipype does...
github.com - Nucleus is a library of Python and C++ code designed to make it easy to read, write and analyze data in common genomics file formats like SAM and VCF. In addition, Nucleus enables painless integration with the TensorFlow machine learning framework,...
github.com - Luigi is a Python (3.6, 3.7, 3.8, 3.9 tested) package that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, handling failures, command line integration, and much more.
Run pip...
http://scikit-bio.org/ - scikit-bio is currently in beta. We are very actively developing it, and backward-incompatible interface changes can and will arise. To avoid these types of changes being a surprise to our users, our public APIs are decorated to make it clear...
github.com - Tinycov is a small standalone command line utility written in python to plot the coverage of a BAM file quickly. This software was inspired by Matt Edwards' genome coverage plotter.
To install the stable version: pip3 install --user...
github.com - A probabilistic framework for structural variant discovery.
Ryan M Layer, Colby Chiang, Aaron R Quinlan, and Ira M Hall. 2014. "LUMPY: a Probabilistic Framework for Structural Variant Discovery." Genome Biology 15 (6):...