The recent revolution in sequencing technologies has led to an exponential growth of sequence data. To tackle this ‘data deluge’, here Lawrence Berkeley National Laboratory introduce the BioPig sequence analysis toolkit as one of the solutions that scale to data and computation.
BioPig: a Hadoop-based analytic toolkit for large-scale sequence data http://bioinformatics.oxfordjournals.org/content/29/23/3014.short?rss=1
This useful repository contains the files from the course in de novo assembly https://github.com/lexnederbragt/denovo-assembly-tutorial
A field guide to whole-genome sequencing, assembly and annotation http://onlinelibrary.wiley.com/doi/10.1111/eva.12178/full
Informatics for RNA-seq: A web resource for analysis on the cloud https://github.com/griffithlab/rnaseq_tutorial?