bedtools.readthedocs.io - Collectively, the bedtools utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic: that is, set theory on the genome. For...
code.google.com - splitbam splits a BAM by chromosomes.
Using the reference sequence dictionary (*.dict), it also creates some empty BAM files if no sam record was found for a chromosome. A pair of 'mock' SAM-Records can also be added to those empty BAMs to...
www.vicbioinformatics.com - Prokka is a software tool for the rapid annotation of prokaryotic genomes. A typical 4 Mbp genome can be fully annotated in less than 10 minutes on a quad-core computer, and scales well to 32 core SMP systems. It produces GFF3, GBK and SQN files...
github.com - COCACOLA is a general framework that combines different types of information: sequence COmposition, CoverAge across multiple samples, CO-alignment to reference genomes and paired-end reads LinkAge to automatically bin contigs into OTUs. Furthermore,...
github.com - This project contains scripts and tutorials on how to assemble individual microbial genomes from metagenomes, as described in:
Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes
Mads...
A new “Download assemblies” button is now available in the Assembly database. This makes it easy to download data for multiple genomes without having to write scripts.
github.com - Convenient conversions between genome assemblie. The liftover package makes it easy to remap genomic coordinates to a different genome assembly.
More at...
github.com - Trinity, developed at the Broad Institute and the Hebrew University of Jerusalem, represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Trinity combines three independent software modules:...
www.broadinstitute.org - DISCOVAR is a new variant caller and DISCOVAR de novo a new genome assembler, both designed for state-of-the-art data. Their inputs are chosen to optimize quality while keeping costs low. Currently it takes as input Illumina reads of length 250 or...
github.com - Tools for making blobplots or Taxon-Annotated-GC-Coverage plots (TAGC plots) to visualise the contents of genome assembly data sets as a QC step
Blaxter Lab, Institute of Evolutionary Biology, University of Edinburgh
Goal: To create blobplots or...