github.com - Flye is a de novo assembler for long and noisy reads, such as those produced by PacBio and Oxford Nanopore Technologies. The algorithm uses an A-Bruijn graph to find the overlaps between reads and does not require them to be error-corrected. After...
sourceforge.net - Cerulean extends contigs assembled using short read datasets like Illumina paired-end reads using long reads like PacBio RS long reads.
Cerulean v0.1 has been implemented with bacterial genomes in mind.
The method is fully described in...
This research group works on problems from the fields of Bioinformatics, Biotechnology, Data Mining, and Information Retrieval. The group's research projects includes Comparative Genomics of Bacterial genomes, Metagenomics, Genomic databases,...
github.com - Pilon is a software tool which can be used to:
Automatically improve draft assemblies
Find variation among strains, including large event detection
Pilon requires as input a FASTA file of the genome along with one or more BAM files of reads...
tldp.org - This tutorial assumes no previous knowledge of scripting or programming, yet progresses rapidly toward an intermediate/advanced level of instruction . . . all the while sneaking in little nuggets of UNIX® wisdom and lore. It serves as a...
broadinstitute.github.io - Picard is a set of command line tools for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF. These file formats are defined in the Hts-specs repository. See especially the SAM specification and the VCF...
mira-assembler.sourceforge.net - MIRA is a multi-pass DNA sequence data assembler/mapper for whole genome and EST/RNASeq projects. MIRA assembles/maps reads gained by
electrophoresis sequencing (aka Sanger sequencing)
454 pyro-sequencing (GS20, FLX or Titanium)
Ion...
www.bioinformatics.babraham.ac.uk - Understanding Following table and graphs
Duplication level
kmer profile
per base GC content
per base N content
per base quality
per base sequence content
per sequence GC content
per sequence quality
sequence length distribution
More at...
http://docs.bpipe.org/ - Bpipe provides a platform for running big bioinformatics jobs that consist of a series of processing stages - known as 'pipelines'.
January 20th, 2016 - New! Bpipe 0.9.9 released!
Download latest, all
Documentation
Mailing List (Google...
github.com - In a nutshell
Anvi’o is an analysis and visualization platform for ‘omics data.
Please find the methods paper here: https://peerj.com/articles/1319/
Anvi’o would not have been possible without the help of many people who...