biobits.org - SAMtools: Primer / Tutorial by Ethan Cerami, Ph.D.keywords: samtools, next-gen, next-generation, sequencing, bowtie, sam, bam, primer, tutorial, how-to, introductionRevisions 1.0: May 30, 2013: First public release on...
wiki.bits.vib.be - NGS data are just a bunch of sequences, you have no idea which region in the genome each sequences comes from, which gene it represents...To know that you have to align the sequences to the reference sequence. The reference sequence is in most cases...
github.com - ComplexHeatmap (Gu, Eils, and Schlesner (2016)) is an R Programming Language (R Core Team (2020)) package that is currently listed in the Bioconductor package repository.
install and load required packages...
carpentries-incubator.github.io - A lesson introducing the Snakemake workflow system for bioinformatics analysis.
Prerequisites
This is an intermediate lesson and assumes learners have already done some bioinformatics:
Familiarity with the BASH command shell, including...
biochem218.stanford.edu - Excellent article to introduce different sequencing methods along with tools for de novo assembly of sequencing reads and their relevant references.
Title: Comparison of Short Read De Novo Alignment Algorithms
Author: Nikhil Gopal
github.com - Miniasm is a very fast OLC-based de novo assembler for noisy long reads. It takes all-vs-all read self-mappings (typically by minimap) as input and outputs an assembly graph in the GFA format. Different from mainstream...
github.com - MashMap is a fast and approximate software for mapping long reads (PacBio/ONT) or assembly to reference genome(s). It maps a query sequence against a reference region if and only if its estimated alignment identity is above a specified threshold. It...
www.ncbi.nlm.nih.gov - RGFA, an implementation of the proposed GFA specification in Ruby. It allows the user to conveniently parse, edit and write GFA files. Complex operations such as the separation of the implicit instances of repeats and the merging of linear paths can...
sourceforge.net - Cerulean extends contigs assembled using short read datasets like Illumina paired-end reads using long reads like PacBio RS long reads.
Cerulean v0.1 has been implemented with bacterial genomes in mind.
The method is fully described in...