With the emergence of NGS technologies, and sequencing data most of the bioinformaticians mung and wrangle around massive amounts of genomics text. There are several "standardized" file formats (FASTQ, SAM, VCF, etc.) and some tools for manipulating...
http://orione.crs4.it/ - End-to-end NGS microbiology data analysis requires a diversity of tools covering bacterial resequencing, de novo assembly, scaffolding, bacterial RNA-Seq, gene annotation and metagenomics. However, the construction of computational pipelines that...
pevsnerlab.kennedykrieger.org - DRAGON: Database Referencing of Array Genes Online
SNOMAD: Standardization and Normalization of Microarray Data
SNPduo: SNP Analysis Between Two Individuals
SNPtrio: Analyzing and Visualizing and Inheritance Patterns in Trios
SNPscan: Data...
rosalind.info - Rosalind is a platform for learning bioinformatics and programming through problem solving. Take a tour to get the hang of how Rosalind works.
http://rosalind.info/problems/list-view/
For a beginner this can be is the hardest part, it is also the most important to get right.
It is possible to create a vector by typing data directly into R using the combine function ‘c’
x
same as
x
creates the vector x...
www.bx.psu.edu - LASTZ is a program for aligning DNA sequences, a pairwise aligner. Originally designed to handle sequences the size of human chromosomes and from different species, it is also useful for sequences produced by NGS sequencing technologies such as...
www.ncbi.nlm.nih.gov - The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny,...
mrfast.sourceforge.net - mrFAST is a read mapper that is designed to map short reads to reference genome with a special emphasis on the discovery of structural variation and segmental duplications. mrFAST maps short reads with respect to user defined error threshold,...
bioinformatics.oxfordjournals.org - This is the andi program for estimating the evolutionary distance between closely related genomes. These distances can be used to rapidly infer phylogenies for big sets of genomes. Because andi does not compute full alignments, it is so efficient...