To decide which strategy should be our “preferred” genome assembly approach based on data rather than my gut-feeling about the “best assembly” I decided to do some testing with a known “true” reference E Coli K12 MG1655
faculty.washington.edu - Beagle is a software package that performs genotype calling, genotype phasing, imputation of ungenotyped markers, and identity-by-descent segment detection.
Beagle version 4.1 has a more accurate genotype phasing algorithm and a very fast and...
github.com - Wtdbg2 is a de novo sequence assembler for long noisy reads produced by PacBio or Oxford Nanopore Technologies (ONT). It assembles raw reads without error correction and then builds the consensus from intermediate assembly...
eforge.cs.ucl.ac.uk - The eFORGE tool provides a method to view the tissue specific regulatory component of a set of EWAS DMPs. eFORGE analysis takes a set of DMPs, such as those hits above genome-wide significance threshold in an EWAS study, and analyses whether there...
tritexassembly.bitbucket.io - The pipeline is open-source and hosted in a public Bitbucket repository.
TRITEX has been run on highly inbred genotypes of barley (Hordeum vulgare), tetraploid wheat (Triticum turgidum) and hexaploid wheat (T. aestivum) with reasonable...
www.niehs.nih.gov - ART is a set of simulation tools to generate synthetic next-generation sequencing reads. ART simulates sequencing reads by mimicking real sequencing process with empirical error models or quality profiles summarized from large recalibrated...
github.com - HASLR is a tool for rapid genome assembly of long sequencing reads. HASLR is a hybrid tool which means it requires long reads generated by Third Generation Sequencing technologies (such as PacBio or Oxford Nanopore) together with Next Generation...
R Graphical Cookbook by Winston Chang
A very nice book by Winston Chang for R ethusiast. The R code presented in these pages is the R code actually used to produce the Figures in the book. There will be differences compared to the code chunks shown...
ftp.ncbi.nih.gov - Now a days there are a lots of genomics databases available around the world. This bookmark is created to provide all links in one place ...
ftp://ftp.ncbi.nih.gov/genomes/
https://hgdownload.soe.ucsc.edu/downloads.html
www.khanacademy.org - Topics
Displaying and describing data
Modeling distributions of data
Describing relationships in quantitative data
Designing studies
Probability
Random variables
Sampling distributions
Confidence intervals (one sample)
Significance tests...