The Roth Lab seeks insight into biological systems through genome- and proteome-scale experimentation and analysis.
Current computational interests:
Systematic analysis of genetic epistasis to identify redundant or compensatory systems and to...
RAST – Web tool (upload contigs), uses the subsystems in the SEED database and provides detailed annotation and pathway analysis. Takes several hours per genome but I think this is the best way to get a high quality annotation...
www.h-invitational.jp - G-compass (http://www.h-invitational.jp/g-compass/) is a comparative genome browser. It visualizes evolutionarily conserved genomic regions between human and other 12 vertebrates based on original genome alignments pursuing higher coverage (1,2)....
github.com - A comparative genome scaffolding tool based on MUMmer
mScaffolder scaffolds a genome using an existing high quality genome as the reference. It aligns the two genomes using nucmer utility from MUMmer and then orders and orients the contigs of the...
github.com - KAT is a suite of tools that analyse jellyfish hashes or sequence files (fasta or fastq) using kmer counts. The following tools are currently available in KAT:
hist: Create an histogram of k-mer occurrences from a sequence file. Adds metadata in...
www.homolog.us - Useful bioinformatics tutorial, such as
De Bruijn Graphs for NGS AssemblyAlgorithms for PacBio ReadsSoftware and Hardware Concepts for BioinformaticsFinding us in Homolog.us (Search Algorithms)NGS Genome and RNAseq Assembly - a Hands on...
FYI, I've found it useful to use MUMmer to extract the specific changes that Racon makes, so I can evaluate them individually:
minimap -t 24 assembly.fasta long_reads.fastq.gz | racon -t 24 long_reads.fastq.gz - assembly.fasta...
github.com - pyGenomeTracks aims to produce high-quality genome browser tracks that are highly customizable. Currently, it is possible to plot:
bigwig
bed (many options)
bedgraph
links (represented as arcs)
Hi-C matrices (if HiCExplorer is...
peerj.com - The sequencing, assembly, and basic analysis of microbial genomes, once a painstaking and expensive undertaking, has become almost trivial for research labs with access to standard molecular biology and computational tools. However, there are a wide...
github.com - The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells.
Computational methods used by the Shasta assembler include:
Using...