The Roth Lab seeks insight into biological systems through genome- and proteome-scale experimentation and analysis.
Current computational interests:
Systematic analysis of genetic epistasis to identify redundant or compensatory systems and to...
github.com - InfoGenomeR is the Integrative Framework for Genome Reconstruction that uses a breakpoint graph to model the connectivity among genomic segments at the genome-wide scale. InfoGenomeR integrates cancer purity and ploidy, total CNAs, allele-specific...
sanger-pathogens.github.io - A tool to circularize genome assemblies. The algorithm and benchmarks are described in the Genome Biology manuscript.
Citation: "Circlator: automated circularization of genome assemblies using long sequencing reads", Hunt et al, Genome Biology...
github.com - A comparative genome scaffolding tool based on MUMmer
mScaffolder scaffolds a genome using an existing high quality genome as the reference. It aligns the two genomes using nucmer utility from MUMmer and then orders and orients the contigs of the...
github.com - KAT is a suite of tools that analyse jellyfish hashes or sequence files (fasta or fastq) using kmer counts. The following tools are currently available in KAT:
hist: Create an histogram of k-mer occurrences from a sequence file. Adds metadata in...
There are many R software and bioconductor packages for NGS data analysis, some of them are as follows
Biostrings
The Biostrings package from Bioconductor provides an advanced environment for efficient sequence management and analysis in R. It...
FYI, I've found it useful to use MUMmer to extract the specific changes that Racon makes, so I can evaluate them individually:
minimap -t 24 assembly.fasta long_reads.fastq.gz | racon -t 24 long_reads.fastq.gz - assembly.fasta...
www.homolog.us - Useful bioinformatics tutorial, such as
De Bruijn Graphs for NGS AssemblyAlgorithms for PacBio ReadsSoftware and Hardware Concepts for BioinformaticsFinding us in Homolog.us (Search Algorithms)NGS Genome and RNAseq Assembly - a Hands on...
http://orione.crs4.it/ - End-to-end NGS microbiology data analysis requires a diversity of tools covering bacterial resequencing, de novo assembly, scaffolding, bacterial RNA-Seq, gene annotation and metagenomics. However, the construction of computational pipelines that...