The Institute of Bioinformatics conducts internationally renowned research and provides profound education in bioinformatics. Its research focuses on development and application of machine learning and statistical methods in biology and...
https://dfast.nig.ac.jp/ - We developed a prokaryotic genome annotation pipeline, DFAST, that also supports genome submission to public sequence databases. DFAST was originally started as an on-line annotation server, and to date, over 7,000 jobs have been processed since its...
github.com - A comparative genome scaffolding tool based on MUMmer
mScaffolder scaffolds a genome using an existing high quality genome as the reference. It aligns the two genomes using nucmer utility from MUMmer and then orders and orients the contigs of the...
github.com - KAT is a suite of tools that analyse jellyfish hashes or sequence files (fasta or fastq) using kmer counts. The following tools are currently available in KAT:
hist: Create an histogram of k-mer occurrences from a sequence file. Adds metadata in...
FYI, I've found it useful to use MUMmer to extract the specific changes that Racon makes, so I can evaluate them individually:
minimap -t 24 assembly.fasta long_reads.fastq.gz | racon -t 24 long_reads.fastq.gz - assembly.fasta...
There are many R software and bioconductor packages for NGS data analysis, some of them are as follows
Biostrings
The Biostrings package from Bioconductor provides an advanced environment for efficient sequence management and analysis in R. It...
github.com - The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells.
Computational methods used by the Shasta assembler include:
Using...
www.homolog.us - Useful bioinformatics tutorial, such as
De Bruijn Graphs for NGS AssemblyAlgorithms for PacBio ReadsSoftware and Hardware Concepts for BioinformaticsFinding us in Homolog.us (Search Algorithms)NGS Genome and RNAseq Assembly - a Hands on...