github.com - Second generation sequencing technologies paved the way to an exceptional increase in the number of sequenced genomes, both prokaryotic and eukaryotic. However, short reads are difficult to assemble and often lead to highly fragmented assemblies....
github.com - Flye is a de novo assembler for single molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. The...
www.cs.helsinki.fi - LoRMA is a tool for correcting sequencing errors in long reads such those produced by Pacific Biosciences sequencing machines.
Publication:
L. Salmela, R. Walve, E. Rivals, and E. Ukkonen: Accurate selfcorrection of errors in long reads using de...
https://cblab.org/camsa/ - CAMSA – is a tool for Comparative Analysis and Merging of Scaffold Assemblies, distributed both as a standalone software package and as Python library under the MIT license.
Main features:
works with any number of...
github.com - AccessSyRI: finding genomic rearrangements andlocal sequence differences from whole-genome assemblies
SyRI, a pairwise whole-genome comparison tool for chromosome-level assemblies. SyRI starts by finding rearranged regions and then searches for...
csbl.bmb.uga.edu - dbCAN is a web server and DataBase for automated Carbohydrate-active enzyme ANnotation, funded by the BioEnergy Science Center of the DOE. Similar resources on the web include CAZy database and CAT. All...
github.com - A computational pipeline for genome-wide detection of putative horizontal gene transfer (HGT) events based on sequence homology search hit distribution statistics
Authors: Qiyun Zhu (qiyunzhu@gmail.com), Katharina Dittmar...
RAST – Web tool (upload contigs), uses the subsystems in the SEED database and provides detailed annotation and pathway analysis. Takes several hours per genome but I think this is the best way to get a high quality annotation...
Genome browsers are useful not only for showing final results but also for improving analysis protocols, testing data quality, and generating result drafts. Its integration in analysis pipelines allows the optimization of parameters, which...