Hercules: a profile HMM-based hybrid error correction algorithm for long reads
Choosing whether to use second or third generation sequencing platforms can lead to trade-offs between accuracy and read length. Several studies require long and accurate reads including de novo assembly, fusion and structural variation detection. In such cases researchers often combine both tech...Tags: Hercules, profile, HMM-based, hybrid, error, correction, algorithm, long, reads, PacBio
2073 days ago
CoLoRMap: Correcting Long Reads by Mapping short reads
Second generation sequencing technologies paved the way to an exceptional increase in the number of sequenced genomes, both prokaryotic and eukaryotic. However, short reads are difficult to assemble and often lead to highly fragmented assemblies. The recent developments in long reads sequencing m...Tags: CoLoRMap, Correcting, Long, Reads, Mapping, short, reads, pacbio
2073 days ago
SimLoRD: A read simulator for third generation sequencing reads
SimLoRD is a read simulator for third generation sequencing reads and is currently focused on the Pacific Biosciences SMRT error model. Reads are simulated from both strands of a provided or randomly generated reference sequence. The reference can be read from a FASTA file or randomly gener...Tags: SimLoRD, read, simulator, third, generation, sequencing, reads, Pacific, Biosciences, SMRT, error, model
2072 days ago
LRCstats: a tool for evaluating long reads correction methods
LRCstats is an open-source pipeline for benchmarking DNA long read correction algorithms for long reads outputted by third generation sequencing technology such as machines produced by Pacific Biosciences. The reads produced by third generation sequencing technology, as the name suggests, are lon...Tags: LRCstats, tool, evaluation, long, reads, correction, methods
2072 days ago
LoRMA: A tool for correcting sequencing errors in long reads
An error correction method that uses long reads only. The method consists of two phases: first, we use an iterative alignment-free correction method based on de Bruijn graphs with increasing length of k-mers, and second, the corrected reads are further polished using long-distance dependenci...Tags: LoRMA, tool, correction, sequencing, errors, long, reads, PacBio, SMRT
2056 days ago
P_RNA_scaffolder: a fast and accurate genome scaffolder using paired-end RNA-sequencing reads
P_RNA_scaffolder is a novel scaffolding tool using Pair-end RNA-seq to scaffold genome fragments. The method is suitable for most genomes. The program could utilize Illumina Paired-end RNA-sequencing reads from target speciesies. Our method provides another practical alternative to existing mate-...Tags: P_RNA_scaffolder, fast, accurate, genome, scaffolder, paired-end, RNA, sequencing, reads, scaffold
2056 days ago
Rebaler: program for conducting reference-based assemblies using long reads.
Rebaler is a program for conducting reference-based assemblies using long reads. It relies mainly on minimap2 for alignment and Racon for making consensus sequences. I made Rebaler for bacterial genomes (specifically for the task of testing basecallers). It should in pri...Tags: Rebaler, program, conducting, reference-based, assemblies, long, reads, ont, pacbio
2045 days ago
Tags: PANDASEQ, program, align, Illumina, reads, PCR, primers, sequence, reconstruct, overlapping, sequence
2042 days ago
Tags: longread, ngs, assembly, genome, reads, tutorial
2028 days ago
BASE: a practical de novo assembler for large genomes using long NGS reads
new de novo assembler called BASE. It enhances the classic seed-extension approach by indexing the reads efficiently to generate adaptive seeds that have high probability to appear uniquely in the genome. Such seeds form the basis for BASE to build extension trees and then to use revers...Tags: BASE, practical, de novo, assembler, large, genomes, long, NGS, reads
2014 days ago