rHAT: a seed-and-extension-based noisy long read alignment tool
rHAT is a seed-and-extension-based noisy long read alignment tool. It is suitable for aligning 3rd generation sequencing reads which are in large read length with relatively high error rate, especially Pacbio's Single Molecule Read-time (SMRT) sequencing reads.Tags: rHAT, seed-and-extension-based, noisy, long, read, alignment, tool, ont, pacbio
2044 days ago
MSAProbs - Parallel and accurate multiple sequence alignment
MSAProbs is a well-established state-of-the-art multiple sequence alignment algorithm for protein sequences. The design of MSAProbs is based on a combination of pair hidden Markov models and partition functions to calculate posterior probabilities. Assessed using the popular benchmarks: BAli...Tags: MSAProbs, Parallel, accurate, multiple, sequence, alignment
1755 days ago
Cactus: a reference-free whole-genome multiple alignment program
Cactus is a reference-free whole-genome multiple alignment program. The principal algorithms are described here: https://doi.org/10.1101/gr.123356.111 Cactus uses substantial resources. For primate-sized genomes (3 gigabases each), you should expect Cactus to use approximately 120 CPU-days ...Tags: Cactus, reference-free, whole-genome, multiple, alignment, program
1721 days ago
Kalign: fast multiple sequence alignment program for biological sequences.
Kalign is a fast multiple sequence alignment program for biological sequences. Align sequences and output the alignment in MSF format: kalign -i BB11001.tfa -f msf -o out.msf Align sequences and output the alignment in clustal format: kalign -i BB11001.tfa -f clu -o out.clu Re-align seq...Tags: Kalign, fast, multiple, sequence, alignment, biological, sequences
1641 days ago
Shouji: a fast and efficient pre-alignment filter for sequence alignment
The ability to generate massive amounts of sequencing data continues to overwhelm the processing capacity of existing algorithms and compute infrastructures. In this work, we explore the use of hardware/software co-design and hardware acceleration to significantly reduce the execution time of sho...Tags: Shouji, fast, efficient, pre-alignment, filter, sequence, alignment
1637 days ago
Tags: VG, variation, graph, structures, interchange, formats, alignment, genotyping, variant, calling, methods, snp
1552 days ago
The wavefront alignment (WFA) algorithm
The wavefront alignment (WFA) algorithm is an exact gap-affine algorithm that takes advantage ofhomologous regions between the sequences to accelerate the alignment process. As opposed to traditional dynamic programming algorithms that run in quadratic time, the WFA runs in time O(ns), proportion...Tags: wavefront, alignment, WFA, algorithm
1400 days ago
parallelLastz: Lastz with multi-threads support.
Running Lastz (https://github.com/lastz/lastz) in parallel mode. This program is for single computer with multiple core processors. When the query file format is fasta, you can specify many threads to process it. It can reduce run time linearly, and use almost equal memory as the original lastz ...Tags: parallelLastz, Lastz, multi-threads, support, alignment
1345 days ago
Caretta – A multiple protein structure alignment and feature extraction suite
Caretta – a multiple protein structure alignment and feature extraction suite Caretta, a multiple structure alignment suite meant for homologous but sequentially divergent protein families which consistently returns accurate alignments with a higher coverage than current state-of-the-...Tags: Caretta, multiple, protein, structure, alignment, feature, extraction, suite
1228 days ago