SDA: Long-read sequence and assembly of segmental duplications
Segmental Duplication Assembler (SDA; https://github.com/mvollger/SDA) constructs graphs in which paralogous sequence variants define the nodes and long-read sequences provide attraction and repulsion edges, enabling the partition and assembly of long reads corresponding to distinct paralogs. ht...Tags: SDA, Long-read, sequence, assembly, segmental, duplications
1887 days ago
iRNAD: a computational tool for identifying D modification sites in RNA sequence
iRNAD, for identifying D modification sites in RNA sequence. In this predictor, the RNA samples derived from five species were encoded by nucleotide chemical property and nucleotide density. Support vector machine was utilized to perform the classification. http://lin-group.cn/server/iRNAD...Tags: iRNAD, computational, tool, identifying, modification, sites, RNA, sequence
1815 days ago
MSAProbs - Parallel and accurate multiple sequence alignment
MSAProbs is a well-established state-of-the-art multiple sequence alignment algorithm for protein sequences. The design of MSAProbs is based on a combination of pair hidden Markov models and partition functions to calculate posterior probabilities. Assessed using the popular benchmarks: BAli...Tags: MSAProbs, Parallel, accurate, multiple, sequence, alignment
1760 days ago
TRITEX sequence assembly pipeline for Triticeae genomes
The pipeline is open-source and hosted in a public Bitbucket repository. TRITEX has been run on highly inbred genotypes of barley (Hordeum vulgare), tetraploid wheat (Triticum turgidum) and hexaploid wheat (T. aestivum) with reasonable results: super-scaffold N50 values in the range o...Tags: TRITEX, sequence, assembly, pipeline, Triticeae, genomes
1719 days ago
Miropeats: discovers regions of sequence similarity amongst any set of DNA sequences
Miropeats discovers regions of sequence similarity amongst any set of DNA sequences and then presents this similarity information graphically. Sequence similarity searching is a very general tool that forms the basis of many different biological sequence analyses but it is limited by the verbosit...Tags: Miropeats, discovers, regions, sequence, similarity, DNA, sequences
1713 days ago
Apollo: a sequence annotation editor
The well-established inaccuracy of purely computational methods for annotating genome sequences necessitates an interactive tool to allow biological experts to refine these approximations by viewing and independently evaluating the data supporting each annotation. Apollo was developed to meet thi...Tags: Apollo, sequence, annotation, editor, synteny
1712 days ago
Kalign: fast multiple sequence alignment program for biological sequences.
Kalign is a fast multiple sequence alignment program for biological sequences. Align sequences and output the alignment in MSF format: kalign -i BB11001.tfa -f msf -o out.msf Align sequences and output the alignment in clustal format: kalign -i BB11001.tfa -f clu -o out.clu Re-align seq...Tags: Kalign, fast, multiple, sequence, alignment, biological, sequences
1646 days ago
Shouji: a fast and efficient pre-alignment filter for sequence alignment
The ability to generate massive amounts of sequencing data continues to overwhelm the processing capacity of existing algorithms and compute infrastructures. In this work, we explore the use of hardware/software co-design and hardware acceleration to significantly reduce the execution time of sho...Tags: Shouji, fast, efficient, pre-alignment, filter, sequence, alignment
1643 days ago
RePS: Repeat-masked Phrap with scaffolding, a WGS sequence assembler
RePS (Repeat-masked Phrap with scaffolding), a WGS sequence assembler, that explicitly identifies exact kmer repeats from the shotgun data and removes them prior to the assembly. The established software Phrap is used to compute meaningful error probabilities for each base. Clone-end-pairing info...Tags: RePS, Repeat, masked, Phrap, scaffolding, WGS, sequence, assembler
1582 days ago
GfaViz: flexible and interactive visualization of GFA sequence graphs
GFA (Graphical Fragment Assembly) is an emerging standard format for representing sequence graphs. Although it was originally conceived as a format for sequence assembly (hence the name), and this remains its core application, it is more general, and able to represent many different types of sequ...Tags: GfaViz, flexible, interactive, visualization, GFA, sequence, graphs
1563 days ago