pbalign: maps PacBio reads to reference sequences and saves alignments to a BAM file
pbalign aligns PacBio reads to reference sequences, filters aligned reads according to user-specific filtering criteria, and converts the output to either the SAM format or PacBio Compare HDF5 (e.g., .cmp.h5) format. The output Compare HDF5 file will be compatible with Quiver if --forQuiver optio...Tags: pbalign, maps, PacBio, reads, reference, sequences, alignments, BAM, NGS
2217 days ago
minimap2: A versatile pairwise aligner for genomic and spliced nucleotide sequences
git clone https://github.com/lh3/minimap2 cd minimap2 && make # long sequences against a reference genome ./minimap2 -a test/MT-human.fa test/MT-orang.fa > test.sam # create an index first and then map ./minimap2 -d MT-human.mmi test/MT-human.fa ./minimap2 -a MT-human.mmi test/MT...Tags: minimap2, versatile, pairwise, aligner, genomic, spliced, nucleotide, sequences
2190 days ago
KAST: Perform Alignment-free k-tuple frequency comparisons from sequences
Perform Alignment-free k-tuple frequency comparisons from sequences. This can be in the form of two input files (e.g. a reference and a query) or a single file for pairwise comparisons to be made.Tags: KAST, Perform, Alignment-free, k-tuple, frequency, comparisons, sequences
2098 days ago
S-plot2: creates an interactive, two-dimensional heatmap of sequences
S-plot2 creates an interactive, two-dimensional heatmap capturing the similarities and dissimilarities in nucleotide usage between genomic sequences (partial or complete). In S-plot2, whole eukaryotic chromosomes and smaller prokaryotic genomes can be efficiently compared. The tool includes funct...Tags: S-plot2, interactive, two-dimensional, heatmap, capture, similarities, dissimilarities, nucleotide, genomic, sequences
2090 days ago
Tags: GRSR, tool, genome, rearrangement, scenarios, multiple, unichromosomal, genome, sequences
2090 days ago
RopeBWT2: Incremental construction of FM-index for DNA sequences
RopeBWT2 is an tool for constructing the FM-index for a collection of DNA sequences. It works by incrementally inserting one or multiple sequences into an existing pseudo-BWT position by position, starting from the end of the sequences. This algorithm can be largely considered a mixture of B...Tags: RopeBWT2, Incremental, construction, FM-index, DNA, sequences
2063 days ago
COSINE: non-seeding method for mapping long noisy sequences
Third generation sequencing (TGS) are highly promising technologies but the long and noisy reads from TGS are difficult to align using existing algorithms. Here, we present COSINE, a conceptually new method designed specifically for aligning long reads contaminated by a high level of errors.Tags: COSINE, non-seeding, method, mapping, long, noisy, sequences
2062 days ago
Tags: DECIPHER, software, toolset, decipher, biological, sequences, efficiently, R
2017 days ago
SiLiX: implements an ultra-efficient algorithm for the clustering of homologous sequences
The software package SiLiX implements an ultra-efficient algorithm for the clustering of homologous sequences, based on single transitive links (single linkage) with alignment coverage constraints. SiLiX adopts a graph-theoretical framework to interpret similarity pairs as edges of a networ...Tags: SiLiX, ultra-efficient, algorithm, clustering, homologous, sequences, Bacteria
2015 days ago