LAMSA: fast split read alignment with long approximate matches
LAMSA (Long Approximate Matches-based Split Aligner) is a novel split alignment approach with faster speed and good ability of handling SV events. It is well-suited to align long reads (over thousands of base-pairs). LAMSA takes takes the advantage of the rareness of SVs to implement a specifi...Tags: LAMSA, fast, split, read, alignment, long, approximate, matches
2183 days ago
TULIP - The Uncorrected Long read Integration Pipeline
TULIP currently consists of two Perl scripts, tulipseed.perl and tulipbulb.perl. These are very much intended as prototypes, and additional components and/or implementations are likely to follow. Tulipseed takes as input alignments files of long reads to sparse short seeds, and outputs a graph...Tags: TULIP, Uncorrected, Long, read, Integration, Pipeline
2183 days ago
HALC: High throughput algorithm for long read error correction
HALC, a high throughput algorithm for long read error correction. HALC aligns the long reads to short read contigs from the same species with a relatively low identity requirement so that a long read region can be aligned to at least one contig region, including its true genome region’s repeats i...Tags: HALC, algorithm, long, read, error, correction, NGS, PacBio, Nanopore
2158 days ago
SALSA: A tool to scaffold long read assemblies with Hi-C
This code is used to scaffold your assemblies using Hi-C data. This version implements some improvements in the original SALSA algorithm. If you want to use the old version, it can be found in the old_salsa branch. To use the latest version, first run the following commands: cd SALSA m...Tags: SALSA, tool, scaffold, long, read, assemblies, Hi-C, Genome
2152 days ago
nanofilt: Filtering and trimming of long read sequencing data
Filtering on quality and/or read length, and optional trimming after passing filters.Reads from stdin, writes to stdout. Intended to be used: directly after fastq extraction prior to mapping in a stream between extraction and mapping https://github.com/wdecoster/nanofiltTags: nanofilt, filter, trim, long, read, sequencing, NGS, nanopore
2106 days ago
LSC :a long read error correction tool
Getting Started These simple steps will help you integrate LSC into your transcriptomics analysis pipeline. Read the LSC_requirements for running LSC. Download and set-up the LSC package. Follow the tutorial to see how LSC works on some example data. Read the ...Tags: LSC, long, read, error, correction, tool, fast, correction, sensitivity, good, accuracy
2104 days ago
PureCN: copy number calling and SNV classification using targeted short read sequencing
This package estimates tumor purity, copy number, and loss of heterozygosity (LOH), and classifies single nucleotide variants (SNVs) by somatic status and clonality. PureCN is designed for targeted short read sequencing data, integrates well with standard somatic variant detection and copy number...Tags: PureCN, copy, number, calling, SNV, classification, targeted, short, read, sequencing
2097 days ago
SimLoRD: A read simulator for third generation sequencing reads
SimLoRD is a read simulator for third generation sequencing reads and is currently focused on the Pacific Biosciences SMRT error model. Reads are simulated from both strands of a provided or randomly generated reference sequence. The reference can be read from a FASTA file or randomly gener...Tags: SimLoRD, read, simulator, third, generation, sequencing, reads, Pacific, Biosciences, SMRT, error, model
2084 days ago
rHAT: a seed-and-extension-based noisy long read alignment tool
rHAT is a seed-and-extension-based noisy long read alignment tool. It is suitable for aligning 3rd generation sequencing reads which are in large read length with relatively high error rate, especially Pacbio's Single Molecule Read-time (SMRT) sequencing reads.Tags: rHAT, seed-and-extension-based, noisy, long, read, alignment, tool, ont, pacbio
2052 days ago