github.com - Flye is a de novo assembler for single molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. The...
clark.cs.ucr.edu - CLARK, a method based on a supervised sequence classification using discriminative k-mers. Considering two distinct specific classification problems (see the article for details), namely (1) the taxonomic classification of metagenomic reads to...
github.com - MashMap is a fast and approximate software for mapping long reads (PacBio/ONT) or assembly to reference genome(s). It maps a query sequence against a reference region if and only if its estimated alignment identity is above a specified threshold. It...
github.com - LAMSA (Long Approximate Matches-based Split Aligner) is a novel split alignment approach with faster speed and good ability of handling SV events. It is well-suited to align long reads (over thousands of base-pairs).
LAMSA takes takes the...
www.healthcare.uiowa.edu - Getting Started
These simple steps will help you integrate LSC into your transcriptomics analysis pipeline.
Read the LSC_requirements for running LSC.
Download and set-up the LSC package.
Follow the tutorial to see how...
www.fishbrowser.org - P_RNA_scaffolder is a novel scaffolding tool using Pair-end RNA-seq to scaffold genome fragments. The method is suitable for most genomes. The program could utilize Illumina Paired-end RNA-sequencing reads from target speciesies. Our method provides...
github.com - Kalign is a fast multiple sequence alignment program for biological sequences.
Align sequences and output the alignment in MSF format:
kalign -i BB11001.tfa -f msf -o out.msf
Align sequences and output the alignment in clustal format:
kalign...
github.com - RefKA, a reference-based approach for long read genome assembly. This approach relies on breaking up a closely related reference genome into bins, aligning k-mers unique to each bin with PacBio reads, and then assembling each bin in parallel...
github.com - chromeister: An ultra fast, heuristic approach to detect conserved signals in extremely large pairwise genome comparisons.
USAGE:
-query: sequence A in fasta format
-db: sequence B in fasta format
-out: output matrix
-kmer Integer: k>1...
Abstract:
Background/Motivation:
The dearth of structural information on alpha helical membrane protein (MPs) has hindered thus far the development of reliable knowledge –based potentials that can be used for automatic prediction of...