github.com - Minda is a tool for evaluating structural variant (SV) callers that
standardizes VCF records for compatibility with both germline and somatic SV callers,
benchmarks against a single VCF input file, or
benchmarks against an ensemble call set...
github.com - Description
LoVis4u is a bioinformatics tool for Loci Visualisation.
LoVis4u, a command-line tool and Python API designed for highly customizable and fast visualisation of multiple genomic loci. LoVis4u generates vector images in...
github.com - VariantBam is a tool to extract/count specific sets of sequencing reads from next-generational sequencing files. To save money, disk space and I/O, one may not want to store an entire BAM on disk. In many cases, it would be more efficient to store...
github.com - Third generation sequencing (TGS) are highly promising technologies but the long and noisy reads from TGS are difficult to align using existing algorithms. Here, we present COSINE, a conceptually new method designed specifically for aligning long...
kakitone.github.io - FinisherSC, a repeat-aware and scalable tool for upgrading de novo assembly using long reads. Experiments with real data suggest that FinisherSC can provide longer and higher quality contigs than existing tools while maintaining high...
github.com - DBG2OLC:Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies
Our work is published in Scientific Reports:
Ye, C. et al. DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous...
github.com - Long Read Correction and other Correction tools
This package is a loose collection of scripts. To run the correctionroutine see the section below. Descriptions of the other scriptsare at the bottom of this file.
Contact: gurtowsk@cshl.edu
In...
www.bioinformatics.babraham.ac.uk - Understanding Following table and graphs
Duplication level
kmer profile
per base GC content
per base N content
per base quality
per base sequence content
per sequence GC content
per sequence quality
sequence length distribution
More at...
github.com - KAT is a suite of tools that analyse jellyfish hashes or sequence files (fasta or fastq) using kmer counts. The following tools are currently available in KAT:
hist: Create an histogram of k-mer occurrences from a sequence file. Adds metadata in...
github.com - OMArk is a software for proteome (protein-coding gene repertoire) quality assessment. It provides measures of proteome completeness, characterizes the consistency of all protein coding genes with regard to their homologs, and identifies the presence...