github.com - Flye is a de novo assembler for single molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. The...
github.com - mosdepth can output:
per-base depth about 2x as fast samtools depth--about 25 minutes of CPU time for a 30X genome.mean per-window depth given a window size--as would be used for CNV calling.the mean per-region given a BED file of regions.a...
bioinfo.ut.ee - FastGT is a program package for whole-genome genotyping of genome variants directly from raw sequencing reads. It is written in C and runs in Linux. FastGT uses a list of variant-specific k-mer pairs that are unique in human genome, counts the...
github.com - This tool extracts heterozygous kmer pairs from kmer count databases and performs gymnastics with them. We are able to disentangle genome structure by comparing the sum of kmer pair coverages (CovA + CovB) to their relative coverage (CovB / (CovA +...
github.com - ClipCrop for detecting SVs with single-base resolution using soft-clipping information. A soft-clipped sequence is an unmatched fragment in a partially mapped read. To assess the performance of ClipCrop with other SV-detecting tools, we generated...
biokinet.belozersky.msu.ru - The vsFilt is the first open application for post-docking structural filtration, available as a web-server. The new tool is easy to use and configure to detect a wide range of interaction types that are known to be involved in molecular recognition,...
github.com - Severus is a somatic structural variation (SV) caller for long reads (both PacBio and ONT). It is designed for matching tumor/normal analysis, supports multiple tumor samples, and produces accurate and complete somatic and germline calls. Severus...
github.com - Structural variant comparison tool for VCFs
Given benchmark and comparsion sets of SVs, calculate the recall, precision, and f-measure.
Spiral Genetics
Motivation
www.nature.com - The human genome is arguably the most complete mammalian reference assembly yet more than 160 euchromatic gaps remain and aspects of its structural variation remain poorly understood ten years after its completion. The results in this paper...
Recently, the progress of the Human Genome Project, aiming to decode all human DNA sequences, has highlighted a research field called bioinformatics. In this new field, computers and techniques from information science are not just used as tools to...