Indexcov: fast coverage quality control for whole-genome sequencing
indexcov, an efficient estimator of whole-genome sequencing coverage to rapidly identify samples with aberrant coverage profiles, reveal large-scale chromosomal anomalies, recognize potential batch effects, and infer the sex of a sample. Indexcov is available at https://github.com/...Tags: Indexcov, fast, coverage, quality, control, whole-genome, sequencing
2073 days ago
Referee: Genome assembly quality scores
Modern genome sequencing technologies provide a succint measure of quality at each position in every read, however all of this information is lost in the assembly process. Referee summarizes the quality information from the reads that map to a site in an assembled genome to calculate a quality sc...Tags: Referee, Genome, assembly, quality, scores, ngs
2005 days ago
Merqury: reference-free quality and phasing assessment for genome assemblies
Often, genome assembly projects have illumina whole genome sequencing reads available for the assembled individual. The k-mer spectrum of this read set can be used for independently evaluating assembly quality without the need of a high quality reference. Merqury provides a set of tools for this ...Tags: Merqury, reference-free, quality, phasing, assessment, genome, assemblies
1426 days ago
iSeqQC: a tool for expression-based quality control in RNA sequencing
iSeqQC, an expression-based QC tool that detects outliers either produced due to variable laboratory conditions or due to dissimilarity within a phenotypic group. iSeqQC implements various statistical approaches including unsupervised clustering, agglomerative hierarchical clustering and correlat...Tags: iSeqQC, tool, expression-based, quality, control, RNA, sequencing
1537 days ago
LoFreq*: A sequence-quality aware, ultra-sensitive variant caller for NGS data
LoFreq* (i.e. LoFreq version 2) is a fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data. It makes full use of base-call qualities and other sources of errors inherent in sequencing (e.g. mapping or base/indel alignment uncertainty), which are usua...Tags: LoFreq, sequence, quality, aware, ultra-sensitive, variant, caller, NGS, data
1535 days ago
Filtlong: quality filtering tool for long reads
Filtlong is a tool for filtering long reads by quality. It can take a set of long reads and produce a smaller, better subset. It uses both read length (longer is better) and read identity (higher is better) when choosing which reads pass the filter. Filtlong builds into a stand-alone executable:...Tags: Filtlong, quality, filtering, tool, long, reads
1450 days ago
Ktrim: an extra-fast and accurate adapter- and quality-trimmer for sequencing data
Ktrim is written in C++ for GNU Linux/Unix platforms. After uncompressing the source package, you can find an executable file ktrim under bin/ directory compiled using g++ v4.8.5 and linked with libz v1.2.7 for Linux x86_64 system. If you cou...Tags: Ktrim, extra-fast, accurate, adapter, quality, trimmer, sequencing, data
1175 days ago
GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations
The GenomeQC web application is implemented in R/Shiny version 1.5.9 and Python 3.6 and is freely available at https://genomeqc.maizegdb.org/ under the GPL license. All source code and a containerized version of the GenomeQC pipeline is available in the GitHub repository https://gi...Tags: GenomeQC, quality, assessment, tool, genome, assemblies, gene, structure, annotations
714 days ago
OMArk: software for proteome (protein-coding gene repertoire) quality assessment
OMArk is a software for proteome (protein-coding gene repertoire) quality assessment. It provides measures of proteome completeness, characterizes the consistency of all protein coding genes with regard to their homologs, and identifies the presence of contamination from other species. OMArk reli...Tags: OMArk, software, proteome, protein-coding, gene, repertoire, quality, assessment
70 days ago