<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: AfterQC: Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data]]></title>
	<link>https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data?</link>
	<atom:link href="https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data</guid>
	<pubDate>Fri, 29 Jun 2018 03:26:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data</link>
	<title><![CDATA[AfterQC: Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data]]></title>
	<description><![CDATA[Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data
AfterQC can simply go through all fastq files in a folder and then output three folders: good, bad and QC folders, which contains good reads, bad reads and the QC results of each fastq file/pair.
Currently it supports processing data from HiSeq 2000/2500/3000/4000, Nextseq 500/550, MiniSeq...and other Illumina 1.8 or newer formats

The author has reimplemented this tool in C++ with multithreading support to make it much faster. The new tool is called fastp and can be found at: https://github.com/OpenGene/fastp . If you prefer a C++ based tool, please use fastp instead.

https://github.com/OpenGene/AfterQC<p>Address of the bookmark: <a href="https://github.com/OpenGene/AfterQC" rel="nofollow">https://github.com/OpenGene/AfterQC</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink='true'>https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data#item-annotation-3421</guid>
	<pubDate>Fri, 29 Jun 2018 03:30:04 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37205/afterqc-automatic-filtering-trimming-error-removing-and-quality-control-for-fastq-data#item-annotation-3421</link>
	<title><![CDATA[Comment by Jit]]></title>
	<description><![CDATA[<ul>
<li><strong>AfterQC</strong>&nbsp;<em><a href="https://github.com/OpenGene/AfterQC">AfterQC</a></em>&nbsp;- Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data.</li>
<li><strong>dupRadar</strong>&nbsp;<em><a href="http://bioconductor.org/packages/devel/bioc/html/dupRadar.html">dupRadar</a></em>.&nbsp;An R package which provides functions for plotting and analyzing the duplication rates dependent on the expression levels.</li>
<li><strong>FastQC</strong>&nbsp;<em><a href="http://www.bioinformatics.babraham.ac.uk/projects/fastqc/">FastQC</a></em>&nbsp;is a quality control tool for high-throughput sequence data (<a href="https://en.wikipedia.org/wiki/Babraham_Institute" title="Babraham Institute">Babraham Institute</a>) and is developed in&nbsp;<a href="https://en.wikipedia.org/wiki/Java_(programming_language)" title="Java (programming language)">Java</a>. Import of data is possible from&nbsp;<a href="https://en.wikipedia.org/wiki/FASTQ_format" title="FASTQ format">FastQ</a>files, BAM or SAM format. This tool provides an overview to inform about problematic areas, summary graphs and tables to rapid assessment of data. Results are presented in&nbsp;<a href="https://en.wikipedia.org/wiki/HTML" title="HTML">HTML</a>&nbsp;permanent reports. FastQC can be run as a stand-alone application or it can be integrated into a larger pipeline solution.</li>
<li><strong>fastqp</strong>&nbsp;<em><a href="https://github.com/mdshw5/fastqp">fastqp</a></em>. Simple FASTQ quality assessment using Python.</li>
<li><strong>Kraken</strong>&nbsp;<em><a href="http://www.ebi.ac.uk/research/enright/software/kraken">kraken</a></em>:A set of tools for quality control and analysis of high-throughput sequence data.</li>
<li><strong>HTSeq</strong>&nbsp;<em><a href="http://www-huber.embl.de/users/anders/HTSeq/doc/qa.html#qa">HTSeq</a></em>.The Python script htseq-qa takes a file with sequencing reads (either raw or aligned reads) and produces a PDF file with useful plots to assess the technical quality of a run.</li>
<li><strong>mRIN</strong>&nbsp;<em><a href="http://zhanglab.c2b2.columbia.edu/index.php/MRIN">mRIN</a></em>&nbsp;- Assessing mRNA integrity directly from RNA-Seq data.</li>
<li><strong>MultiQC</strong>&nbsp;<a href="http://multiqc.info/"><em>MultiQC</em></a>- Aggregate and visualise results from numerous tools&nbsp;<em>(FastQC, HTSeq, RSeQC, Tophat, STAR, others..)</em>&nbsp;across all samples into a single report.</li>
<li><strong>NGSQC</strong>&nbsp;<em><a href="http://brainarray.mbni.med.umich.edu/brainarray/ngsqc/">NGSQC</a></em>: cross-platform quality analysis pipeline for deep sequencing data.</li>
<li><strong>NGS QC Toolkit</strong>&nbsp;<em><a href="http://www.nipgr.res.in/ngsqctoolkit.html">NGS QC Toolkit</a></em>&nbsp;A toolkit for the quality control (QC) of next generation sequencing (NGS) data. The toolkit comprises user-friendly stand alone tools for quality control of the sequence data generated using Illumina and Roche 454 platforms with detailed results in the form of tables and graphs, and filtering of high-quality sequence data. It also includes few other tools, which are helpful in NGS data quality control and analysis.</li>
<li><strong>PRINSEQ</strong>&nbsp;<em><a href="http://prinseq.sourceforge.net/manual.html">PRINSEQ</a></em>&nbsp;is a tool that generates summary statistics of sequence and quality data and that is used to filter, reformat and trim next-generation sequence data. It is particular designed for 454/Roche data, but can also be used for other types of sequence.</li>
<li><strong>QC-Chain</strong>&nbsp;<em><a href="http://www.computationalbioenergy.org/qc-chain.html">QC-Chain</a></em>&nbsp;is a package of quality control tools for next generation sequencing (NGS) data, consisting of both raw reads quality evaluation and de novo contamination screening, which could identify all possible contamination sequences.</li>
<li><strong>QC3</strong>&nbsp;<em><a href="https://github.com/slzhao/QC3">QC3</a></em>&nbsp;a quality control tool designed for DNA sequencing data for raw data, alignment, and variant calling.</li>
<li><strong>qrqc</strong>&nbsp;<em><a href="http://bioconductor.org/packages/2.8/bioc/html/qrqc.html">qrqc</a></em>. Quickly scans reads and gathers statistics on base and quality frequencies, read length, and frequent sequences. Produces graphical output of statistics for use in quality control pipelines, and an optional HTML quality report. S4 SequenceSummary objects allow specific tests and functionality to be written around the data collected.</li>
<li><strong>RNA-SeQC</strong>&nbsp;<em><a href="http://www.broadinstitute.org/cancer/cga/rna-seqc">RNA-SeQC</a></em>is a tool with application in experiment design, process optimization and quality control before computational analysis. Essentially, provides three types of quality control: read counts (such as duplicate reads, mapped reads and mapped unique reads, rRNA reads, transcript-annotated reads, strand specificity), coverage (like mean coverage, mean coefficient of variation, 5&rsquo;/3&rsquo; coverage, gaps in coverage, GC bias) and expression correlation (the tool provides RPKM-based estimation of expression levels). RNA-SeQC is implemented in Java and is not required installation, however can be run using the&nbsp;<em><a href="http://genepattern.org/">GenePattern</a></em>&nbsp;web interface. The input could be one or more BAM files. HTML reports are generated as output.</li>
<li><strong>RSeQC</strong>&nbsp;<em><a href="https://code.google.com/p/rseqc/">RSeQC</a>&nbsp;</em>analyzes diverse aspects of RNA-Seq experiments: sequence quality, sequencing depth, strand specificity, GC bias, read distribution over the genome structure and coverage uniformity. The input can be SAM, BAM, FASTA, BED files or Chromosome size file (two-column, plain text file). Visualization can be performed by genome browsers like UCSC, IGB and IGV. However, R scripts can also be used to visualization.</li>
<li><strong>SAMStat</strong>&nbsp;<em><a href="http://samstat.sourceforge.net/">SAMStat</a></em>&nbsp;identifies problems and reports several statistics at different phases of the process. This tool evaluates unmapped, poorly and accurately mapped sequences independently to infer possible causes of poor mapping.</li>
<li><strong>SolexaQA</strong>&nbsp;<em><a href="http://solexaqa.sourceforge.net/">SolexaQA</a></em>&nbsp;calculates sequence quality statistics and creates visual representations of data quality for second-generation sequencing data. Originally developed for the Illumina system (historically known as &ldquo;Solexa&rdquo;), SolexaQA now also supports Ion Torrent and 454 data.</li>
<li><strong>Trim galore!</strong>&nbsp;<em><a href="http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/">Trim_galore</a></em>&nbsp;is a wrapper script to automate quality and adapter trimming as well as quality control, with some added functionality to remove biased methylation positions for RRBS sequence files (for directional, non-directional (or paired-end) sequencing).</li>
</ul>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>