<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/36895?offset=130</link>
	<atom:link href="https://bioinformaticsonline.com/related/36895?offset=130" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</guid>
	<pubDate>Thu, 02 Jan 2025 20:11:07 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</link>
	<title><![CDATA[The &quot;Ifs&quot; and &quot;Buts&quot; of NGS Quality Control and Trimming]]></title>
	<description><![CDATA[<p>Next-Generation Sequencing (NGS) has revolutionized biological research, providing vast amounts of data for a wide range of applications. However, the reliability of NGS analyses heavily depends on the quality of raw sequencing data. Quality control (QC) and trimming are critical preprocessing steps that can make or break your downstream analyses. In this blog, we explore the "ifs" (why you should perform QC and trimming) and the "buts" (challenges or considerations) of this vital step in NGS workflows.</p><h3><strong>The "Ifs" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Ensures Data Integrity</strong><br />If you want to minimize errors in downstream analyses, QC and trimming remove low-quality reads and bases, ensuring high-confidence data. This step is essential for reliable variant calling, assembly, and other applications.</p>
</li>
<li>
<p><strong>Removes Contaminants</strong><br />If adapter sequences or contaminants are present in the raw reads, trimming can eliminate them. This prevents issues like misalignment or incorrect biological interpretations, ensuring cleaner data for analysis.</p>
</li>
<li>
<p><strong>Improves Mapping and Assembly</strong><br />If your goal is better alignment to a reference genome or improved de novo assembly, trimming low-quality bases and adapters is critical. High-quality reads map more efficiently and generate more accurate assemblies.</p>
</li>
<li>
<p><strong>Reduces Computational Load</strong><br />If you want to save computational resources, trimming reduces the dataset size, which speeds up processing and analysis. Clean datasets mean less computational time spent on processing low-quality data.</p>
</li>
<li>
<p><strong>Prepares for Standardized Analyses</strong><br />If your project involves multiple datasets, QC and trimming ensure uniformity across them. This standardization makes comparisons valid and reproducible, particularly in large collaborative studies.</p>
</li>
</ol><h3><strong>The "Buts" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Risk of Over-Trimming</strong><br />But excessive trimming can lead to the loss of informative sequences, reducing read depth and potentially discarding biologically relevant data. This is especially critical in studies with limited sequencing depth.</p>
</li>
<li>
<p><strong>Bias Introduction</strong><br />But trimming algorithms might introduce biases, especially if they inadvertently remove sequences with specific biological patterns. This can skew results and compromise biological insights.</p>
</li>
<li>
<p><strong>Loss of Context in Paired-End Reads</strong><br />But trimming one read in a pair more than the other can lead to loss of pairing information. This complicates downstream analyses that rely on paired-end data, such as structural variant detection.</p>
</li>
<li>
<p><strong>Time and Resource Intensive</strong><br />But running QC and trimming for large datasets can be computationally expensive and time-consuming. As sequencing depth increases, preprocessing becomes a bottleneck in the analysis pipeline.</p>
</li>
<li>
<p><strong>Variable Standards</strong><br />But the criteria for trimming (e.g., quality threshold, minimum read length) can vary between tools and datasets. This variability may affect reproducibility and comparability of results across studies.</p>
</li>
</ol><h3><strong>Balancing the "Ifs" and "Buts"</strong></h3><p>To maximize the benefits of QC and trimming while mitigating the challenges, consider the following best practices:</p><ul>
<li>
<p><strong>Use QC Tools Wisely:</strong> Start with tools like <strong>FastQC</strong> to identify quality issues in your raw data. Visualizing quality metrics helps tailor your trimming parameters.</p>
</li>
<li>
<p><strong>Choose Reliable Trimming Tools:</strong> Tools like <strong>Trimmomatic</strong>, <strong>Cutadapt</strong>, and <strong>BBduk</strong> offer adaptive and customizable trimming options. Select one that aligns with your dataset and project goals.</p>
</li>
<li>
<p><strong>Set Reasonable Parameters:</strong> Avoid over-trimming by setting quality thresholds and minimum read lengths that balance data retention and quality improvement.</p>
</li>
<li>
<p><strong>Test Downstream Effects:</strong> Validate the impact of QC and trimming on downstream analyses, such as alignment efficiency, variant calling accuracy, or assembly quality.</p>
</li>
<li>
<p><strong>Document Your Workflow:</strong> Maintain detailed records of the parameters and tools used for QC and trimming. This ensures reproducibility and enables better troubleshooting.</p>
</li>
</ul><h3><strong>Conclusion</strong></h3><p>NGS quality control and trimming are essential steps to ensure reliable and accurate data for analysis. While the "ifs" highlight the clear benefits of these steps, the "buts" remind us of the potential pitfalls. By adopting best practices and carefully balancing these considerations, you can optimize your preprocessing workflow and unlock the full potential of your sequencing data.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40604/gapfinisher-a-reliable-gap-filling-pipeline-for-sspace-longread-scaffolder-output</guid>
	<pubDate>Fri, 24 Jan 2020 06:04:40 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40604/gapfinisher-a-reliable-gap-filling-pipeline-for-sspace-longread-scaffolder-output</link>
	<title><![CDATA[gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output]]></title>
	<description><![CDATA[<p><span>gapFinisher is based on the controlled use of a previously published gap filling tool FGAP and works on all standard Linux/UNIX command lines. They compare the performance of gapFinisher against two other published gap filling tools PBJelly and GMcloser. </span></p>
<p><span>gapFinisher can fill gaps in draft genomes quickly and reliably.</span></p><p>Address of the bookmark: <a href="https://github.com/kammoji/gapFinisher" rel="nofollow">https://github.com/kammoji/gapFinisher</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/989/bioinformatics-approach-to-boar-taint</guid>
	<pubDate>Wed, 17 Jul 2013 15:50:37 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/989/bioinformatics-approach-to-boar-taint</link>
	<title><![CDATA[Bioinformatics approach to Boar Taint]]></title>
	<description><![CDATA[<p><span>Meat products obtained from intact male pigs often produce offensive smell or odour which is recognized as a complex genetic trait called boar taint.Androstenone and Skatole&nbsp;in the fat primarily cause boar taint. Metabolism of androstenone and sex steroids share a common pathway which makes removal of boar taint a very challenging task. Castration is a traditional solution to remove boar taint but it also results in bad quality of meat due to low level of steroids which is objectionable to many consumers. Detected functional variant(s) underlying boar taint compounds can be used as genetic markers in selection of male pigs with reduced boar taint levels. Resequencing of a total of 47 samples belong to Norwegian Landrace (NL) and Duroc (D) pigs with varied boar taint levels were done in Illumina HiSeq2000 to &gt;10X average coverage. Short reads generated from these samples mapped to&nbsp;<em>Sus Scrofa</em>&nbsp;version 10.2 reference assembly using Bowtie2. Alignment file then used for calling SNPs and InDels inside previousy identified QTL regions on SSC5,13, and 7 with the aid of FreeBayes , a variant caller tool. A final list of SNPs was prepared after filtering SNPs on the basis of SNP quality, coverage of SNP allele, functional and structural annotation, and repeats, etc. Selected SNPs will be genotyped in sample population for validation and then used for constructing SNPs haplotypes in close linkage disequilibrium with QTLs and fine mapping of QTLs through association mapping of genotyped SNPs.</span><span>&nbsp;</span></p><p><span>&nbsp;</span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/989" length="19688" type="image/jpeg" />
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/view/1926</guid>
	<pubDate>Sun, 11 Aug 2013 11:42:32 -0500</pubDate>
	<link>https://bioinformaticsonline.com/view/1926</link>
	<title><![CDATA[Want to Know which genome assembler rule the world ?]]></title>
	<description><![CDATA[<p><span><strong>Assemblathon 2</strong>: evaluating de novo methods of genome assembly&nbsp;</span></p><p><span><a href="http://www.gigasciencejournal.com/content/2/1/10/abstract">http://www.gigasciencejournal.com/content/2/1/10/abstract</a></span></p><p><span><a href="http://blogs.nature.com/news/2013/07/genome-assembly-contest-prompts-soul-searching.html">http://blogs.nature.com/news/2013/07/genome-assembly-contest-prompts-soul-searching.html</a></span></p><p><a href="http://assemblathon.org/post/44431915644/feedback-and-analysis-of-the-assemblathon-2-p">http://assemblathon.org/post/44431915644/feedback-and-analysis-of-the-assemblathon-2-p</a></p><p>&nbsp;</p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/4195/barber-pole-worm-sheep-pathogen-sequenced</guid>
	<pubDate>Tue, 03 Sep 2013 16:32:18 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/4195/barber-pole-worm-sheep-pathogen-sequenced</link>
	<title><![CDATA[Barber pole worm , sheep pathogen sequenced !!!]]></title>
	<description><![CDATA[<p>Haemonchus contortus is a highly pathogenic parasitic nematode of that can infect a large number of wild and domesticated ruminant species and is the most economically important parasite of sheep and goats worldwide. Scientists at the Wellcome Trust Sanger Institute have sequenced the genome of the barber's pole worm (Haemonchus contortus), which will help to explore the this tropical parasite which&nbsp;been disseminated around the world by livestock movement.&nbsp;</p><p>H. contortus is a member of the superfamily trichostrongyloidea (Strongylida) which contains most of the economically important parasitic nematodes of grazing livestock. These parasites cost the global livestock industry billions of dollars per annum in lost production and drug costs.&nbsp;A common type of clover may be a preventative or palliative for the disease. However, some particular breeds of sheep, such as the Gulf Coast Native from the Southern United States, have been shown to have developed special resistance to H. contortus.</p><p>Getting the full genome can help to tackle the problem and understand the resistance mechanism with an ease. Moreover, the genome could now provide a comprehensive understanding of how treatments against parasitic worms work and point to further new treatments and vaccines.&nbsp;By comparing the genome of the barber's pole worm with those of worms that have acquired drug resistance, researchers expect to reveal information about how and why resistance has occurred. Till now, researchers have uncovered essential information in the fight against drug resistance in worms.</p><p>Reference:</p><p><a href="http://www.fwi.co.uk/articles/28/08/2013/140758/researchers-close-in-on-worm-resistance-in-sheep.htm">http://www.fwi.co.uk/articles/28/08/2013/140758/researchers-close-in-on-worm-resistance-in-sheep.htm</a></p><p><a href="http://www.sciencedaily.com/releases/2013/08/130828103351.htm?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+sciencedaily%2Fplants_animals+(ScienceDaily%3A+Plants+%26+Animals+News)">http://www.sciencedaily.com/releases/2013/08/130828103351.htm?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+sciencedaily%2Fplants_animals+(ScienceDaily%3A+Plants+%26+Animals+News)</a></p><p>Image source: Wikipedia</p><p><img src="http://upload.wikimedia.org/wikipedia/commons/8/8e/Haemonchus_contortus.jpg" alt="image" width="800" height="533" style="border: 0px; border: 0px;"></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/6896/dna-tale-of-3-to-4-years-old-serbia-boy</guid>
	<pubDate>Tue, 26 Nov 2013 17:34:00 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/6896/dna-tale-of-3-to-4-years-old-serbia-boy</link>
	<title><![CDATA[DNA tale of 3 to 4 years old Serbia boy]]></title>
	<description><![CDATA[<p><span>The genome of a young boy found underground at Mal&rsquo;ta near Lake Baikal of eastern Siberia around 24,000 years ago came out as close relative of Europeans and Native Indians.</span></p><p><span>Link:</span></p><p><span><a href="http://www.nytimes.com/2013/11/21/science/two-surprises-in-dna-of-boy-found-buried-in-siberia.html?_r=0">http://www.nytimes.com/2013/11/21/science/two-surprises-in-dna-of-boy-found-buried-in-siberia.html?_r=0</a></span></p><p>&nbsp;</p><p><a href="http://www.nature.com/nature/journal/vaop/ncurrent/full/nature12736.html">http://www.nature.com/nature/journal/vaop/ncurrent/full/nature12736.html</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/10237/genome-of-rainbow-trout-sequenced</guid>
	<pubDate>Fri, 25 Apr 2014 10:36:51 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/10237/genome-of-rainbow-trout-sequenced</link>
	<title><![CDATA[Genome of Rainbow Trout Sequenced]]></title>
	<description><![CDATA[<p>Major finding:</p><p><span>&ldquo;In humans and most vertebrates the duplication events were older so there are fewer duplicated genes still present. Most of the duplicated genes get lost or modified so much that they are no longer recognizable as duplicates over time. In the trout and salmon we can see an earlier stage in the process and many duplicated genes are still present,&rdquo; said Dr Gary Thorgaard of Washington State University, a co-author of the paper published in the journal Nature Communications.</span></p><p><span>Source:</span></p><p><span>http://www.sci-news.com/genetics/science-genome-rainbow-trout-01877.html</span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/10378/real-time-sequencing</guid>
	<pubDate>Sun, 04 May 2014 18:16:42 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/10378/real-time-sequencing</link>
	<title><![CDATA[Real time Sequencing]]></title>
	<description><![CDATA[<p><span>&ldquo;... we now know we can do high-throughput sequencing at any location on Earth,&rdquo; Moroz said.</span></p><p><span>Source:</span></p><p><span>http://news.ufl.edu/2014/04/28/real-time-genome-sequencing-at-sea/</span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/11365/drawback-of-exome-sequencing</guid>
	<pubDate>Mon, 02 Jun 2014 05:46:43 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/11365/drawback-of-exome-sequencing</link>
	<title><![CDATA[Drawback of Exome Sequencing]]></title>
	<description><![CDATA[<p><span><span>Dr Eric Londin, Assistant Professor, Thomas Jefferson University, USA, stated that analysis of 44 exome datasets from four different testing kits showed that they missed a high proportion of clinically relevant regions in the 56 ACMG genes. "At least one gene in each exome method was missing more than 40 percent of disease-causing genetic variants, and we found that the worst-performing method missed more than 90 percent of such variants in four of the 56 genes," he says.</span><br /></span></p><p><span><strong>Source</strong>:&nbsp;http://www.eurekalert.org/pub_releases/2014-05/esoh-pco052914.php</span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/17843/pathway-analysis</guid>
	<pubDate>Fri, 03 Oct 2014 08:51:13 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/17843/pathway-analysis</link>
	<title><![CDATA[Pathway Analysis]]></title>
	<description><![CDATA[<p>Pathway Analysis is usually performed with aim to enrich the genes with their functional information and reveal the underlying biological mechanisms pursue by genes. Pathway Analysis is not only limited to what biological pathways a particular set of expressed genes follow but also to disclose the relationships between these genes. With availability of more genomics, transcriptomics and proteomics data, interactions between genes involve in multiple pathways become more clear and also relationships between the genes, their transcripts, and their gene products. However, existing tools and dbs mainly based on knowledge driven approach in which pathways will be identified by finding the correlation between the&nbsp;<span>information in one of the pathway knowledge databases (KEGG,Reactome,Panther,BioCarta, Panther,GO,NCI,WikiPathways,etc) and gene expression result for a specific conditions for instance tumor, obesity , cold resistant crops/plants, etc.</span></p><p><span><strong>Introductory Articles/ppt/sources</strong>:</span></p><p><a href="http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002375"><span>http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002375</span></a></p><p><a href="http://bioinformatics.mdanderson.org/MicroarrayCourse/Lectures09/Pathway%20Analysis.pdf"><span>http://bioinformatics.mdanderson.org/MicroarrayCourse/Lectures09/Pathway%20Analysis.pdf</span></a></p><p><a href="http://gettinggeneticsdone.blogspot.de/2012/03/pathway-analysis-for-high-throughput.html"><span>http://gettinggeneticsdone.blogspot.de/2012/03/pathway-analysis-for-high-throughput.html</span></a></p><p><a href="http://davetang.org/muse/tag/pathway/"><span>http://davetang.org/muse/tag/pathway/</span></a></p><p><a href="https://www.biostars.org/p/42219/"><span>https://www.biostars.org/p/42219/</span></a></p><p><a href="http://bioinformatics.ca//files/public/Pathways_2014_Module4_v2.pdf"><span>http://bioinformatics.ca//files/public/Pathways_2014_Module4_v2.pdf</span></a></p><p><a href="http://bioinformatics.ca//files/public/Pathways_2014_Module2.pdf"><span>http://bioinformatics.ca//files/public/Pathways_2014_Module2.pdf</span></a></p><p><span><strong>Impotant Database and Tools</strong>:</span></p><p>GeneMANIA, Cytoscape,&nbsp;<a href="http://www.ingenuity.com/products/ipa">IPA</a>&nbsp;and <a href="http://thomsonreuters.com/metacore/">Metacore</a> (Commerical ),&nbsp;<span>Pathway Commons, Reactome ,Panther, BioCyc, WikiPathways, Pathvisio, KEGG, NCI, Stringdb, Amigo,&nbsp;<span>WebGestalt ,<span>ConsensusPathDB ,GSEA,Blast2go</span></span></span></p><p><span><strong>Popular R based tools</strong>:</span></p><p><span>Reactome.db, ReactomePA, ClusterProfiler, Gage, SPIA, topGO, Pathview,DOSE,GOStat</span></p><p><span><strong>More</strong>:</span></p><p><a href="http://www.bioconductor.org/help/search/index.html?q=Enrichment+analysis+"><span>http://www.bioconductor.org/help/search/index.html?q=Enrichment+analysis+</span></a></p><p>&nbsp;</p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>

</channel>
</rss>