<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/39671?offset=110</link>
	<atom:link href="https://bioinformaticsonline.com/related/39671?offset=110" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</guid>
	<pubDate>Thu, 02 Jan 2025 20:11:07 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</link>
	<title><![CDATA[The &quot;Ifs&quot; and &quot;Buts&quot; of NGS Quality Control and Trimming]]></title>
	<description><![CDATA[<p>Next-Generation Sequencing (NGS) has revolutionized biological research, providing vast amounts of data for a wide range of applications. However, the reliability of NGS analyses heavily depends on the quality of raw sequencing data. Quality control (QC) and trimming are critical preprocessing steps that can make or break your downstream analyses. In this blog, we explore the "ifs" (why you should perform QC and trimming) and the "buts" (challenges or considerations) of this vital step in NGS workflows.</p><h3><strong>The "Ifs" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Ensures Data Integrity</strong><br />If you want to minimize errors in downstream analyses, QC and trimming remove low-quality reads and bases, ensuring high-confidence data. This step is essential for reliable variant calling, assembly, and other applications.</p>
</li>
<li>
<p><strong>Removes Contaminants</strong><br />If adapter sequences or contaminants are present in the raw reads, trimming can eliminate them. This prevents issues like misalignment or incorrect biological interpretations, ensuring cleaner data for analysis.</p>
</li>
<li>
<p><strong>Improves Mapping and Assembly</strong><br />If your goal is better alignment to a reference genome or improved de novo assembly, trimming low-quality bases and adapters is critical. High-quality reads map more efficiently and generate more accurate assemblies.</p>
</li>
<li>
<p><strong>Reduces Computational Load</strong><br />If you want to save computational resources, trimming reduces the dataset size, which speeds up processing and analysis. Clean datasets mean less computational time spent on processing low-quality data.</p>
</li>
<li>
<p><strong>Prepares for Standardized Analyses</strong><br />If your project involves multiple datasets, QC and trimming ensure uniformity across them. This standardization makes comparisons valid and reproducible, particularly in large collaborative studies.</p>
</li>
</ol><h3><strong>The "Buts" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Risk of Over-Trimming</strong><br />But excessive trimming can lead to the loss of informative sequences, reducing read depth and potentially discarding biologically relevant data. This is especially critical in studies with limited sequencing depth.</p>
</li>
<li>
<p><strong>Bias Introduction</strong><br />But trimming algorithms might introduce biases, especially if they inadvertently remove sequences with specific biological patterns. This can skew results and compromise biological insights.</p>
</li>
<li>
<p><strong>Loss of Context in Paired-End Reads</strong><br />But trimming one read in a pair more than the other can lead to loss of pairing information. This complicates downstream analyses that rely on paired-end data, such as structural variant detection.</p>
</li>
<li>
<p><strong>Time and Resource Intensive</strong><br />But running QC and trimming for large datasets can be computationally expensive and time-consuming. As sequencing depth increases, preprocessing becomes a bottleneck in the analysis pipeline.</p>
</li>
<li>
<p><strong>Variable Standards</strong><br />But the criteria for trimming (e.g., quality threshold, minimum read length) can vary between tools and datasets. This variability may affect reproducibility and comparability of results across studies.</p>
</li>
</ol><h3><strong>Balancing the "Ifs" and "Buts"</strong></h3><p>To maximize the benefits of QC and trimming while mitigating the challenges, consider the following best practices:</p><ul>
<li>
<p><strong>Use QC Tools Wisely:</strong> Start with tools like <strong>FastQC</strong> to identify quality issues in your raw data. Visualizing quality metrics helps tailor your trimming parameters.</p>
</li>
<li>
<p><strong>Choose Reliable Trimming Tools:</strong> Tools like <strong>Trimmomatic</strong>, <strong>Cutadapt</strong>, and <strong>BBduk</strong> offer adaptive and customizable trimming options. Select one that aligns with your dataset and project goals.</p>
</li>
<li>
<p><strong>Set Reasonable Parameters:</strong> Avoid over-trimming by setting quality thresholds and minimum read lengths that balance data retention and quality improvement.</p>
</li>
<li>
<p><strong>Test Downstream Effects:</strong> Validate the impact of QC and trimming on downstream analyses, such as alignment efficiency, variant calling accuracy, or assembly quality.</p>
</li>
<li>
<p><strong>Document Your Workflow:</strong> Maintain detailed records of the parameters and tools used for QC and trimming. This ensures reproducibility and enables better troubleshooting.</p>
</li>
</ul><h3><strong>Conclusion</strong></h3><p>NGS quality control and trimming are essential steps to ensure reliable and accurate data for analysis. While the "ifs" highlight the clear benefits of these steps, the "buts" remind us of the potential pitfalls. By adopting best practices and carefully balancing these considerations, you can optimize your preprocessing workflow and unlock the full potential of your sequencing data.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/989/bioinformatics-approach-to-boar-taint</guid>
	<pubDate>Wed, 17 Jul 2013 15:50:37 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/989/bioinformatics-approach-to-boar-taint</link>
	<title><![CDATA[Bioinformatics approach to Boar Taint]]></title>
	<description><![CDATA[<p><span>Meat products obtained from intact male pigs often produce offensive smell or odour which is recognized as a complex genetic trait called boar taint.Androstenone and Skatole&nbsp;in the fat primarily cause boar taint. Metabolism of androstenone and sex steroids share a common pathway which makes removal of boar taint a very challenging task. Castration is a traditional solution to remove boar taint but it also results in bad quality of meat due to low level of steroids which is objectionable to many consumers. Detected functional variant(s) underlying boar taint compounds can be used as genetic markers in selection of male pigs with reduced boar taint levels. Resequencing of a total of 47 samples belong to Norwegian Landrace (NL) and Duroc (D) pigs with varied boar taint levels were done in Illumina HiSeq2000 to &gt;10X average coverage. Short reads generated from these samples mapped to&nbsp;<em>Sus Scrofa</em>&nbsp;version 10.2 reference assembly using Bowtie2. Alignment file then used for calling SNPs and InDels inside previousy identified QTL regions on SSC5,13, and 7 with the aid of FreeBayes , a variant caller tool. A final list of SNPs was prepared after filtering SNPs on the basis of SNP quality, coverage of SNP allele, functional and structural annotation, and repeats, etc. Selected SNPs will be genotyped in sample population for validation and then used for constructing SNPs haplotypes in close linkage disequilibrium with QTLs and fine mapping of QTLs through association mapping of genotyped SNPs.</span><span>&nbsp;</span></p><p><span>&nbsp;</span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/989" length="19688" type="image/jpeg" />
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/2518/genome-browsers</guid>
	<pubDate>Fri, 16 Aug 2013 19:04:47 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/2518/genome-browsers</link>
	<title><![CDATA[Genome Browsers]]></title>
	<description><![CDATA[<p>Genome Browser is the platform/database used for searching and retreiving sequences and annotation of genomes belong to various eukaryotes, prokaryotes, etc.</p><p>Following are the weblink for different available browsers:</p><p><a href="http://www.ensembl.org/index.html">http://www.ensembl.org/index.html</a></p><p><a href="http://ensemblgenomes.org/">http://ensemblgenomes.org/</a></p><p><a href="http://genome.ucsc.edu/">http://genome.ucsc.edu/</a></p><p><a href="http://www.ncbi.nlm.nih.gov/genome">http://www.ncbi.nlm.nih.gov/genome</a></p><p><a href="http://www.ebi.ac.uk/genomes/">http://www.ebi.ac.uk/genomes/</a></p><p><a href="http://flybase.org/">http://flybase.org/</a></p><p><a href="http://cmr.jcvi.org/tigr-scripts/CMR/CmrHomePage.cgi">http://cmr.jcvi.org/tigr-scripts/CMR/CmrHomePage.cgi</a></p><p><a href="http://www.sanger.ac.uk/resources/databases/">http://www.sanger.ac.uk/resources/databases/</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/4164/two-major-breakthrough</guid>
	<pubDate>Mon, 02 Sep 2013 10:18:11 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/4164/two-major-breakthrough</link>
	<title><![CDATA[Two major breakthrough!!]]></title>
	<description><![CDATA[<p>"Scientists in Uruguay in colloboration with European partners sequenced the genome of the high-value Tannat grape, from which "the most healthy of red wines" are fermented.</p><p>A quick, $1 syphilis&nbsp;test in development by researchers from UNU-BIOLAC."</p><p><strong>Source</strong>:</p><p><a href="http://www.sciencedaily.com/releases/2013/09/130902101846.htm">http://www.sciencedaily.com/releases/2013/09/130902101846.htm</a></p><p><a href="http://www.eurekalert.org/pub_releases/2013-09/tca-ssg082613.php">http://www.eurekalert.org/pub_releases/2013-09/tca-ssg082613.php</a></p><p>&nbsp;</p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/9032/encode-sequencing-data-freely-available-to-download-and-use-for-academic-means</guid>
	<pubDate>Thu, 13 Mar 2014 18:18:08 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/9032/encode-sequencing-data-freely-available-to-download-and-use-for-academic-means</link>
	<title><![CDATA[Encode sequencing data freely available to download and use for academic means]]></title>
	<description><![CDATA[<p>In <span style="text-decoration: underline;"><strong>Encode</strong></span>,&nbsp;<span>regulatory elements investigated via DNA hypersensitivity assays, assays of DNA methylation, and chromatin immunoprecipitation (ChIP) of proteins that interact with DNA, including modified histones and transcription factors, followed by sequencing (ChIP-Seq).</span></p>
<p><span>More information:</span></p>
<p><span>https://genome.ucsc.edu/ENCODE/pilot.html</span></p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://genome.ucsc.edu/ENCODE/" rel="nofollow">https://genome.ucsc.edu/ENCODE/</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/10238/tsetse-fly-genome-sequenced</guid>
	<pubDate>Fri, 25 Apr 2014 10:48:35 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/10238/tsetse-fly-genome-sequenced</link>
	<title><![CDATA[Tsetse Fly Genome sequenced]]></title>
	<description><![CDATA[<p><span><span>As it&nbsp;</span><a href="http://www.sciencemag.org/content/344/6182/380" target="_blank">reported online today</a><span>&nbsp;in&nbsp;</span><em>Science</em><span>, the team used several sequencing approaches to tackle the tsetse fly's 366 million base genome.</span></span></p><p><span>The current study, and companion articles slated to appear in&nbsp;</span><em>PLOS One</em><span>,&nbsp;</span><em>PLOS Genetics</em><span>, and&nbsp;</span><em>PLOS Neglected Tropic Diseases</em><span>, are the result of &nbsp;nearly 150 researchers based in 18 countries.</span></p><p><span>Source:</span></p><p><span>http://www.genomeweb.com/sequencing/international-team-sequences-tsetse-fly-genome</span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/10739/science-for-life-laboratory-scilifelab-sweden</guid>
  <pubDate>Sat, 10 May 2014 06:22:30 -0500</pubDate>
  <link></link>
  <title><![CDATA[Science for Life Laboratory (SciLifeLab)-Sweden]]></title>
  <description><![CDATA[
<p>Science for Life Laboratory (SciLifeLab) is a national center for molecular biosciences with focus on health and environmental research. The center combines frontline technical expertise with advanced knowledge of translational medicine and molecular bioscience. SciLifeLab is a national resource and a collaboration between four universities: Karolinska Institutet, KTH Royal Institute of Technology, Stockholm University and Uppsala University.</p>

<p>Webpage : https://www.scilifelab.se/about-us/<br />Opportunity: https://www.scilifelab.se/about-us/career/</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/11644/mirna-database-and-tools</guid>
	<pubDate>Mon, 09 Jun 2014 07:58:40 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/11644/mirna-database-and-tools</link>
	<title><![CDATA[miRNA database and tools]]></title>
	<description><![CDATA[<p>Since few years miRNA has shown to play important role in therapeutic related research and also known to play vital role in controlling gene expression specifically at transcriptional and post-transcription levels. Here are some important DBs and tools related with miRNA:</p><p><strong>miRNA Sequencing data analysis</strong> :&nbsp;http://tools.genxpro.net/omiras/</p><p><strong>miRNApath( R based tool)&nbsp;</strong>: &nbsp;<a href="http://www.bioconductor.org/packages/release/bioc/html/miRNApath.html">http://www.bioconductor.org/packages/release/bioc/html/miRNApath.html</a></p><p><strong>miRWalk DB</strong> :&nbsp;http://www.umm.uni-heidelberg.de/apps/zmf/mirwalk/</p><p><strong>TargetScanHuman</strong> :&nbsp;http://www.targetscan.org/</p><p><strong>RNAhybrid</strong> :&nbsp;http://bibiserv.techfak.uni-bielefeld.de/rnahybrid/welcome.html</p><p><strong>RNA22 predictor</strong> :&nbsp;http://cbcsrv.watson.ibm.com/rna22.html</p><p><strong>miRNA predictor</strong> :&nbsp;http://www.microrna.org/microrna/home.do</p><p><strong>Plant miRNA DB</strong> :http://bioinformatics.cau.edu.cn/PMRD/</p><p><strong>miRBASE DB</strong>:&nbsp;http://www.mirbase.org/</p><p><strong>Plant RNA predictor</strong> : http://plantgrn.noble.org/psRNATarget/</p><p><strong>miRNA Interaction DB</strong> :&nbsp;http://starbase.sysu.edu.cn/</p><p><strong>Sequencing based miRNA DB</strong> :&nbsp;http://mirgator.kobic.re.kr/</p><p><strong>predicted A-to-I edited miRNA DB </strong>:&nbsp;http://microrna.osumc.edu/mireditar/</p><p><strong>Animal, plant and virus miRNA DB</strong> :&nbsp;http://lemur.amu.edu.pl/share/php/mirnest/</p><p><strong>Atlantic Salmon&nbsp;miRNAs DB </strong>:<strong>&nbsp;</strong>http://www.molgenv.com/ssa_mirnas_db_home.php</p><p><strong>miRNA prediction on UTRs</strong> :&nbsp;http://genie.weizmann.ac.il/pubs/mir07/mir07_prediction.html</p><p><span style="text-decoration: underline;"><strong>Idea of analysing miRNA Sequencing data</strong></span> :</p><p>http://www.illumina.com/applications/epigenetics/small_rna_analysis.ilmn</p><p><strong>More:</strong></p><p><a href="http://www.bioconductor.org/help/search/index.html?q=miRNA+target">http://www.bioconductor.org/help/search/index.html?q=miRNA+target</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/19560/alien-genome</guid>
	<pubDate>Sat, 13 Dec 2014 00:24:32 -0600</pubDate>
	<link>https://bioinformaticsonline.com/file/view/19560/alien-genome</link>
	<title><![CDATA[Alien Genome !!!]]></title>
	<description><![CDATA[<p>Genome sequencing, analysis and expression of Alien genome.</p><p>Note: This image/cartoon is create only for fun. It has nothing to do with any scientific findings.</p>]]></description>
	<dc:creator>Jit</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/19560" length="40389" type="image/jpeg" />
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/32862/gam-ngs-genomic-assemblies-merger-for-next-generation-sequencing</guid>
	<pubDate>Fri, 19 May 2017 07:44:14 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/32862/gam-ngs-genomic-assemblies-merger-for-next-generation-sequencing</link>
	<title><![CDATA[GAM-NGS: genomic assemblies merger for next generation sequencing]]></title>
	<description><![CDATA[<p><span>GAM-NGS is a tool able to merge two or more assemblies in order to improve contiguity and correctness. It can be used on all NGS-based assembly projects and it shows its full potential with multi-library Illumina-based projects. With more than 20 available assemblers it is hard to select the best tool. In this context we propose a tool that improves assemblies (and, as a by-product, perhaps even assemblers) by merging them and selecting the generating that is most likely to be correct.</span></p><p>Address of the bookmark: <a href="https://github.com/vice87/gam-ngs" rel="nofollow">https://github.com/vice87/gam-ngs</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>