<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/36267?offset=230</link>
	<atom:link href="https://bioinformaticsonline.com/related/36267?offset=230" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</guid>
	<pubDate>Thu, 02 Jan 2025 20:11:07 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</link>
	<title><![CDATA[The &quot;Ifs&quot; and &quot;Buts&quot; of NGS Quality Control and Trimming]]></title>
	<description><![CDATA[<p>Next-Generation Sequencing (NGS) has revolutionized biological research, providing vast amounts of data for a wide range of applications. However, the reliability of NGS analyses heavily depends on the quality of raw sequencing data. Quality control (QC) and trimming are critical preprocessing steps that can make or break your downstream analyses. In this blog, we explore the "ifs" (why you should perform QC and trimming) and the "buts" (challenges or considerations) of this vital step in NGS workflows.</p><h3><strong>The "Ifs" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Ensures Data Integrity</strong><br />If you want to minimize errors in downstream analyses, QC and trimming remove low-quality reads and bases, ensuring high-confidence data. This step is essential for reliable variant calling, assembly, and other applications.</p>
</li>
<li>
<p><strong>Removes Contaminants</strong><br />If adapter sequences or contaminants are present in the raw reads, trimming can eliminate them. This prevents issues like misalignment or incorrect biological interpretations, ensuring cleaner data for analysis.</p>
</li>
<li>
<p><strong>Improves Mapping and Assembly</strong><br />If your goal is better alignment to a reference genome or improved de novo assembly, trimming low-quality bases and adapters is critical. High-quality reads map more efficiently and generate more accurate assemblies.</p>
</li>
<li>
<p><strong>Reduces Computational Load</strong><br />If you want to save computational resources, trimming reduces the dataset size, which speeds up processing and analysis. Clean datasets mean less computational time spent on processing low-quality data.</p>
</li>
<li>
<p><strong>Prepares for Standardized Analyses</strong><br />If your project involves multiple datasets, QC and trimming ensure uniformity across them. This standardization makes comparisons valid and reproducible, particularly in large collaborative studies.</p>
</li>
</ol><h3><strong>The "Buts" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Risk of Over-Trimming</strong><br />But excessive trimming can lead to the loss of informative sequences, reducing read depth and potentially discarding biologically relevant data. This is especially critical in studies with limited sequencing depth.</p>
</li>
<li>
<p><strong>Bias Introduction</strong><br />But trimming algorithms might introduce biases, especially if they inadvertently remove sequences with specific biological patterns. This can skew results and compromise biological insights.</p>
</li>
<li>
<p><strong>Loss of Context in Paired-End Reads</strong><br />But trimming one read in a pair more than the other can lead to loss of pairing information. This complicates downstream analyses that rely on paired-end data, such as structural variant detection.</p>
</li>
<li>
<p><strong>Time and Resource Intensive</strong><br />But running QC and trimming for large datasets can be computationally expensive and time-consuming. As sequencing depth increases, preprocessing becomes a bottleneck in the analysis pipeline.</p>
</li>
<li>
<p><strong>Variable Standards</strong><br />But the criteria for trimming (e.g., quality threshold, minimum read length) can vary between tools and datasets. This variability may affect reproducibility and comparability of results across studies.</p>
</li>
</ol><h3><strong>Balancing the "Ifs" and "Buts"</strong></h3><p>To maximize the benefits of QC and trimming while mitigating the challenges, consider the following best practices:</p><ul>
<li>
<p><strong>Use QC Tools Wisely:</strong> Start with tools like <strong>FastQC</strong> to identify quality issues in your raw data. Visualizing quality metrics helps tailor your trimming parameters.</p>
</li>
<li>
<p><strong>Choose Reliable Trimming Tools:</strong> Tools like <strong>Trimmomatic</strong>, <strong>Cutadapt</strong>, and <strong>BBduk</strong> offer adaptive and customizable trimming options. Select one that aligns with your dataset and project goals.</p>
</li>
<li>
<p><strong>Set Reasonable Parameters:</strong> Avoid over-trimming by setting quality thresholds and minimum read lengths that balance data retention and quality improvement.</p>
</li>
<li>
<p><strong>Test Downstream Effects:</strong> Validate the impact of QC and trimming on downstream analyses, such as alignment efficiency, variant calling accuracy, or assembly quality.</p>
</li>
<li>
<p><strong>Document Your Workflow:</strong> Maintain detailed records of the parameters and tools used for QC and trimming. This ensures reproducibility and enables better troubleshooting.</p>
</li>
</ul><h3><strong>Conclusion</strong></h3><p>NGS quality control and trimming are essential steps to ensure reliable and accurate data for analysis. While the "ifs" highlight the clear benefits of these steps, the "buts" remind us of the potential pitfalls. By adopting best practices and carefully balancing these considerations, you can optimize your preprocessing workflow and unlock the full potential of your sequencing data.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2728/statistics-of-current-sequencing-and-bioinformatics-market</guid>
	<pubDate>Wed, 21 Aug 2013 08:29:21 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2728/statistics-of-current-sequencing-and-bioinformatics-market</link>
	<title><![CDATA[Statistics of current Sequencing and Bioinformatics market]]></title>
	<description><![CDATA[<p>This survey conducted by&nbsp;<strong>Oxford&nbsp;<a href="http://www.ogt.co.uk/" target="_blank">Gene</a>&nbsp;Technology,</strong>&nbsp;<span>provider of innovative&nbsp;genetics&nbsp;research and&nbsp;biomarker</span>&nbsp;<span>solutions to advance molecular medicine, has released the results from a recent survey of researchers using next generation sequencing. (Source:<a href="http://www.news-medical.net/news/20130821/Oxford-Gene-Technology-releases-next-generation-sequencing-survey-results.aspx">http://www.news-medical.net/news/20130821/Oxford-Gene-Technology-releases-next-generation-sequencing-survey-results.aspx</a>&nbsp;)</span></p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="http://www.ogt.com/assets/0000/3190/NGS_Survey_2013_Infographic_Web.pdf" rel="nofollow">http://www.ogt.com/assets/0000/3190/NGS_Survey_2013_Infographic_Web.pdf</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/3029/bioinformatics-market-in-india</guid>
	<pubDate>Fri, 23 Aug 2013 07:08:49 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/3029/bioinformatics-market-in-india</link>
	<title><![CDATA[Bioinformatics market in India]]></title>
	<description><![CDATA[<div><strong>Key Topics Covered in the Report:</strong></div>
<ul>
<li>The market size of the Indian Bioinformatics Industry , FY&rsquo;2007-FY&rsquo;2013</li>
<li>Market segmentation of India bioinformatics industry by application by sectors, FY&rsquo;2007-FY&rsquo;2013</li>
<li>Market Segmentation of India bioinformatics industry by products and services,FY&rsquo;2007-FY&rsquo;2013</li>
<li>Market Segmentation of India bioinformatics industry by applications of bioinformatics ,FY&rsquo;2007-FY&rsquo;2013</li>
<li>India bioinformatics industry trends and developments</li>
<li>Government regulations and initiatives of India bioinformatics industry</li>
<li>Major bioinformatics research institutes in India</li>
<li>Market Share of leading players in bioinformatics industry in India,FY&rsquo;2013</li>
<li>Company profiles of major players in India bioinformatics industry</li>
<li>Future outlook and projections on the basis of revenue in India bioinformatics market, FY&rsquo;2014-FY&rsquo;2018</li>
</ul>
<p>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;(Source: Ken Research)</p><p>Address of the bookmark: <a href="http://www.kenresearch.com/healthcare/biotechnology/india-bioinformatics-industry-research-report/392-91.html" rel="nofollow">http://www.kenresearch.com/healthcare/biotechnology/india-bioinformatics-industry-research-report/392-91.html</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/4943/molecular-genetics-lecture</guid>
	<pubDate>Fri, 27 Sep 2013 04:24:45 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/4943/molecular-genetics-lecture</link>
	<title><![CDATA[Molecular Genetics Lecture]]></title>
	<description><![CDATA[<p><span>"Robert Sapolsky makes interdisciplinary connections between behavioral biology and molecular genetic influences. He relates protein synthesis and point mutations to microevolutionary change, and discusses conflicting theories of gradualism and punctuated equilibrium and the influence of epigenetics on development theories."&nbsp;</span></p>
<p><span>"<span><strong>Robert Sapolsky</strong> is an American neuroendocrinologist, professor of biology, neuroscience, and neurosurgery at Stanford University, researcher and author" ----Wikipedia</span></span></p><p>Address of the bookmark: <a href="http://www.youtube.com/watch?v=_dRXA1_e30o" rel="nofollow">http://www.youtube.com/watch?v=_dRXA1_e30o</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/7812/bioinformatics-infrastructure-speed-up-indian-agriculture</guid>
	<pubDate>Tue, 07 Jan 2014 12:44:44 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/7812/bioinformatics-infrastructure-speed-up-indian-agriculture</link>
	<title><![CDATA[Bioinformatics infrastructure speed up Indian agriculture]]></title>
	<description><![CDATA[<p>"<span>Realizing the paradigm shift it can bring about, the government is focusing on increased bioinformatics intervention in agri-sciences. Currently under process, the national grid on bioinformatics is expected make much better sense out of huge genomic" - </span></p><p><span></span><a href="http://www.biospectrumindia.com/biospecindia/features/203849/supercomputing-indian-agriculture-fast-track-mode/page/1">http://www.biospectrumindia.com/biospecindia/features/203849/supercomputing-indian-agriculture-fast-track-mode/page/1</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/10739/science-for-life-laboratory-scilifelab-sweden</guid>
  <pubDate>Sat, 10 May 2014 06:22:30 -0500</pubDate>
  <link></link>
  <title><![CDATA[Science for Life Laboratory (SciLifeLab)-Sweden]]></title>
  <description><![CDATA[
<p>Science for Life Laboratory (SciLifeLab) is a national center for molecular biosciences with focus on health and environmental research. The center combines frontline technical expertise with advanced knowledge of translational medicine and molecular bioscience. SciLifeLab is a national resource and a collaboration between four universities: Karolinska Institutet, KTH Royal Institute of Technology, Stockholm University and Uppsala University.</p>

<p>Webpage : https://www.scilifelab.se/about-us/<br />Opportunity: https://www.scilifelab.se/about-us/career/</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/26290/webinar-on-streamlining-large-scale-analysis-using-the-strand-ngs-pipeline-manager-on-24-feb-2016</guid>
	<pubDate>Fri, 05 Feb 2016 06:43:28 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/26290/webinar-on-streamlining-large-scale-analysis-using-the-strand-ngs-pipeline-manager-on-24-feb-2016</link>
	<title><![CDATA[Webinar on Streamlining large scale analysis using the Strand NGS Pipeline Manager on 24 Feb 2016]]></title>
	<description><![CDATA[<p><a href="http://www.strand-ngs.com/webinar_registration" title="webinar"><strong>Live Webinar on Streamlining large scale NGS data analysis using the Strand NGS Pipeline Manager on 24 Feb 2016</strong></a></p><p><strong>Abstract:</strong> Strand NGS includes comprehensive workflows for DNA-Seq, RNA-Seq, Small RNA-Seq, ChIP-Seq, MeDIP-Seq, and Methyl-Seq analysis. Each workflow includes a quality assessment and filter section, followed by a workflow-specific analysis section. The pipeline functionality in Strand NGS allows users to execute a sequence of analysis steps with specific parameters - all without any manual intervention. This simplifies the analysis in large scale sequencing projects where every sample needs to be processed identically.</p><p>In this webinar we will discuss the pre-packaged pipelines present in Strand NGS. The packaged pipelines have well-chosen default parameters and are suitable for users analyzing data for the first time in the tool. We will also show how advanced users can customize pipelines and share them with other Strand NGS users. Finally, we will show a brief glimpse of an elaborate pipeline that aligns reads, filters poor-quality matches, computes coverage metrics, identifies variants, checks for sample cross-contamination, and emails quality reports - all from within Strand NGS.</p><p><strong>Speaker:</strong> Dr. Vamsi Veeramachaneni, Vice President - Bioinformatics, Strand Life Sciences</p><p><strong>Details:</strong> Session 1: 2:30 PM IST, Session 2 : 10:30 PM IST<br /><strong>Register here:</strong> http://www.strand-ngs.com/webinar_registration</p><h3>&nbsp;</h3>]]></description>
	<dc:creator>Yeshodari</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/34912/list-of-cancer-genomics-research-web-resources</guid>
	<pubDate>Wed, 27 Dec 2017 20:33:09 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/34912/list-of-cancer-genomics-research-web-resources</link>
	<title><![CDATA[List of cancer genomics research web resources !]]></title>
	<description><![CDATA[<p>Major web resources for cancer genomics research</p><p>CGHub <br />https://cghub.ucsc.edu/ <br />Comprehensive data repository; huge data size</p><p>EGA <br />https://www.ebi.ac.uk/ega/ <br />Comprehensive data repository; huge data size</p><p>COSMIC <br />http://cancer.sanger.ac.uk <br />Largest somatic mutation database; genome sequencing paper curation</p><p>CPRG <br />http://www.broadinstitute.org/software/cprg <br />Interface for cancer program resources</p><p>GDAC <br />http://gdac.broadinstitute.org/ <br />Data analysis; automatic pipelines; user-friendly reports</p><p>SNP500Cancer <br />http://snp500cancer.nci.nih.gov <br />Sequence and genotype verification of SNPs</p><p>canEvolve <br />www.canevolve.org/ <br />Comprehensive analysis of tumor profile; Data from 90 studies involving more than 10,000 patients</p><p>MethyCancer <br />http://methycancer.psych.ac.cn <br />Relationship among DNA methylation, gene expression and cancer</p><p>SomamiR <br />http://compbio.uthsc.edu/SomamiR/ <br />Correlation between somatic mutation and microRNA; genome-wide displaying</p><p>cBioPortal <br />http://www.cbioportal.org/public-portal/ <br />Graphical summaries; gene alteration; processed data; visualization</p><p>UCSC Cancer Genomics Browser <br />https://genome-cancer.soe.ucsc.edu/ <br />Clinical information; gene expression; copy number variation; visualization</p><p>CGWB <br />https://cgwb.nci.nih.gov/ <br />Visualization; gene mutation and variation; automated analysis pipeline</p><p>GDSC <br />http://www.cancerrxgene.org <br />Drug sensitivity information; drug response information</p><p>canSAR <br />https://cansar.icr.ac.uk/ <br />Multidisciplinary information; drug discovery</p><p>NONCODE <br />http://www.noncode.org/ ncRNAs; <br />lncRNAs; up-to-date and comprehensive resource</p>]]></description>
	<dc:creator>biogeek</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/38649/ngs-platforms-launched-by-bgi%E2%80%99s-mgi-tech</guid>
	<pubDate>Thu, 10 Jan 2019 04:42:06 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/38649/ngs-platforms-launched-by-bgi%E2%80%99s-mgi-tech</link>
	<title><![CDATA[NGS Platforms launched by BGI’s MGI Tech]]></title>
	<description><![CDATA[<p>MGI Tech Co., Ltd. (MGI), a subsidiary of BGI Group, is committed to enabling effective and affordable healthcare solutions for all. Based on its proprietary technology, MGI produces sequencing devices, equipment, consumables and reagents to support life science research, medicine and healthcare. MGI's multi-omics platforms include genetic sequencing, mass spectrometry and medical imaging. Providing real-time, comprehensive, life-long solutions, its mission&nbsp;is to&nbsp;develop and promote advanced life science tools for future healthcare.</p><p>MGI, a subsidiary of global genomics leader BGI Group, announced pricing and its first early access customer for the new ultra high-throughput sequencer, MGISEQ-T7, saying it has driven down sequencing cost to&nbsp;$5&nbsp;per gigabyte, with exceptionally high accuracy. Such innovations are helping more people to realize the benefits of genomic information.</p><p>In October, MGI launched the MGISEQ-T7, a highly flexible production-scale platform that is the most powerful sequencer to date. It can produce as many as 60 whole human genomes in one day. The instrument sells for&nbsp;$1 million.</p><p>The T7 enables simultaneous but independent operation of up to four flow cells, which means different applications such as single-cell RNA sequencing, whole exome sequencing and whole genome sequencing can be run in different flow cells at the same time. This helps to reduce costs, allowing MGI to offer the most competitive sequencing price in the market.</p><p><span>Powered by DNBseq&trade;, MGISEQ delivers quality data with accuracy for SNP and Indel calling rate of 99.9% and 99%, respectively, along with decreased duplication rate down to less than 2 percent, and almost zero Index mis-assignment rate.</span></p><p><span><span>SOURCE MGI</span></span></p><p>https://www.bgi.com/global/company/news/bgis-mgi-tech-launches-two-new-ngs-platforms/</p><p>http://en.mgitech.cn/</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40834/nucleus-python-and-c-code-for-reading-and-writing-genomics-data</guid>
	<pubDate>Sun, 02 Feb 2020 08:14:19 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40834/nucleus-python-and-c-code-for-reading-and-writing-genomics-data</link>
	<title><![CDATA[Nucleus: Python and C++ code for reading and writing genomics data.]]></title>
	<description><![CDATA[<p>Nucleus is a library of Python and C++ code designed to make it easy to read, write and analyze data in common genomics file formats like SAM and VCF. In addition, Nucleus enables painless integration with the TensorFlow machine learning framework, as anywhere a genomics file is consumed or produced, a TensorFlow tfrecords file may be used instead.</p><p>Address of the bookmark: <a href="https://github.com/google/nucleus" rel="nofollow">https://github.com/google/nucleus</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>