<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/1514?offset=670</link>
	<atom:link href="https://bioinformaticsonline.com/related/1514?offset=670" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43362/machine-learning-for-genomics</guid>
	<pubDate>Thu, 09 Sep 2021 11:26:32 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43362/machine-learning-for-genomics</link>
	<title><![CDATA[Machine Learning for Genomics]]></title>
	<description><![CDATA[<h3>Module 1: Statistics for genomics (2-8 August 2021)</h3>
<ul>
<li>A simple intro to statistical distributions</li>
<li>hypothesis testing</li>
<li>linear models.</li>
</ul>
<p>reading:&nbsp;<a href="http://compgenomr.github.io/book/stats.html">http://compgenomr.github.io/book/stats.html</a></p>
<p>slides:&nbsp;<a href="https://github.com/BIMSBbioinfo/compgen2021/tree/main/week1/compgen2021_stats.pdf">https://github.com/BIMSBbioinfo/compgen2021/tree/main/week1/compgen2021_stats.pdf</a></p>
<p>exercises+code:&nbsp;<a href="https://github.com/BIMSBbioinfo/compgen2021/tree/main/week1/">https://github.com/BIMSBbioinfo/compgen2021/tree/main/week1/</a></p>
<h3><a href="https://github.com/BIMSBbioinfo/compgen2021#module-2-unsupervised-learning-for-genomics-9-15-august-2021"></a>Module 2: Unsupervised learning for genomics (9-15 August 2021)</h3>
<ul>
<li>Understanding basic intuition behind machine learning approaches.</li>
<li>Using unsupervised learning to cluster and visualise data points</li>
<li>Dimension reduction techniques for visualisation and as input to clustering methods</li>
</ul>
<p>reading:&nbsp;<a href="http://compgenomr.github.io/book/unsupervisedLearning.html">http://compgenomr.github.io/book/unsupervisedLearning.html</a></p>
<p>slides:&nbsp;<a href="https://github.com/BIMSBbioinfo/compgen2021/tree/main/week2/compgen2021_unsupervisedLearning.pdf">https://github.com/BIMSBbioinfo/compgen2021/tree/main/week2/compgen2021_unsupervisedLearning.pdf</a></p>
<p>exercises+code:&nbsp;<a href="https://github.com/BIMSBbioinfo/compgen2021/tree/main/week2/">https://github.com/BIMSBbioinfo/compgen2021/tree/main/week2/</a></p>
<h3><a href="https://github.com/BIMSBbioinfo/compgen2021#module-3-supervised-learning-for-genomics-16-22-august-2021"></a>Module 3: Supervised learning for genomics (16-22 August 2021)</h3>
<ul>
<li>Understanding and using supervised learning methods for predictive purposes</li>
<li>How to measure prediction performance</li>
<li>Understand and use cross-validation and related concepts</li>
</ul>
<p>reading:&nbsp;<a href="http://compgenomr.github.io/book/supervisedLearning.html">http://compgenomr.github.io/book/supervisedLearning.html</a></p>
<p>slides:&nbsp;<a href="https://github.com/BIMSBbioinfo/compgen2021/tree/main/week3/compgen2021_supervisedLearning.pdf">https://github.com/BIMSBbioinfo/compgen2021/tree/main/week3/compgen2021_supervisedLearning.pdf</a></p>
<p>exercises+code:&nbsp;<a href="https://github.com/BIMSBbioinfo/compgen2021/tree/main/week3/">https://github.com/BIMSBbioinfo/compgen2021/tree/main/week3/</a></p>
<p>https://github.com/BIMSBbioinfo/compgen2021</p><p>Address of the bookmark: <a href="https://github.com/BIMSBbioinfo/compgen2021" rel="nofollow">https://github.com/BIMSBbioinfo/compgen2021</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/44412/scientist-positions-csir-institute-of-genomics-integrative-biology-igib</guid>
  <pubDate>Sat, 02 Dec 2023 00:51:08 -0600</pubDate>
  <link></link>
  <title><![CDATA[Scientist positions @ CSIR-Institute of Genomics &amp; Integrative Biology (IGIB)]]></title>
  <description><![CDATA[
<p>CSIR-Institute of Genomics &amp; Integrative Biology (IGIB) is a premier Institute of Council of Scientific<br />and Industrial Research (CSIR), engaged in research of national importance in the areas of genomics,<br />molecular medicine, bioinformatics and proteomics. For more details, kindly refer to website<br />https://igib.res.in.<br />The Institute is looking for dynamic and creative Indian researchers having excellent academic record<br />and interested in Product Development / Technology Innovation / Applied Technology / Translational<br />Research in the above broad areas. The eligible candidates may apply for the following positions<br />through the CSIR-IGIB website.</p>

<p>More at https://www.igib.res.in/bdmg/ScientistRecruitmentAdvt2023.pdf</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44713/understanding-rna-seq-normalization-methods-tpm-vs-fpkm-vs-cpm</guid>
	<pubDate>Wed, 11 Dec 2024 00:59:15 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44713/understanding-rna-seq-normalization-methods-tpm-vs-fpkm-vs-cpm</link>
	<title><![CDATA[Understanding RNA-Seq Normalization Methods: TPM vs. FPKM vs. CPM]]></title>
	<description><![CDATA[<p>RNA sequencing (RNA-Seq) is a powerful technology used to study transcriptomes, providing insights into gene expression levels. However, raw RNA-Seq data requires normalization to account for sequencing depth and gene length, enabling accurate comparisons between genes and samples. Among the most widely used normalization methods are TPM (Transcripts Per Million), FPKM (Fragments Per Kilobase Million), and CPM (Counts Per Million). Each method has its unique principles and applications, which we&rsquo;ll explore in this blog.</p><h2>Why Normalize RNA-Seq Data?</h2><p>Normalization is a crucial step in RNA-Seq analysis for the following reasons:</p><ul>
<li>
<p><strong>Sequencing depth:</strong> Different RNA-Seq experiments produce varying numbers of reads, making direct comparisons between samples misleading.</p>
</li>
<li>
<p><strong>Gene length:</strong> Longer genes inherently generate more reads, irrespective of their actual expression level.</p>
</li>
<li>
<p><strong>Bias reduction:</strong> Normalization mitigates technical biases, enabling meaningful biological interpretation.</p>
</li>
</ul><h2>TPM (Transcripts Per Million)</h2><p>TPM measures the proportion of reads mapped to a transcript, normalized by transcript length and sequencing depth. It is calculated as:</p><h3>Key Features:</h3><ol>
<li>
<p><strong>Proportionality:</strong> TPM values sum to 1,000,000 across all transcripts in a sample, making it easier to compare between samples.</p>
</li>
<li>
<p><strong>Intuitive interpretation:</strong> TPM values directly represent the abundance of transcripts in a sample.</p>
</li>
<li>
<p><strong>Preferred for comparisons:</strong> TPM facilitates between-sample comparisons better than FPKM.</p>
</li>
</ol><h2>FPKM (Fragments Per Kilobase Million)</h2><p>FPKM normalizes read counts by transcript length and sequencing depth, but without enforcing proportionality like TPM. It is defined as:</p><h3>Key Features:</h3><ol>
<li>
<p><strong>Historical significance:</strong> FPKM was one of the first normalization methods used for RNA-Seq.</p>
</li>
<li>
<p><strong>Single-end vs. paired-end:</strong> In paired-end sequencing, FPKM becomes RPKM (Reads Per Kilobase Million).</p>
</li>
<li>
<p><strong>Limited utility:</strong> FPKM values are not as robust as TPM for cross-sample comparisons due to lack of proportionality.</p>
</li>
</ol><h2>CPM (Counts Per Million)</h2><p>CPM normalizes raw read counts by sequencing depth, without considering gene length. It is expressed as:</p><h3>Key Features:</h3><ol>
<li>
<p><strong>Simplicity:</strong> CPM is straightforward and computationally less intensive.</p>
</li>
<li>
<p><strong>Application:</strong> Suitable for non-length-dependent analyses, such as comparing total expression levels or differential expression analysis.</p>
</li>
<li>
<p><strong>Gene length agnostic:</strong> CPM does not correct for gene length, making it less ideal for measuring expression levels.</p>
</li>
</ol><h2>When to Use Each Method</h2><ul>
<li>
<p><strong>TPM:</strong> Best for comparing expression levels between samples, especially when transcript length and sequencing depth vary.</p>
</li>
<li>
<p><strong>FPKM:</strong> Useful for historical consistency but generally replaced by TPM.</p>
</li>
<li>
<p><strong>CPM:</strong> Ideal for differential expression analysis when gene length normalization is unnecessary.</p>
</li>
</ul><h2>Conclusion</h2><p>Choosing the right normalization method depends on the specific objectives of your RNA-Seq analysis. TPM&rsquo;s proportionality and robustness make it the preferred choice for most applications, while CPM serves well for differential expression studies. Although FPKM paved the way for RNA-Seq normalization, it has largely been supplanted by TPM in modern workflows. Understanding these methods and their nuances ensures accurate and meaningful interpretations of RNA-Seq data.</p><h3>References:</h3><ol>
<li>
<p>Li, B., &amp; Dewey, C. N. (2011). RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. <em>BMC Bioinformatics.</em></p>
</li>
<li>
<p>Trapnell, C., et al. (2010). Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. <em>Nature Biotechnology.</em></p>
</li>
<li>
<p>Law, C. W., et al. (2014). voom: precision weights unlock linear model analysis tools for RNA-seq read counts. <em>Genome Biology.</em></p>
</li>
</ol>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/3885/precision-medicine</guid>
	<pubDate>Sat, 24 Aug 2013 15:47:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/3885/precision-medicine</link>
	<title><![CDATA[Precision Medicine]]></title>
	<description><![CDATA[<p>Coupling established clinical&ndash;pathological indexes with state-of-the-art molecular profiling to create diagnostic, prognostic, and therapeutic strategies precisely tailored to each patient's requirements &mdash; hence the term &ldquo;Precision medicine&rdquo;&nbsp;</p>
<p>Source:<a href="http://www.nejm.org/doi/full/10.1056/NEJMp1114866">http://www.nejm.org/doi/full/10.1056/NEJMp1114866</a></p>
<p><strong>Another video on precision medicine</strong>:</p>
<p><a href="http://www.youtube.com/watch?v=Pi8W0yOXnzE">http://www.youtube.com/watch?v=Pi8W0yOXnzE</a></p>
<p>Precision Medicine basically intergrates bioinformatics, genomics , genetics, molecular biology and nanotechnology to deliver precise cure/diagnotics to a specific patient.</p>
<p>Examples:</p>
<ul>
<li><span>The drug imatinib (Gleevec) designed to inhibit an altered enzyme produced by a fused version of two genes found in chronic myelogenous leukemia.</span></li>
<li><span>The breast cancer drug trastuzumab (Herceptin) works only for women whose tumors have a particular genetic profile called HER-2 positive.</span></li>
</ul>
<p><span>E.g. source :</span></p>
<p><span><a href="http://www.bionews-tx.com/news/2013/08/15/how-the-impact-of-cancer-genomics-on-precision-medicine-is-revolutionizing-cancer-treatment/">http://www.bionews-tx.com/news/2013/08/15/how-the-impact-of-cancer-genomics-on-precision-medicine-is-revolutionizing-cancer-treatment/</a></span></p><p>Address of the bookmark: <a href="http://www.cbsnews.com/video/watch/?id=50149783n" rel="nofollow">http://www.cbsnews.com/video/watch/?id=50149783n</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/14024/grapher</guid>
	<pubDate>Thu, 14 Aug 2014 14:02:17 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/14024/grapher</link>
	<title><![CDATA[GrapheR !!!]]></title>
	<description><![CDATA[<p>What a wonderful gem <em>GrapheR</em> is.... Oh yes it is. <em>GrapheR</em> is a GUI for base graphics in R by http://www.maximeherve.com/. The package provides a graphical user interface for creating base charts in R. It is ideal for beginners in R, as the user interface is very clear and the code is written along side into a text file, allowing users to recreate the charts directly in the console. <br /><br />Adding and changing legends? Messing around with the plotting window settings? It is much easier/quicker with this GUI than reading the help file and trying to understand the various parameters.<br />Here is a little example using the iris data set.<br /><br />library(GrapheR)<br />data(iris)<br />run.GrapheR()<br /><br />This will bring up a window that helps me to create the chart and tweak the various parameters.</p><p><img src="http://4.bp.blogspot.com/-NbnCM1dPh3E/U9aW9YxJ9oI/AAAAAAAABgo/gEPzPhOpf2Y/s1600/GrapheR.png" alt="image" width="878" height="868" style="border: 0px; border: 0px;"><br /><br />Finally, I find the underlying R code in a file created by <em>GrapheR</em>. For more details read also the <a href="http://cran.r-project.org/web/packages/GrapheR/index.html" target="_blank">package vignette</a>, which is available in <a href="http://cran.r-project.org/web/packages/GrapheR/vignettes/manual_en.pdf" target="_blank">English</a>, <a href="http://cran.r-project.org/web/packages/GrapheR/vignettes/manual_fr.pdf" target="_blank">French</a> and <a href="http://cran.r-project.org/web/packages/GrapheR/vignettes/manual_de.pdf" target="_blank">German</a>!</p>]]></description>
	<dc:creator>John Parker</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34485/phyloxml-xml-for-evolutionary-biology-and-comparative-genomics</guid>
	<pubDate>Wed, 29 Nov 2017 10:04:48 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34485/phyloxml-xml-for-evolutionary-biology-and-comparative-genomics</link>
	<title><![CDATA[phyloXML:  XML for evolutionary biology and comparative genomics]]></title>
	<description><![CDATA[<p><a href="http://www.biomedcentral.com/1471-2105/10/356/">phyloXML</a><span>&nbsp;(</span><a href="http://www.phyloxml.org/examples_syntax/phyloxml_syntax_example_1.html">example</a><span>) is an&nbsp;</span><a href="http://en.wikipedia.org/wiki/XML">XML</a><span>&nbsp;language designed to describe phylogenetic trees (or networks) and associated data. PhyloXML provides elements for commonly used features, such as taxonomic information, gene names and identifiers, branch lengths, support values, and gene duplication and speciation events. Using these standardized elements allows interoperability between various applications and databases. Furthermore, both due to extensible nature of XML itself and the provision of &lt;property&gt; elements by phyloXML, extensibility as well as domain specific applications are ensured. The structure of phyloXML is described by&nbsp;</span><a href="http://en.wikipedia.org/wiki/XML_Schema_%28W3C%29">XML Schema Definition (XSD)</a><span>&nbsp;language.</span></p><p>Address of the bookmark: <a href="http://www.phyloxml.org/" rel="nofollow">http://www.phyloxml.org/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/14186/pybedtools</guid>
	<pubDate>Wed, 20 Aug 2014 01:03:41 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/14186/pybedtools</link>
	<title><![CDATA[pybedtools]]></title>
	<description><![CDATA[<p>pybedtools is a Python wrapper for Aaron Quinlan's BEDtools programs (https://github.com/arq5x/bedtools), which are widely used for genomic interval manipulation or "genome algebra". pybedtools extends BEDTools by offering feature-level manipulations from with Python. See full online documentation, including installation instructions, at http://pythonhosted.org/pybedtools/.</p><p>More at http://pythonhosted.org/pybedtools/</p><p>A powerful toolset for genome arithmetic.http://code.google.com/p/bedtools/</p>]]></description>
	<dc:creator>Shruti Paniwala</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36267/pspairwise-sequentially-markovian-coalescent-psmc-model</guid>
	<pubDate>Thu, 19 Apr 2018 05:29:23 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36267/pspairwise-sequentially-markovian-coalescent-psmc-model</link>
	<title><![CDATA[PSPairwise Sequentially Markovian Coalescent (PSMC) model]]></title>
	<description><![CDATA[<p><span>Implementation of the Pairwise Sequentially Markovian Coalescent (PSMC) model</span></p><p>Address of the bookmark: <a href="https://github.com/lh3/psmc" rel="nofollow">https://github.com/lh3/psmc</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/38649/ngs-platforms-launched-by-bgi%E2%80%99s-mgi-tech</guid>
	<pubDate>Thu, 10 Jan 2019 04:42:06 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/38649/ngs-platforms-launched-by-bgi%E2%80%99s-mgi-tech</link>
	<title><![CDATA[NGS Platforms launched by BGI’s MGI Tech]]></title>
	<description><![CDATA[<p>MGI Tech Co., Ltd. (MGI), a subsidiary of BGI Group, is committed to enabling effective and affordable healthcare solutions for all. Based on its proprietary technology, MGI produces sequencing devices, equipment, consumables and reagents to support life science research, medicine and healthcare. MGI's multi-omics platforms include genetic sequencing, mass spectrometry and medical imaging. Providing real-time, comprehensive, life-long solutions, its mission&nbsp;is to&nbsp;develop and promote advanced life science tools for future healthcare.</p><p>MGI, a subsidiary of global genomics leader BGI Group, announced pricing and its first early access customer for the new ultra high-throughput sequencer, MGISEQ-T7, saying it has driven down sequencing cost to&nbsp;$5&nbsp;per gigabyte, with exceptionally high accuracy. Such innovations are helping more people to realize the benefits of genomic information.</p><p>In October, MGI launched the MGISEQ-T7, a highly flexible production-scale platform that is the most powerful sequencer to date. It can produce as many as 60 whole human genomes in one day. The instrument sells for&nbsp;$1 million.</p><p>The T7 enables simultaneous but independent operation of up to four flow cells, which means different applications such as single-cell RNA sequencing, whole exome sequencing and whole genome sequencing can be run in different flow cells at the same time. This helps to reduce costs, allowing MGI to offer the most competitive sequencing price in the market.</p><p><span>Powered by DNBseq&trade;, MGISEQ delivers quality data with accuracy for SNP and Indel calling rate of 99.9% and 99%, respectively, along with decreased duplication rate down to less than 2 percent, and almost zero Index mis-assignment rate.</span></p><p><span><span>SOURCE MGI</span></span></p><p>https://www.bgi.com/global/company/news/bgis-mgi-tech-launches-two-new-ngs-platforms/</p><p>http://en.mgitech.cn/</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40834/nucleus-python-and-c-code-for-reading-and-writing-genomics-data</guid>
	<pubDate>Sun, 02 Feb 2020 08:14:19 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40834/nucleus-python-and-c-code-for-reading-and-writing-genomics-data</link>
	<title><![CDATA[Nucleus: Python and C++ code for reading and writing genomics data.]]></title>
	<description><![CDATA[<p>Nucleus is a library of Python and C++ code designed to make it easy to read, write and analyze data in common genomics file formats like SAM and VCF. In addition, Nucleus enables painless integration with the TensorFlow machine learning framework, as anywhere a genomics file is consumed or produced, a TensorFlow tfrecords file may be used instead.</p><p>Address of the bookmark: <a href="https://github.com/google/nucleus" rel="nofollow">https://github.com/google/nucleus</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>