<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/30147?offset=1210</link>
	<atom:link href="https://bioinformaticsonline.com/related/30147?offset=1210" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44254/bioinformatics-chat</guid>
	<pubDate>Mon, 13 Mar 2023 13:20:27 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44254/bioinformatics-chat</link>
	<title><![CDATA[Bioinformatics Chat !]]></title>
	<description><![CDATA[<p>The bioinformatics chat is a podcast about computational biology, bioinformatics, and next generation sequencing.</p>
<p>The bioinformatics chat is produced by&nbsp;<a href="https://ro-che.info/">Roman&nbsp;Cheplyaka</a>&nbsp;and hosted by Roman and&nbsp;<a href="https://jmschrei.github.io/">Jacob&nbsp;Schreiber</a>.</p><p>Address of the bookmark: <a href="https://bioinformatics.chat/" rel="nofollow">https://bioinformatics.chat/</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/view/2021</guid>
	<pubDate>Mon, 12 Aug 2013 09:27:57 -0500</pubDate>
	<link>https://bioinformaticsonline.com/view/2021</link>
	<title><![CDATA[What are the difference between BioRuby and BioGem?]]></title>
	<description><![CDATA[<p>I came across two diferent but matching term BioRuby and BioGem. What are the difference between these two term? If both are using same Ruby language for development then why did they develope two different biological packages.</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44618/important-bioinformatics-tools</guid>
	<pubDate>Tue, 30 Jul 2024 05:03:29 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44618/important-bioinformatics-tools</link>
	<title><![CDATA[Important Bioinformatics Tools !]]></title>
	<description><![CDATA[<p><span>1. Ktrim: An extra-fast, accurate adapter trimmer for sequencing data. It processes FASTQ files from multiple lanes with minimal mismatching and over-trimming of adapters.</span><span><br /></span><span><br /></span><span>2. BWA MEM: A reliable alignment tool (particularly for mapping ALT contigs and HLA genes, which are not fully addressed in BWA-MEM2).</span><span><br /></span><span><br /></span><span>3. Sambamba markdup: Quickly marks or removes duplicate reads using Picard's criteria.</span><span><br /></span><span><br /></span><span>4. ichorCNA: Estimates the tumor DNA fraction in cell-free DNA from ultra-low-pass whole genome sequencing (0.1x coverage) based on copy number alterations (CNA).</span><span><br /></span><span><br /></span><span>5. Fragle: A deep learning method for quantifying ctDNA levels from cell-free DNA fragmentomic profiles. It detects TF as low as ~1% ctDNA and works with targeted genomic panel sequencing data.</span><span><br /></span><span><br /></span><span>6. AlfredQC: A quality control tool for high-throughput sequencing data. It assesses metrics like read quality scores, GC content, and duplication rates, visualized through detailed plots and summary statistics.</span><span><br /></span><span><br /></span><span>7. Mosdepth: A fast tool for calculating sequencing coverage depth, offering a quicker alternative to samtools/sambamba depth by processing BAM and CRAM files.</span><span><br /></span><span><br /></span><span>8. Bedtools: A versatile toolkit for genomics, enabling operations like intersect, merge, count, and shuffle on genomic intervals across formats such as BAM, BED, GFF/GTF, and VCF.</span><span><br /></span><span><br /></span><span>9. Datamash: A command-line tool for basic numeric, textual, and statistical operations on input data streams. It supports operations such as grouping, sorting, transposing, and performing arithmetic calculations on tabular data.</span><span><br /></span><span><br /></span><span>10.</span><span> </span><a href="http://gwf.app/" target="_self">gwf.app</a><span>: A pragmatic alternative to Snakemake. Developed at</span><span> </span><a href="https://www.linkedin.com/company/aarhus-university-denmark-/" target="_self"><span>Aarhus University</span></a><span>, this flexible, generic workflow tool builds and runs large scientific workflows.</span></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/4090/computational-biology-in-the-21st-century-making-sense-out-of-massive-data</guid>
	<pubDate>Thu, 29 Aug 2013 08:32:26 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/4090/computational-biology-in-the-21st-century-making-sense-out-of-massive-data</link>
	<title><![CDATA[Computational Biology in the 21st Century: Making Sense out of Massive Data]]></title>
	<description><![CDATA[<iframe width="" height="" src="https://www.youtube-nocookie.com/embed/I99UiA_vaJQ" frameborder="0" allowfullscreen></iframe>Computational Biology in the 21st Century: Making Sense out of Massive Data    
    
Air date:  Wednesday, February 01, 2012, 3:00:00 PM
Category:  Wednesday Afternoon Lectures  
 
Description:  The last two decades have seen an exponential increase in genomic and biomedical data, which will soon outstrip advances in computing power to perform current methods of analysis. Extracting new science from these massive datasets will require not only faster computers; it will require smarter algorithms. We show how ideas from cutting-edge algorithms, including spectral graph theory and modern data structures, can be used to attack challenges in sequencing, medical genomics and biological networks. 

The NIH Wednesday Afternoon Lecture Series includes weekly scientific talks by some of the top researchers in the biomedical sciences worldwide. 

Author:  Dr. Bonnie Berger  
Runtime:  00:58:06  
Permanent link:  http://videocast.nih.gov/launch.asp?17563]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44667/bioinformatics-lecture-notes</guid>
	<pubDate>Tue, 01 Oct 2024 03:45:26 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44667/bioinformatics-lecture-notes</link>
	<title><![CDATA[Bioinformatics Lecture Notes]]></title>
	<description><![CDATA[<h1 style="text-align: center;">Study Resources for</h1><h1 style="text-align: center;">ECM3413 - Bioinformatics</h1><p style="text-align: center;">Contents</p><p style="text-align: center;">&nbsp;</p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#GenInfo">General Information</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Past%20Paper">Lecture Slides</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Past%20Paper">Past Exam Paper</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Assess">Continuous Assessments</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Reading">Suggested Reading List</a></p><p><a name="GenInfo" id="GenInfo"></a><strong>General Information</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">This module runs in Semester 2.&nbsp;</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">It is taught by&nbsp;<a href="http://www.secam.ex.ac.uk/staff/index.php?nav=40&amp;group=Teaching%20Fellows&amp;user_directory_limit=&amp;user_directory_order=&amp;sid=182">Dr Ed Keedwell</a>&nbsp;(Module Coordinator)</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Module Descriptor</strong>:&nbsp;&nbsp;<a href="http://www.secam.ex.ac.uk/student/modules?mid=393">ECM3413</a></td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Lecture Times</strong>: Tuesday 5pm,&nbsp; 171| Thursday, 171</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Workshop Times</strong>: Wednesday 11am Blue Room (Weeks 29,33 &amp;40)</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p><strong>Assessment:&nbsp;</strong>2 CAs each worth 15% | 1 Examination worth 70%</p>
</td>
</tr>
</tbody>
</table><p>&nbsp;&nbsp;</p><p style="text-align: left;"><strong><a name="Slides" id="Slides"></a>Lecture Slides&nbsp;</strong>(if you have to print slides, to save your ink choose 'print in black and white' on the print menu)</p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture1.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture1.pdf">PDF</a>| Lecture 1 - Introduction to Bioinformatics (&amp; Biology)</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture2.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture2.pdf">PDF</a>| Lecture 2 - Genome Sequences: from fragments to sequences</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture3.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture3.pdf">PDF</a>| Lecture 3 - Sequence Alignment 1</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture4.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture4.pdf">PDF</a>| Lecture 4 - Global Pairwise Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture5.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture5.pdf">PDF</a>| Lecture 5 - Local Pairwise Sequence Alignment (Smith-Waterman &amp; BLAST)</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOWorkshop1.doc">DOC</a>| Workshop 1 - Using BLAST and other Bioinformatics Databases</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture6.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture6.pdf">PDF</a>| Lecture 6 - Multiple Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture7.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture7.pdf">PDF</a>| Lecture 7 - BLAST (in more detail) &amp; FASTA</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture8.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture8.pdf">PDF</a>| Lecture 8 - Sequence Alignment Conclusion &amp; Other Sequence Analyses</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture9.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture9.pdf">PDF</a>| Lecture 9 - Markov Chains and Intro to Hidden Markov Models</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture10.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture10.pdf">PDF</a>| Lecture 10 - Hidden Markov Models</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture11.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture11.pdf">PDF</a>| Lecture 11 - Classification in Bioinformatics</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOWorkshop2.doc">DOC</a>|Workshop 2 - Using See5</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Workshop Data - Part 1 -&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.names">adult.names&nbsp;</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.data">adult.data&nbsp;</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.test">adult.test,&nbsp;</a>Part 3 -&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/wdbc.names">wdbc.names</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/wdbc.data">wdbc.data</a></p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture12.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture12.pdf">PDF</a>| Lecture 12 - Gene Expression Data</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture13.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture13.pdf">PDF</a>| Lecture 13 - Decision Trees and Gene Expression Classification</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture14.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture14.pdf">PDF</a>| Lecture 14 - Other Methods for Gene Expression Classification</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture15.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture15.pdf">PDF</a>| Lecture 15 - Gene Regulation</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture16.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture16.pdf">PDF</a>| Lecture 16 - Neural Networks in Gene Expression Analysis</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture17.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture17.pdf">PDF</a>| Lecture 17 - Genome Analysis</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture18.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture18.pdf">PDF</a>| Lecture 18 - Conclusion/Revision Lecture</p>
</td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;">For some reason best known to itself, my PDF creator doesn't like the slide with the substitution matrix on.&nbsp; Therefore this has been removed from Lectures 3 and 7 for the PDF copy only - however, more information on these matrices can be found&nbsp;<a href="http://www.ebi.ac.uk/help/matrix.html">here</a>.</p><p style="text-align: left;"><strong><a name="Past%20Paper"></a>Past Exam Paper</strong></p><p style="text-align: left;">The paper from 2007/8 can be found&nbsp;<a href="http://library.exeter.ac.uk/exampapers/">here</a>.</p><p style="text-align: left;"><strong><a name="Assess" id="Assess"></a>Continuous Assessments</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/CA1ECM3413.pdf">PDF</a>|&nbsp; CA1 - Manual Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/CA2ECM3413.pdf">PDF</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/Promoter.names">Promoter.names</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/Promoter.data">Promoter.data</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/ML.names">ML.names</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/ML.data">ML.data</a>| CA2 - Data Mining in Bioinformatics</p>
</td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;"><strong><a name="Reading" id="Reading"></a>Suggested Reading List</strong></p><p style="text-align: left;"><strong>General Bioinformatics</strong></p><p>&lt;="top"&gt;Xiong, J., (2006) Essential Bioinformatics, Cambridge University Press</p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Lesk, A.M., (2002) Introduction to Bioinformatics, Oxford University Press</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Higgs, P.G., (2005) Bioinformatics and Molecular Evolution,&nbsp; Blackwell Publishing</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;"><strong>Machine Learning in Bioinformatics</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Baldi, P., Brunak, S., (2001) Bioinformatics: The Machine Learning Approach, MIT Press</p>
</td>
</tr>
<tr>
<td><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Keedwell, E., Narayanan, A., (2005) Intelligent Bioinformatics: The Application of Artificial Intelligence Techniques to Bioinformatics Problems, Wiley</p>
</td>
</tr>
</tbody>
</table>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2334/binc-bioinformatics-national-certification-website-address</guid>
	<pubDate>Wed, 14 Aug 2013 09:40:22 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2334/binc-bioinformatics-national-certification-website-address</link>
	<title><![CDATA[BINC (BioInformatics National Certification) Website address]]></title>
	<description><![CDATA[<p><span>BINC (BioInformatics National Certification) is an initiative of Department of Biotechnology(DBT), Government Of India in coordination with Bioinformatics Center, University of Pune. The objective of the examination is to recognize trained manpower in the area of Bioinformatics. Currently, various Indian universities, Government and private institutions are involved in imparting courses in Bioinformatics in India.</span></p>
<p>Foreign nationals intending to have certification are eligible to appear for BINC examination.<br>Minimum qualification includes a degree from a recognized university/institute in the areas listed in FAQ.<br>Formal training in the area of Bioinformatics is not a prerequisite.<br>Note that the foreign students will only be certified by DBT and are not eligible for the cash award as well as junior research fellowship.</p><p>Address of the bookmark: <a href="http://binc.scisjnu.ernet.in/" rel="nofollow">http://binc.scisjnu.ernet.in/</a></p>]]></description>
	<dc:creator>Kamalakshi Mukherjee</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/44700/professorsenior-lecturer-of-comparative-genomics-university-of-glasgow</guid>
  <pubDate>Fri, 06 Dec 2024 05:16:09 -0600</pubDate>
  <link></link>
  <title><![CDATA[Professor/Senior Lecturer of Comparative Genomics @ University of Glasgow]]></title>
  <description><![CDATA[
<p>University of Glasgow<br />College of Medical, Veterinary and Life Sciences<br />School of Biodiversity, One Health and Veterinary Medicine</p>

<p>Professor/Senior Lecturer of Comparative Genomics<br />Vacancy Ref: 153610<br />Salary: Professor, Grade 10 will be within the Professorial range and<br />subject to negotiation<br />Senior Lecturer, Grade 9, 57,696 - 64,914 per annum</p>

<p>The School of Biodiversity, One Health and Veterinary Medicine has an<br />exciting opportunity to appoint a Professor/Senior Lecturer in Comparative<br />Genomics. You will make a substantial and positive contribution to the<br />strategic direction of the School/College through leading and contributing<br />to research of international standard, high quality teaching at both<br />undergraduate and postgraduate level, securing research funding, and<br />providing academic leadership and management within the School/College.</p>

<p>Applications are invited from candidates of international standing with<br />an appropriate record of academic achievement in comparative genomics<br />and associated omics technologies. We are looking for a candidate who<br />will complement our existing strengths in clinical veterinary medicine,<br />evolutionary biology, and animal physiology, with a demonstrable interest<br />in using domestic mammals among their study systems. We are particularly<br />interested in applications from candidates with a track record of<br />studying health related traits and their underlying genomic basis in<br />companion animals. Traits of specific interest include those related<br />to metabolism, ageing, and disease (e.g. cancer, autoimmune diseases,<br />neuromuscular disorders).</p>

<p>The School of Biodiversity, One Health and Veterinary Medicine is home to<br />researchers studying organismal biology and animal health across a diverse<br />range of systems, approaches and disciplines with existing strengths<br />in infectious disease, physiology, ageing, veterinary epidemiology, and<br />evolution among others. You will be based on the University of Glasgow's<br />Garscube campus, where the majority of veterinary teaching and research<br />infrastructure is located. This includes the Small Animal Hospital (a<br />recent 15M investment) and our Veterinary Diagnostic Services, offering<br />excellent opportunities for collaborative research at the clinical and<br />translational interface, especially with respect to companion animals.</p>

<p>We welcome applications from candidates with a Scottish Credit and<br />Qualification Framework level 12 (PhD) in animal biology, genomics and<br />health or related discipline with an extensive and established reputation<br />in research and significant teaching experience within the subject area.</p>

<p>This post is full time and open ended.</p>

<p>Visit our website for further information on The University of<br />Glasgow's, School of Biodiversity, One Health &amp; Veterinary Medicine,<br />https://www.gla.ac.uk/schools/bohvm/</p>

<p>Informal Enquiries should be directed to Professor Roman Biek,<br />Roman.Biek@glasgow.ac.uk</p>

<p>Apply online at:<br />https://my.corehr.com/pls/uogrecruit/erq_jobspec_version_4.jobspec?p_id=153610</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/7674/useful-publications-and-websites-for-deep-sequencing-data-analysis</guid>
	<pubDate>Sun, 29 Dec 2013 22:30:45 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/7674/useful-publications-and-websites-for-deep-sequencing-data-analysis</link>
	<title><![CDATA[Useful Publications and Websites for Deep Sequencing Data Analysis]]></title>
	<description><![CDATA[<h3>Global overview papers</h3><p>Next generation quantitative genetics in plants. Jim&eacute;nez-G&oacute;mez, Frontiers in Plant Science 2:77, 2011 <span style="text-decoration: underline;"><a href="http://www.frontiersin.org/Plant_Physiology/10.3389/fpls.2011.00077/full">Full Text</a> </span><em>[equally relevant to animal and microbial systems]</em></p><p>Sense from sequence reads: methods for alignment and assembly. Flicek &amp; Birney, Nat Methods 6(11 Suppl):S6-S12, 2009. <a href="http://www.nature.com/nmeth/journal/v6/n11s/full/nmeth.1376.html"><span style="text-decoration: underline;">Full Text</span></a></p><h3>Library construction and experimental design</h3><p>Statistical design and analysis of RNA sequencing data. Auer &amp; Doerge, Genetics 185(2):405-16, 2010. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881125"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Biases in Illumina transcriptome sequencing caused by random hexamer priming. Hansen et al., Nucleic Acids Res. 38(12): e131, 2010. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2896536"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Aird et al, Genome Biology 12:R18, 2011 <a href="http://genomebiology.com/2011/12/2/R18"><span style="text-decoration: underline;">Full Text</span></a></p><p>Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of GC-biased genomes. Kozarewa et al, Nature Methods 6(4):291-5, 2009 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2664327/"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture. Rohland &amp; Reich, Genome Research 22(5): 939&ndash;946. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3337438/"><span style="text-decoration: underline;">PubMedCentral</span></a></p><h3>Data formats, data management, and alignment software tools<span style="text-decoration: underline;"> </span></h3><p>The Sequence Alignment/Map format and SAMtools. Li et al, Bioinformatics 25(16):2078-9, 2009 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2723002"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>SAM format specification <a href="http://samtools.sourceforge.net/SAM1.pdf"><span style="text-decoration: underline;">file</span></a></p><p>Efficient storage of high throughput sequencing data using reference-based compression. Fritz et al, Genome Res 21(5):734-40, 2011. <a href="http://genome.cshlp.org/content/21/5/734.long"><span style="text-decoration: underline;">Full Text</span></a></p><p>Compression of DNA sequence reads in FASTQ format. Deorowicz &amp; Grabowski, Bioinformatics 27(6):860-2, 2011. <a href="http://www.ncbi.nlm.nih.gov/pubmed/21252073"><span style="text-decoration: underline;">PubMed</span></a></p><p>Fast and accurate short read alignment with Burrows-Wheeler transform. Li &amp; Durbin, Bioinformatics 25(14):1754-60, 2009. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2705234"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Improving SNP discovery by base alignment quality. Li H, Bioinformatics 27(8):1157-8, 2011. <a href="http://www.ncbi.nlm.nih.gov/pubmed/21320865"><span style="text-decoration: underline;">PubMed</span></a></p><p>BEDTools: a flexible suite of utilities for comparing genomic features. Quinlan and Hall, Bioinformatics 26:841-842, 2010. <a href="http://bioinformatics.oxfordjournals.org/content/26/6/841.full.pdf+html"><span style="text-decoration: underline;">Publisher Website</span></a></p><h3>Data quality assessment, filtering, and correction</h3><p>SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. Cox et al, BMC Bioinformatics 11:485, 2010. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2956736"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>TileQC: a system for tile-based quality control of Solexa data. Dolan &amp; Denver, BMC Bioinformatics 9:250, 2008 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2443380"><span style="text-decoration: underline;">PubMedCentral</span></a> <em>[requires a reference sequence]</em></p><p>Quake: quality-aware detection and correction of sequencing errors. Kelley et al, Genome Biol 11(11):R116, 2010. <a href="http://www.ncbi.nlm.nih.gov/pubmed/21114842"> <span style="text-decoration: underline;">PubMed</span></a></p><p>FastQC: a quality control tool for high-throughput sequence data. <a href="http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/"><span style="text-decoration: underline;">Home Page</span></a></p><p>FASTX-toolkit: FASTQ/A short-reads pre-processing tools <a href="http://hannonlab.cshl.edu/fastx_toolkit/"><span style="text-decoration: underline;">Home Page</span></a></p><p>Reference-free validation of short read data. Schr&ouml;der et al, PLoS One 5(9):e12681, 2010. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2943903"> <span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Correction of sequencing errors in a mixed set of reads. Salmela, Bioinformatics 26(10):1284, 2010. <a href="http://bioinformatics.oxfordjournals.org/content/26/10/1284.long"><span style="text-decoration: underline;">Full Text</span></a> <em>[includes error correction of SOLiD reads in colorspace]</em></p><p>Repeat-aware modeling and correction of short read errors. Yang et al, BMC Bioinformatics 12(Supp1):S52, 2011 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3044310"> <span style="text-decoration: underline;">PubMedCentral</span></a> <em>[requires a reference sequence]</em></p><p>HiTEC: accurate error correction in high-throughput sequencing data. Ilie et al, Bioinformatics 27(3):295, 2011 <a href="http://bioinformatics.oxfordjournals.org/content/27/3/295.long"><span style="text-decoration: underline;">Full Text</span></a></p><p>Error correction of high-throughput sequencing datasets with non-uniform coverage. Medvedev et al., Bioinformatics 27(13):i137-41, 2011. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117386"><span style="text-decoration: underline;">PubMedCentral</span></a></p><h3>De novo assembly<span style="text-decoration: underline;"> </span></h3><p>Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Zerbino &amp; Birney, Genome Res 18(5):821-9, 2008. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2336801">u&gt;PubMedCentral</a></p><p>Assembly of large genomes using second-generation sequencing. Schatz et al, Genome Res 20(9):1165-73, 2010. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928494"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Gnerre et al, PNAS 108(4): 1513-18, 2011 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3029755"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Genome assembly has a major impact on gene content: a comparison of annotation in two <em>Bos taurus </em> assemblies. Florea&nbsp; et al., PLoS One 6(6):e21400, 2011. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3120881/"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data. Carver et al, Bioinformatics 28(4):464 - 469, 2012 <span style="text-decoration: underline;"><a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278759/">PubMedCentral</a></span></p><p>Efficient de novo assembly of large genomes using compressed data structures. Simpson &amp; Durbin, Genome Research 22:549-556, 2012 <span style="text-decoration: underline;"><a href="http://genome.cshlp.org/content/22/3/549.full">Full Text</a></span> <em>[Describes the String Graph Assembler (SGA), which assembled a human genome in less than 6 days using 54 Gb of RAM and a 123-processor compute cluster for calculation of an FM-index of the 1.2 billion reads]</em></p><p>Readjoiner: a fast and memory efficient string graph-based sequence assembler. Gonnella &amp; Kurtz, BMC Bioinformatics 13: 82, 2012 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3507659"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Assemblathon 1: A competitive assessment of de novo short read assembly methods. Earl et al, Genome Research 21:2224-2241, 2011 <span style="text-decoration: underline;"><a href="http://genome.cshlp.org/content/early/2011/09/16/gr.126599.111.full.pdf+html">Full Text</a></span></p><h3>Chromatin immunoprecipation analysis: ChIP-seq</h3><p>ChIP-seq: advantages and challenges of a maturing technology. Park, Nat Rev Genet. 10:669-80, 2009 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3191340/"><span style="text-decoration: underline;">PubMed</span></a></p><p>ChIP-seq and Beyond: new and improved methodologies to detect and characterize protein-DNA interactions. Furey, Nat Rev Genet 13: 840&ndash;852, 2012 <a href="http://www.nature.com/nrg/journal/v13/n12/full/nrg3306.html"> <span style="text-decoration: underline;">Publisher Web Site</span></a></p><p>MuMoD: a Bayesian approach to detect multiple modes of protein&ndash;DNA binding from genome-wide ChIP data. Narlikar, Nucleic Acids Res 41:21&ndash;32, 2013 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592440/"><span style="text-decoration: underline;">PubMed</span></a></p><h3>Transcriptome analysis</h3><h3>Assembly and comparison to genome</h3><p>Full-length transcriptome assembly from RNA-Seq data without a reference genome. Grabherr et al, Nature Biotechnology 29:644 - 652, 2011. <a href="http://www.ncbi.nlm.nih.gov/pubmed/21572440"><span style="text-decoration: underline;">PubMed</span></a> <em>[The software is called <a href="http://trinityrnaseq.sourceforge.net/"><span style="text-decoration: underline;">Trinity</span></a>, and is available on Sourceforge.]</em></p><p>Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome. Peng et al, Nature Biotechnology 30:253 - 260, 2012. <span style="text-decoration: underline;"><a href="http://www.ncbi.nlm.nih.gov/pubmed/22327324">PubMed</a></span> <em>[Several comments on this paper question whether the reported differences are in fact evidence of editing or are simply sequencing errors - the authors stand by their conclusions, but the controversy demonstrates the importance of robust data analysis methods.] </em></p><p>Optimization of de novo transcriptome assembly from next-generation sequencing data. Surget-Groba &amp; Montoya-Burgos, Genome Res 20(10):1432-40, 2010. <a href="http://genome.cshlp.org/content/20/10/1432.long"><span style="text-decoration: underline;">Full Text</span></a></p><p>Rnnotator: an automated <em>de novo</em> transcriptome assembly pipeline from stranded RNA-Seq reads. Martin et al, BMC Genomics 11:663, 2010 <a href="http://www.biomedcentral.com/1471-2164/11/663"><span style="text-decoration: underline;">Full Text</span></a></p><p><em>De novo</em> assembly and analysis of RNA-seq data. Robertson et al, Nature Methods 7:909-912, 2010 <a href="http://www.nature.com/nmeth/journal/v7/n11/full/nmeth.1517.html"><span style="text-decoration: underline;">Full Text</span></a> <em>[describes Trans-ABySS, a pipeline to use the ABySS parallel assembler for de novo transcriptome analysis]</em></p><h3>Differential expression analysis</h3><p>R-SAP: a multi-threading computational pipeline for the characterization of high-throughput RNA-sequencing data. Mittal &amp; McDonald, Nucleic Acids Res, 2012 <span style="text-decoration: underline;"><a href="http://nar.oxfordjournals.org/content/early/2012/01/28/nar.gks047.long">Full Text</a></span></p><p>Targeted RNA sequencing reveals the deep complexity of the human transcriptome. Mercer et al, Nature Biotechnology 30:99 - 104, 2012 <span style="text-decoration: underline;"><a href="http://www.nature.com/nbt/journal/v30/n1/full/nbt.2024.html"> Publisher Website</a></span></p><p>Differential gene and transcript expression analysis of RNA-Seq experiments with TopHat and Cufflinks. Trapnell et al, Nature Protocols 7:562 - 578, 2012 <span style="text-decoration: underline;"><a href="http://www.nature.com/nprot/journal/v7/n3/full/nprot.2012.016.html"> Publisher Website</a></span></p><p>Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling. Łabaj et al, Bioinformatics 27:i383 - i391, 2011 <span style="text-decoration: underline;"><a href="http://bioinformatics.oxfordjournals.org/content/27/13/i383.full.pdf+html"> Full Text</a></span></p><p>Improving RNA-Seq expression estimates by correcting for fragment bias. Roberts et al, Genome Biol 12:R22, 2011 <span style="text-decoration: underline;"><a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3129672/">PubMed Central</a></span></p><p>Cloud-scale RNA-sequencing differential expression analysis with Myrna. Langmead et al, Genome Biol 11:R83, 2010 <a href="http://genomebiology.com/2010/11/8/R83"><span style="text-decoration: underline;">Full Text</span></a></p><p>From RNA-seq reads to differential expression results. Oshlack et al, Genome Biol 11(12):220, 2010 <a href="http://genomebiology.com/content/11/12/220"><span style="text-decoration: underline;">Full Text</span></a></p><p>DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Wang et al., Bioinformatics. 26(1):136-8. 2010 <a href="http://www.ncbi.nlm.nih.gov/pubmed/19855105"><span style="text-decoration: underline;"> PubMed</span></a></p><p>DEseq: Differential expression analysis for sequence count data. Anders and Huber, Genome Biology 11:R106, 2010 <a href="http://genomebiology.com/2010/11/10/R106"><span style="text-decoration: underline;">Full Text</span></a></p><p>edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Robinson et al., Bioinformatics 26(1):139-40 2010 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2796818"> <span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Two-stage Poisson model for testing RNA-seq data. Auer and Doerge, SAGMB 10(1), article 26 <a href="http://www.bepress.com/sagmb/vol10/iss1/art26/"><span style="text-decoration: underline;">Full Text</span></a></p><p>Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments. McCormick et al., Silence2(1):2, 2011 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3055805"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>RNA-Seq gene expression estimation with read mapping uncertainty. Li et al, Bioinformatics 26:493-500, 2010 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2820677">PubMedCentral</a> <em>[describes the RSEM software package]</em></p><h3>Comparing genomes and assemblies; variant detection<span style="text-decoration: underline;"> </span></h3><p>Versatile and open software for comparing large genomes. Kurtz et al, Genome Biol (5(2):R12, 2004. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC395750"><span style="text-decoration: underline;">PubMedCentral</span></a> <em>[describes the MUMmer software for full-genome alignment &amp; comparisons]</em></p><p>Searching for SNPs with cloud computing. Langmead et al, Genome Biol 10(11):R134, 2009 <a href="http://genomebiology.com/content/10/11/R134"><span style="text-decoration: underline;">Full Text</span></a></p><p>Calling SNPs without a reference sequence. Ratan et al, BMC Bioinformatics 11:130, 2010 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2851604"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Microindel detection in short-read sequence data. Krawitz et al, Bioinformatics 26(6):722-9, 2010. <a href="http://bioinformatics.oxfordjournals.org/content/26/6/722.long"><span style="text-decoration: underline;">Full Text</span></a></p><p>vipR: variant identification in pooled DNA using R. Altmann et al., Bioinformatics 27: i77-i84, 2011. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117388"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Geoseq: a tool for dissecting deep-sequencing datasets. Gurtowski et al, BMC Bioinformatics 11:506, 2010. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2972303/"><span style="text-decoration: underline;">PubMedCentral</span></a> <em>[Geoseq is a web service that allows searching deep sequencing datasets with a reference sequence of a gene of interest]</em></p><p>Detecting and annotating genetic variations using the HugeSeq pipeline. Lam et al, Nature Biotechnology 30:226 - 229, 2012 <span style="text-decoration: underline;"><a href="http://www.nature.com/nbt/journal/v30/n3/full/nbt.2134.html">Publisher Website</a></span>, <span style="text-decoration: underline;"><a href="http://hugeseq.snyderlab.org/">Home Page</a></span></p><p>Genome-wide LORE1 retrotransposon mutagenesis and high-throughput insertion detection in <em>Lotus japonicus</em>. Urbański et al, Plant J 64:731-741, 2012. <span style="text-decoration: underline;"><a href="http://onlinelibrary.wiley.com/doi/10.1111/j.1365-313X.2011.04827.x/abstract">Publisher Website</a></span> <em>[This paper describes a 2-dimensional pooling strategy with barcoding to allow use of Illumina sequencing to screen for retrotransposon insertion mutations, and includes a software package called FSTpoolit for analysis of the resulting sequence reads.]</em></p><h3>Genotyping by sequencing</h3><p>Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Davey et al., Nat Rev Genet 12(7):499-510, 2011 <a href="http://www.ncbi.nlm.nih.gov/pubmed/21681211"><span style="text-decoration: underline;">PubMed</span></a> <em>[A review of methods available at the time]</em></p><p>A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. Elshire et al., PLoS One 6(5):e19379, 2011. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3087801"><span style="text-decoration: underline;">Full Text</span></a></p><p>Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. Poland et al., PLoS One 7(2): e32253, 2012. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289635/"><span style="text-decoration: underline;">Full Text</span></a></p><p>Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. Peterson et al, PLoS One 7(5):e37135, . 2012. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3365034/"><span style="text-decoration: underline;">Full Text</span></a></p><p>Imputation of unordered markers and the impact on genomic selection accuracy. Rutkowski et al, G3 3(3):427-39, 2013. <a href="http://www.g3journal.org/content/3/3/427.long"><span style="text-decoration: underline;">Full Text</span></a></p><p>Diversity Arrays Technology (DArT) and next-generation sequencing combined: genome-wide, high-throughput, highly informative genotyping for molecular breeding of <em>Eucalyptus</em>. Sansaloni et al., BMC Proceedings 5(Suppl 7):P54, 2011 <span style="text-decoration: underline;"><a href="http://www.biomedcentral.com/1753-6561/5/S7/P54">Full Text</a></span></p><p>High-throughput genotyping by whole-genome resequencing. Huang et al., Genome Res 19(6):1068-76, 2009. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2694477"><span style="text-decoration: underline;">Full Text</span></a></p><p>Multiplexed shotgun genotyping for rapid and efficient genetic mapping. Andolfatto et al. Genome Res 21(4):610-7, 2011. <a href="http://genome.cshlp.org/content/21/4/610.long"><span style="text-decoration: underline;">Full Text</span></a></p><h3>Restriction-site Associated DNA (RAD) markers</h3><p>Rapid SNP discovery and genetic mapping using sequenced RAD markers. Baird et al, PLoS One 3(10):e3376, 2008 <span style="text-decoration: underline;"><a href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0003376">Full Text</a></span></p><p>Linkage mapping and comparative genomics using next-generation RAD sequencing of a non-model organism. Baxter et al., PLoS One 6(4):e19315, 2011. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3082572"><span style="text-decoration: underline;">Full Text</span></a></p><p>Genome evolution and meiotic maps by massively parallel DNA sequencing: spotted gar, an outgroup for the teleost genome duplication. Amores et al, Genetics 188(4):799-808, 2011. <a href="http://www.ncbi.nlm.nih.gov/pubmed/21828280"><span style="text-decoration: underline;"> PubMed</span></a></p><p>Construction and application for QTL analysis of a Restriction-site Associated DNA (RAD) linkage map in barley. Chutimanitsakun et al, BMC Genomics 4; 12:4, 2011. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3023751"><span style="text-decoration: underline;">Full Text</span></a></p><p>RAD tag sequencing as a source of SNP markers in <em>Cynara cardunculus </em>L. Scaglione et al., BMC Genomics 13:3, 2012. <span style="text-decoration: underline;"><a href="http://www.biomedcentral.com/1471-2164/13/3">Full Text</a></span></p><p>Paired-end RAD-seq for de novo assembly and marker design without available reference. Willing et al., Bioinformatics 27(16):2187-93, 2011. <a href="http://bioinformatics.oxfordjournals.org/content/27/16/2187.long"><span style="text-decoration: underline;">Publisher Website</span></a></p><p>Local de novo assembly of RAD paired-end contigs using short sequencing reads. Etter et al., PLOS ONE 6(4): e18561, 2011. <a href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0018561"><span style="text-decoration: underline;">Full Text</span></a></p><p>Stacks: building and genotyping loci de novo from short-read sequences. Catchen et al., G3: Genes, Genomes, Genetics, 1:171-182, 2011. <span style="text-decoration: underline;"> Full Text</span>, <a href="http://creskolab.uoregon.edu/stacks/"><span style="text-decoration: underline;">Home Page</span></a></p><p>Rainbow: an integrated tool for efficient clustering and assembling RAD-seq reads. Chong et al, Bioinformatics 28(21):2732-7, 2012. <a href="http://bioinformatics.oxfordjournals.org/content/28/21/2732.long"> <span style="text-decoration: underline;">Publisher Website</span></a></p><p>UK RAD Sequencing Wiki page, with bibliography and RADTools software download <a href="https://www.wiki.ed.ac.uk/display/RADSequencing/Home"><span style="text-decoration: underline;">Home Page</span></a></p><h3>Workspace environments</h3><p><span style="text-decoration: underline;">Papers</span></p><p>Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Goecks et al, Genome Biol 11(8):R86, 2010 <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2945788"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>Galaxy Cloudman: Delivering compute clusters. BMC Bioinformatics 11(Suppl. 12):S4, 2010 <a href="http://www.biomedcentral.com/content/pdf/1471-2105-11-S12-S4.pdf"><span style="text-decoration: underline;">Full Text</span></a></p><p><a href="http://www.broadinstitute.org/gsa/wiki/index.php/The_Genome_Analysis_Toolkit"><span style="text-decoration: underline;">The Genome Analysis Toolkit</span></a>: a MapReduce framework for analyzing next-generation DNA sequencing data. McKenna et al, Genome Res 20(9):1297-303, 2010. <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928508"><span style="text-decoration: underline;">PubMedCentral</span></a></p><p>A framework for variation discovery and genotyping using next-generation DNA sequencing data. DePristo et al., Nat Genet 43(5):491-8, 2011. <a href="http://www.ncbi.nlm.nih.gov/pubmed/21478889"><span style="text-decoration: underline;"> PubMed</span></a></p><p><span style="text-decoration: underline;">Online resources</span></p><p>The <a href="http://cran.r-project.org/"><span style="text-decoration: underline;">R statistical computing</span></a> environment includes<a href="http://www.bioconductor.org/"><span style="text-decoration: underline;"> Bioconductor</span></a>, a specialized set of tools for analysis of microarray and high-throughput sequencing data. Introductory materials from on-line or short workshops are widely available online; examples are <span style="text-decoration: underline;"><a href="http://bioconductor.org/help/course-materials/2012/Evomics2012/Bioconductor-tutorial.pdf">Evomics2012 Bioconductor-tutorial.pdf</a></span>, and <a href="http://bcb.dfci.harvard.edu/%7Eaedin/courses/Bioconductor/"><span style="text-decoration: underline;">Intro to Bioconductor</span></a>. Materials from an advanced course on high-throughput genetic data analysis are at <span style="text-decoration: underline;"><a href="http://bioconductor.org/help/course-materials/2012/SeattleFeb2012/">Seattle 2012 materials</a></span>. Thomas Girke of UC-Riverside has written a very complete set of manuals describing the use of R and Bioconductor for analysis of genomic datasets, available at <a href="http://manuals.bioinformatics.ucr.edu/home/R_BioCondManual">R and Bioconductor Manuals</a>. <br /> <a href="http://cran.r-project.org/manuals.html"><span style="text-decoration: underline;">Manuals</span></a> and contributed <a href="http://cran.r-project.org/other-docs.html"><span style="text-decoration: underline;">documentation</span></a> for R are available at the R-project.org website, and video tutorials are also available on Youtube; those posted by Tutorlol are brief, clear, and to the point. <br /> Materials from a series of mini-courses in R taught in 2010 at UCLA are available:</p><ul>
<li><a href="http://scc.stat.ucla.edu/page_attachments/0000/0141/10S-basicR.pdf">Intro to programming and graphics</a></li>
<li><a href="http://scc.stat.ucla.edu/page_attachments/0000/0143/S10_RProgII.pdf">Data manipulation and functions</a></li>
<li><a href="http://scc.stat.ucla.edu/page_attachments/0000/0185/Graphics_course.pdf">Graphics for exploratory data analysis</a></li>
<li><a href="http://scc.stat.ucla.edu/page_attachments/0000/0147/20100503_IntroStats.pdf">Introductory statistics</a></li>
<li><a href="http://scc.stat.ucla.edu/page_attachments/0000/0188/reg_R_1_09S_slides.pdf">Linear regression</a></li>
</ul><p><a href="http://a-little-book-of-r-for-bioinformatics.readthedocs.org/en/latest/"> <span style="text-decoration: underline;">A Little Book of R for Bioinformatics</span></a> is an on-line resource with information and exercises to provide practice in bioinformatics analysis of DNA sequences and other biological data in R. <br /> Many books on specific topics in R programming are also available through Amazon or other vendors.</p><h3>Cloud computing resources</h3><p>The case for cloud computing in genome informatics. Lincoln Stein, Genome Biol. 11(5):207, 2010 <a href="http://www.ncbi.nlm.nih.gov/pubmed/20441614"><span style="text-decoration: underline;">Pubmed</span></a></p><p>Galaxy Cloudman: delivering cloud compute clusters. Afgan et al, BMC Bioinformatics <span style="text-decoration: underline;">11</span>(Suppl 12):S4, 2010 <a href="http://www.biomedcentral.com/1471-2105/11/S12/S4"><span style="text-decoration: underline;">Full Text</span></a></p><p><a href="http://cloudbiolinux.com/">CloudBioLinux</a> is an open-source project that provides a bioinformatics Linux system for cloud computing, pre-configured with a variety of software tools installed and ready to use.</p><p>A <a href="https://github.com/chapmanb/cloudbiolinux/blob/master/doc/intro/gettingStarted_CloudBioLinux.pdf?raw=true"><span style="text-decoration: underline;">tutorial</span></a> on getting started with CloudBioLinux on the Amazon Web Services Elastic Compute Cloud (EC2)</p><p><a href="http://userwww.service.emory.edu/%7Eeafgan/content/ppt/EnisAfgan_BOSC_2010.pdf"><span style="text-decoration: underline;">Deploying Galaxy on the Cloud</span></a>  slides from a presentation by Enis Afgan (Emory University) at the <br /> &nbsp;Bioinformatics Open Source Conference in Boston, July 2010</p><p>A <a href="http://screencast.g2.bx.psu.edu/cloud/"><span style="text-decoration: underline;"> screencast</span></a> that provides a step-by-step guide to starting a Galaxy cluster in the EC2 environment</p><p>A <a href="https://bitbucket.org/galaxy/galaxy-central/wiki/cloud"><span style="text-decoration: underline;">webpage</span></a> that has the same information in text form, and is the basis for the screencast</p><p>The iPlant Collaborative, an NSF-funded project to create computational resources for plant biology research, provides access to cloud computing resources through <span style="text-decoration: underline;"><a href="http://www.iplantcollaborative.org/discover/atmosphere">Atmosphere</a></span></p><p>SeqWare Query Engine: storing and searching sequence data in the cloud. OConnor et al, BMC Bioinformatics <strong>11</strong>(Suppl 12)<strong>:</strong>S2, 2010 <a href="http://www.biomedcentral.com/1471-2105/11/S12/S2"><span style="text-decoration: underline;">Full Text</span></a></p><p>An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics. Taylor, BMC Bioinformatics <strong>11</strong>(Suppl 12)<strong>:</strong>S1, 2010 <a href="http://www.biomedcentral.com/1471-2105/11/S12/S1"><span style="text-decoration: underline;">Full Text</span></a></p><h3>Links to Linux command-line tutorials and resources</h3><p>Tutorials for AWK, a powerful tool for handling data tables</p><ul>
<li>A set of <a href="http://people.bu.edu/scottm/AWK.NOTES"><span style="text-decoration: underline;">awk notes</span></a> from Boston University</li>
<li>Bruce Barnett's <a href="http://www.grymoire.com/Unix/Awk.html"><span style="text-decoration: underline;">awk tutorial</span></a></li>
<li>Greg Goebel's <a href="http://www.vectorsite.net/tsawk.html"><span style="text-decoration: underline;">awk tutorial</span></a></li>
<li><a href="http://teaching.software-carpentry.org/2013/01/16/1433/"><span style="text-decoration: underline;">Executing an awk command from R</span></a> to simplify data exploratory analysis, from Lex Nederbragt</li>
</ul><p>Tutorials for bash shell scripting</p><ul>
<li>A <a href="http://www.linuxconfig.org/bash-scripting-tutorial"><span style="text-decoration: underline;">tutorial</span></a> at linuxconfig.org</li>
<li>A <a href="http://www.hypexr.org/bash_tutorial.php"><span style="text-decoration: underline;">Getting Started With Bash</span></a> tutorial at hypexr.org</li>
<li>Mendel Cooper's <a href="http://tldp.org/LDP/abs/html/"><span style="text-decoration: underline;">Advanced Bash Shell-Scripting Guide</span></a></li>
</ul><p>Tutorials for sed, the command-line stream editor</p><ul>
<li>A <a href="http://www.panix.com/%7Eelflord/unix/sed.html"><span style="text-decoration: underline;">tutorial</span></a> at Rutgers</li>
<li>Peteris Krumins claims to have the <a href="http://www.catonmat.net/blog/worlds-best-introduction-to-sed/"><span style="text-decoration: underline;"> World's Best Introduction to Sed</span></a>; take a look and judge for yourself.</li>
<li>Bruce Barnett's <a href="http://www.grymoire.com/Unix/Sed.html"><span style="text-decoration: underline;">sed tutorial</span></a>.</li>
</ul><h3>Links to other useful sites</h3><p>The<a href="http://seqanswers.com/"><span style="text-decoration: underline;"> SEQanswers</span></a> online community has forums on several topics related to sequencing; the bioinformatics forum is the most active.</p><p>The SEQanswers <span style="text-decoration: underline;"><a href="http://seqanswers.com/wiki/Software">Software Wiki</a></span> is a list of software for analysis of sequencing data</p><p><a href="http://biostar.stackexchange.com/">Biostar</a> is another online community for questions and answers on bioinformatics and computational genomics.</p><p>Information on file formats used by the University of California - Santa Cruz Genome Browser is on the <a href="http://genome.ucsc.edu/FAQ/FAQformat"><span style="text-decoration: underline;"> FAQ list</span></a></p><p>A manual for the Integrated Genome Browser visualization tool is <a href="http://wiki.transvar.org/confluence/display/igbman/Home"><span style="text-decoration: underline;">here</span></a></p><p>Course materials for a short course entitled <a href="http://bioconductor.org/help/course-materials/2010/SeattleIntro/"><span style="text-decoration: underline;">Introduction to R and Bioconductor</span></a>, held in Seattle in Dec 2010</p><p><a href="http://great.stanford.edu/"><span style="text-decoration: underline;">Genomic Regions Enrichment of Annotations Tool</span></a> - A web service to test for over-representation of specific ontology categories among genes near ChIP-seq peaks</p><p><a href="http://www.animalgenome.org/bioinfo/resources/nextgensoft.html"><span style="text-decoration: underline;">Next-gen-seq software</span></a> - a list of software packages, both commercial and open-source, related to analysis of deep sequencing datasets</p><p><a href="http://www.cbcb.umd.edu/software/"><span style="text-decoration: underline;">Software</span></a> from the Center for Bioinformatics and Computational Biology, University of Maryland - many useful programs, all open-source</p><p><a href="http://bioinformatics.psb.ugent.be/plaza/"><span style="text-decoration: underline;"> PLAZA</span></a>: a comparative genomics resource to study gene and genome evolution in plants; described by Proost et al, Plant Cell 21:3718, 2010 <a href="http://www.plantcell.org/content/21/12/3718.full"><span style="text-decoration: underline;">Full Text</span></a></p><p>The European Bioinformatics Institute provides tools <a href="http://www.ebi.ac.uk/Tools/rcloud/"><span style="text-decoration: underline;">ArrayExpressHTS</span><span style="text-decoration: underline;"> and R-Cloud</span></a> for analysis of transcriptome data</p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44707/rna-seq-analysis-a-guide-for-bioinformaticians</guid>
	<pubDate>Sat, 07 Dec 2024 22:22:24 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44707/rna-seq-analysis-a-guide-for-bioinformaticians</link>
	<title><![CDATA[RNA-Seq Analysis: A Guide for Bioinformaticians]]></title>
	<description><![CDATA[<p>RNA sequencing (RNA-Seq) has revolutionized transcriptomics, offering unprecedented insights into gene expression, splicing, and transcript diversity. For bioinformaticians, RNA-Seq analysis is a gateway to exploring the complexity of RNA biology and its implications in health and disease. This blog post provides an overview of RNA-Seq analysis, key computational steps, and tools for bioinformaticians eager to delve into this powerful technique.</p><h3>What is RNA-Seq?</h3><p>RNA-Seq is a next-generation sequencing (NGS) technology used to study the transcriptome&mdash;the complete set of RNA molecules in a cell. It quantifies gene expression, detects novel transcripts, and captures alternative splicing events with high sensitivity and resolution.</p><h3>Workflow for RNA-Seq Analysis</h3><p>RNA-Seq analysis involves several stages, each requiring computational tools and expertise.</p><h4>1. <strong>Experimental Design and Data Acquisition</strong></h4><p>Before diving into analysis, bioinformaticians should consider:</p><ul>
<li><strong>Biological Replicates</strong>: Ensure statistical power to detect meaningful differences.</li>
<li><strong>Sequencing Depth</strong>: Align sequencing depth to study objectives (e.g., higher depth for low-abundance transcripts).</li>
<li><strong>Paired-End vs. Single-End</strong>: Paired-end sequencing provides more detailed information on transcript structure.</li>
</ul><p>Once sequencing is complete, raw data is provided in FASTQ format, containing sequence reads and quality scores.</p><h4>2. <strong>Quality Control and Preprocessing</strong></h4><p>Quality control (QC) ensures data integrity. Tools such as <strong>FastQC</strong> evaluate metrics like base quality, GC content, and adapter contamination.</p><p><strong>Preprocessing Steps</strong>:</p><ul>
<li><strong>Trimming</strong>: Tools like <strong>Trimmomatic</strong> or <strong>Cutadapt</strong> remove low-quality bases and adapter sequences.</li>
<li><strong>Filtering</strong>: Discard reads below a certain quality threshold or length.</li>
</ul><h4>3. <strong>Read Alignment</strong></h4><p>Reads are mapped to a reference genome or transcriptome to determine their origin. Alignment tools include:</p><ul>
<li><strong>HISAT2</strong>: Handles large genomes efficiently and supports spliced alignments.</li>
<li><strong>STAR</strong>: High-speed aligner optimized for RNA-Seq.</li>
<li><strong>Bowtie2</strong>: Suitable for short-read alignment.</li>
</ul><p><strong>Output</strong>: A SAM/BAM file containing aligned reads.</p><h4>4. <strong>Transcript Assembly and Quantification</strong></h4><p>This step involves identifying transcripts and quantifying their expression levels. Tools used include:</p><ul>
<li><strong>StringTie</strong>: Assembles and quantifies transcripts from aligned reads.</li>
<li><strong>Salmon/Kallisto</strong>: Perform pseudo-alignment for rapid and accurate quantification.</li>
</ul><p>Expression levels are typically measured as TPM (transcripts per million) or FPKM (fragments per kilobase of transcript per million mapped reads).</p><h4>5. <strong>Differential Expression Analysis</strong></h4><p>To identify genes with altered expression between conditions, bioinformaticians use tools such as:</p><ul>
<li><strong>DESeq2</strong>: Accounts for data normalization and variability.</li>
<li><strong>edgeR</strong>: Handles overdispersed count data efficiently.</li>
<li><strong>Limma-voom</strong>: Combines linear modeling with RNA-Seq count data.</li>
</ul><p>The output includes a list of differentially expressed genes (DEGs) with statistical significance and fold-change values.</p><h4>6. <strong>Functional Annotation and Pathway Analysis</strong></h4><p>Understanding the biological significance of DEGs involves:</p><ul>
<li><strong>Gene Ontology (GO) Analysis</strong>: Tools like <strong>DAVID</strong> or <strong>clusterProfiler</strong> categorize genes based on their biological functions.</li>
<li><strong>Pathway Enrichment Analysis</strong>: Identifies pathways enriched in DEGs using tools like <strong>KEGG</strong>, <strong>Reactome</strong>, or <strong>GSEA</strong>.</li>
</ul><h4>7. <strong>Visualization</strong></h4><p>Visualizing results enhances interpretability. Common visualizations include:</p><ul>
<li><strong>Heatmaps</strong>: Show expression patterns across samples (e.g., <strong>pheatmap</strong>).</li>
<li><strong>Volcano Plots</strong>: Highlight significant DEGs (e.g., <strong>ggplot2</strong>).</li>
<li><strong>PCA/UMAP</strong>: Assess sample clustering and variability (e.g., <strong>Seurat</strong>).</li>
</ul><h3>Challenges in RNA-Seq Analysis</h3><ol>
<li><strong>Batch Effects</strong>: Technical variability can confound biological signals. Combat this with normalization techniques or batch-correction tools like <strong>ComBat</strong>.</li>
<li><strong>Low-Quality Samples</strong>: Poor-quality RNA impacts downstream analyses.</li>
<li><strong>Computational Complexity</strong>: RNA-Seq generates massive datasets, requiring robust computing resources and optimized pipelines.</li>
</ol><h3>Key Tools and Resources</h3><ul>
<li><strong>Bioconductor</strong>: A treasure trove of R packages for RNA-Seq analysis.</li>
<li><strong>Galaxy</strong>: A web-based platform for running RNA-Seq workflows.</li>
<li><strong>Nextflow/Snakemake</strong>: Workflow management tools to streamline analyses.</li>
</ul><h3>Applications of RNA-Seq</h3><p>RNA-Seq is used in diverse research areas, including:</p><ul>
<li><strong>Cancer Transcriptomics</strong>: Identifying tumor-specific expression profiles.</li>
<li><strong>Developmental Biology</strong>: Studying dynamic transcriptome changes.</li>
<li><strong>Drug Discovery</strong>: Screening genes modulated by therapeutic compounds.</li>
</ul><h3>Conclusion</h3><p>RNA-Seq analysis is a cornerstone of modern transcriptomics, offering bioinformaticians a versatile toolkit for unraveling gene expression and regulation. Mastering RNA-Seq workflows and tools empowers researchers to transform raw sequencing data into biological discoveries.</p><p>Whether you&rsquo;re investigating disease mechanisms, exploring cellular pathways, or developing new therapeutics, RNA-Seq is a powerful ally in your bioinformatics arsenal.</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2457/rdataminingcom-r-and-data-mining</guid>
	<pubDate>Thu, 15 Aug 2013 18:37:23 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2457/rdataminingcom-r-and-data-mining</link>
	<title><![CDATA[Rdatamining.com : R and Data Mining]]></title>
	<description><![CDATA[<p>This website presents examples, documents and resources on data mining with R. <br>Documents on using R for data mining are available to download for non-commercial personal use, including&nbsp;R Reference card for Data Mining, R and Data Mining: Examples and Case Studies and Time Series Analysis and Mining with R.</p><p>Address of the bookmark: <a href="http://www.rdatamining.com/" rel="nofollow">http://www.rdatamining.com/</a></p>]]></description>
	<dc:creator>Poonam Mahapatra</dc:creator>
</item>

</channel>
</rss>