<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/41843?offset=190</link>
	<atom:link href="https://bioinformaticsonline.com/related/41843?offset=190" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44707/rna-seq-analysis-a-guide-for-bioinformaticians</guid>
	<pubDate>Sat, 07 Dec 2024 22:22:24 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44707/rna-seq-analysis-a-guide-for-bioinformaticians</link>
	<title><![CDATA[RNA-Seq Analysis: A Guide for Bioinformaticians]]></title>
	<description><![CDATA[<p>RNA sequencing (RNA-Seq) has revolutionized transcriptomics, offering unprecedented insights into gene expression, splicing, and transcript diversity. For bioinformaticians, RNA-Seq analysis is a gateway to exploring the complexity of RNA biology and its implications in health and disease. This blog post provides an overview of RNA-Seq analysis, key computational steps, and tools for bioinformaticians eager to delve into this powerful technique.</p><h3>What is RNA-Seq?</h3><p>RNA-Seq is a next-generation sequencing (NGS) technology used to study the transcriptome&mdash;the complete set of RNA molecules in a cell. It quantifies gene expression, detects novel transcripts, and captures alternative splicing events with high sensitivity and resolution.</p><h3>Workflow for RNA-Seq Analysis</h3><p>RNA-Seq analysis involves several stages, each requiring computational tools and expertise.</p><h4>1. <strong>Experimental Design and Data Acquisition</strong></h4><p>Before diving into analysis, bioinformaticians should consider:</p><ul>
<li><strong>Biological Replicates</strong>: Ensure statistical power to detect meaningful differences.</li>
<li><strong>Sequencing Depth</strong>: Align sequencing depth to study objectives (e.g., higher depth for low-abundance transcripts).</li>
<li><strong>Paired-End vs. Single-End</strong>: Paired-end sequencing provides more detailed information on transcript structure.</li>
</ul><p>Once sequencing is complete, raw data is provided in FASTQ format, containing sequence reads and quality scores.</p><h4>2. <strong>Quality Control and Preprocessing</strong></h4><p>Quality control (QC) ensures data integrity. Tools such as <strong>FastQC</strong> evaluate metrics like base quality, GC content, and adapter contamination.</p><p><strong>Preprocessing Steps</strong>:</p><ul>
<li><strong>Trimming</strong>: Tools like <strong>Trimmomatic</strong> or <strong>Cutadapt</strong> remove low-quality bases and adapter sequences.</li>
<li><strong>Filtering</strong>: Discard reads below a certain quality threshold or length.</li>
</ul><h4>3. <strong>Read Alignment</strong></h4><p>Reads are mapped to a reference genome or transcriptome to determine their origin. Alignment tools include:</p><ul>
<li><strong>HISAT2</strong>: Handles large genomes efficiently and supports spliced alignments.</li>
<li><strong>STAR</strong>: High-speed aligner optimized for RNA-Seq.</li>
<li><strong>Bowtie2</strong>: Suitable for short-read alignment.</li>
</ul><p><strong>Output</strong>: A SAM/BAM file containing aligned reads.</p><h4>4. <strong>Transcript Assembly and Quantification</strong></h4><p>This step involves identifying transcripts and quantifying their expression levels. Tools used include:</p><ul>
<li><strong>StringTie</strong>: Assembles and quantifies transcripts from aligned reads.</li>
<li><strong>Salmon/Kallisto</strong>: Perform pseudo-alignment for rapid and accurate quantification.</li>
</ul><p>Expression levels are typically measured as TPM (transcripts per million) or FPKM (fragments per kilobase of transcript per million mapped reads).</p><h4>5. <strong>Differential Expression Analysis</strong></h4><p>To identify genes with altered expression between conditions, bioinformaticians use tools such as:</p><ul>
<li><strong>DESeq2</strong>: Accounts for data normalization and variability.</li>
<li><strong>edgeR</strong>: Handles overdispersed count data efficiently.</li>
<li><strong>Limma-voom</strong>: Combines linear modeling with RNA-Seq count data.</li>
</ul><p>The output includes a list of differentially expressed genes (DEGs) with statistical significance and fold-change values.</p><h4>6. <strong>Functional Annotation and Pathway Analysis</strong></h4><p>Understanding the biological significance of DEGs involves:</p><ul>
<li><strong>Gene Ontology (GO) Analysis</strong>: Tools like <strong>DAVID</strong> or <strong>clusterProfiler</strong> categorize genes based on their biological functions.</li>
<li><strong>Pathway Enrichment Analysis</strong>: Identifies pathways enriched in DEGs using tools like <strong>KEGG</strong>, <strong>Reactome</strong>, or <strong>GSEA</strong>.</li>
</ul><h4>7. <strong>Visualization</strong></h4><p>Visualizing results enhances interpretability. Common visualizations include:</p><ul>
<li><strong>Heatmaps</strong>: Show expression patterns across samples (e.g., <strong>pheatmap</strong>).</li>
<li><strong>Volcano Plots</strong>: Highlight significant DEGs (e.g., <strong>ggplot2</strong>).</li>
<li><strong>PCA/UMAP</strong>: Assess sample clustering and variability (e.g., <strong>Seurat</strong>).</li>
</ul><h3>Challenges in RNA-Seq Analysis</h3><ol>
<li><strong>Batch Effects</strong>: Technical variability can confound biological signals. Combat this with normalization techniques or batch-correction tools like <strong>ComBat</strong>.</li>
<li><strong>Low-Quality Samples</strong>: Poor-quality RNA impacts downstream analyses.</li>
<li><strong>Computational Complexity</strong>: RNA-Seq generates massive datasets, requiring robust computing resources and optimized pipelines.</li>
</ol><h3>Key Tools and Resources</h3><ul>
<li><strong>Bioconductor</strong>: A treasure trove of R packages for RNA-Seq analysis.</li>
<li><strong>Galaxy</strong>: A web-based platform for running RNA-Seq workflows.</li>
<li><strong>Nextflow/Snakemake</strong>: Workflow management tools to streamline analyses.</li>
</ul><h3>Applications of RNA-Seq</h3><p>RNA-Seq is used in diverse research areas, including:</p><ul>
<li><strong>Cancer Transcriptomics</strong>: Identifying tumor-specific expression profiles.</li>
<li><strong>Developmental Biology</strong>: Studying dynamic transcriptome changes.</li>
<li><strong>Drug Discovery</strong>: Screening genes modulated by therapeutic compounds.</li>
</ul><h3>Conclusion</h3><p>RNA-Seq analysis is a cornerstone of modern transcriptomics, offering bioinformaticians a versatile toolkit for unraveling gene expression and regulation. Mastering RNA-Seq workflows and tools empowers researchers to transform raw sequencing data into biological discoveries.</p><p>Whether you&rsquo;re investigating disease mechanisms, exploring cellular pathways, or developing new therapeutics, RNA-Seq is a powerful ally in your bioinformatics arsenal.</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/35437/dupradar-package</guid>
	<pubDate>Sun, 04 Feb 2018 14:28:57 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/35437/dupradar-package</link>
	<title><![CDATA[dupRadar package]]></title>
	<description><![CDATA[<p><span>The&nbsp;</span><em>dupRadar</em><span>&nbsp;package gives an insight into the duplication problem by graphically relating the gene expression level and the duplication rate present on it. Thus, failed experiments can be easily identified at a glance</span></p><p>Address of the bookmark: <a href="https://bioconductor.org/packages/3.7/bioc/vignettes/dupRadar/inst/doc/dupRadar.html" rel="nofollow">https://bioconductor.org/packages/3.7/bioc/vignettes/dupRadar/inst/doc/dupRadar.html</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34398/ont-assembly-and-illumina-polishing-pipeline</guid>
	<pubDate>Thu, 23 Nov 2017 10:13:42 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34398/ont-assembly-and-illumina-polishing-pipeline</link>
	<title><![CDATA[ONT assembly and Illumina polishing pipeline]]></title>
	<description><![CDATA[<p>This pipeline performs the following steps:</p>
<ul>
<li>Assembly of nanopore reads using&nbsp;<a href="http://canu.readthedocs.io/">Canu</a>.</li>
<li>Polish canu contigs using&nbsp;<a href="https://github.com/isovic/racon">racon</a>&nbsp;(<em>optional</em>).</li>
<li>Map a paired-end Illumina dataset onto the contigs obtained in the previous steps using&nbsp;<a href="http://bio-bwa.sourceforge.net/">BWA</a>&nbsp;mem.</li>
<li>Perform correction of contigs using&nbsp;<a href="https://github.com/broadinstitute/pilon/wiki">pilon</a>&nbsp;and the Illumina dataset.</li>
</ul><p>Address of the bookmark: <a href="https://github.com/nanoporetech/ont-assembly-polish" rel="nofollow">https://github.com/nanoporetech/ont-assembly-polish</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34501/dnapipete-de-novo-assembly-annotation-pipeline-for-transposable-elements</guid>
	<pubDate>Sat, 02 Dec 2017 18:25:44 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34501/dnapipete-de-novo-assembly-annotation-pipeline-for-transposable-elements</link>
	<title><![CDATA[dnaPipeTE: de-novo assembly &amp; annotation Pipeline for Transposable Elements]]></title>
	<description><![CDATA[<p>dnaPipeTE (for de-novo assembly &amp; annotation Pipeline for Transposable Elements), is a pipeline designed to find, annotate and quantify Transposable Elements in small samples of NGS datasets. It is very useful to quantify the proportion of TEs in newly sequenced genomes since it does not require genome assembly and works on small datasets (&lt; 1X).</p>
<ul>
<li>
<p>dnaPipeTE is developped by Cl&eacute;ment Goubert, Laurent Modolo and the TREEP team of the LBBE:&nbsp;<a href="http://lbbe.univ-lyon1.fr/-Equipe-Elements-transposables-.html?lang=en">http://lbbe.univ-lyon1.fr/-Equipe-Elements-transposables-.html?lang=en</a></p>
</li>
<li>
<p>You can find the original publication in GBE here:&nbsp;<a href="https://academic.oup.com/gbe/article/7/4/1192/533768">https://academic.oup.com/gbe/article/7/4/1192/533768</a></p>
</li>
</ul>
<p><a href="https://github.com/clemgoub/dnaPipeTE/blob/dev/dnaPipefront.png" target="_blank"><img src="https://github.com/clemgoub/dnaPipeTE/raw/dev/dnaPipefront.png" alt="Front" style="border: 0px;"></a><em>output examples of quantification and TE landscape (relative age) produced by dnaPipeTE</em></p>
<p><em>&nbsp;</em></p><p>Address of the bookmark: <a href="https://github.com/clemgoub/dnaPipeTE" rel="nofollow">https://github.com/clemgoub/dnaPipeTE</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34914/ra-assembler-a-de-novo-dna-assembler-for-third-generation-sequencing-data</guid>
	<pubDate>Wed, 27 Dec 2017 20:36:54 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34914/ra-assembler-a-de-novo-dna-assembler-for-third-generation-sequencing-data</link>
	<title><![CDATA[Ra assembler - a de novo DNA assembler for third generation sequencing data]]></title>
	<description><![CDATA[<p>Integration of the Ra assembler - a de novo DNA assembler for third generation sequencing data developed on Faculty of Electrical Engineering and Computing (FER), Ruder Boskovic Institute (RBI) and Genome Institute of Singapore (GIS).</p>
<p>Ra is in development since 2014 in the form of several separate components that used to be run individually.<br>This project aims to ease the usage of Ra by integrating it into a complete de novo assembly tool.</p>
<p>Unlike other state-of-the-art assemblers,&nbsp;<span>Ra does not have an error correction step.</span>&nbsp;Instead, it relies on detecting overlaps using a very sensitive and specific overlapper ("graphmap -w owler",&nbsp;<a href="https://github.com/isovic/graphmap">https://github.com/isovic/graphmap</a>) and constructing and reducing an overlap graph (Ra layout,&nbsp;<a href="https://github.com/mariokostelac/ra">https://github.com/mariokostelac/ra</a>).</p><p>Address of the bookmark: <a href="https://github.com/mariokostelac/ra-integrate/" rel="nofollow">https://github.com/mariokostelac/ra-integrate/</a></p>]]></description>
	<dc:creator>biogeek</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36257/aligngraph-algorithm-for-secondary-de-novo-genome-assembly-guided-by-closely-related-references</guid>
	<pubDate>Tue, 17 Apr 2018 16:21:20 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36257/aligngraph-algorithm-for-secondary-de-novo-genome-assembly-guided-by-closely-related-references</link>
	<title><![CDATA[AlignGraph: algorithm for secondary de novo genome assembly guided by closely related references]]></title>
	<description><![CDATA[<p>AlignGraph is a software that extends and joins contigs or scaffolds by reassembling them with help provided by a reference genome of a closely related organism.</p>
<p>Using AlignGraph</p>
<pre><code>AlignGraph --read1 reads_1.fa --read2 reads_2.fa --contig contigs.fa --genome genome.fa --distanceLow distanceLow --distanceHigh distancehigh --extendedContig extendedContigs.fa --remainingContig remainingContigs.fa [--kMer k --insertVariation insertVariation --coverage coverage --part p --fastMap --ratioCheck --iterativeMap --misassemblyRemoval --resume]</code></pre>
<h3>&nbsp;</h3><p>Address of the bookmark: <a href="https://github.com/baoe/AlignGraph" rel="nofollow">https://github.com/baoe/AlignGraph</a></p>]]></description>
	<dc:creator>Manisha Mishra</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36861/eagler-a-scaffolding-tool-for-long-reads</guid>
	<pubDate>Mon, 04 Jun 2018 05:26:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36861/eagler-a-scaffolding-tool-for-long-reads</link>
	<title><![CDATA[EAGLER: a scaffolding tool for long reads.]]></title>
	<description><![CDATA[<p>EAGLER is a scaffolding tool for long reads. The scaffolder takes as input a draft genome created by any NGS assembler and a set of long reads. The long reads are used to extend the contigs present in the NGS draft and possibly join overlapping contigs. EAGLER supports both PacBio and Oxford Nanopore reads.</p>
<p>The tool should be compatible with most UNIX flavors and has been successfully tested on the following operating systems:</p>
<ul>
<li>Mac OS X 10.11.1</li>
<li>Mac OS X 10.10.3</li>
<li>Ubuntu 14.04 LTS</li>
</ul>

https://bib.irb.hr/datoteka/844447.Diplomski_2015_Luka_terbi.pdf<p>Address of the bookmark: <a href="https://github.com/mculinovic/EAGLER" rel="nofollow">https://github.com/mculinovic/EAGLER</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36985/swalo-scaffolding-with-assembly-likelihood-optimization</guid>
	<pubDate>Wed, 20 Jun 2018 02:45:16 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36985/swalo-scaffolding-with-assembly-likelihood-optimization</link>
	<title><![CDATA[SWALO: Scaffolding with assembly likelihood optimization]]></title>
	<description><![CDATA[SWALO (scaffolding with assembly likelihood optimization) is a method for scaffolding based on likelihood of genome assemblies computed using generative models for sequencing.

Please email your questions, comments, suggestions, and bug reports to atif.bd@gmail.com.<p>Address of the bookmark: <a href="https://atifrahman.github.io/SWALO/" rel="nofollow">https://atifrahman.github.io/SWALO/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/37396/converting-a-vcf-into-a-fasta-given-some-reference</guid>
	<pubDate>Fri, 20 Jul 2018 10:03:53 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/37396/converting-a-vcf-into-a-fasta-given-some-reference</link>
	<title><![CDATA[Converting a VCF into a FASTA given some reference !]]></title>
	<description><![CDATA[<p>Samtools/BCFtools (Heng Li) provides a Perl script&nbsp;<a href="https://github.com/lh3/samtools/blob/master/bcftools/vcfutils.pl"><code>vcfutils.pl</code></a>&nbsp;which does this, the function&nbsp;<code>vcf2fq</code>&nbsp;(lines 469-528)</p><p>This script has been modified by others to convert InDels as well, e.g.&nbsp;<a href="https://github.com/gringer/bioinfscripts/blob/master/vcf2fq.pl">this</a>&nbsp;by David Eccles</p><pre><code><span>./</span><span>vcf2fq</span><span>.</span><span>pl </span><span>-</span><span>f </span><span>&lt;</span><span>input</span><span>.</span><span>fasta</span><span>&gt;</span><span> </span><span>&lt;</span><span>all</span><span>-</span><span>site</span><span>.</span><span>vcf</span><span>&gt;</span><span> </span><span>&gt;</span><span> </span><span>&lt;</span><span>output</span><span>.</span><span>fastq</span><span>&gt;</span></code></pre><p>https://github.com/gringer/bioinfscripts/blob/master/vcf2fq.pl</p><p>https://github.com/lh3/samtools/blob/master/bcftools/vcfutils.pl</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37840/long-read-assembly-workshop</guid>
	<pubDate>Thu, 04 Oct 2018 17:23:18 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37840/long-read-assembly-workshop</link>
	<title><![CDATA[Long read assembly workshop !]]></title>
	<description><![CDATA[<p>This is a tutorial for a workshop on long-read (PacBio) genome assembly.</p>
<p>It demonstrates how to use long PacBio sequencing reads to assemble a bacterial genome, and includes additional steps for circularising, trimming, finding plasmids, and correcting the assembly with short-read Illumina data.</p>
<p>&nbsp;Please comment if you know any other long read addembly tutorial.</p><p>Address of the bookmark: <a href="http://sepsis-omics.github.io/tutorials/modules/cmdline_assembly_v2/" rel="nofollow">http://sepsis-omics.github.io/tutorials/modules/cmdline_assembly_v2/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

</channel>
</rss>