<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/19555?offset=1430</link>
	<atom:link href="https://bioinformaticsonline.com/related/19555?offset=1430" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</guid>
	<pubDate>Thu, 02 Jan 2025 20:11:07 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</link>
	<title><![CDATA[The &quot;Ifs&quot; and &quot;Buts&quot; of NGS Quality Control and Trimming]]></title>
	<description><![CDATA[<p>Next-Generation Sequencing (NGS) has revolutionized biological research, providing vast amounts of data for a wide range of applications. However, the reliability of NGS analyses heavily depends on the quality of raw sequencing data. Quality control (QC) and trimming are critical preprocessing steps that can make or break your downstream analyses. In this blog, we explore the "ifs" (why you should perform QC and trimming) and the "buts" (challenges or considerations) of this vital step in NGS workflows.</p><h3><strong>The "Ifs" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Ensures Data Integrity</strong><br />If you want to minimize errors in downstream analyses, QC and trimming remove low-quality reads and bases, ensuring high-confidence data. This step is essential for reliable variant calling, assembly, and other applications.</p>
</li>
<li>
<p><strong>Removes Contaminants</strong><br />If adapter sequences or contaminants are present in the raw reads, trimming can eliminate them. This prevents issues like misalignment or incorrect biological interpretations, ensuring cleaner data for analysis.</p>
</li>
<li>
<p><strong>Improves Mapping and Assembly</strong><br />If your goal is better alignment to a reference genome or improved de novo assembly, trimming low-quality bases and adapters is critical. High-quality reads map more efficiently and generate more accurate assemblies.</p>
</li>
<li>
<p><strong>Reduces Computational Load</strong><br />If you want to save computational resources, trimming reduces the dataset size, which speeds up processing and analysis. Clean datasets mean less computational time spent on processing low-quality data.</p>
</li>
<li>
<p><strong>Prepares for Standardized Analyses</strong><br />If your project involves multiple datasets, QC and trimming ensure uniformity across them. This standardization makes comparisons valid and reproducible, particularly in large collaborative studies.</p>
</li>
</ol><h3><strong>The "Buts" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Risk of Over-Trimming</strong><br />But excessive trimming can lead to the loss of informative sequences, reducing read depth and potentially discarding biologically relevant data. This is especially critical in studies with limited sequencing depth.</p>
</li>
<li>
<p><strong>Bias Introduction</strong><br />But trimming algorithms might introduce biases, especially if they inadvertently remove sequences with specific biological patterns. This can skew results and compromise biological insights.</p>
</li>
<li>
<p><strong>Loss of Context in Paired-End Reads</strong><br />But trimming one read in a pair more than the other can lead to loss of pairing information. This complicates downstream analyses that rely on paired-end data, such as structural variant detection.</p>
</li>
<li>
<p><strong>Time and Resource Intensive</strong><br />But running QC and trimming for large datasets can be computationally expensive and time-consuming. As sequencing depth increases, preprocessing becomes a bottleneck in the analysis pipeline.</p>
</li>
<li>
<p><strong>Variable Standards</strong><br />But the criteria for trimming (e.g., quality threshold, minimum read length) can vary between tools and datasets. This variability may affect reproducibility and comparability of results across studies.</p>
</li>
</ol><h3><strong>Balancing the "Ifs" and "Buts"</strong></h3><p>To maximize the benefits of QC and trimming while mitigating the challenges, consider the following best practices:</p><ul>
<li>
<p><strong>Use QC Tools Wisely:</strong> Start with tools like <strong>FastQC</strong> to identify quality issues in your raw data. Visualizing quality metrics helps tailor your trimming parameters.</p>
</li>
<li>
<p><strong>Choose Reliable Trimming Tools:</strong> Tools like <strong>Trimmomatic</strong>, <strong>Cutadapt</strong>, and <strong>BBduk</strong> offer adaptive and customizable trimming options. Select one that aligns with your dataset and project goals.</p>
</li>
<li>
<p><strong>Set Reasonable Parameters:</strong> Avoid over-trimming by setting quality thresholds and minimum read lengths that balance data retention and quality improvement.</p>
</li>
<li>
<p><strong>Test Downstream Effects:</strong> Validate the impact of QC and trimming on downstream analyses, such as alignment efficiency, variant calling accuracy, or assembly quality.</p>
</li>
<li>
<p><strong>Document Your Workflow:</strong> Maintain detailed records of the parameters and tools used for QC and trimming. This ensures reproducibility and enables better troubleshooting.</p>
</li>
</ul><h3><strong>Conclusion</strong></h3><p>NGS quality control and trimming are essential steps to ensure reliable and accurate data for analysis. While the "ifs" highlight the clear benefits of these steps, the "buts" remind us of the potential pitfalls. By adopting best practices and carefully balancing these considerations, you can optimize your preprocessing workflow and unlock the full potential of your sequencing data.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37835/variantbam-filtering-and-profiling-of-next-generational-sequencing-data-using-region-specific-rules</guid>
	<pubDate>Thu, 04 Oct 2018 16:30:44 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37835/variantbam-filtering-and-profiling-of-next-generational-sequencing-data-using-region-specific-rules</link>
	<title><![CDATA[VariantBam: Filtering and profiling of next-generational sequencing data using region-specific rules]]></title>
	<description><![CDATA[<p>VariantBam is a tool to extract/count specific sets of sequencing reads from next-generational sequencing files. To save money, disk space and I/O, one may not want to store an entire BAM on disk. In many cases, it would be more efficient to store only those read-pairs or reads who intersect some region around the variant locations. Alternatively, if your scientific question is focused on only one aspect of the data (e.g. breakpoints), many reads can be removed without losing the information relevant to the problem.</p>
<h5>&nbsp;</h5><p>Address of the bookmark: <a href="https://github.com/broadinstitute/VariantBam" rel="nofollow">https://github.com/broadinstitute/VariantBam</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/39726/jackalope-a-swift-versatile-phylogenomic-and-high-throughput-sequencing-simulator</guid>
	<pubDate>Fri, 26 Jul 2019 00:58:12 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/39726/jackalope-a-swift-versatile-phylogenomic-and-high-throughput-sequencing-simulator</link>
	<title><![CDATA[jackalope: A swift, versatile phylogenomic and high-throughput sequencing simulator]]></title>
	<description><![CDATA[<p><code>jackalope</code> simply and efficiently simulates (i) variants from reference genomes and (ii) reads from both Illumina and Pacific Biosciences (PacBio) platforms. It can either read reference genomes from FASTA files or simulate new ones. Genomic variants can be simulated using summary statistics, phylogenies, Variant Call Format (VCF) files, and coalescent simulations&mdash;the latter of which can include selection, recombination, and demographic fluctuations. <code>jackalope</code> can simulate single, paired-end, or mate-pair Illumina reads, as well as reads from Pacific Biosciences These simulations include sequencing errors, mapping qualities, multiplexing, and optical/PCR duplicates. All outputs can be written to standard file formats.</p>
<p><span>A swift, versatile phylogenomic and high-throughput sequencing simulator </span> <span><a href="https://jackalope.lucasnell.com">https://jackalope.lucasnell.com</a></span></p><p>Address of the bookmark: <a href="https://github.com/lucasnell/jackalope" rel="nofollow">https://github.com/lucasnell/jackalope</a></p>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41009/genomics-public-data-links</guid>
	<pubDate>Thu, 13 Feb 2020 00:20:00 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41009/genomics-public-data-links</link>
	<title><![CDATA[genomics public data links !]]></title>
	<description><![CDATA[<p>List of publically available databases on google server.</p>
<p>More at <a href="https://software.broadinstitute.org/gatk/download/bundle">https://software.broadinstitute.org/gatk/download/bundle</a></p>
<p><a href="ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/VCF/GATK/">ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/VCF/GATK/</a>.</p>
<p><a href="ftp://ftp.broadinstitute.org/bundle/hg38/hg38bundle/">ftp://ftp.broadinstitute.org/bundle/hg38/hg38bundle/</a></p><p>Address of the bookmark: <a href="https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0?pli=1" rel="nofollow">https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0?pli=1</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37545/ncbi-magic-blast</guid>
	<pubDate>Tue, 14 Aug 2018 18:11:11 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37545/ncbi-magic-blast</link>
	<title><![CDATA[NCBI Magic-BLAST]]></title>
	<description><![CDATA[<p>Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. Each alignment optimizes a composite score, taking into account simultaneously the two reads of a pair, and in case of RNA-seq, locating the candidate introns and adding up the score of all exons. This is very different from other versions of BLAST, where each exon is scored as a separate hit and read-pairing is ignored.</p>
<p>Magic-BLAST incorporates within the NCBI BLAST code framework ideas developed in the NCBI Magic pipeline, in particular hit extensions by local walk and jump&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/26109056">(http://www.ncbi.nlm.nih.gov/pubmed/26109056)</a>, and recursive clipping of mismatches near the edges of the reads, which avoids accumulating artefactual mismatches near splice sites and is needed to distinguish short indels from substitutions near the edges.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://ncbi.github.io/magicblast/" rel="nofollow">https://ncbi.github.io/magicblast/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43384/lncpipea-nextflow-based-pipeline-for-comprehensive-analyses-of-long-non-coding-rnas-from-rna-seq-datasets</guid>
	<pubDate>Fri, 17 Sep 2021 01:57:02 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43384/lncpipea-nextflow-based-pipeline-for-comprehensive-analyses-of-long-non-coding-rnas-from-rna-seq-datasets</link>
	<title><![CDATA[LncPipe:A Nextflow-based pipeline for comprehensive analyses of long non-coding RNAs from RNA-seq datasets]]></title>
	<description><![CDATA[<p><span>The pipeline was developed based on a popular workflow framework&nbsp;</span><a href="https://github.com/nextflow-io/nextflow">Nextflow</a><span>, composed of four core procedures including reads alignment, assembly, identification and quantification. It contains various unique features such as well-designed lncRNAs annotation strategy, optimized calculating efficiency, diversified classification and interactive analysis report.&nbsp;</span><a href="https://github.com/likelet/LncPipe">LncPipe</a><span>&nbsp;allows users additional control in interuppting the pipeline, resetting parameters from command line, modifying main script directly and resume analysis from previous checkpoint.</span></p>
<p>Ref&nbsp;https://www.lncrnablog.com/lncpipe-a-nextflow-based-pipeline-for-identification-and-analysis-of-long-non-coding-rnas-from-rna-seq-data/</p>
<p><img src="https://ars.els-cdn.com/content/image/1-s2.0-S1673852718301176-gr1.jpg" alt="image" style="border: 0px;"></p><p>Address of the bookmark: <a href="https://github.com/likelet/LncPipe" rel="nofollow">https://github.com/likelet/LncPipe</a></p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/10966/genxpro-gmbh</guid>
	<pubDate>Thu, 22 May 2014 07:18:35 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/10966/genxpro-gmbh</link>
	<title><![CDATA[GenXPro GmbH]]></title>
	<description><![CDATA[<p><strong>GenXPro</strong>&nbsp;GMbH is service provider for entire spectrum of nucleotide-based information&nbsp;of any biological sample. By combining intelligent data reduction techniques and&nbsp;latest next generation sequencing technologies, our service portfolio provides most accurate and cost efficient solutions for&nbsp;transcriptomic-, genomic- or epigenomic research.</p><p><span><span><strong><span>GENXPRO GMBH</span>,&nbsp;</strong></span></span><span>ALTENH&Ouml;FERALLEE 3,&nbsp;</span><span>60438 FRANKFURT MAIN,&nbsp;</span><span>GERMANY</span></p><p><span><span><strong>Website</strong></span>:&nbsp;<a href="http://www.genxpro.info/products_and_services/"></a><a href="http://www.genxpro.info/products_and_services/">http://www.genxpro.info/products_and_services/</a></span></p><p><span><strong>PHONE</strong>: +49 (0)69- 95 73 97 10,&nbsp;FAX: +49 (0)69- 95 73 97 06</span></p><p><span>EMAIL: info@genxpro.de</span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/34197/strand-life-sciences-announces-the-release-of-strand-ngs-v31-at-ashg-2017</guid>
	<pubDate>Mon, 23 Oct 2017 02:39:24 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/34197/strand-life-sciences-announces-the-release-of-strand-ngs-v31-at-ashg-2017</link>
	<title><![CDATA[Strand Life Sciences announces the release of Strand NGS v3.1 at ASHG 2017]]></title>
	<description><![CDATA[<h1><a href="http://www.strand-ngs.com/strand-announce-strandngss-v31">Strand Life Sciences announces the release of Strand NGS v3.1 at ASHG 2017</a></h1><p><strong><em>ORLANDO, USA, Oct 17, 2017/ PRNewswire/</em></strong></p><p><em>Strand NGS now supports large scale RNA- and small-RNA-Seq and Unique Molecular Identifiers (UMIs) for DNA-, RNA-, and small-RNA-Seq.</em></p><p>Strand Life Sciences announced the latest version release of its bioinformatics flagship product, Strand NGS, at the Annual Meeting of the American Society of Human Genetics today. Two major themes in Strand NGS v3.1 address recent challenges in next generation sequencing (NGS).</p><p>The first theme is large-scale RNA-Seq data analysis. Current cross-cohort RNA- and small-RNA-Seq studies span tens of replicates and batches across hundreds of samples, sometimes conducted across several different institutions. For such studies, Strand NGS v3.1 includes confounding variable analysis to eliminate technical effects, including batch effects; the t-SNE plot; profile and heat-map plots of gene-body coverage; and several other notable visual enhancements.</p><p>The second new feature is support for Unique Molecular Identifiers, or UMIs, for DNA-, RNA- and small-RNA-Seq. UMI support in Strand NGS is end-to-end, spanning alignment to variant calling in DNA-Seq, and alignment to quantification in RNA- and small-RNA-Seq. The Bioo Scientific, Qiagen, and Rubicon UMI protocols are natively supported, and an intuitive interface allows the specification of custom UMI protocols.</p><p><em>&ldquo;For liquid biopsies and low-grade FFPE samples, UMI support in DNA-Seq enables the detection of somatic variants at low concentrations. In RNA-Seq, large-scale and UMI support can be used in single-cell-based studies that reveal tumor-cell heterogeneity, even at low concentrations&rdquo;, says<strong>&nbsp;Dr. Vamsi Veeramachaneni, Chief Scientific Officer, Strand Life Sciences.</strong></em></p><p><em>&ldquo;At Strand, we are continuously working towards improving the accuracy and efficiency of NGS data analysis. Customers can look forward to Strand NGS becoming available on the cloud in the near future&rdquo;, says&nbsp;<strong>Dr. Ramesh Hariharan, Chief Executive Officer, Strand Life Sciences.</strong></em></p><p>Visit Strand Life Sciences at ASHG booth #1017 to know more about Strand NGS v3.1 and other products and service offerings from Strand Life Sciences. Click here to access detailed agenda and v3.1&nbsp;<a href="http://www.strand-ngs.com/download/releasenotes">release notes</a>.</p><p><strong>About Strand Life Sciences</strong></p><p>Strand Life Sciences is a premier life science informatics innovation company. Founded in 2000, Strand is a leader in technology innovations for healthcare using genomics. By enhancing sequence-based diagnostics and clinical genomic data interpretation using a strong foundation of computational, scientific, and medical expertise, Strand is bringing individualized medicine to the world. To know more, visit&nbsp;<a href="http://www.strandls.com/" title="www.strandls.com">www.strandls.com</a></p>]]></description>
	<dc:creator>Yeshodari</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42040/proactiv-estimation-of-promoter-activity-from-rna-seq-data</guid>
	<pubDate>Thu, 13 Aug 2020 10:21:44 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42040/proactiv-estimation-of-promoter-activity-from-rna-seq-data</link>
	<title><![CDATA[proActiv: Estimation of Promoter Activity from RNA-Seq data]]></title>
	<description><![CDATA[<p>proActiv is an R package that estimates promoter activity from RNA-Seq data. proActiv uses aligned reads and genome annotations as input, and provides absolute and relative promoter activity as output. The package can be used to identify active promoters and alternative promoters, the details of the method are described in&nbsp;<a href="https://github.com/GoekeLab/proActiv#reference">Demircioglu et al</a>.</p>
<p>Additional data on differential promoters in tissues and cancers from TCGA, ICGC, GTEx, and PCAWG can be downloaded here:&nbsp;<a href="https://jglab.org/data-and-software/">https://jglab.org/data-and-software/</a></p><p>Address of the bookmark: <a href="https://github.com/GoekeLab/proActiv" rel="nofollow">https://github.com/GoekeLab/proActiv</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/32950/genespring-webinar-uncovering-mechanisms-of-hepatotoxicity-on-14-june-at-8am-pst</guid>
	<pubDate>Tue, 23 May 2017 06:48:40 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/32950/genespring-webinar-uncovering-mechanisms-of-hepatotoxicity-on-14-june-at-8am-pst</link>
	<title><![CDATA[GeneSpring webinar- Uncovering mechanisms of hepatotoxicity on 14 June at 8AM PST]]></title>
	<description><![CDATA[<p><a href="http://genespring-support.com/content/webinar-uncovering-mechanisms-hepatotoxicity-high-affinity-antisense-oligonucleotides-using-"><strong>Uncovering Mechanisms of Hepatotoxicity for High Affinity Antisense Oligonucleotides &ndash; 3&rsquo; end RNA-seq Profiling Using GeneSpring GX</strong></a></p><p>High affinity antisense oligonucleotides (ASOs) containing bicylic modifications (BNA) such as locked nucleic acid (LNA) or constrained ethyl (cEt) designed to induce target RNA cleavage have been shown to have enhanced potency along with a higher propensity to cause hepatotoxicity. In order to unravel the mechanism of this hepatotoxicity, we leveraged GeneSpring GX analysis software to analyze transcriptional profiles from the livers of mice treated with a panel of highly efficacious hepatotoxic or non-hepatotoxic LNA ASOs.</p><p><a href="http://genespring-support.com/content/webinar-uncovering-mechanisms-hepatotoxicity-high-affinity-antisense-oligonucleotides-using-"><strong>Speaker:</strong></a><br />Sebastien A. Burel, PhD<br />Director, Nonclinical Development, Ionis Pharmaceuticals, California</p><p><a href="http://genespring-support.com/content/webinar-uncovering-mechanisms-hepatotoxicity-high-affinity-antisense-oligonucleotides-using-"><strong>Details:</strong></a><br />14 June 2017, 8 AM PST</p><h3><a href="http://genespring-support.com/content/webinar-uncovering-mechanisms-hepatotoxicity-high-affinity-antisense-oligonucleotides-using-">Register for this Webinar</a></h3>]]></description>
	<dc:creator>Yeshodari</dc:creator>
</item>

</channel>
</rss>