<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/29614?offset=230</link>
	<atom:link href="https://bioinformaticsonline.com/related/29614?offset=230" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37957/base-a-practical-de-novo-assembler-for-large-genomes-using-long-ngs-reads</guid>
	<pubDate>Fri, 19 Oct 2018 07:25:21 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37957/base-a-practical-de-novo-assembler-for-large-genomes-using-long-ngs-reads</link>
	<title><![CDATA[BASE: a practical de novo assembler for large genomes using long NGS reads]]></title>
	<description><![CDATA[<p><span>new&nbsp;</span><em>de novo</em><span>&nbsp;assembler called BASE. It enhances the classic seed-extension approach by indexing the reads efficiently to generate adaptive seeds that have high probability to appear uniquely in the genome. Such seeds form the basis for BASE to build extension trees and then to use reverse validation to remove the branches based on read coverage and paired-end information, resulting in high-quality consensus sequences of reads sharing the seeds. Such consensus sequences are then extended to contigs.</span></p><p>Address of the bookmark: <a href="https://github.com/dhlbh/BASE" rel="nofollow">https://github.com/dhlbh/BASE</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</guid>
	<pubDate>Thu, 02 Jan 2025 20:11:07 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44758/the-ifs-and-buts-of-ngs-quality-control-and-trimming</link>
	<title><![CDATA[The &quot;Ifs&quot; and &quot;Buts&quot; of NGS Quality Control and Trimming]]></title>
	<description><![CDATA[<p>Next-Generation Sequencing (NGS) has revolutionized biological research, providing vast amounts of data for a wide range of applications. However, the reliability of NGS analyses heavily depends on the quality of raw sequencing data. Quality control (QC) and trimming are critical preprocessing steps that can make or break your downstream analyses. In this blog, we explore the "ifs" (why you should perform QC and trimming) and the "buts" (challenges or considerations) of this vital step in NGS workflows.</p><h3><strong>The "Ifs" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Ensures Data Integrity</strong><br />If you want to minimize errors in downstream analyses, QC and trimming remove low-quality reads and bases, ensuring high-confidence data. This step is essential for reliable variant calling, assembly, and other applications.</p>
</li>
<li>
<p><strong>Removes Contaminants</strong><br />If adapter sequences or contaminants are present in the raw reads, trimming can eliminate them. This prevents issues like misalignment or incorrect biological interpretations, ensuring cleaner data for analysis.</p>
</li>
<li>
<p><strong>Improves Mapping and Assembly</strong><br />If your goal is better alignment to a reference genome or improved de novo assembly, trimming low-quality bases and adapters is critical. High-quality reads map more efficiently and generate more accurate assemblies.</p>
</li>
<li>
<p><strong>Reduces Computational Load</strong><br />If you want to save computational resources, trimming reduces the dataset size, which speeds up processing and analysis. Clean datasets mean less computational time spent on processing low-quality data.</p>
</li>
<li>
<p><strong>Prepares for Standardized Analyses</strong><br />If your project involves multiple datasets, QC and trimming ensure uniformity across them. This standardization makes comparisons valid and reproducible, particularly in large collaborative studies.</p>
</li>
</ol><h3><strong>The "Buts" of NGS QC and Trimming</strong></h3><ol>
<li>
<p><strong>Risk of Over-Trimming</strong><br />But excessive trimming can lead to the loss of informative sequences, reducing read depth and potentially discarding biologically relevant data. This is especially critical in studies with limited sequencing depth.</p>
</li>
<li>
<p><strong>Bias Introduction</strong><br />But trimming algorithms might introduce biases, especially if they inadvertently remove sequences with specific biological patterns. This can skew results and compromise biological insights.</p>
</li>
<li>
<p><strong>Loss of Context in Paired-End Reads</strong><br />But trimming one read in a pair more than the other can lead to loss of pairing information. This complicates downstream analyses that rely on paired-end data, such as structural variant detection.</p>
</li>
<li>
<p><strong>Time and Resource Intensive</strong><br />But running QC and trimming for large datasets can be computationally expensive and time-consuming. As sequencing depth increases, preprocessing becomes a bottleneck in the analysis pipeline.</p>
</li>
<li>
<p><strong>Variable Standards</strong><br />But the criteria for trimming (e.g., quality threshold, minimum read length) can vary between tools and datasets. This variability may affect reproducibility and comparability of results across studies.</p>
</li>
</ol><h3><strong>Balancing the "Ifs" and "Buts"</strong></h3><p>To maximize the benefits of QC and trimming while mitigating the challenges, consider the following best practices:</p><ul>
<li>
<p><strong>Use QC Tools Wisely:</strong> Start with tools like <strong>FastQC</strong> to identify quality issues in your raw data. Visualizing quality metrics helps tailor your trimming parameters.</p>
</li>
<li>
<p><strong>Choose Reliable Trimming Tools:</strong> Tools like <strong>Trimmomatic</strong>, <strong>Cutadapt</strong>, and <strong>BBduk</strong> offer adaptive and customizable trimming options. Select one that aligns with your dataset and project goals.</p>
</li>
<li>
<p><strong>Set Reasonable Parameters:</strong> Avoid over-trimming by setting quality thresholds and minimum read lengths that balance data retention and quality improvement.</p>
</li>
<li>
<p><strong>Test Downstream Effects:</strong> Validate the impact of QC and trimming on downstream analyses, such as alignment efficiency, variant calling accuracy, or assembly quality.</p>
</li>
<li>
<p><strong>Document Your Workflow:</strong> Maintain detailed records of the parameters and tools used for QC and trimming. This ensures reproducibility and enables better troubleshooting.</p>
</li>
</ul><h3><strong>Conclusion</strong></h3><p>NGS quality control and trimming are essential steps to ensure reliable and accurate data for analysis. While the "ifs" highlight the clear benefits of these steps, the "buts" remind us of the potential pitfalls. By adopting best practices and carefully balancing these considerations, you can optimize your preprocessing workflow and unlock the full potential of your sequencing data.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/39726/jackalope-a-swift-versatile-phylogenomic-and-high-throughput-sequencing-simulator</guid>
	<pubDate>Fri, 26 Jul 2019 00:58:12 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/39726/jackalope-a-swift-versatile-phylogenomic-and-high-throughput-sequencing-simulator</link>
	<title><![CDATA[jackalope: A swift, versatile phylogenomic and high-throughput sequencing simulator]]></title>
	<description><![CDATA[<p><code>jackalope</code> simply and efficiently simulates (i) variants from reference genomes and (ii) reads from both Illumina and Pacific Biosciences (PacBio) platforms. It can either read reference genomes from FASTA files or simulate new ones. Genomic variants can be simulated using summary statistics, phylogenies, Variant Call Format (VCF) files, and coalescent simulations&mdash;the latter of which can include selection, recombination, and demographic fluctuations. <code>jackalope</code> can simulate single, paired-end, or mate-pair Illumina reads, as well as reads from Pacific Biosciences These simulations include sequencing errors, mapping qualities, multiplexing, and optical/PCR duplicates. All outputs can be written to standard file formats.</p>
<p><span>A swift, versatile phylogenomic and high-throughput sequencing simulator </span> <span><a href="https://jackalope.lucasnell.com">https://jackalope.lucasnell.com</a></span></p><p>Address of the bookmark: <a href="https://github.com/lucasnell/jackalope" rel="nofollow">https://github.com/lucasnell/jackalope</a></p>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/19090/deeptools</guid>
	<pubDate>Sat, 08 Nov 2014 15:02:08 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/19090/deeptools</link>
	<title><![CDATA[deepTools]]></title>
	<description><![CDATA[<p>deepTools addresses the challenge of handling the large amounts of data that are now routinely generated from DNA sequencing centers. To do so, deepTools contains useful modules to process the mapped reads data to create coverage files in standard bedGraph and bigWig file formats. By doing so, deepTools allows the creation of normalized coverage files or the comparison between two files (for example, treatment and control). Finally, using such normalized and standardized files, multiple visualizations can be created to identify enrichments with functional annotations of the genome.<br /><br />Publicaton: http://nar.oxfordjournals.org/content/early/2014/05/05/nar.gku365.full<br /><br />Source Code and Wiki: https://github.com/fidelram/deepTools/wiki<br /><br />Galaxy Tool Shed repository: http://toolshed.g2.bx.psu.edu/view/bgruening/deeptools<br /><br />and example Galaxy workflows: http://toolshed.g2.bx.psu.edu/view/bgruening/deeptools_workflows</p>]]></description>
	<dc:creator>Martin Jones</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/19631/rosalind-bioinformatics-problems</guid>
	<pubDate>Thu, 18 Dec 2014 10:32:48 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/19631/rosalind-bioinformatics-problems</link>
	<title><![CDATA[Rosalind Bioinformatics problems !!!]]></title>
	<description><![CDATA[<p>Rosalind is a platform for learning bioinformatics and programming through problem solving. <a href="http://rosalind.info/problems/list-view/">Take a tour</a> to get the hang of how Rosalind works.</p>
<p>http://rosalind.info/problems/list-view/</p><p>Address of the bookmark: <a href="http://rosalind.info/problems/list-view/" rel="nofollow">http://rosalind.info/problems/list-view/</a></p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/21241/pacman</guid>
	<pubDate>Mon, 16 Feb 2015 12:15:17 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/21241/pacman</link>
	<title><![CDATA[Pacman]]></title>
	<description><![CDATA[<p><span>The pacman package is an R package management tool that combines the functionality of base library related functions into intuitively named functions. This package is ideally added to .Rprofile to increase workflow by reducing time recalling obscurely named functions, reducing code and integrating functionality of base functions to simultaneously perform multiple actions.<br /><br />Function names in the pacman package follow the format of p_xxx where &lsquo;xxx&rsquo; is the task the function performs. For instance the p_load function allows the user to load one or more packages as a more generic substitute for the library or require functions and if the package isn&rsquo;t available locally it will install it for you.<br /><br /></span></p><p><strong>Installation</strong></p><p><span>To download the development version of pacman:</span></p><p><span>Download the </span><a href="https://github.com/trinker/pacman/zipball/master">zip ball</a><span> or </span><a href="https://github.com/trinker/pacman/tarball/master">tar ball</a><span>, decompress and run </span><code>R CMD INSTALL</code><span> on it, or use th</span><span>e </span><strong>devtools</strong><span> package to install the development version:</span></p><pre title="">## Make sure your current packages are up to date
update.packages()
## devtools is required
devtools::install_github("trinker/pacman")
</pre><p>Note: Windows users need <a href="http://www.murdoch-sutherland.com/Rtools/">Rtools</a> and <a href="http://CRAN.R-project.org/package=devtools">devtools</a> to install this way.</p><p>More at https://github.com/trinker/pacman</p><p>&nbsp;</p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/26993/lastz</guid>
	<pubDate>Mon, 18 Apr 2016 04:41:55 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/26993/lastz</link>
	<title><![CDATA[LASTZ]]></title>
	<description><![CDATA[<p>LASTZ is a program for aligning DNA sequences, a pairwise aligner. Originally designed to handle sequences the size of human chromosomes and from different species, it is also useful for sequences produced by NGS sequencing technologies such as Roche 454.</p>
<p>More at http://www.bx.psu.edu/~rsharris/lastz/</p>
<p>Thesis: http://www.bx.psu.edu/~rsharris/rsharris_phd_thesis_2007.pdf</p><p>Address of the bookmark: <a href="http://www.bx.psu.edu/~rsharris/lastz/" rel="nofollow">http://www.bx.psu.edu/~rsharris/lastz/</a></p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/24297/bioinformatics-walkin-at-nii</guid>
  <pubDate>Fri, 04 Sep 2015 21:48:15 -0500</pubDate>
  <link></link>
  <title><![CDATA[Bioinformatics WalkIn at NII]]></title>
  <description><![CDATA[
<p>ADVERTISEMENT OF WALK-IN-INTERVIEW</p>

<p>NAME OF THE POST : Bioinformatician (Part time 3 days in a week) (One Position only)</p>

<p>DURATION : One Year</p>

<p>NAME OF THE PROJECT : Next generation sequencing facility</p>

<p>EDUCATIONAL QUALIFICATIONS : At least a Masters degree in Bioinformatics and Bachelors degree in any stream of life sciences</p>

<p>REQUIREMENTS :</p>

<p>Around 5 years of experience and proven track record in next generation sequence data analysis (supported by publications in peer-reviewed journals), ability to analyze transcriptomics, Chip-seq, and small RNA –seq data.</p>

<p>: Should have the ability to analyze raw primary data generated by Illumina next generation sequencing platforms and create / troubleshoot custom analysis Pipelines.</p>

<p>Should have ability to handle all downstream secondary and tertiary data analysis using commercially available as well as open source softwares (transcriptomics, ChIP-seq, small RNA-seq)</p>

<p>Apart from these, the applicant should have knowledge of the following: Programming: Perl and Python. Operating system:</p>

<p>Linux and Windows. NGS Analysis tools: Maq, BWA, Bowtie, SAM tools, BEDTools, MACS, Galaxy, FastQC, Bismark, MEDIPS, Tophat, Cufflinks, AvadisNGS, CLC Genomics Workbench, Galaxy, BaseSpace, Trinity Statistics: Microsoft Excel and R. Database: MySQL Genome Browser: UCSC, Ensemble, IGV, IGB Motif Analysis Tools: MEME Suite, Transfac and RSAT Functional Annotation Tools: DAVID, GeneCodis, Gene Cards Networking Tools: Cytoscape</p>

<p>EMOLUMENTS : The incumbent will be paid a fee of Rs. 2000/- per sitting/ per day.</p>

<p>SCIENTIST NAME : Dr. Arnab Mukhopadhyay,</p>

<p>Staff Scientific V Next generation sequencing facility</p>

<p>SCIENTIST’S E-MAIL ID : arnab@nii.ac.in</p>

<p>WALK IN INTERVIEW ON : 18th September, 2015</p>

<p>REGISTRATION OF CANDIDATES: 10.30 AM to 11.00 AM</p>

<p>PLEASE NOTE- 1. CANDIDATE MAY FILL UP APPLICATION IN THE PRECRIBED FORMAT ALONG WITH NECESSARY DOCUMENTS FOR VERIFICATION. 2. APPLICATIONS CONTAINING INCOMPLETE INFORMATION SHALL NOT BE ENTERTAINED. 3. DATE OF PASSING THE EXAMINATIONS MUST BE INDICATED CLEARLY. 4. ONLY REGISTERED CANDIDATES WILL BE INTERVIEWED. 5. NO TA/DA WILL BE PAID FOR ATTENDING THE INTERVIEW PRESCRIBED FORM 1. NAME 2. FATHER’S NAME 3. MOTHER’S NAME 4. DATE OF BIRTH 5. SEX (MALE/FEMALE) 6. CATEGORY (SC/ ST/ OBC/ PH) 7. ADDRESS a. (CORRSPONDENCE) b. (PERMANENT) 8. E MAIL, TELEPHONE NO. &amp; MOBILE No (if any) 9. ACADEMIC &amp; PROFESSIONAL QUALIFICATIONS NAME OF EXAMINATION PASSED WITH SUBJECTS YEAR OF PASSING BOARD/ UNIVERSITY PERCENTAGE/ DIVISION REMARKS 10. PAST EXPERIENCE &amp; PRESENT EMPLOYMENT, IF ANY 11. CANDIDATES SHOULD STATE CLEARLY WHETHER THEY HAVE BEEN AWARDED PH.D DEGREE OR THESIS HAS BEEN SUBMITTED. 12. HAVE YOU APPLIED FOR A POSITION EARLIER IN THE INSTITUTE? IF SO:- (1) THE DETAILS OF THE PROJECT AND PROJECT INVESTIGATOR (2) IF CALLED FOR INVERVIEW, RESULTS THEREOF</p>

<p>More at http://www1.nii.res.in/sites/default/files/walkininterview-18sept2015.pdf</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/25410/srfjrf-bioinformatics-at-ciari</guid>
  <pubDate>Fri, 04 Dec 2015 00:10:09 -0600</pubDate>
  <link></link>
  <title><![CDATA[SRF/JRF Bioinformatics at CIARI]]></title>
  <description><![CDATA[
<p>Realizing the importance of Island Agriculture to meet the requirements of local population and tourists, Indian Council of Agricultural research (ICAR) established Central Island Agricultural Research Institute, Port Blair on June 23rd, 1978 by merging different regional research stations of ICAR institutes located in Islands. The ultimate aim of CIARI is the developments of island agricultural production technologies which utilizes the strengths of the island and convert the constraints in opportunities, without causing any ill effect to the fragile ecosystem of the island.The institute has made tremendous progress in the Agriculture development of the islands during the last three decades. Keeping in view the natural resources of the islands diversity, fragile ecosystem, research program would be designed to maximize the productivity without disturbing to the islands ecosystem to provide better and decent livelihood and as a source of revenue and resource generation. Research and development in Agriculture sector should cover all disciplines in order to have a balanced progress in all disciplines for the overall benefits of the farmers of these islands.</p>

<p>Position I</p>

<p>Job Title : Junior Research Fellow</p>

<p>No. of Posts : One</p>

<p>Project : Establishment of sub distributed information centre</p>

<p>Qualification : M.Sc in Basic Science with NET or B.Sc in professional course with NET or M.Sc in professional course</p>

<p>Desired Experience : Experience in Bioinformatics and molecular biology</p>

<p>Payscale : Rs. 25000 per month</p>

<p>Age Limit : Upto 35 for men and 40 for women with 5 years relaxation to SC/ST and 3 years relaxation for OBC.</p>

<p>Position II</p>

<p>Job Title : Traineeship</p>

<p>No. of Posts : One</p>

<p>Project : Establishment of sub distributed information centre</p>

<p>Qualification : B.Sc Bioinformatics /Biotechnology / Life Science / Computer Science</p>

<p>Desired Experience : Experience in Bioinformatics and molecular biology</p>

<p>Payscale : Rs. 8000 per month</p>

<p>Age Limit : Upto 35 for men and 40 for women with 5 years relaxation to SC/ST and 3 years relaxation for OBC.</p>

<p>Position III</p>

<p>Job Title : Studentship</p>

<p>No. of Posts : Two</p>

<p>Project : Establishment of sub distributed information centre</p>

<p>Qualification : B.Sc Bioinformatics /Biotechnology / Life Science / Computer Science</p>

<p>Desired Experience : Experience in Bioinformatics and molecular biology</p>

<p>Payscale : Rs. 8000 per month</p>

<p>Age Limit : Upto 35 for men and 40 for women with 5 years relaxation to SC/ST and 3 years relaxation for OBC.</p>

<p>How to Apply : Candidates who meet the requirements can attend the walk in interview at CIARI,Port Blair on 09.12.2015 10.30AM.</p>

<p>http://icar-ciari.res.in/employment/9-12-15.pdf</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/25674/post-doc-position-at-labgem-evry-france</guid>
  <pubDate>Fri, 11 Dec 2015 06:24:00 -0600</pubDate>
  <link></link>
  <title><![CDATA[Post-doc position at LABGeM - Evry, France]]></title>
  <description><![CDATA[
<p>The LABGeM team (CEA/Genoscope, CNRS UMR 8030, France, Dir. Claudine Médigue) is developing integrated approaches which combines bioinformatics methods based (i) on genomic and metabolic contexts, (ii) on an orignal metabolic network representation and (iii) on a structural classification of active sites for the discovery of new metabolic enzymatic activities.</p>

<p>We are hiring a post-doctoral fellow for the development of innovative bioinformatics methods to explore metabolic networks and enzyme families. These methods will be based on protein family analysis and graph approaches combining genomic and metabolic contexts.</p>

<p>For more details, please see this link : http://goo.gl/tHQOqk</p>

<p>Qualifications:<br />PhD degree in bioinformatics or computational biology<br />- Previous experience in network or protein family analysis<br />- Programming skills (C/C++, Python, Java) and in common biostatistical analyses<br />- Team player, innovative and creative thinking, good oral and written communication skills</p>

<p>24 months, Post Doctoral position<br />Start: from March 2016<br />Place: CEA, Genoscope UMR8030, LABGeM (Laboratory of Bioinformatics Analyses for Genomics and Metabolism), Evry, France<br />Contact: David Vallenet, vallenet@genoscope.cns.fr<br />Publications: https://scholar.google.com/citations?user=rJNPLSAAAAAJ<br />Remuneration per month: from 2,850 €</p>

<p>Interested candidates should send their CV, statement of research interests, and contact information of at least 2 references to David Vallenet (vallenet@genoscope.cns.fr).</p>
]]></description>
</item>

</channel>
</rss>