<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/27321?offset=990</link>
	<atom:link href="https://bioinformaticsonline.com/related/27321?offset=990" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44352/bioinformatics-tools-for-genome-assembly</guid>
	<pubDate>Mon, 24 Jul 2023 07:04:26 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44352/bioinformatics-tools-for-genome-assembly</link>
	<title><![CDATA[Bioinformatics tools for genome assembly !]]></title>
	<description><![CDATA[<p>There are numerous genome assembly tools available, each with its strengths and weaknesses. Here is a list of some widely used genome assembly tools as of my last update in September 2021:</p><ol>
<li>
<p><span>SPAdes:</span> An assembler specifically designed for single-cell and multi-cell bacterial genomes, as well as small eukaryotic genomes.</p>
</li>
<li>
<p><span>ABySS:</span> A parallelized assembler for large genomes that uses de Bruijn graphs.</p>
</li>
<li>
<p><span>Velvet:</span> Another de Bruijn graph-based assembler optimized for short-read sequencing data.</p>
</li>
<li>
<p><span>SOAPdenovo:</span> A de Bruijn graph-based assembler designed for short reads, widely used for assembling large and complex genomes.</p>
</li>
<li>
<p><span>MaSuRCA:</span> A hybrid assembler that combines data from multiple sequencing technologies, such as Illumina and PacBio.</p>
</li>
<li>
<p><span>Canu:</span> A long-read assembler optimized for PacBio and Oxford Nanopore sequencing data.</p>
</li>
<li>
<p><span>Flye:</span> A long-read assembler suitable for bacterial and small eukaryotic genomes.</p>
</li>
<li>
<p><span>SMARTdenovo:</span> An assembler designed for long reads, particularly suited for PacBio data.</p>
</li>
<li>
<p><span>SPAdes Long Read (SPAdesLR):</span> An extension of SPAdes for long-read data, such as those from PacBio or Nanopore.</p>
</li>
<li>
<p><span>Minia:</span> An assembler optimized for low memory consumption, suitable for small and medium-sized genomes.</p>
</li>
<li>
<p><span>Unicycler:</span> A hybrid assembler that combines short and long reads for circular bacterial genome assembly.</p>
</li>
<li>
<p><span>wtdbg2:</span> A de Bruijn graph assembler for long reads, efficient for very large genomes.</p>
</li>
<li>
<p><span>Shasta:</span> A long-read assembler that uses the Overlap-Layout-Consensus approach, suitable for PacBio and Nanopore data.</p>
</li>
<li>
<p><span>Sparc:</span> An assembler designed to handle noisy long reads from Nanopore sequencing.</p>
</li>
<li>
<p><span>CANA:</span> An assembler for metagenomic data, particularly for complex and diverse microbial communities.</p>
</li>
<li>
<p><span>Ra</span> Assembler: A metagenome assembler for long reads, designed for highly complex metagenomic samples.</p>
</li>
</ol><p>Please note that the field of bioinformatics is constantly evolving, and new assembly tools may have emerged since my last update. Additionally, the performance of these tools can vary depending on the characteristics of the sequencing data and the genome being assembled. When selecting an assembly tool, consider the specific requirements of your project, the available data types, and the computational resources at your disposal. Always refer to the respective tool's documentation and publications for the most up-to-date information and recommendations.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/10124/jrf-at-bose-institute-kolkata</guid>
  <pubDate>Mon, 21 Apr 2014 19:41:14 -0500</pubDate>
  <link></link>
  <title><![CDATA[JRF at Bose Institute, Kolkata]]></title>
  <description><![CDATA[
<p>ADVT. No. S/BIC/01/2014-15</p>

<p>Bose Institute, Kolkata, invites applications from Indian Citizens for ONE (01) temporary position of Junior Research Fellow in the DBT sponsored project entitled, “Centre of Excellance (CoE) in Bioinformatics at Bose Institute”, running under Prof. Pinakpani Chakrabarti, Project Co-ordinatior, Bioinformatics Centre. The project is tenable upto 31.03.2017, but duration of the fellowship is one year only. The JRF will work with one of the faculty members of the center based on his / her motivation in any specific area on Bioinformatics.</p>

<p>Essential Qualification: 1st class M.Sc. / M.Tech degree in any stream of Chemical/ Biological Sciences with CSIR-UGC-NET-JRF / ICMR-JRF / DBT-JRF or CSIR-UGCNET- LS / GATE qualification.</p>

<p>Desirable qualification:</p>

<p>(i) Specialized knowledge in Organic / Physical chemistry.<br />(ii) Any exposure to research involving the small molecules (like drug) and / or protein structure determination or prediction.<br />(iii) Basic knowledge in computer programming, e.g. using FORTRAN, C, shell, perl etc.<br />(iv) Hands-on-experience on any of the following software : CHARMM/AMBER/NAMD/GROMACS,Gaussian/Gamess, Haddock/Autodock, Schrodinger etc. (or any other software serving similar purposes in molecular modeling)</p>

<p>Fellowship :</p>

<p>(i) Rs. 16,000/- p.m., plus admissible HRA &amp; Medical Benefit for M.Sc. with CSIRUGC NET-JRF/ICMR-JRF/DBT-JRF or M.Tech. with CSIR-UGC NETJRF/<br />ICMR-JRF/DBT-JRF/CSIR-UGC NET-LS/GATE<br />(ii) Rs. 12,000/- p.m., plus admissible HRA &amp; Medical Benefit for M.Sc. with CSIRUGC NET-LS/GATE</p>

<p>Age : Below 28 years as on the day on which the application is made (relaxable in case of SC/ST/OBC/WOMEN candidates only as per rule).</p>

<p>Interested and eligible candidates should apply on plain paper duly signed by them clearly mentioning the area of interest in research, possession of any desirable qualification (s) as mentioned above and quoting Advertisement No. on the envelop as well as application with complete Bio-data giving e-mail ID, Phone No. and details of qualification i.e. examination passed, year, division, percentage of marks, from Secondary onwards with attested copies of testimonials, addressed to the Registrar, Bose Institute, P-1/12, CIT Scheme VII-M, Kankurgachi, Kolkata-700054 on or before April 25, 2014.</p>

<p>The shortlisted candidates will be called for an interview. Applicants are advised to check our website for future updates.</p>

<p>Advertisement: www.boseinst.ernet.in/ADVT/14/p_2.pdf</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44581/biokit-a-set-of-tools-dedicated-to-bioinformatics-data-visualisation</guid>
	<pubDate>Tue, 18 Jun 2024 02:04:39 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44581/biokit-a-set-of-tools-dedicated-to-bioinformatics-data-visualisation</link>
	<title><![CDATA[BioKit: a set of tools dedicated to bioinformatics, data visualisation]]></title>
	<description><![CDATA[<p><span>BioKit is a set of tools dedicated to bioinformatics, data visualisation (</span><a href="https://biokit.readthedocs.io/en/latest/references.html#module-biokit.viz" title="biokit.viz"><code><span>biokit.viz</span></code></a><span>), access to online biological data (e.g. UniProt, NCBI thanks to bioservices). It also contains more advanced tools related to data analysis (e.g.,&nbsp;</span><a href="https://biokit.readthedocs.io/en/latest/references.html#module-biokit.stats" title="biokit.stats"><code><span>biokit.stats</span></code></a><span>). Since R is quite common in bioinformatics, we also provide a convenient module to run R inside your Python scripts or shell (:mod:biokit.rtools module).</span></p><p>Address of the bookmark: <a href="https://biokit.readthedocs.io/en/latest/index.html" rel="nofollow">https://biokit.readthedocs.io/en/latest/index.html</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/10394/bioinformatics-protocols</guid>
	<pubDate>Mon, 05 May 2014 10:21:41 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/10394/bioinformatics-protocols</link>
	<title><![CDATA[Bioinformatics Protocols]]></title>
	<description><![CDATA[<h2><span> RNA Seq </span></h2>
<p><strong> Basic Galaxy Tutorial </strong></p>
<ul>
<li><a href="https://docs.google.com/document/pub?id=1KbTiBHtvHLfPRZ39AY3uriazrINA8TJzgjjwn1zPP7Y">RNA-Seq tutorial</a> based on <a href="http://www.nature.com/protocolexchange/protocols/2327">Trapnell et al. (2012)</a> <em>Nature Protocols</em></li>
</ul>
<dl><dd>In this tutorial we cover the concepts of <a href="http://en.wikipedia.org/wiki/RNA-Seq">RNA-Seq</a> differential gene expression (DGE) analysis using a very small synthetic dataset from a well studied organism.</dd></dl>
<p><strong> Advanced Galaxy Tutorial </strong></p>
<ul>
<li><a href="https://docs.google.com/document/d/1fQ1XfeOKhezJUDTzMXtZVY20c3RGoHe-HLvFOGzqU4s/pub">RNA-Seq (Advanced) Tutorial</a></li>
</ul>
<dl><dd>In this tutorial we compare the performance of three statistically-based differential expression tools:</dd><dd>* CuffDiff</dd><dd>* EdgeR</dd><dd>* DESeq2</dd></dl>
<p><strong> Advanced Command Line Tutorial </strong></p>
<ul>
<li><a href="https://docs.google.com/document/d/1ayJXtgBP1OXtnV7o7lq4QHKMNk5SdPHFq4hGkqndBtI/pub">Graphical Output with CummeRbund</a> introduces some basic commands using the cummeRbund package of the R programming language</li>
</ul>
<dl><dd>You will need to install R, RStudio and cummeRbund on your PC (explained in the Tutorial). You will learn how to produce graphical output from RNA-Seq analysis previously done using a Cuffdiff analysis.</dd></dl>
<h2><span> Variant Detection </span></h2>
<p><strong> Basic Galaxy Tutorial </strong></p>
<ul>
<li><a href="https://docs.google.com/document/pub?id=1ZRzrjjOCvtAu3m-IKL-rbJ1f4On60dDL_IEwG7oejdI">Variant Detection tutorial</a></li>
</ul>
<dl><dd>In this tutorial we cover the concepts of detecting small variants (SNVs and indels) in human genomic DNA using a small set of reads from chromosome 22.</dd></dl>
<p><strong>Advanced Galaxy Tutorial</strong></p>
<ul>
<li><a href="https://docs.google.com/document/pub?id=1CuKkKylVDb03tnN7RSWl5EUzleetn0ctjmvaidPKLxM">Variant Detection (Advanced) Tutorial</a></li>
</ul>
<dl><dd>In this tutorial we compare the performance of three statistically-based variant detection tools:</dd><dd>* SAMtools: Mpileup</dd><dd>* GATK: Unified Genotyper</dd><dd>* FreeBayes</dd><dd>Each of these tools takes as its input a BAM file of aligned reads and generates a list of likely variants in VCF format</dd></dl>
<p><strong>Pipelines</strong> are for those who are comfortable with using the UNIX command line; and often allow more control over branching and iteration logic.</p>
<ul>
<li><a href="https://github.com/claresloggett/variant_calling_pipeline">WGS/exome GATK-based variant calling pipeline</a></li>
</ul>
<dl><dd>This is a basic variant-calling and annotation pipeline developed at the Victorian Life Sciences Computation Initiative (VLSCI), University of Melbourne. It is based around BWA, GATK and ENSEMBL and was originally designed for human (or similar) data. The master branch is configured for WGS data; there is an exome branch configured for variant calling in exome data.</dd><dd>To run the pipeline you will need Rubra: <a href="https://github.com/bjpop/rubra">https://github.com/bjpop/rubra</a>. Rubra uses the python Ruffus library: <a href="http://www.ruffus.org.uk/">http://www.ruffus.org.uk/</a>.</dd></dl>
<p><strong>Protocols</strong></p>
<ul>
<li><a href="https://docs.google.com/document/d/1lfDYNzHjfDA1pHTHd-0w3xHhg7L4TipT1gRfzgiV8es/pub">Familial Variant Calling</a></li>
</ul>
<dl><dd>In this protocol we discuss and outline the process of calling familial related mutations.</dd></dl>
<ul>
<li><a href="https://docs.google.com/document/d/1PIhm8NrFGaSK0hxpDcp8wUOz11ZkOaHIrpnJshMgDec/pub">Somatic Variant Calling</a></li>
</ul>
<dl><dd>In this protocol we discuss and outline the process of identifying somatic variants or mutations.</dd></dl>
<h2><span> Assembly </span></h2>
<p><strong> Basic Galaxy Tutorial </strong></p>
<ul>
<li><a href="https://docs.google.com/document/pub?id=1N3AB9ptISUu4zULqe1kXpVF0BDyGb5f5yzxWSJd_WNM">Genome assembly tutorial</a></li>
</ul>
<dl><dd>In this tutorial we carry out de novo assembly of a microbial genome. We have also written a <a href="https://docs.google.com/document/d/1xs-TI5MejQARqo0pcocGlymsXldwJbJII890gnmjI0o/pub">De novo Genome Assembly for Illumina Data</a> Protocol for a more generic description of the method.</dd></dl>
<p><strong> Protocol </strong></p>
<ul>
<li><a href="https://docs.google.com/document/d/1xs-TI5MejQARqo0pcocGlymsXldwJbJII890gnmjI0o/pub">De novo Genome Assembly for Illumina Data</a></li>
</ul>
<dl><dd>In this protocol we discuss and outline the process of de novo assembly for small to medium sized genomes. Use our <a href="https://docs.google.com/document/pub?id=1N3AB9ptISUu4zULqe1kXpVF0BDyGb5f5yzxWSJd_WNM">Genome assembly tutorial</a> to learn a specific case of using Galaxy to carry out de novo assembly of a microbial genome.</dd></dl>
<h2><span> Small RNAs </span></h2>
<p><strong> Basic Galaxy Tutorial </strong></p>
<ul>
<li><a href="https://docs.google.com/document/d/1WAObJr7M0m8U-2ku-0Y0Sdt_IHmqd1h8WaJHPhnJ1lM/pub">Quality control for small RNA</a></li>
</ul>
<dl><dd>This tutorial covers initial steps of the workflow for analysis of short RNA expression such as a quality control of the raw reads, processing of the raw reads for the subsequent analysis and initial quality assessment of the library.</dd></dl>
<h2><span> ChIP Seq </span></h2>
<p><strong> Protocol </strong></p>
<ul>
<li><a href="https://docs.google.com/document/d/1UPJC8dsiDeP5R9MH9U0IvoDgPF2Q3EOstAuzS3e6WCE/pub">ChIP-Seq</a></li>
</ul>
<dl><dd>In this protocol we discuss ChIP-Seq: a method to analyze the interaction between proteins and DNA.</dd></dl>
<h2><span> Amplicons </span></h2>
<p><strong>Protocol</strong></p>
<ul>
<li><a href="https://docs.google.com/document/d/1uW7JzxG86QzS92hTyeuNsLhX_d1XFbaZPSjh7jWxcSg/pub">Amplicon Alignment</a></li>
</ul>
<dl><dd>In this protocol we discuss and outline the process of aligning custom amplicons using primers for high precision.</dd></dl>
<h2><span> Learn Galaxy </span></h2>
<p><a href="https://docs.google.com/document/d/1wsdJDYfjZVg2uJxm9AHi_j0mY3X1M1F4gB-elkuYL7c/pub">Introduction to Galaxy,</a> for those who are very new to Galaxy.</p>
<p><a href="https://docs.google.com/document/d/1t7vVqa3mdeZYPv5-8hiHBFBYhNiynV_3mWByno9-wUM/pub">Using Histories and Workflows,</a> for those with some Galaxy knowledge.</p>
<p>The Galaxy project website has many <a href="http://wiki.galaxyproject.org/Learn">tutorials</a> and <a href="http://wiki.galaxyproject.org/Learn/Screencasts">screencasts</a> about using Galaxy and the tools, and developing new tools.</p><p>Address of the bookmark: <a href="https://genome.edu.au/wiki/Learn" rel="nofollow">https://genome.edu.au/wiki/Learn</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/38950/assistant-professor-job-recruitment-in-university-of-pune-on-temporary-basis</guid>
  <pubDate>Mon, 11 Feb 2019 22:32:48 -0600</pubDate>
  <link></link>
  <title><![CDATA[Assistant Professor job recruitment in University of Pune on temporary basis]]></title>
  <description><![CDATA[
<p>Assistant Professor  job recruitment in University of Pune on temporary basis</p>

<p>No. of Post  :  01</p>

<p>Department : Institute of Bioinformatics and Biotechnology</p>

<p>Qualification : (i) Good academic record as defined by the concerned University with at least 55% marks (or an equivalent grade in a point scale wherever grading system is followed) at the Master's degree level in a relevant subject from an Indian University, or an equivalent degree from an accredited foreign University. (ii) Besides fulfilling the above qualifications, the candidate must have cleared the National Eligibility Test (NET) conducted by the UGC, CSIR or similar test accredited by the UGC like SLET / SET. (iii) Candidates, who are, or have been awarded a Ph.D. degree in accordance with the University Grants Commission (Minimum Standards and Procedures for award of Ph.D. Degree) Regulations, 2009, shall be exempted from the requirement of the minimum eligibility condition of NET/SLET/SET. (iv) NET/SLET/SET shall not be required for such Master's Degree Programmes in disciplines for which NET/SLET/SET accredited test is not conducted.</p>

<p>Pay Band :  Rs. 15,600 - Rs. 39,100/- with AGP of Rs.6,000</p>

<p>Application Fee : Application fees of Rs. 600/- (for Open Category) and Rs. 300/- (for candidates belonging to reserved categories),should be paid by the Challan (at Bank of Maharashtra/HDFC Bank)</p>

<p>More at http://collegecirculars.unipune.ac.in/sites/documents/Job%20Openings/ATNF161%20IBB_05.022019.pdf</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/10457/assistant-professor-bio-informatics-at-health-and-family-welfare-department-medical-education-in-raipur</guid>
  <pubDate>Wed, 07 May 2014 00:08:38 -0500</pubDate>
  <link></link>
  <title><![CDATA[Assistant Professor (Bio-Informatics) at Health and Family Welfare Department (Medical Education) in Raipur]]></title>
  <description><![CDATA[
<p>Advertisement No.05/2014/ Exam/Dated 17/04/2014</p>

<p>No of vacancies: 01</p>

<p>Pay scale:Rs. 15600 – 39100 + 6600/-</p>

<p>Essential Academic Qualifications / Experience : Good academic record as defined by the concerned university with at least 55% marks (or an equivalent grade in a point scale wherever grading system is followed) at the Master's Degree level in a relevant subject from an Indian University, or an equivalent degree from an accredited foreign university.</p>

<p>Besides fulfilling the above qualifications, the candidate must have cleared the National Eligibility Test (NET) conducted by the UGC, CSIR or similar test accredited by the UGC like SLET/ SET.</p>

<p>Notwithstanding anything contained in sub-clauses (a) and (b) to this Clause, candidates, who are, or have been awarded a Ph.D. Degree in accordance with the University Grants Commission (Minimum Standards and Procedure for Award of Ph.D. Degree) Regulations, 2009, shall be exempted from the requirement of the minimum eligibility condition of NET/SLET/SET for recruitment and appointment of Assistant Professor or equivalent positions in Universities/Colleges/Institutions.</p>

<p>NET/SLET/SET shall also not be required for such Masters Programmes in disciplines for which NET/SLET/SET is not conducted.</p>

<p>Apply online: http://www.psc.cg.gov.in/htm/OA_ME2014.html</p>

<p>Last Date for Online Registration: 22/05/2014</p>

<p>For more details: http://www.psc.cg.gov.in/pdf/Advertisement/ADV_ME2014.pdf</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44722/step-by-step-guide-to-running-genome-assembly</guid>
	<pubDate>Fri, 13 Dec 2024 11:35:55 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44722/step-by-step-guide-to-running-genome-assembly</link>
	<title><![CDATA[Step-by-Step Guide to Running Genome Assembly]]></title>
	<description><![CDATA[<p>Genome assembly is a critical process in bioinformatics, enabling the reconstruction of an organism's genome from short DNA sequence reads. Whether you&rsquo;re working on a new microbial genome or a complex eukaryotic organism, this guide will walk you through the steps of genome assembly using state-of-the-art tools and best practices.</p><h4><strong>What is Genome Assembly?</strong></h4><p>Genome assembly involves piecing together short DNA sequence reads generated by sequencing platforms (e.g., Illumina, PacBio, Oxford Nanopore) into longer, contiguous sequences called contigs. This can be performed as:</p><ul>
<li><strong>De Novo Assembly</strong>: Without a reference genome.</li>
<li><strong>Reference-Guided Assembly</strong>: Using a reference genome to guide the assembly process.</li>
</ul><h4><strong>Step 1: Preparing Your Data</strong></h4><p>Before starting the assembly, ensure that your raw sequencing data is high quality.</p><ol>
<li>
<p><strong>Input Data</strong></p>
<ul>
<li><strong>Short Reads</strong>: Illumina sequencing generates short, accurate reads ideal for scaffolding.</li>
<li><strong>Long Reads</strong>: PacBio and Nanopore sequencing provide long reads for resolving repetitive regions.</li>
</ul>
</li>
<li>
<p><strong>Quality Control (QC)</strong><br />Use tools like <strong>FastQC</strong> or <strong>MultiQC</strong> to assess the quality of your reads:</p>
<div>
<div dir="ltr"><code>fastqc reads.fastq multiqc . </code></div>
</div>
<p>Look for issues like low-quality bases, adapter contamination, or overrepresented sequences.</p>
</li>
<li>
<p><strong>Read Trimming and Filtering</strong><br />Trim low-quality bases and adapters using <strong>Trimmomatic</strong> or <strong>Cutadapt</strong>:</p>
<div>
<div dir="ltr"><code>trimmomatic PE reads_R1.fastq reads_R2.fastq trimmed_R1.fastq trimmed_R2.fastq \ ILLUMINACLIP:adapters.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:20 MINLEN:36 </code></div>
</div>
</li>
</ol><h4><strong>Step 2: Choosing an Assembly Strategy</strong></h4><p>Select an assembly strategy based on your data type:</p><ul>
<li>
<p><strong>Short-Read Assemblers</strong>:</p>
<ul>
<li>SPAdes: Popular for microbial genomes.</li>
<li>Velvet: Fast for smaller genomes.</li>
</ul>
</li>
<li>
<p><strong>Long-Read Assemblers</strong>:</p>
<ul>
<li>Canu: Ideal for long-read datasets.</li>
<li>Flye: Versatile for small and large genomes.</li>
</ul>
</li>
<li>
<p><strong>Hybrid Assemblers</strong>:</p>
<ul>
<li>MaSuRCA: Combines short and long reads.</li>
<li>Unicycler: Optimized for bacterial genomes.</li>
</ul>
</li>
</ul><h4><strong>Step 3: Running the Assembly</strong></h4><h5><strong>3.1. SPAdes (Short-Read Assembly)</strong></h5><p>SPAdes is an excellent choice for small genomes, such as bacteria.</p><div><div dir="ltr"><code>spades.py -1 trimmed_R1.fastq -2 trimmed_R2.fastq -o spades_output </code></div></div><p>The output includes assembled contigs (<code>contigs.fasta</code>) and scaffolds (<code>scaffolds.fasta</code>).</p><h5><strong>3.2. Canu (Long-Read Assembly)</strong></h5><p>Canu is designed for high-error long reads from PacBio or Nanopore.</p><div><div dir="ltr"><code>canu -p genome -d canu_output genomeSize=4.7m -nanopore-raw reads.fastq </code></div></div><p>The output will be in <code>canu_output/genome.contigs.fasta</code>.</p><h5><strong>3.3. Hybrid Assembly with Unicycler</strong></h5><p>Unicycler combines short and long reads for improved assemblies.</p><div><div dir="ltr"><code>unicycler -1 trimmed_R1.fastq -2 trimmed_R2.fastq -l long_reads.fastq -o unicycler_output </code></div></div><h4><strong>Step 4: Assessing Assembly Quality</strong></h4><p>After assembly, evaluate its quality using the following tools:</p><ol>
<li>
<p><strong>QUAST</strong><br />QUAST generates assembly statistics, such as N50, genome size, and GC content:</p>
<div>
<div dir="ltr"><code>quast contigs.fasta -o quast_output </code></div>
</div>
</li>
<li>
<p><strong>BUSCO</strong><br />BUSCO checks genome completeness by identifying conserved genes:</p>
<div>
<div dir="ltr"><code>busco -i contigs.fasta -o busco_output -l fungi_odb10 -m genome </code></div>
</div>
</li>
<li>
<p><strong>Assembly Graph Visualization</strong><br />Visualize assembly graphs with <strong>Bandage</strong>:</p>
<div>
<div dir="ltr"><code>Bandage load assembly_graph.gfa </code></div>
</div>
</li>
</ol><hr><h4><strong>Step 5: Post-Assembly Steps</strong></h4><ol>
<li>
<p><strong>Polishing</strong><br />Improve assembly accuracy using tools like <strong>Pilon</strong> (for short reads) or <strong>Racon</strong> (for long reads).</p>
<div>
<div dir="ltr"><code>racon long_reads.fasta mapped_reads.sam contigs.fasta &gt; polished_contigs.fasta </code></div>
</div>
</li>
<li>
<p><strong>Scaffolding</strong><br />Link contigs into scaffolds using tools like <strong>SSPACE</strong> or <strong>Opera-LG</strong> if required.</p>
</li>
<li>
<p><strong>Annotation</strong><br />Annotate the assembled genome using <strong>Prokka</strong> for prokaryotes or <strong>Maker</strong> for eukaryotes.</p>
<div>
<div dir="ltr"><code>prokka --outdir annotation_output --prefix genome contigs.fasta </code></div>
</div>
</li>
</ol><h4><strong>Step 6: Sharing and Archiving</strong></h4><ol>
<li>
<p><strong>Submit to Public Repositories</strong><br />Share your assembly in databases like <strong>NCBI GenBank</strong>, <strong>ENA</strong>, or <strong>DDBJ</strong>.</p>
</li>
<li>
<p><strong>Metadata Preparation</strong><br />Include detailed metadata for your submission, such as organism name, sequencing platform, and coverage.</p>
</li>
</ol><h4><strong>Best Practices</strong></h4><ul>
<li>Always perform quality checks at each stage to ensure data integrity.</li>
<li>Use multiple tools to cross-validate results when working with complex genomes.</li>
<li>Document parameters and software versions for reproducibility.</li>
</ul><h4><strong>Conclusion</strong></h4><p>Genome assembly is a powerful process that transforms raw sequencing data into a coherent representation of an organism&rsquo;s genome. By following this step-by-step guide, you can successfully assemble genomes and uncover valuable biological insights. Whether you&rsquo;re assembling a microbial genome or tackling the complexities of a eukaryotic genome, these tools and strategies will set you on the path to success.</p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/10659/gps-dna-tracking-university-of-sheffield</guid>
	<pubDate>Sat, 10 May 2014 04:33:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/10659/gps-dna-tracking-university-of-sheffield</link>
	<title><![CDATA[GPS DNA tracking - University of Sheffield]]></title>
	<description><![CDATA[<iframe width="" height="" src="https://www.youtube-nocookie.com/embed/Aap-s1kle4Q" frameborder="0" allowfullscreen></iframe>University of Sheffield geneticist and bioinformatics expert Dr Eran Elhaik demonstrates the power of his new DNA research, which allows people to discover their genetic homeland from 1000 years ago. Find out more about our biological research here http://www.sheffield.ac.uk/aps]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34940/jpred4-a-protein-secondary-structure-prediction-server</guid>
	<pubDate>Fri, 29 Dec 2017 16:14:28 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34940/jpred4-a-protein-secondary-structure-prediction-server</link>
	<title><![CDATA[JPred4: A Protein Secondary Structure Prediction Server]]></title>
	<description><![CDATA[<p><span>JPred4 (</span><a href="http://www.compbio.dundee.ac.uk/jpred4" target="">http://www.compbio.dundee.ac.uk/jpred4</a><span>) is the latest version of the popular JPred protein secondary structure prediction server which provides predictions by the JNet algorithm, one of the most accurate methods for secondary structure prediction.</span></p><p>Address of the bookmark: <a href="http://www.compbio.dundee.ac.uk/jpred4/" rel="nofollow">http://www.compbio.dundee.ac.uk/jpred4/</a></p>]]></description>
	<dc:creator>Poonam Mahapatra</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/10741/managing-and-analyzing-next-generation-sequence-data</guid>
	<pubDate>Sat, 10 May 2014 06:28:06 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/10741/managing-and-analyzing-next-generation-sequence-data</link>
	<title><![CDATA[Managing and Analyzing Next-Generation Sequence Data]]></title>
	<description><![CDATA[<p>Centralized Bioinformatics Core Facilities provide shared resources for the computational and IT requirements of the investigators in their department or institution. As such, they must be able to effectively react to new types of experimental technology. Recently faced with an unprecedented flood of data generated by the next generation of DNA sequencers, these groups found it necessary to respond quickly and efficiently to the informatics and infrastructure demands. Centralized Facilities newly facing this challenge need to anticipate time and design considerations of necessary components, including infrastructure upgrades, staffing, and tools for data analyses and management ...</p>
<p>More at http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000369</p><p>Address of the bookmark: <a href="http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000369" rel="nofollow">http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000369</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>

</channel>
</rss>