<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/33856?offset=20</link>
	<atom:link href="https://bioinformaticsonline.com/related/33856?offset=20" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38804/grabb-selective-assembly-of-genomic-regions-a-new-niche-for-genomic-research</guid>
	<pubDate>Sat, 26 Jan 2019 18:58:16 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38804/grabb-selective-assembly-of-genomic-regions-a-new-niche-for-genomic-research</link>
	<title><![CDATA[GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research]]></title>
	<description><![CDATA[<p><span>GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions) of the new program are not matched by other programs. The program is available with explanatory documentation at&nbsp;</span><a href="https://github.com/b-brankovics/grabb">https://github.com/b-brankovics/grabb</a><span>. GRAbB has been tested on Ubuntu (12.04 and 14.04), Fedora (23), CentOS (7.1.1503) and Mac OS X (10.7). Furthermore, GRAbB is available as a docker repository: brankovics/grabb (</span><a href="https://hub.docker.com/r/brankovics/grabb/">https://hub.docker.com/r/brankovics/grabb/</a><span>).</span></p><p>Address of the bookmark: <a href="https://github.com/b-brankovics/grabb" rel="nofollow">https://github.com/b-brankovics/grabb</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40940/consed-a-finishing-package-bam-file-viewer-assembly-editor-autofinish-autoreport-autoedit-and-align-reads-to-reference-sequence</guid>
	<pubDate>Fri, 07 Feb 2020 07:16:22 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40940/consed-a-finishing-package-bam-file-viewer-assembly-editor-autofinish-autoreport-autoedit-and-align-reads-to-reference-sequence</link>
	<title><![CDATA[Consed--A Finishing Package (BAM File Viewer, Assembly Editor, Autofinish, Autoreport, Autoedit, and Align Reads To Reference Sequence)]]></title>
	<description><![CDATA[<ul>
<li>Supports Illumina, 454, other Next-Gen and Sanger Reads and allows mixtures of these read types</li>
<li>Consed includes BamScape which can view bam files with unlimited numbers of reads. BamScape can bring up consed to edit reads and the reference sequence in targeted regions.</li>
<li>Consed is compatible with Newbler, Cross_match, Phrap, MIRA, Velvet and PCAP output.</li>
<li>Quickly takes the user to each variant site for viewing (also available as an automated report)</li>
<li>Overview of assembly can help detect and fix misassemblies</li>
<li>Editing time reduced by the program's ability to pin-point problem areas</li>
<li>Editing is guided by error probabilities</li>
</ul><p>Address of the bookmark: <a href="http://www.phrap.org/consed/consed.html" rel="nofollow">http://www.phrap.org/consed/consed.html</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/42633/protocol-for-de-novo-genome-assembly-using-illumina-reads</guid>
	<pubDate>Sat, 16 Jan 2021 21:42:11 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/42633/protocol-for-de-novo-genome-assembly-using-illumina-reads</link>
	<title><![CDATA[Protocol for De novo Genome Assembly using Illumina Reads]]></title>
	<description><![CDATA[<p>In this protocol, we address and describe the de novo assembly method for small to medium-sized genomes.</p><p><strong>What is de novo genome assembly?<br /></strong>The method of taking a large number of short DNA sequences and placing them back together to create a reflection of the original chromosomes from which the DNA originated relates to genome assembly. No previous knowledge of the source DNA sequence length, structure or composition is inferred by De novo genome assemblies. The DNA of the target organism is split up into millions of tiny parts and read on a sequencing computer in a genome sequencing experiment. Depending on the sequencing system used, these "reads" range from 20 to 1000 nucleotide base pairs (bp) in length. Usually, length reads of 36 - 150 bp are produced for Illumina style short read sequencing. These reads can be either &ldquo;single ended&rdquo; as described above or &ldquo;paired end.&rdquo;</p><p><strong>Why genome assembly?</strong><br />In basic research into why and how they live, as well as in applied topics, identifying the DNA sequence of an organism is useful. Awareness of a DNA sequence may be useful in virtually any biological research because of the relevance of DNA to living things. For example, it may be used in medicine to classify, diagnose and eventually improve genetic disorder therapies. Similarly, pathogens study can lead to treatments for infectious diseases.</p><p><strong>Raw NGS data</strong><br />Reads can be saved as a Fasta file as text or in a FastQ file with their attributes.&nbsp;FastQ is the most common read file format since this is what the Illumina sequencing pipeline creates. This will henceforth be the subject of our conversation.</p><p><strong>In a nutshell the protocol:</strong> <br />Get the sequence file(s) read from the sequencing machine (s). <br />Look at the readings - have an idea of what you have and what the standard is like. <br />If required, raw data cleanup/quality trimming. <br />Choose an adequate parameter set for assembly. <br />Assemble the data into scaffolds/contigs. <br />Examine the assembly performance and determine the efficiency of the assembly.</p><p><strong>Read Quality Control:</strong><br />Check the qualiy with fastQC.<br />Script<br />https://bioinformaticsonline.com/snippets/view/42540/install-fastqc-using-conda</p><p>Quality trimming/cleanup of read files.<br />This function trims adapters, barcodes and other contaminants from the reads.<br />Script<br />https://bioinformaticsonline.com/snippets/view/42542/trimmomatic-command</p><p><strong>Genome Assembly:</strong><br />The object of this portion of the protocol is to explain the method of assembling the reads trimmed by quality into draft contigs.</p><blockquote><p>spades.py -1 illumina_R1.fastq.gz -2 illumina_R2.fastq.gz --careful --cov-cutoff auto -o result_of_spades_assembly_all_illumina</p></blockquote><p>A significant range of short-read assemblers are available. Everyone with strengths and disadvantages of their own. <br /><em>Some of the assemblers available include:</em><br />Velvet<br />SOAP-denovo<br />MIRA<br />ALLPATHS</p><p>Next step is to assess the suitability and what to do with a draft package of contiguous details for the remainder of the study now.&nbsp;Few stuff you can note about the contigs you just created:&nbsp;They're the draft Contigs. Any mis-assemblies can occur.</p><p><strong>Mis-assembly checking and assembly metric tools:</strong><br />QUAST - Quality assessment tool for genome assembly http://bioinf.spbau.ru/quast<br />Mauve assembly metrics - http://code.google.com/p/ngopt/wiki/How_To_Score_Genome_Assemblies_with_Mauve<br />InGAP-SV - https://sites.google.com/site/nextgengenomics/ingap and http://ingap.sourceforge.net/<br />inGAP is also useful for finding structural variants between genomes from read mappings.</p><p><strong>Genome finishing tools:</strong><br />Semi-automated gap fillers:<br />Gap filler - http://www.baseclear.com/landingpages/basetools-a-wide-range-of-bioinformatics-solutions/gapfiller/</p><p>IMAGE (V2) - http://sourceforge.net/apps/mediawiki/image2/index.php?title=Main_Page</p><p><strong>Genome visualisers and editors:</strong><br />Artemis - http://www.sanger.ac.uk/resources/software/artemis/<br />IGV - http://www.broadinstitute.org/igv/</p><p><strong>Automated and semi automated annotation tools:</strong><br />Prokka - https://github.com/tseemann/prokka<br />RAST - http://www.nmpdr.org/FIG/wiki/view.cgi/FIG/RapidAnnotationServer<br />JCVI Annotation Service - http://www.jcvi.org/cms/research/projects/annotation-service/</p><p><strong>Frequent command use for the analysis are at:</strong></p><p>https://bioinformaticsonline.com/blog/view/38765/list-of-tools-frequently-used-while-genome-assembly<br />https://bioinformaticsonline.com/pages/view/42275/frequent-parameters-for-bioinformatics-tools</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/43419/senior-bioinformatician-assembly-moore-aquatic-symbiosis-project-tree-of-life</guid>
  <pubDate>Sat, 02 Oct 2021 00:28:30 -0500</pubDate>
  <link></link>
  <title><![CDATA[Senior Bioinformatician (Assembly) Moore Aquatic Symbiosis Project Tree of Life]]></title>
  <description><![CDATA[
<p>You will have some previous experience with genome bioinformatics or other large scale scientific data analysis, or a newly qualified graduate student with data science skills interested in DNA sequence data. While desirable, previous experience with DNA sequencing data is not strictly necessary for the position. We have a strong publication record and culture of producing open data resources and open source software development. This role requires an investigative and solution-oriented mindset and excellent communication skills to work effectively within large national and international consortia. </p>

<p>More at https://jobs.sanger.ac.uk/vacancy/senior-bioinformatician-assembly-moore-aquatic-symbiosis-project-tree-of-life-458923.html</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/44342/ncbi-datasets%E2%80%AFpages</guid>
	<pubDate>Wed, 12 Jul 2023 06:29:31 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/44342/ncbi-datasets%E2%80%AFpages</link>
	<title><![CDATA[NCBI Datasets pages]]></title>
	<description><![CDATA[<p>Update! Assembly and Genome record pages now redirect to new NCBI Datasets pages. NCBI Datasets is a new resource that makes it easier to find and download genome data. Learn more: https://ncbiinsights.ncbi.nlm.nih.gov/2023/07/11/ncbi-datasets-genome-assembly-pages/&nbsp;<a href="https://ow.ly/GU3o50P8QH4"></a><a href="https://www.linkedin.com/feed/hashtag/?keywords=ncbicgr&amp;highlightedUpdateUrns=urn%3Ali%3Aactivity%3A7084592728260386816">#NCBICGR</a></p><p><span>Effective July 10, 2023, NCBI&rsquo;s Assembly and Genome record pages now redirect to&nbsp;</span>new<a href="https://www.ncbi.nlm.nih.gov/datasets/?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=datasets-genome-assembly-redirect-20230711"> NCBI Datasets </a><span>pages. As&nbsp;</span><a href="https://ncbiinsights.ncbi.nlm.nih.gov/2023/03/07/ncbi-datasets-genome-taxonomy-pages/?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=datasets-genome-assembly-redirect-20230711">previously announced</a><span>, these updates are part of our ongoing effort to modernize and improve your user experience. NCBI Datasets is a new resource that makes it easier to find and download genome data.  </span><span>&nbsp;</span></p><h5>The following pages have been updated:</h5><ul>
<li><span>The NCBI Assembly record pages now redirect to the new </span><a href="https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_023065955.2/?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=datasets-genome-assembly-redirect-20230711"><span>NCBI Datasets</span><strong><span> </span></strong><span>Genome</span></a><span> </span><span>record pages that describe assembled genomes and provide links to related NCBI tools such as Genome Data Viewer and BLAST. </span><span>&nbsp;</span></li>
<li><span>The NCBI</span><strong> </strong><span>Genome record pages now redirect to the </span><a href="https://www.ncbi.nlm.nih.gov/datasets/taxonomy/9644/?utm_source=ncbi_insights&amp;utm_medium=referral&amp;utm_campaign=datasets-genome-assembly-redirect-20230711"><span>NCBI Datasets</span><strong><span> </span></strong><span>Taxonomy</span></a><span> </span><span>record pages that provide a taxonomy-focused portal to genes, genomes, and additional NCBI resources.  </span><span>&nbsp;</span></li>
</ul><p><span>During this transition, you will have the option to return to the legacy Genome and Assembly record pages. We will remove the legacy pages in early 2024. </span><span>&nbsp;</span></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44722/step-by-step-guide-to-running-genome-assembly</guid>
	<pubDate>Fri, 13 Dec 2024 11:35:55 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44722/step-by-step-guide-to-running-genome-assembly</link>
	<title><![CDATA[Step-by-Step Guide to Running Genome Assembly]]></title>
	<description><![CDATA[<p>Genome assembly is a critical process in bioinformatics, enabling the reconstruction of an organism's genome from short DNA sequence reads. Whether you&rsquo;re working on a new microbial genome or a complex eukaryotic organism, this guide will walk you through the steps of genome assembly using state-of-the-art tools and best practices.</p><h4><strong>What is Genome Assembly?</strong></h4><p>Genome assembly involves piecing together short DNA sequence reads generated by sequencing platforms (e.g., Illumina, PacBio, Oxford Nanopore) into longer, contiguous sequences called contigs. This can be performed as:</p><ul>
<li><strong>De Novo Assembly</strong>: Without a reference genome.</li>
<li><strong>Reference-Guided Assembly</strong>: Using a reference genome to guide the assembly process.</li>
</ul><h4><strong>Step 1: Preparing Your Data</strong></h4><p>Before starting the assembly, ensure that your raw sequencing data is high quality.</p><ol>
<li>
<p><strong>Input Data</strong></p>
<ul>
<li><strong>Short Reads</strong>: Illumina sequencing generates short, accurate reads ideal for scaffolding.</li>
<li><strong>Long Reads</strong>: PacBio and Nanopore sequencing provide long reads for resolving repetitive regions.</li>
</ul>
</li>
<li>
<p><strong>Quality Control (QC)</strong><br />Use tools like <strong>FastQC</strong> or <strong>MultiQC</strong> to assess the quality of your reads:</p>
<div>
<div dir="ltr"><code>fastqc reads.fastq multiqc . </code></div>
</div>
<p>Look for issues like low-quality bases, adapter contamination, or overrepresented sequences.</p>
</li>
<li>
<p><strong>Read Trimming and Filtering</strong><br />Trim low-quality bases and adapters using <strong>Trimmomatic</strong> or <strong>Cutadapt</strong>:</p>
<div>
<div dir="ltr"><code>trimmomatic PE reads_R1.fastq reads_R2.fastq trimmed_R1.fastq trimmed_R2.fastq \ ILLUMINACLIP:adapters.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:20 MINLEN:36 </code></div>
</div>
</li>
</ol><h4><strong>Step 2: Choosing an Assembly Strategy</strong></h4><p>Select an assembly strategy based on your data type:</p><ul>
<li>
<p><strong>Short-Read Assemblers</strong>:</p>
<ul>
<li>SPAdes: Popular for microbial genomes.</li>
<li>Velvet: Fast for smaller genomes.</li>
</ul>
</li>
<li>
<p><strong>Long-Read Assemblers</strong>:</p>
<ul>
<li>Canu: Ideal for long-read datasets.</li>
<li>Flye: Versatile for small and large genomes.</li>
</ul>
</li>
<li>
<p><strong>Hybrid Assemblers</strong>:</p>
<ul>
<li>MaSuRCA: Combines short and long reads.</li>
<li>Unicycler: Optimized for bacterial genomes.</li>
</ul>
</li>
</ul><h4><strong>Step 3: Running the Assembly</strong></h4><h5><strong>3.1. SPAdes (Short-Read Assembly)</strong></h5><p>SPAdes is an excellent choice for small genomes, such as bacteria.</p><div><div dir="ltr"><code>spades.py -1 trimmed_R1.fastq -2 trimmed_R2.fastq -o spades_output </code></div></div><p>The output includes assembled contigs (<code>contigs.fasta</code>) and scaffolds (<code>scaffolds.fasta</code>).</p><h5><strong>3.2. Canu (Long-Read Assembly)</strong></h5><p>Canu is designed for high-error long reads from PacBio or Nanopore.</p><div><div dir="ltr"><code>canu -p genome -d canu_output genomeSize=4.7m -nanopore-raw reads.fastq </code></div></div><p>The output will be in <code>canu_output/genome.contigs.fasta</code>.</p><h5><strong>3.3. Hybrid Assembly with Unicycler</strong></h5><p>Unicycler combines short and long reads for improved assemblies.</p><div><div dir="ltr"><code>unicycler -1 trimmed_R1.fastq -2 trimmed_R2.fastq -l long_reads.fastq -o unicycler_output </code></div></div><h4><strong>Step 4: Assessing Assembly Quality</strong></h4><p>After assembly, evaluate its quality using the following tools:</p><ol>
<li>
<p><strong>QUAST</strong><br />QUAST generates assembly statistics, such as N50, genome size, and GC content:</p>
<div>
<div dir="ltr"><code>quast contigs.fasta -o quast_output </code></div>
</div>
</li>
<li>
<p><strong>BUSCO</strong><br />BUSCO checks genome completeness by identifying conserved genes:</p>
<div>
<div dir="ltr"><code>busco -i contigs.fasta -o busco_output -l fungi_odb10 -m genome </code></div>
</div>
</li>
<li>
<p><strong>Assembly Graph Visualization</strong><br />Visualize assembly graphs with <strong>Bandage</strong>:</p>
<div>
<div dir="ltr"><code>Bandage load assembly_graph.gfa </code></div>
</div>
</li>
</ol><hr><h4><strong>Step 5: Post-Assembly Steps</strong></h4><ol>
<li>
<p><strong>Polishing</strong><br />Improve assembly accuracy using tools like <strong>Pilon</strong> (for short reads) or <strong>Racon</strong> (for long reads).</p>
<div>
<div dir="ltr"><code>racon long_reads.fasta mapped_reads.sam contigs.fasta &gt; polished_contigs.fasta </code></div>
</div>
</li>
<li>
<p><strong>Scaffolding</strong><br />Link contigs into scaffolds using tools like <strong>SSPACE</strong> or <strong>Opera-LG</strong> if required.</p>
</li>
<li>
<p><strong>Annotation</strong><br />Annotate the assembled genome using <strong>Prokka</strong> for prokaryotes or <strong>Maker</strong> for eukaryotes.</p>
<div>
<div dir="ltr"><code>prokka --outdir annotation_output --prefix genome contigs.fasta </code></div>
</div>
</li>
</ol><h4><strong>Step 6: Sharing and Archiving</strong></h4><ol>
<li>
<p><strong>Submit to Public Repositories</strong><br />Share your assembly in databases like <strong>NCBI GenBank</strong>, <strong>ENA</strong>, or <strong>DDBJ</strong>.</p>
</li>
<li>
<p><strong>Metadata Preparation</strong><br />Include detailed metadata for your submission, such as organism name, sequencing platform, and coverage.</p>
</li>
</ol><h4><strong>Best Practices</strong></h4><ul>
<li>Always perform quality checks at each stage to ensure data integrity.</li>
<li>Use multiple tools to cross-validate results when working with complex genomes.</li>
<li>Document parameters and software versions for reproducibility.</li>
</ul><h4><strong>Conclusion</strong></h4><p>Genome assembly is a powerful process that transforms raw sequencing data into a coherent representation of an organism&rsquo;s genome. By following this step-by-step guide, you can successfully assemble genomes and uncover valuable biological insights. Whether you&rsquo;re assembling a microbial genome or tackling the complexities of a eukaryotic genome, these tools and strategies will set you on the path to success.</p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44667/bioinformatics-lecture-notes</guid>
	<pubDate>Tue, 01 Oct 2024 03:45:26 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44667/bioinformatics-lecture-notes</link>
	<title><![CDATA[Bioinformatics Lecture Notes]]></title>
	<description><![CDATA[<h1 style="text-align: center;">Study Resources for</h1><h1 style="text-align: center;">ECM3413 - Bioinformatics</h1><p style="text-align: center;">Contents</p><p style="text-align: center;">&nbsp;</p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#GenInfo">General Information</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Past%20Paper">Lecture Slides</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Past%20Paper">Past Exam Paper</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Assess">Continuous Assessments</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Reading">Suggested Reading List</a></p><p><a name="GenInfo" id="GenInfo"></a><strong>General Information</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">This module runs in Semester 2.&nbsp;</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">It is taught by&nbsp;<a href="http://www.secam.ex.ac.uk/staff/index.php?nav=40&amp;group=Teaching%20Fellows&amp;user_directory_limit=&amp;user_directory_order=&amp;sid=182">Dr Ed Keedwell</a>&nbsp;(Module Coordinator)</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Module Descriptor</strong>:&nbsp;&nbsp;<a href="http://www.secam.ex.ac.uk/student/modules?mid=393">ECM3413</a></td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Lecture Times</strong>: Tuesday 5pm,&nbsp; 171| Thursday, 171</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Workshop Times</strong>: Wednesday 11am Blue Room (Weeks 29,33 &amp;40)</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p><strong>Assessment:&nbsp;</strong>2 CAs each worth 15% | 1 Examination worth 70%</p>
</td>
</tr>
</tbody>
</table><p>&nbsp;&nbsp;</p><p style="text-align: left;"><strong><a name="Slides" id="Slides"></a>Lecture Slides&nbsp;</strong>(if you have to print slides, to save your ink choose 'print in black and white' on the print menu)</p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture1.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture1.pdf">PDF</a>| Lecture 1 - Introduction to Bioinformatics (&amp; Biology)</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture2.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture2.pdf">PDF</a>| Lecture 2 - Genome Sequences: from fragments to sequences</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture3.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture3.pdf">PDF</a>| Lecture 3 - Sequence Alignment 1</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture4.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture4.pdf">PDF</a>| Lecture 4 - Global Pairwise Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture5.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture5.pdf">PDF</a>| Lecture 5 - Local Pairwise Sequence Alignment (Smith-Waterman &amp; BLAST)</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOWorkshop1.doc">DOC</a>| Workshop 1 - Using BLAST and other Bioinformatics Databases</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture6.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture6.pdf">PDF</a>| Lecture 6 - Multiple Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture7.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture7.pdf">PDF</a>| Lecture 7 - BLAST (in more detail) &amp; FASTA</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture8.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture8.pdf">PDF</a>| Lecture 8 - Sequence Alignment Conclusion &amp; Other Sequence Analyses</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture9.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture9.pdf">PDF</a>| Lecture 9 - Markov Chains and Intro to Hidden Markov Models</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture10.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture10.pdf">PDF</a>| Lecture 10 - Hidden Markov Models</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture11.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture11.pdf">PDF</a>| Lecture 11 - Classification in Bioinformatics</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOWorkshop2.doc">DOC</a>|Workshop 2 - Using See5</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Workshop Data - Part 1 -&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.names">adult.names&nbsp;</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.data">adult.data&nbsp;</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.test">adult.test,&nbsp;</a>Part 3 -&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/wdbc.names">wdbc.names</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/wdbc.data">wdbc.data</a></p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture12.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture12.pdf">PDF</a>| Lecture 12 - Gene Expression Data</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture13.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture13.pdf">PDF</a>| Lecture 13 - Decision Trees and Gene Expression Classification</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture14.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture14.pdf">PDF</a>| Lecture 14 - Other Methods for Gene Expression Classification</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture15.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture15.pdf">PDF</a>| Lecture 15 - Gene Regulation</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture16.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture16.pdf">PDF</a>| Lecture 16 - Neural Networks in Gene Expression Analysis</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture17.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture17.pdf">PDF</a>| Lecture 17 - Genome Analysis</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture18.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture18.pdf">PDF</a>| Lecture 18 - Conclusion/Revision Lecture</p>
</td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;">For some reason best known to itself, my PDF creator doesn't like the slide with the substitution matrix on.&nbsp; Therefore this has been removed from Lectures 3 and 7 for the PDF copy only - however, more information on these matrices can be found&nbsp;<a href="http://www.ebi.ac.uk/help/matrix.html">here</a>.</p><p style="text-align: left;"><strong><a name="Past%20Paper"></a>Past Exam Paper</strong></p><p style="text-align: left;">The paper from 2007/8 can be found&nbsp;<a href="http://library.exeter.ac.uk/exampapers/">here</a>.</p><p style="text-align: left;"><strong><a name="Assess" id="Assess"></a>Continuous Assessments</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/CA1ECM3413.pdf">PDF</a>|&nbsp; CA1 - Manual Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/CA2ECM3413.pdf">PDF</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/Promoter.names">Promoter.names</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/Promoter.data">Promoter.data</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/ML.names">ML.names</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/ML.data">ML.data</a>| CA2 - Data Mining in Bioinformatics</p>
</td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;"><strong><a name="Reading" id="Reading"></a>Suggested Reading List</strong></p><p style="text-align: left;"><strong>General Bioinformatics</strong></p><p>&lt;="top"&gt;Xiong, J., (2006) Essential Bioinformatics, Cambridge University Press</p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Lesk, A.M., (2002) Introduction to Bioinformatics, Oxford University Press</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Higgs, P.G., (2005) Bioinformatics and Molecular Evolution,&nbsp; Blackwell Publishing</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;"><strong>Machine Learning in Bioinformatics</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Baldi, P., Brunak, S., (2001) Bioinformatics: The Machine Learning Approach, MIT Press</p>
</td>
</tr>
<tr>
<td><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Keedwell, E., Narayanan, A., (2005) Intelligent Bioinformatics: The Application of Artificial Intelligence Techniques to Bioinformatics Problems, Wiley</p>
</td>
</tr>
</tbody>
</table>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2726/comparison-of-short-read-de-novo-alignment-algorithms</guid>
	<pubDate>Wed, 21 Aug 2013 07:56:01 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2726/comparison-of-short-read-de-novo-alignment-algorithms</link>
	<title><![CDATA[Comparison of Short Read De Novo Alignment Algorithms]]></title>
	<description><![CDATA[<p>Excellent article to introduce different sequencing methods along with tools for de novo assembly of sequencing reads and their relevant references.</p>
<p>Title:&nbsp;<strong>Comparison of Short Read De Novo Alignment Algorithms&nbsp;</strong></p>
<p>Author<strong>: Nikhil Gopal</strong></p><p>Address of the bookmark: <a href="http://biochem218.stanford.edu/Projects%202011/Gopal%202011.pdf" rel="nofollow">http://biochem218.stanford.edu/Projects%202011/Gopal%202011.pdf</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34413/coursera-genome-assembly-tutorial</guid>
	<pubDate>Sat, 25 Nov 2017 08:57:25 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34413/coursera-genome-assembly-tutorial</link>
	<title><![CDATA[coursera genome assembly tutorial]]></title>
	<description><![CDATA[<p><span>Solutions to Coursera Genome Sequencing (Bioinformatics II)</span></p><p>Address of the bookmark: <a href="https://github.com/iansealy/coursera-assembly" rel="nofollow">https://github.com/iansealy/coursera-assembly</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34528/cope-an-accurate-k-mer-based-pair-end-reads-connection-tool-to-facilitate-genome-assembly</guid>
	<pubDate>Wed, 06 Dec 2017 02:08:14 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34528/cope-an-accurate-k-mer-based-pair-end-reads-connection-tool-to-facilitate-genome-assembly</link>
	<title><![CDATA[COPE: an accurate k-mer-based pair-end reads connection tool to facilitate genome assembly]]></title>
	<description><![CDATA[<p><span>An efficient tool called Connecting Overlapped Pair-End (COPE) reads, to connect overlapping pair-end reads using k-mer frequencies. We evaluated our tool on 30&times; simulated pair-end reads from Arabidopsis thaliana with 1% base error. COPE connected over 99% of reads with 98.8% accuracy, which is, respectively, 10 and 2% higher than the recently published tool FLASH. When COPE is applied to real reads for genome assembly, the resulting contigs are found to have fewer errors and give a 14-fold improvement in the N50 measurement when compared with the contigs produced using unconnected reads.</span></p><p>Address of the bookmark: <a href="ftp://ftp.genomics.org.cn/pub/cope" rel="nofollow">ftp://ftp.genomics.org.cn/pub/cope</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>