<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: All site pages]]></title>
	<link>https://bioinformaticsonline.com/pages/all?</link>
	<atom:link href="https://bioinformaticsonline.com/pages/all?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44672/libraries-or-management-tools-for-high-throughput-sequencing-data</guid>
	<pubDate>Fri, 04 Oct 2024 02:45:06 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44672/libraries-or-management-tools-for-high-throughput-sequencing-data</link>
	<title><![CDATA[Libraries or management tools for high throughput sequencing data]]></title>
	<description><![CDATA[<ul>
<li><a href="http://gatb.inria.fr/"><span>GATB</span></a>&nbsp;Library.&nbsp;The&nbsp;<span>Genome Analysis Toolbox with de-Bruijn graph.&nbsp;</span>A large part of tools developed by the GenScale team are based on this library.<br />These methods enable the analysis of data sets of any size on multi-core desktop computers, including very huge amount of reads data coming from any kind of organisms such as bacteria, plants, animals and even complex samples (<em>e.g.</em>&nbsp;metagenomes). Among them are (the full is available here:&nbsp;<a href="https://gatb.inria.fr/software/">https://gatb.inria.fr/software/</a>):</li>
<li><a href="https://github.com/morispi/LRez"><span>LRez</span></a>: C++ Library and toolkit for the barcode-based management and indexation of linked-read datasets.</li>
</ul><h2>Variant calling and/or genotyping</h2><ul>
<li><a href="https://gatb.inria.fr/software/discosnp/" title="DiscoSNP">DiscoSNP++ and&nbsp;discoSnpRAD</a>: Reference-free small variant discovery (SNPs and indels)</li>
<li><a href="https://gatb.inria.fr/software/mind-the-gap/" title="MindTheGap">MindTheGap</a>: Detection and assembly of large insertion variants</li>
<li><a href="https://gatb.inria.fr/software/takeabreak/" title="TakeABreak">TakeABreak</a>:&nbsp;reference-free inversion discovery tool</li>
<li><a href="https://github.com/llecompte/SVJedi">SVJedi</a>: Structural Variant genotyper with long read data</li>
<li><a href="https://github.com/SandraLouise/SVJedi-graph">SVJedi-graph</a>: Structural Variant genotyper with long read data using a variation graph</li>
</ul><h2>Sequence assembly</h2><ul>
<li><a href="https://github.com/cguyomar/MinYS">MinYS</a>: reference-guided genome assembly in metagenomics data</li>
<li><a href="https://github.com/anne-gcd/MTG-Link">MTG-link</a>: local assembly tool for linked-read data</li>
<li><a href="https://gatb.inria.fr/software/minia/" title="Minia">Minia</a>: De novo short read assembler</li>
<li><a href="https://gatb.inria.fr/de-novo-genome-assembly/">de-novo pipeline</a>:&nbsp;<em>de-novo</em>&nbsp;assembly pipeline (error correction / contigs / scaffolding) for genomes and meta-genomes</li>
<li><a href="https://gatb.inria.fr/software/mapsembler/" title="Mapsembler2">Mapsembler2</a>: Targeted assembly (not maintained)</li>
</ul><h2>Managing k-mers &amp; indexation</h2><ul>
<li><a href="https://github.com/lrobidou/findere">findere</a>:&nbsp;simple strategy for speeding up queries and for reducing false positive calls from any Approximate Membership Query data structure.
<ul>
<li><a href="https://github.com/lrobidou/fimpera">fimpera</a>&nbsp;extends findere adding the abundance information.</li>
</ul>
</li>
<li><a href="https://github.com/tlemane/kmtricks">kmtricks</a>:&nbsp;modular tool suite for counting kmers, and constructing Bloom filters or kmer matrices, for large collections of sequencing data.</li>
<li><a href="https://github.com/tlemane/kmindex">kmindex&nbsp;</a>is a tool for indexing and querying sequencing samples. It is built on top of kmtricks.</li>
<li><a href="https://github.com/pierrepeterlongo/back_to_sequences">back to sequences</a>: Find sequences (reads, unitigs, genes) related to a set of kmers in large datasets, in a matter of seconds.</li>
<li><a href="https://github.com/vicLeva/bqf">Backpack Quotient Filter</a>:&nbsp;k-mer indexing data structure with abundance</li>
<li><a href="http://github.com/GATB/rconnector">short read connector</a>:&nbsp;Detect similar reads from potentially large read set</li>
<li><a href="https://gatb.inria.fr/software/dsk/" title="DSK">DSK</a>:&nbsp;Count K-mer in sequences</li>
</ul><h2>Pangenome graph manipulation</h2><ul>
<li><a href="https://github.com/Tharos-ux/pancat">Pancat</a>: Pangenome Comparison and Analysis Toolkit</li>
<li><a href="https://pypi.org/project/gfagraphs/">GFAGraphs</a>: a Python library to handle pangenome graph files in GFA format.</li>
</ul><h2>Comparative metagenomics with k-mers</h2><ul>
<li><a href="https://github.com/GATB/simka">Simka and SimkaMin</a>:&nbsp;Comparative metagenomics for large-scale datasets</li>
<li><a href="https://team.inria.fr/genscale/high-throughput-sequence-analysis/compreads-metagenomic-data-analysis/">Comparead &amp; Commet</a>:&nbsp;comparison of metagenomic datasets</li>
</ul><h2>Species and bacterial strains identification</h2><ul>
<li><a href="https://github.com/gsiekaniec/ORI">ORI</a>: software using long nanopore reads to identify bacteria present in a sample at the strain level</li>
<li><a href="https://github.com/kevsilva/StrainFLAIR">StrainFLAIR</a>:&nbsp;STRAIN-level proFiLing using vArIation gRaph</li>
</ul><h2>General-purpose sequencing data manipulation</h2><ul>
<li><a href="https://team.inria.fr/genscale/ngs-software/gassst/">GASSST</a>:&nbsp;long read mapper</li>
<li><a href="https://gatb.inria.fr/software/leon/" title="Leon">Leon</a>: short read compressor (now included in GATB-core)</li>
<li><a href="https://gatb.inria.fr/software/bloocoo/" title="Bloocoo">Bloocoo</a>:&nbsp;short read corrector</li>
<li><a href="https://github.com/GATB/bcalm">BCALM</a>:&nbsp;Construct compacted de Bruijn graphs (unitigs)</li>
</ul><h2>&nbsp;Protein Structure</h2><ul>
<li><a href="https://team.inria.fr/genscale/protein-structure/a-purva-contact-map-overlap-solver/">A_Purva</a>:&nbsp;Contact Map Overlap solver</li>
<li><a href="https://team.inria.fr/genscale/protein-structure/md-jeep-distance-geomtry-solver/">MD-Jeep</a>:&nbsp;Distance Geometry solver</li>
<li><a href="https://team.inria.fr/genscale/csa-comparative-structural-alignment/">CSA</a>:&nbsp;Comparative Structural Alignment</li>
</ul><h2>Workflow</h2><ul>
<li><a href="https://team.inria.fr/genscale/workflows/slicee/">SLICEE</a>:&nbsp;parallel execution of bioinformatics workflows</li>
</ul><h3>Comparative Genomics</h3><ul>
<li><a href="https://team.inria.fr/genscale/comparative-genomics/cassis/">CASSIS</a>:&nbsp;detection of rearrangement breakpoints</li>
<li><a href="https://team.inria.fr/genscale/high-throughput-sequence-analysis/plast-intensive-sequence-comparison/">PLAST</a>:&nbsp;intensive bank-to-bank sequence comparison</li>
<li><a href="https://github.com/stephanierobin/DrjBreakpointFinder">DRJBreakpointFinder</a>: detection and precise localization of excision sites in proviral segments</li>
</ul>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44667/bioinformatics-lecture-notes</guid>
	<pubDate>Tue, 01 Oct 2024 03:45:26 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44667/bioinformatics-lecture-notes</link>
	<title><![CDATA[Bioinformatics Lecture Notes]]></title>
	<description><![CDATA[<h1 style="text-align: center;">Study Resources for</h1><h1 style="text-align: center;">ECM3413 - Bioinformatics</h1><p style="text-align: center;">Contents</p><p style="text-align: center;">&nbsp;</p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#GenInfo">General Information</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Past%20Paper">Lecture Slides</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Past%20Paper">Past Exam Paper</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Assess">Continuous Assessments</a></p><p style="text-align: center;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/#Reading">Suggested Reading List</a></p><p><a name="GenInfo" id="GenInfo"></a><strong>General Information</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">This module runs in Semester 2.&nbsp;</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">It is taught by&nbsp;<a href="http://www.secam.ex.ac.uk/staff/index.php?nav=40&amp;group=Teaching%20Fellows&amp;user_directory_limit=&amp;user_directory_order=&amp;sid=182">Dr Ed Keedwell</a>&nbsp;(Module Coordinator)</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Module Descriptor</strong>:&nbsp;&nbsp;<a href="http://www.secam.ex.ac.uk/student/modules?mid=393">ECM3413</a></td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Lecture Times</strong>: Tuesday 5pm,&nbsp; 171| Thursday, 171</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top"><strong>Workshop Times</strong>: Wednesday 11am Blue Room (Weeks 29,33 &amp;40)</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p><strong>Assessment:&nbsp;</strong>2 CAs each worth 15% | 1 Examination worth 70%</p>
</td>
</tr>
</tbody>
</table><p>&nbsp;&nbsp;</p><p style="text-align: left;"><strong><a name="Slides" id="Slides"></a>Lecture Slides&nbsp;</strong>(if you have to print slides, to save your ink choose 'print in black and white' on the print menu)</p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture1.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture1.pdf">PDF</a>| Lecture 1 - Introduction to Bioinformatics (&amp; Biology)</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture2.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture2.pdf">PDF</a>| Lecture 2 - Genome Sequences: from fragments to sequences</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture3.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture3.pdf">PDF</a>| Lecture 3 - Sequence Alignment 1</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture4.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture4.pdf">PDF</a>| Lecture 4 - Global Pairwise Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture5.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture5.pdf">PDF</a>| Lecture 5 - Local Pairwise Sequence Alignment (Smith-Waterman &amp; BLAST)</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOWorkshop1.doc">DOC</a>| Workshop 1 - Using BLAST and other Bioinformatics Databases</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture6.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture6.pdf">PDF</a>| Lecture 6 - Multiple Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture7.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture7.pdf">PDF</a>| Lecture 7 - BLAST (in more detail) &amp; FASTA</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture8.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture8.pdf">PDF</a>| Lecture 8 - Sequence Alignment Conclusion &amp; Other Sequence Analyses</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture9.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture9.pdf">PDF</a>| Lecture 9 - Markov Chains and Intro to Hidden Markov Models</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture10.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture10.pdf">PDF</a>| Lecture 10 - Hidden Markov Models</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture11.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture11.pdf">PDF</a>| Lecture 11 - Classification in Bioinformatics</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOWorkshop2.doc">DOC</a>|Workshop 2 - Using See5</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Workshop Data - Part 1 -&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.names">adult.names&nbsp;</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.data">adult.data&nbsp;</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/adult.test">adult.test,&nbsp;</a>Part 3 -&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/wdbc.names">wdbc.names</a>|&nbsp;<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/wdbc.data">wdbc.data</a></p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture12.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture12.pdf">PDF</a>| Lecture 12 - Gene Expression Data</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture13.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture13.pdf">PDF</a>| Lecture 13 - Decision Trees and Gene Expression Classification</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture14.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture14.pdf">PDF</a>| Lecture 14 - Other Methods for Gene Expression Classification</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture15.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture15.pdf">PDF</a>| Lecture 15 - Gene Regulation</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture16.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture16.pdf">PDF</a>| Lecture 16 - Neural Networks in Gene Expression Analysis</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture17.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture17.pdf">PDF</a>| Lecture 17 - Genome Analysis</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture18.ppt">PPT</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/BIOLecture18.pdf">PDF</a>| Lecture 18 - Conclusion/Revision Lecture</p>
</td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;">For some reason best known to itself, my PDF creator doesn't like the slide with the substitution matrix on.&nbsp; Therefore this has been removed from Lectures 3 and 7 for the PDF copy only - however, more information on these matrices can be found&nbsp;<a href="http://www.ebi.ac.uk/help/matrix.html">here</a>.</p><p style="text-align: left;"><strong><a name="Past%20Paper"></a>Past Exam Paper</strong></p><p style="text-align: left;">The paper from 2007/8 can be found&nbsp;<a href="http://library.exeter.ac.uk/exampapers/">here</a>.</p><p style="text-align: left;"><strong><a name="Assess" id="Assess"></a>Continuous Assessments</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/CA1ECM3413.pdf">PDF</a>|&nbsp; CA1 - Manual Sequence Alignment</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;"><a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/CA2ECM3413.pdf">PDF</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/Promoter.names">Promoter.names</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/Promoter.data">Promoter.data</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/ML.names">ML.names</a>|<a href="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/ML.data">ML.data</a>| CA2 - Data Mining in Bioinformatics</p>
</td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;"><strong><a name="Reading" id="Reading"></a>Suggested Reading List</strong></p><p style="text-align: left;"><strong>General Bioinformatics</strong></p><p>&lt;="top"&gt;Xiong, J., (2006) Essential Bioinformatics, Cambridge University Press</p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Lesk, A.M., (2002) Introduction to Bioinformatics, Oxford University Press</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Higgs, P.G., (2005) Bioinformatics and Molecular Evolution,&nbsp; Blackwell Publishing</p>
</td>
</tr>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
</tr>
</tbody>
</table><p style="text-align: left;">&nbsp;</p><p style="text-align: left;"><strong>Machine Learning in Bioinformatics</strong></p><table width="100%" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="baseline"><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Baldi, P., Brunak, S., (2001) Bioinformatics: The Machine Learning Approach, MIT Press</p>
</td>
</tr>
<tr>
<td><img src="https://empslocal.ex.ac.uk/people/staff/reverson/sr/oldECM3413/blubul1a.gif" alt="bullet" width="15" height="15" style="border: 0px; margin-left: 13px; margin-right: 13px; border: 0px;"></td>
<td valign="top">
<p style="text-align: left;">Keedwell, E., Narayanan, A., (2005) Intelligent Bioinformatics: The Application of Artificial Intelligence Techniques to Bioinformatics Problems, Wiley</p>
</td>
</tr>
</tbody>
</table>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44637/tools-to-access-the-quality-of-your-assembled-genome</guid>
	<pubDate>Thu, 08 Aug 2024 23:31:18 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44637/tools-to-access-the-quality-of-your-assembled-genome</link>
	<title><![CDATA[Tools to access the quality of your assembled genome !]]></title>
	<description><![CDATA[<ul dir="auto">
<li><a href="https://github.com/linsalrob/fasta_validator">FASTA VALIDATOR</a>&nbsp;+&nbsp;<a href="https://github.com/shenwei356/seqkit">SEQKIT RMDUP</a>: FASTA validation</li>
<li><a href="https://genometools.org/tools/gt_gff3validator.html">GENOMETOOLS GT GFF3VALIDATOR</a>: GFF3 validation</li>
<li><a href="https://github.com/PlantandFoodResearch/assemblathon2-analysis/blob/a93cba25d847434f7eadc04e63b58c567c46a56d/assemblathon_stats.pl">ASSEMBLATHON STATS</a>: Assembly statistics</li>
<li><a href="https://genometools.org/tools/gt_stat.html">GENOMETOOLS GT STAT</a>: Annotation statistics</li>
<li><a href="https://github.com/ncbi/fcs">NCBI FCS ADAPTOR</a>: Adaptor contamination pass/fail</li>
<li><a href="https://github.com/ncbi/fcs">NCBI FCS GX</a>: Foreign organism contamination pass/fail</li>
<li><a href="https://gitlab.com/ezlab/busco">BUSCO</a>: Gene-space completeness estimation</li>
<li><a href="https://github.com/tolkit/telomeric-identifier">TIDK</a>: Telomere repeat identification</li>
<li><a href="https://github.com/oushujun/LTR_retriever/blob/master/LAI">LAI</a>: Continuity of repetitive sequences</li>
<li><a href="https://github.com/DerrickWood/kraken2">KRAKEN2</a>: Taxonomy classification</li>
<li><a href="https://github.com/igvteam/juicebox.js">HIC CONTACT MAP</a>: Alignment and visualisation of HiC data</li>
<li><a href="https://github.com/mummer4/mummer">MUMMER</a>&nbsp;&rarr;&nbsp;<a href="http://circos.ca/documentation/">CIRCOS</a>&nbsp;+&nbsp;<a href="https://plotly.com/">DOTPLOT</a>&nbsp;&amp;&nbsp;<a href="https://github.com/lh3/minimap2">MINIMAP2</a>&nbsp;&rarr;&nbsp;<a href="https://github.com/schneebergerlab/plotsr">PLOTSR</a>: Synteny analysis</li>
<li><a href="https://github.com/marbl/merqury">MERQURY</a>: K-mer completeness, consensus quality and phasing assessment</li>
</ul>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44618/important-bioinformatics-tools</guid>
	<pubDate>Tue, 30 Jul 2024 05:03:29 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44618/important-bioinformatics-tools</link>
	<title><![CDATA[Important Bioinformatics Tools !]]></title>
	<description><![CDATA[<p><span>1. Ktrim: An extra-fast, accurate adapter trimmer for sequencing data. It processes FASTQ files from multiple lanes with minimal mismatching and over-trimming of adapters.</span><span><br /></span><span><br /></span><span>2. BWA MEM: A reliable alignment tool (particularly for mapping ALT contigs and HLA genes, which are not fully addressed in BWA-MEM2).</span><span><br /></span><span><br /></span><span>3. Sambamba markdup: Quickly marks or removes duplicate reads using Picard's criteria.</span><span><br /></span><span><br /></span><span>4. ichorCNA: Estimates the tumor DNA fraction in cell-free DNA from ultra-low-pass whole genome sequencing (0.1x coverage) based on copy number alterations (CNA).</span><span><br /></span><span><br /></span><span>5. Fragle: A deep learning method for quantifying ctDNA levels from cell-free DNA fragmentomic profiles. It detects TF as low as ~1% ctDNA and works with targeted genomic panel sequencing data.</span><span><br /></span><span><br /></span><span>6. AlfredQC: A quality control tool for high-throughput sequencing data. It assesses metrics like read quality scores, GC content, and duplication rates, visualized through detailed plots and summary statistics.</span><span><br /></span><span><br /></span><span>7. Mosdepth: A fast tool for calculating sequencing coverage depth, offering a quicker alternative to samtools/sambamba depth by processing BAM and CRAM files.</span><span><br /></span><span><br /></span><span>8. Bedtools: A versatile toolkit for genomics, enabling operations like intersect, merge, count, and shuffle on genomic intervals across formats such as BAM, BED, GFF/GTF, and VCF.</span><span><br /></span><span><br /></span><span>9. Datamash: A command-line tool for basic numeric, textual, and statistical operations on input data streams. It supports operations such as grouping, sorting, transposing, and performing arithmetic calculations on tabular data.</span><span><br /></span><span><br /></span><span>10.</span><span> </span><a href="http://gwf.app/" target="_self">gwf.app</a><span>: A pragmatic alternative to Snakemake. Developed at</span><span> </span><a href="https://www.linkedin.com/company/aarhus-university-denmark-/" target="_self"><span>Aarhus University</span></a><span>, this flexible, generic workflow tool builds and runs large scientific workflows.</span></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44377/mitochondrial-genome-assembly-tools</guid>
	<pubDate>Wed, 06 Sep 2023 00:37:18 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44377/mitochondrial-genome-assembly-tools</link>
	<title><![CDATA[Mitochondrial genome assembly tools !]]></title>
	<description><![CDATA[<p>Mitochondrial genome assembly tools are specialized software and algorithms designed to accurately reconstruct the mitochondrial genome (mitogenome) from sequencing data, typically obtained through techniques like next-generation sequencing (NGS). The mitochondrial genome is relatively small compared to the nuclear genome, making it an ideal target for assembly. Here are some commonly used mitochondrial genome assembly tools:</p><p><strong>MitoFinder:</strong> Mitofinder is a pipeline to assemble mitochondrial genomes and annotate mitochondrial genes from trimmed read sequencing data.</p><p><strong>MitoHiFi:</strong> a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads</p><p>MITObim: MITObim is a tool specifically developed for the iterative assembly of mitochondrial genomes. It starts with a reference mitogenome and iteratively refines the assembly using the read data.</p><p><strong>MITOS:</strong> MITOS is a web-based platform that provides a pipeline for annotating mitochondrial genomes. It integrates multiple software tools for assembly, annotation, and visualization of mitogenomes.</p><p><strong>MIRA:</strong> MIRA (Mimicking Intelligent Read Assembly) is a versatile genome assembly tool that can be used for mitochondrial genome assembly. It supports various sequencing technologies and allows for reference-based or de novo assembly.</p><p><strong>NOVOPlasty:</strong> NOVOPlasty is a user-friendly tool designed for de novo assembly of organelle genomes, including mitochondria. It utilizes a seed-and-extend algorithm and is suitable for both short-read and long-read data.</p><p><strong>MITOS2:</strong> MITOS2 is an updated version of the MITOS pipeline, which automates the annotation of mitochondrial genomes. It provides improved accuracy and additional features for mitochondrial genome analysis.</p><p><strong>GetOrganelle:</strong> While primarily designed for chloroplast genome assembly, GetOrganelle can also be used for mitochondrial genome assembly. It is particularly useful for dealing with high-throughput sequencing data.</p><p><strong>SPAdes:</strong> SPAdes (St. Petersburg genome assembler) is a versatile genome assembly tool that can be employed for mitochondrial genome assembly, especially when dealing with complex datasets that may contain nuclear mitochondrial DNA sequences (numts).</p><p><strong>IDBA-UD:</strong> IDBA-UD (Iterative De Bruijn Graph De Novo Assembler) is another de novo assembly tool that can be used for mitochondrial genome assembly, especially in cases with relatively low coverage.</p><p><strong>Velvet:</strong> Velvet is a de novo assembly tool that can be applied to mitochondrial genome assembly, especially when working with short-read data.</p><p>When selecting a mitochondrial genome assembly tool, it's important to consider the specific characteristics of your sequencing data, such as read length and coverage, as well as the complexity of the mitochondrial genome. Additionally, some tools are better suited for specific organisms or research objectives, so choosing the right tool will depend on your particular project requirements.</p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44371/steps-to-find-all-the-repeats-in-the-genome</guid>
	<pubDate>Thu, 31 Aug 2023 02:43:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44371/steps-to-find-all-the-repeats-in-the-genome</link>
	<title><![CDATA[Steps to find all the repeats in the genome !]]></title>
	<description><![CDATA[<div><p>To find repeats in a genome from 2 to 9 length using a Perl script, you can use the RepeatMasker tool with the "--length" option<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>. Here's a step-by-step guide:</p></div><div><ol>
<li>Install RepeatMasker: First, you need to install RepeatMasker on your system. You can download it from the RepeatMasker website<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>.</li>
</ol></div><div><ol>
<li>Prepare the genome sequence: Make sure you have the genome sequence in a FASTA file format. Let's assume the file is named "genome.fasta".</li>
</ol><blockquote><p>./RepeatMasker -pa &lt;number_of_processors&gt; -nolow -norna -no_is -div &lt;divergence_value&gt; -lib RepeatMaskerLib.embl -gff -xsmall -small -poly -species &lt;species_name&gt; -dir &lt;output_directory&gt; -length &lt;min_length&gt;-&lt;max_length&gt; genome.fasta</p></blockquote><div><p>Replace the following placeholders with appropriate values:</p><ul>
<li><code>&lt;number_of_processors&gt;</code>: The number of processors/threads you want to use for parallel processing.</li>
<li><code>&lt;divergence_value&gt;</code>: The divergence value for the species you are analyzing. You can find divergence values for different species in the RepeatMasker documentation<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>.</li>
<li><code>&lt;species_name&gt;</code>: The name of the species you are analyzing.</li>
<li><code>&lt;output_directory&gt;</code>: The directory where you want the output files to be saved.</li>
<li><code>&lt;min_length&gt;</code>&nbsp;and&nbsp;<code>&lt;max_length&gt;</code>: The minimum and maximum lengths of the repeats you want to find (in this case, 2 and 9).</li>
</ul></div><div><ol>
<li>Analyze the output: RepeatMasker will generate several output files, including a .out file. You can parse this file to extract the information you need. There is a Perl tool called "one_code_to_find_them_all.pl" that can help you parse RepeatMasker output files<a href="https://mobilednajournal.biomedcentral.com/articles/10.1186/1759-8753-5-13" target="_blank">[0]</a>. You can download it from the source provided.</li>
</ol></div><div><ol>
<li>Use the provided Perl script: Once you have the "one_code_to_find_them_all.pl" script, you can run it to conveniently parse the RepeatMasker output files. Here's an example of how to use it:</li>
</ol><blockquote><p>perl one_code_to_find_them_all.pl --rm &lt;RepeatMasker_out_file&gt; --length &lt;length_file&gt;</p></blockquote></div><p>&nbsp;</p></div><div><div><p>Replace&nbsp;<code>&lt;RepeatMasker_out_file&gt;</code>&nbsp;with the path to your RepeatMasker .out file, and&nbsp;<code>&lt;length_file&gt;</code>&nbsp;with the path to a file containing the lengths of the reference elements.</p></div><div><p>This script will generate several output files, including .log.txt and .copynumber.csv, which contain quantitative information about the identified repeat elements.</p></div><div><p>Remember to adjust the parameters and options according to your specific needs and the characteristics of your genome.</p></div></div>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44352/bioinformatics-tools-for-genome-assembly</guid>
	<pubDate>Mon, 24 Jul 2023 07:04:26 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44352/bioinformatics-tools-for-genome-assembly</link>
	<title><![CDATA[Bioinformatics tools for genome assembly !]]></title>
	<description><![CDATA[<p>There are numerous genome assembly tools available, each with its strengths and weaknesses. Here is a list of some widely used genome assembly tools as of my last update in September 2021:</p><ol>
<li>
<p><span>SPAdes:</span> An assembler specifically designed for single-cell and multi-cell bacterial genomes, as well as small eukaryotic genomes.</p>
</li>
<li>
<p><span>ABySS:</span> A parallelized assembler for large genomes that uses de Bruijn graphs.</p>
</li>
<li>
<p><span>Velvet:</span> Another de Bruijn graph-based assembler optimized for short-read sequencing data.</p>
</li>
<li>
<p><span>SOAPdenovo:</span> A de Bruijn graph-based assembler designed for short reads, widely used for assembling large and complex genomes.</p>
</li>
<li>
<p><span>MaSuRCA:</span> A hybrid assembler that combines data from multiple sequencing technologies, such as Illumina and PacBio.</p>
</li>
<li>
<p><span>Canu:</span> A long-read assembler optimized for PacBio and Oxford Nanopore sequencing data.</p>
</li>
<li>
<p><span>Flye:</span> A long-read assembler suitable for bacterial and small eukaryotic genomes.</p>
</li>
<li>
<p><span>SMARTdenovo:</span> An assembler designed for long reads, particularly suited for PacBio data.</p>
</li>
<li>
<p><span>SPAdes Long Read (SPAdesLR):</span> An extension of SPAdes for long-read data, such as those from PacBio or Nanopore.</p>
</li>
<li>
<p><span>Minia:</span> An assembler optimized for low memory consumption, suitable for small and medium-sized genomes.</p>
</li>
<li>
<p><span>Unicycler:</span> A hybrid assembler that combines short and long reads for circular bacterial genome assembly.</p>
</li>
<li>
<p><span>wtdbg2:</span> A de Bruijn graph assembler for long reads, efficient for very large genomes.</p>
</li>
<li>
<p><span>Shasta:</span> A long-read assembler that uses the Overlap-Layout-Consensus approach, suitable for PacBio and Nanopore data.</p>
</li>
<li>
<p><span>Sparc:</span> An assembler designed to handle noisy long reads from Nanopore sequencing.</p>
</li>
<li>
<p><span>CANA:</span> An assembler for metagenomic data, particularly for complex and diverse microbial communities.</p>
</li>
<li>
<p><span>Ra</span> Assembler: A metagenome assembler for long reads, designed for highly complex metagenomic samples.</p>
</li>
</ol><p>Please note that the field of bioinformatics is constantly evolving, and new assembly tools may have emerged since my last update. Additionally, the performance of these tools can vary depending on the characteristics of the sequencing data and the genome being assembled. When selecting an assembly tool, consider the specific requirements of your project, the available data types, and the computational resources at your disposal. Always refer to the respective tool's documentation and publications for the most up-to-date information and recommendations.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44284/tools-for-geospatial-data-analysis</guid>
	<pubDate>Wed, 22 Mar 2023 02:10:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44284/tools-for-geospatial-data-analysis</link>
	<title><![CDATA[Tools for Geospatial data analysis !]]></title>
	<description><![CDATA[<div><div><div><div><div><div><div><div><div><div><p>Geospatial data is becoming increasingly important in many fields, including urban planning, environmental science, public health, and more. These tools can help you work with data from a variety of sources, including satellite imagery, GPS data, and other forms of spatial data. They can help you visualize data, perform complex analysis, and even create maps and other visualizations.</p><p>The list includes some of the most popular and widely used geospatial tools available in Python. These tools can help you work with data from a variety of sources and in a variety of formats. Some of the tools are focused on visualization, such as Cartopy, Folium, and Contextily, which allow you to create interactive maps and other visualizations. Other tools are more focused on data manipulation and analysis, such as Fiona, GeoPandas, and Rasterio, which allow you to manipulate and analyze spatial data in a variety of ways.</p><p>The list also includes some tools for working with specific types of geospatial data. For example, the H3 library is designed specifically for working with hexagonal grids, while PySAL is focused on spatial econometrics and spatial analysis. Whether you are a data scientist, GIS specialist, or geospatial enthusiast, these tools are sure to enhance your work and help you achieve your goals.</p><p>In summary, this list is an excellent resource for anyone working with geospatial data in Python. It contains a wide range of tools for working with different types of data, and can help you visualize data, perform complex analysis, and create maps and other visualizations. If you're looking to enhance your skills in geospatial analysis, this list is definitely worth checking out.</p></div></div></div><div><p>These tools are:</p><ul>
<li>ArcGIS - <a href="https://lnkd.in/dgC6sKJH" target="_new">https://lnkd.in/dgC6sKJH</a></li>
<li>Cartopy - <a href="https://lnkd.in/dc8ijXRg" target="_new">https://lnkd.in/dc8ijXRg</a></li>
<li>Contextily - <a href="https://lnkd.in/dTdQsmKX" target="_new">https://lnkd.in/dTdQsmKX</a></li>
<li>Descartes - <a href="https://lnkd.in/dCJykxwW" target="_new">https://lnkd.in/dCJykxwW</a></li>
<li>Fiona - <a href="https://lnkd.in/d8sJ3Q5a" target="_new">https://lnkd.in/d8sJ3Q5a</a></li>
<li>Folium - <a href="https://lnkd.in/dfSsE-MB" target="_new">https://lnkd.in/dfSsE-MB</a></li>
<li>GDAL - <a href="https://lnkd.in/dYBJBaAY" target="_new">https://lnkd.in/dYBJBaAY</a></li>
<li>Geohash - <a href="https://lnkd.in/d_NxJ4_M" target="_new">https://lnkd.in/d_NxJ4_M</a></li>
<li>GeoJSON - <a href="https://lnkd.in/daGs2WYq" target="_new">https://lnkd.in/daGs2WYq</a></li>
<li>GeoPandas - <a href="https://lnkd.in/dBTFKKV3" target="_new">https://lnkd.in/dBTFKKV3</a></li>
<li>Geopy - <a href="https://lnkd.in/dfAzR8Xa" target="_new">https://lnkd.in/dfAzR8Xa</a></li>
<li>Gevent - <a href="http://www.gevent.org/" target="_new">http://www.gevent.org</a></li>
<li>H3 - <a href="https://h3geo.org/docs/" target="_new">https://h3geo.org/docs/</a></li>
<li>OSMnx - <a href="https://lnkd.in/dm3pHgUS" target="_new">https://lnkd.in/dm3pHgUS</a></li>
<li>PyQGIS - <a href="https://lnkd.in/dShWyWVr" target="_new">https://lnkd.in/dShWyWVr</a></li>
<li>PySAL - <a href="https://pysal.org/" target="_new">https://pysal.org</a></li>
<li>Pydeck - <a href="https://lnkd.in/dGBFu-iw" target="_new">https://lnkd.in/dGBFu-iw</a></li>
<li>Pyproj - <a href="https://lnkd.in/dNG9fdkm" target="_new">https://lnkd.in/dNG9fdkm</a></li>
<li>RTree - <a href="https://lnkd.in/dURMiYpU" target="_new">https://lnkd.in/dURMiYpU</a></li>
<li>Rasterio - <a href="https://lnkd.in/dEMC6ve6" target="_new">https://lnkd.in/dEMC6ve6</a></li>
<li>Scikit-mobility - <a href="https://lnkd.in/dpHhaX2J" target="_new">https://lnkd.in/dpHhaX2J</a></li>
<li>Shapely - <a href="https://lnkd.in/d568datK" target="_new">https://lnkd.in/d568datK</a></li>
</ul><p>These tools offer a wide range of capabilities for working with geospatial data, from visualizing and manipulating data to performing complex analysis and modeling. Whether you are a data scientist, GIS specialist, or geospatial enthusiast, these tools are sure to enhance your work and help you achieve your goals.</p></div></div></div></div></div></div></div></div>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/44236/type-of-ssr</guid>
	<pubDate>Thu, 09 Mar 2023 04:35:41 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/44236/type-of-ssr</link>
	<title><![CDATA[Type of SSR]]></title>
	<description><![CDATA[<div><div><div><div><div><div><div><div><div><div><p>Types of SSRs (simple sequence repeats), SSRs are short DNA sequences consisting of a tandem repeat of a few nucleotides, typically 2-6 nucleotides in length. There are different types of SSRs based on the length and pattern of the repeated sequence, as well as the presence or absence of interruptions of non-repeated nucleotides within the repeat array. The four types of SSRs are:</p><ol>
<li>
<p>Perfect SSR: This is the simplest type of SSR, where the same repeat motif is present adjacent to each other without any interruption of any other nucleotide. For example, a perfect SSR with the repeat motif "CAT" would be "CATCATCATCAT", where the "CAT" sequence is repeated four times.</p>
</li>
<li>
<p>Imperfect SSR: This type of SSR contains repeat motifs that are interrupted by one or a few non-repeat nucleotides. For example, an imperfect SSR with the repeat motif "CAT" would be "CATCATGGCATCATCAT", where the "CAT" sequence is repeated twice, but interrupted by "GG".</p>
</li>
<li>
<p>Compound perfect SSR: This type of SSR contains two or more repeat motifs lying adjacent to each other, separated by no or very few intervening nucleotides. For example, a compound perfect SSR with the repeat motifs "CAT" and "GTC" would be "CATCATCATGTCGTC", where the "CAT" sequence is repeated three times, followed by the "GTC" sequence repeated twice.</p>
</li>
<li>
<p>Compound imperfect SSR: This type of SSR contains two or more repeat motifs interrupted by several non-repeat nucleotides. For example, a compound imperfect SSR with the repeat motifs "CAT" and "GTC" would be "CATCATCATNNNNNNNGTCGTCGTC", where the "CAT" sequence is repeated three times, interrupted by several non-repeat nucleotides, followed by the "GTC" sequence repeated three times.</p>
</li>
</ol></div></div></div></div></div></div></div></div></div></div>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/43977/read-simulators</guid>
	<pubDate>Fri, 30 Sep 2022 06:48:18 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/43977/read-simulators</link>
	<title><![CDATA[Read Simulators]]></title>
	<description><![CDATA[<h1>Short Read Simulators</h1><p>With the popularity of next-generation sequencing (NGS) technologies, many NGS read simulators have been developed. Currently, many of the popular short read simulators are designed to simulate reads mimicking many Illumina, 454 and SOLiD platforms. Listed below are some popular short read simulators. Links to their publications are provided as well.</p><ol>
<li><a href="https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0003373" target="_blank">MetaSim</a></li>
<li><a href="https://github.com/lh3/wgsim" target="_blank">wgsim</a></li>
<li><a href="https://github.com/timmassingham/simNGS" target="_blank">SimNGS</a></li>
<li><a href="https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0049110" target="_blank">ArtificialFastqGenerator</a></li>
<li id="e943"><a href="https://academic.oup.com/bioinformatics/article/35/3/521/5055123" target="_blank">InSilicoSeq</a></li>
</ol><h1>Long Read Simulators</h1><p id="d469">With the advancements in sequencing technologies, scientists have shown an increasing interest in using third-generation sequencing (TGS) technologies. Currently, many of the popular long read simulators are designed to simulate reads mimicking the two main TGS technologies; (1)&nbsp;<em>Pacific Biosciences (PacBio)</em>&nbsp;and (2)&nbsp;<em>Oxford Nanopore (ONT)</em>. Listed below are some of the popular and recently introduced PacBio and ONT simulators. Links to their publications are provided as well.</p><h2><span>PacBio Simulators</span></h2><ol>
<li><a href="https://academic.oup.com/bioinformatics/article/29/1/119/273243" target="_blank">PBSIM</a></li>
<li><a href="https://academic.oup.com/bioinformatics/article/32/24/3829/2525710" target="_blank">LongISLND</a></li>
<li><a href="https://academic.oup.com/bioinformatics/article/32/17/2704/2450740" target="_blank">SimLoRD</a></li>
<li><a href="https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-018-2208-0" target="_blank">NPBSS</a></li>
<li id="fed0"><a href="https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-2901-7" target="_blank">PaSS</a></li>
</ol><h2><span>ONT Simulators</span></h2><ol>
<li id="f145"><a href="https://academic.oup.com/gigascience/article/6/4/gix010/3051934" target="_blank">NanoSim</a></li>
<li id="c6f5"><a href="https://ieeexplore.ieee.org/document/8621253" target="_blank">Nanopore SimulatION</a></li>
<li><a href="https://academic.oup.com/bioinformatics/article/34/17/2899/4962495" target="_blank">DeepSimulator</a></li>
<li><a href="https://academic.oup.com/bioinformatics/article/36/8/2578/5698265" target="_blank">DeepSimulator1.5</a></li>
</ol>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>

</channel>
</rss>