<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/44930?offset=40</link>
	<atom:link href="https://bioinformaticsonline.com/related/44930?offset=40" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/7986/list-of-bioinformatics-open-source-projectssoftware</guid>
	<pubDate>Tue, 21 Jan 2014 14:28:37 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/7986/list-of-bioinformatics-open-source-projectssoftware</link>
	<title><![CDATA[List of bioinformatics open source projects/software.]]></title>
	<description><![CDATA[<p>Open source software is software that can be freely used, changed, and shared (in modified or unmodified form) by anyone. Open source software is made by many people, and distributed under licenses that comply with the Open Source Definition.The Open Source Initiative (OSI) is a global non-profit that supports and promotes the open source movement. Followings are the OS bioinformatics projects/software :</p><p><strong>.NET Bio</strong></p><p>http://blogs.msdn.com/b/msr_er/archive/2011/10/18/microsoft-biology-foundation-evolves-into-new-toolkit-net-bio.aspx</p><p>A language-neutral bioinformatics toolkit built using the Microsoft 4.0 .NET Framework to help developers, researchers, and scientists.</p><p><strong>AMPHORA</strong> ("AutoMated Phylogenomic infeRence Application")</p><p>http://wolbachia.biology.virginia.edu/WuLab/Software.html</p><p><a href="http://en.wikipedia.org/wiki/Metagenomics" title="Metagenomics">Metagenomics</a> analysis software</p><p><strong>Anduril</strong></p><p>http://www.anduril.org/anduril/site/</p><p>Component-based <a href="http://en.wikipedia.org/wiki/Workflow" title="Workflow">workflow</a> framework for data analysis</p><p>Armadillo workflow platform</p><p>Tool for designing and executing phylogenetic workflows</p><p><strong>AutoDock</strong></p><p>http://autodock.scripps.edu/</p><p>suite of automated docking tools</p><p><strong>Biochemical Algorithms Library (BALL)</strong></p><p>http://www.ball-project.org/</p><p>C++ library and framework for molecular modeling and visualization designed for rapid prototyping</p><p><strong>Bio4j</strong></p><p>http://bio4j.com/</p><p>Bio4j is a <a href="http://en.wikipedia.org/wiki/Bioinformatics" title="Bioinformatics">bioinformatics</a> platform and <a href="http://en.wikipedia.org/wiki/Chart" title="Chart">graph</a> based <a href="http://en.wikipedia.org/wiki/Database" title="Database">database</a> built around most data available in <a href="http://en.wikipedia.org/wiki/UniProt" title="UniProt">UniProt</a> KB(<a href="http://en.wikipedia.org/wiki/Swiss-Prot" title="Swiss-Prot">Swiss-Prot</a> + <a href="http://en.wikipedia.org/wiki/TrEMBL" title="TrEMBL">TrEMBL</a>), <a href="http://en.wikipedia.org/wiki/Gene_Ontology" title="Gene Ontology">Gene Ontology</a> (GO), <a href="http://en.wikipedia.org/w/index.php?title=UniRef&amp;action=edit&amp;redlink=1" title="UniRef (page does not exist)">UniRef</a> (50,90,100), <a href="http://en.wikipedia.org/wiki/RefSeq" title="RefSeq">RefSeq</a>, <a href="http://en.wikipedia.org/wiki/National_Center_for_Biotechnology_Information" title="National Center for Biotechnology Information">NCBI</a> taxonomy, and Expasy Enzyme DB</p><p><strong>Bioclipse</strong></p><p>www.bioclipse.net</p><p>Visual platform for <a href="http://en.wikipedia.org/wiki/Cheminformatics" title="Cheminformatics">chemo</a>- and <a href="http://en.wikipedia.org/wiki/Bioinformatics" title="Bioinformatics">bioinformatics</a> based on the <a href="http://en.wikipedia.org/wiki/Eclipse_%28software%29" title="Eclipse (software)">Eclipse</a> Rich Client Platform (RCP).</p><p><strong>Bioconductor</strong></p><p>http://www.bioconductor.org/</p><p><a href="http://en.wikipedia.org/wiki/R_%28programming_language%29" title="R (programming language)">R (programming language)</a> language toolkit</p><p><strong>Bioinformatics Learning Tutorial (BLT)</strong></p><p>http://sourceforge.net/projects/biotutorial/</p><p>Educational <a href="http://en.wikipedia.org/wiki/Interactive_tutorials" title="Interactive tutorials">interactive tutorials</a> and 3D animations for Replication, Transcription, and Translation</p><p><strong>BioHaskell</strong></p><p>http://biohaskell.org/</p><p><a href="http://en.wikipedia.org/wiki/Haskell_%28programming_language%29" title="Haskell (programming language)">Haskell (programming language)</a></p><p><strong>BioJava</strong></p><p>http://biojava.org/wiki/Main_Page</p><p><a href="http://en.wikipedia.org/wiki/Java_%28programming_language%29" title="Java (programming language)">Java (programming language)</a></p><p><strong>BioMOBY</strong></p><p>http://biomoby.org/</p><p>registry of <a href="http://en.wikipedia.org/wiki/Web_services" title="Web services">web services</a></p><p><strong>BioPerl</strong></p><p>http://www.bioperl.org/wiki/Main_Page</p><p><a href="http://en.wikipedia.org/wiki/Perl" title="Perl">Perl</a> language toolkit</p><p><strong>BioPHP</strong></p><p>http://www.biophp.org/</p><p><a href="http://en.wikipedia.org/wiki/PHP" title="PHP">PHP</a> language toolkit</p><p><strong>Biopython</strong></p><p>http://biopython.org/wiki/Main_Page</p><p><a href="http://en.wikipedia.org/wiki/Python_%28programming_language%29" title="Python (programming language)">Python</a> language toolkit</p><p><strong>BioRails</strong></p><p>https://github.com/biorails</p><p>a <a href="http://en.wikipedia.org/wiki/Data_management_system" title="Data management system">data management system</a> designed to support researchers in <a href="http://en.wikipedia.org/wiki/Drug_discovery" title="Drug discovery">drug discovery</a></p><p><strong>BioRuby</strong></p><p>http://bioruby.org/</p><p><a href="http://en.wikipedia.org/wiki/Ruby_%28programming_language%29" title="Ruby (programming language)">Ruby</a> language toolkit</p><p><strong>BioSmalltalk</strong></p><p>https://code.google.com/p/biosmalltalk/</p><p><a href="http://en.wikipedia.org/wiki/Smalltalk_%28programming_language%29" title="Smalltalk (programming language)">Smalltalk</a> language toolkit</p><p><strong>BioUno</strong></p><p>http://www.biouno.org/</p><p><a href="http://en.wikipedia.org/w/index.php?title=BioUno&amp;action=edit&amp;redlink=1" title="BioUno (page does not exist)">BioUno</a> is a project that applies <a href="http://en.wikipedia.org/wiki/Continuous_Integration" title="Continuous Integration">Continuous Integration</a> tools and techniques in <a href="http://en.wikipedia.org/wiki/Bioinformatics" title="Bioinformatics">Bioinformatics</a>. It uses <a href="http://en.wikipedia.org/wiki/Jenkins_%28software%29" title="Jenkins (software)">Jenkins</a> and its plug-in API to create <a href="http://en.wikipedia.org/wiki/Bioinformatics_workflow_management_system" title="Bioinformatics workflow management system">biology workflows</a> and manage <a href="http://en.wikipedia.org/wiki/Computer_clusters" title="Computer clusters">computer clusters</a>.</p><p><strong>caCORE</strong></p><p>&nbsp;</p><p>ontologic representation environment</p><p><strong>caArray</strong></p><p>https://cabig-stage.nci.nih.gov/community/tools/caArray</p><p>ontologic representation environment</p><p><strong>EMBOSS</strong></p><p>http://emboss.sourceforge.net/</p><p>Suite of packages for sequencing, searching, etc.</p><p><strong>Gaggle</strong></p><p>https://www.gaggle.net/</p><p>A framework for interoperability between systems biology software</p><p><strong>Galaxy</strong></p><p>http://galaxyproject.org/</p><p><a href="http://en.wikipedia.org/wiki/Scientific_workflow_system" title="Scientific workflow system">Scientific workflow</a> and <a href="http://en.wikipedia.org/wiki/Data_integration" title="Data integration">data integration</a> system</p><p><strong>GenePattern</strong></p><p>http://www.broadinstitute.org/cancer/software/genepattern/</p><p><a href="http://en.wikipedia.org/wiki/Scientific_workflow_system" title="Scientific workflow system">Scientific workflow system</a> that provides access to more than 150 genomic analysis tools</p><p><strong>GeWorkbench</strong></p><p>http://wiki.c2b2.columbia.edu/workbench/index.php/Home</p><p>Genomic <a href="http://en.wikipedia.org/wiki/Data_integration" title="Data integration">data integration</a> platform</p><p><strong>GMOD</strong></p><p>http://www.gmod.org/wiki/Main_Page</p><p>Toolkit for addressing many common challenges at biological databases.</p><p><strong>GeneProf</strong></p><p>http://www.geneprof.org/GeneProf/</p><p>A web-based, bioinformatics software suite for the analysis of functional genomics experiments, e.g. RNA-seq or ChIP-seq.</p><p><strong>GeneTalk</strong></p><p>http://www.gene-talk.de/</p><p>Tool for filtering sequence variants in <a href="http://en.wikipedia.org/wiki/Variant_Call_Format" title="Variant Call Format">VCF</a> files. Network for scientists and clinicians for expertise and knowledge exchange. Database of annotations aboute sequence variants with clinically relevant information.</p><p><strong>GenGIS</strong></p><p>http://kiwi.cs.dal.ca/GenGIS/Main_Page</p><p>Application that allows users to combine digital map data with information about biological sequences collected from the environment.</p><p><strong>GenomeSpace</strong></p><p>http://www.genomespace.org/</p><p>Centralized web application that provides data format transformations and facilitates connections with other bioinformatics tools</p><p><strong>GENtle</strong></p><p>http://directory.fsf.org/wiki/GENtle</p><p>An equivalent to the proprietary <a href="http://en.wikipedia.org/wiki/Vector_NTI" title="Vector NTI">Vector NTI</a>, a tool to analyze and edit <a href="http://en.wikipedia.org/wiki/DNA" title="DNA">DNA</a> sequence files</p><p><strong>Integrated Genome Browser</strong></p><p>http://bioviz.org/igb/</p><p><a href="http://en.wikipedia.org/wiki/Java_%28software_platform%29" title="Java (software platform)">Java</a>-based desktop <a href="http://en.wikipedia.org/wiki/Genome_browser" title="Genome browser">genome browser</a></p><p><strong>Integrative Genomics Viewer (IGV)</strong></p><p>http://www.broadinstitute.org/igv/</p><p>High-performance desktop tool for interactive visual exploration of diverse genomic data</p><p><strong>IntAct</strong></p><p>http://www.ebi.ac.uk/intact/</p><p>molecular interaction database</p><p><strong>InterMine</strong></p><p>http://intermine.github.io/intermine.org/</p><p>Extensive data warehouse system for the analysis and integration of biological datasets</p><p><strong>Java Treeview</strong></p><p>http://jtreeview.sourceforge.net/</p><p>microarray data viewer</p><p><strong>LabKey Server</strong></p><p>http://labkey.com/</p><p>platform for integrating, analyzing and sharing data</p><p><strong>OpenClinica</strong></p><p>https://www.openclinica.com/</p><p>software for capturing and managing data in clinical trials</p><p><a href="http://www.biomedcentral.com/1471-2164/13/512">PromKappa</a></p><p>http://xbioinformatics.wordpress.com/tag/promkappa/</p><p>PromKappa (Promoter analysis by Kappa) software program used for promoter pattern generation and promoter analysis.</p><p><strong>MeV: Multi-Experiment Viewer</strong></p><p>http://www.tm4.org/mev.html</p><p>a desktop application for the analysis, visualization and data-mining of large-scale genomic data</p><p><strong>PathVisio</strong></p><p>http://www.pathvisio.org/</p><p>a desktop software for drawing, analysis and visualization of biological pathways</p><p>REDCRAFT</p><p>software for determining tertiary protein structure given assigned Residual Dipolar Coupling data</p><p>SAM Tools</p><p>Data format (SAM) and accompanying tool suite, for storing large nucleotide sequence alignments</p><p><a href="http://en.wikipedia.org/wiki/Staden_Package" title="Staden Package">Staden Package</a></p><p>Sequence assembly, editing and analysis, primarily consisting of gap4, gap5 and spin.</p><p><a href="http://en.wikipedia.org/wiki/STAMP" title="STAMP">STAMP</a></p><p>Software package for analyzing metagenomic profiles that promotes &lsquo;best practices&rsquo; in choosing appropriate statistical techniques and reporting results.</p><p><a href="http://supfam.org/supraHex">supraHex</a></p><p>An open-source R/Bioconductor package for omics data analysis using a supra-hexagonal map</p><p><a href="http://en.wikipedia.org/wiki/Taverna_workbench" title="Taverna workbench">Taverna workbench</a></p><p>Tool for designing and executing workflows</p><p>TGAC Browser</p><p>Genome Browser, visualisation solutions for big data in the genomic era</p><p>T-REX WebServer</p><p>Bioinformatics and phylogenetics webserver (NJ, PhyML, RAxML, MAFFT, MUSCLE, Newick viewer, <a href="http://en.wikipedia.org/wiki/Horizontal_gene_transfer" title="Horizontal gene transfer">Horizontal gene transfer</a> detection, Reticulograms, Substitution models)</p><p><a href="http://en.wikipedia.org/wiki/UGENE" title="UGENE">UGENE</a></p><p>integrated bioinformatics tools</p><p>Visomics</p><p>bioinformatics tools for omics data</p><p>Genome Analysis Toolkit 1.0 (GATK 1.0)</p><p>a software package to analyse next-generation resequencing data</p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/42974/list-of-bioinformatics-packages-for-ngs-analysis</guid>
	<pubDate>Sat, 20 Mar 2021 00:28:51 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/42974/list-of-bioinformatics-packages-for-ngs-analysis</link>
	<title><![CDATA[List of bioinformatics packages for NGS analysis !]]></title>
	<description><![CDATA[<p>Package suites gather software packages and installation tools for specific languages or platforms. We have some for bioinformatics software.</p><ul>
<li><a href="https://github.com/Bioconductor">Bioconductor</a>&nbsp;&ndash; A plethora of tools for analysis and comprehension of high-throughput genomic data, including 1500+ software packages. [&nbsp;<a href="https://link.springer.com/article/10.1186/gb-2004-5-10-r80">paper-2004</a>&nbsp;|&nbsp;<a href="https://www.bioconductor.org/">web</a>&nbsp;]</li>
<li><a href="https://github.com/biopython/biopython">Biopython</a>&nbsp;&ndash; Freely available tools for biological computing in Python, with included cookbook, packaging and thorough documentation. Part of the&nbsp;<a href="http://open-bio.org/">Open Bioinformatics Foundation</a>. Contains the very useful&nbsp;<a href="https://biopython.org/DIST/docs/api/Bio.Entrez-module.html">Entrez</a>&nbsp;package for API access to the NCBI databases. [&nbsp;<a href="https://pubmed.ncbi.nlm.nih.gov/19304878">paper-2009</a>&nbsp;|&nbsp;<a href="https://biopython.org/">web</a>&nbsp;]</li>
<li><a href="https://github.com/bioconda">Bioconda</a>&nbsp;&ndash; A channel for the&nbsp;<a href="http://conda.pydata.org/docs/intro.html">conda package manager</a>&nbsp;specializing in bioinformatics software. Includes a repository with 3000+ ready-to-install (with&nbsp;<code>conda install</code>) bioinformatics packages. [&nbsp;<a href="https://pubmed.ncbi.nlm.nih.gov/29967506">paper-2018</a>&nbsp;|&nbsp;<a href="https://bioconda.github.io/">web</a>&nbsp;]</li>
<li><a href="https://github.com/BioJulia">BioJulia</a>&nbsp;&ndash; Bioinformatics and computational biology infastructure for the Julia programming language. [&nbsp;<a href="https://biojulia.net/">web</a>&nbsp;]</li>
<li><a href="https://github.com/rust-bio/rust-bio">Rust-Bio</a>&nbsp;&ndash; Rust implementations of algorithms and data structures useful for bioinformatics. [&nbsp;<a href="http://bioinformatics.oxfordjournals.org/content/early/2015/10/06/bioinformatics.btv573.short?rss=1">paper-2016</a>&nbsp;]</li>
<li><a href="https://github.com/seqan/seqan3">SeqAn</a>&nbsp;&ndash; The modern C++ library for sequence analysis.</li>
</ul>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/8174/the-2014-cemm-phd-program</guid>
  <pubDate>Wed, 05 Feb 2014 06:03:15 -0600</pubDate>
  <link></link>
  <title><![CDATA[The 2014 CeMM PhD Program]]></title>
  <description><![CDATA[
<p>For our next PhD Program starting in October 2014 we are looking for exceptionally motivated PhD candidates with a keen interest in genomics and medicine and a strong interest to work in teams.</p>

<p>The 2014 CeMM PhD Program will focus on two thematic areas: INFECTION and CANCER, that are built on the pillars of epigenetics, bioinformatics and systems biology, chemical biology and the mechanism of action of drugs, high-throughput genetics, genomics and proteomics, and molecular and cell biology.</p>

<p>The choice of this strategic focus rests on the synergies between immunology, infection and cancer in pathophysiological and technological terms. It furthermore reflects the strength of the current CeMM faculty, itself built around the historical and contemporary expertise in immunology and cancer of the Medical University of Vienna.</p>

<p>As a CeMM PhD student you will get the chance to work at the cutting edge of interdisciplinary molecular medicine research and be trained by the entire CeMM and associated faculty to become one of the scientists shaping the future of molecular medicine.<br />Requirements</p>

<p>To be eligible to enroll in the CeMM PhD Program all candidates are required to have a bachelor’s or master’s degree in medicine, biology, chemistry, bioinformatics, mathematics or any scientific/technical, subject-relevant degree. Candidates do not need to have completed their degree at the time of application, however they must have obtained their final degree certificate by mid-September. The working language at CeMM is English, so excellent written and oral communication skills in English are required.<br />Timeline</p>

<p>    Applications open on 20th January and close on 20th March 2014.<br />    Two references are required to be submitted through the online system by 31st March 2014.<br />    All complete candidate applications are reviewed by the CeMM Faculty in early April.<br />    Selected candidates are invited to a Skype panel interview in late April.<br />    Shortlisted candidates are then invited to Vienna in May for a full interview process, including an opportunity to introduce yourself through a presentation and interview rounds, meet research group members, and attend an informal dinner to get to know the Faculty members and learn more about their research.<br />    Positions are offered by CeMM Faculty in June.<br />    Start of PhD Program: 1st October 2014 .</p>

<p>Contact</p>

<p>Binia Maria Günther, BEd BA<br />Human Resources Manager<br />bguenther@cemm.oeaw.ac.at</p>

<p>Catherine Lloyd, Ph.D.<br />PhD and Postdoc Program Manager<br />clloyd@cemm.oeaw.ac.at</p>

<p>More Info: www.cemm.oeaw.ac.at/phd-program/application/</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/23121/senior-sas-programmer-urgent-role-permanant-welwyn-garden-city-uk</guid>
  <pubDate>Fri, 03 Jul 2015 08:14:23 -0500</pubDate>
  <link></link>
  <title><![CDATA[Senior SAS Programmer - URGENT ROLE - Permanant - Welwyn Garden City - UK]]></title>
  <description><![CDATA[
<p>SAS Programmer URGENTLY required !! My client is looking for an experienced Senior SAS Programmer, to join their bubbly dynamic team in Welwyn Garden City. You must have experience within SAS and/or R programming language. I am looking for someone with a background within either Life Sciences, Statistics, Computer Science, Bioinformatics etc. I am looking for someone with leadership qualities, you must have excellent analyst skills. Please call Dareen Evans on 01772 278050 or email your cv to dareen.evans@itworkshealth.co.uk</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/8123/jrf-manit</guid>
  <pubDate>Sun, 02 Feb 2014 03:07:58 -0600</pubDate>
  <link></link>
  <title><![CDATA[JRF @ MANIT]]></title>
  <description><![CDATA[
<p>MAULANA AZAD NATIONAL INSTITUTE OF TECHNOLOGY BHOPAL</p>

<p>No. CSE/14/1038</p>

<p>Walk in Interview for the post of JRF under TEQIP-II</p>

<p>SN Department – Qualification Post Graduation – Time</p>

<p>1 Bio-Informatics &amp; Mathematics M.Tech Bio-informatics/M.Sc.* Maths  10.00 AM</p>

<p>2 Biological Sciences M.Sc.* in any branch of Biological Sciences 10.30 AM</p>

<p>3 Chemical Engineering M.Tech Chemical Engineering 11.00 AM</p>

<p>4 Chemistry M.Sc.* Chemistry 11.30 AM</p>

<p>5 Civil Engineering M.Tech Structure/GeoTech. /Water -Resources/Hydraulics/Environment/Transport 12.00 Noon</p>

<p>6 GIS M.Tech GIS/Civil 12.30 PM</p>

<p>7 Computer Science &amp; Engineering M.Tech CSE/Information Security 01.00 PM</p>

<p>8 Electrical Engineering M.Tech Electrical Derives 01.30 PM</p>

<p>9 Electronics &amp; Communication M.Tech Digital Communication 02.00 PM</p>

<p>10 MSME M.Tech Material Science/ Mechanical/Metallurgy 02.30 PM</p>

<p>11 Physics M.Sc.* Physics 03.00 PM</p>

<p>* M.Sc. with NET/GATE qualified</p>

<p>Resume along with one passport size photograph and relevant documents are required at the time of interview</p>

<p>Amount of Fellowship: Rs 18000/-month+ HRA</p>

<p>Duration: 31st Dec 2014 (End of TEQIP-II project)</p>

<p>Date of Interview: 7th  February 2014</p>

<p>Venue Institute Committee Room</p>

<p>Advertisement:</p>

<p>http://www.manit.ac.in/manitbhopal/Year2014/Recruitment/Advertisement%20JRF.pdf</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/34552/edit-distance-application-in-bioinformatics</guid>
	<pubDate>Thu, 07 Dec 2017 08:46:51 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/34552/edit-distance-application-in-bioinformatics</link>
	<title><![CDATA[Edit distance application in bioinformatics !]]></title>
	<description><![CDATA[<p>There are other popular measures of&nbsp;<a href="https://en.wikipedia.org/wiki/Edit_distance" title="Edit distance">edit distance</a>, which are calculated using a different set of allowable edit operations. For instance,</p><ul>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Damerau%E2%80%93Levenshtein_distance" title="Damerau&ndash;Levenshtein distance">Damerau&ndash;Levenshtein distance</a>&nbsp;allows insertion, deletion, substitution, and the&nbsp;<a href="https://en.wikipedia.org/wiki/Transposition_(mathematics)" title="Transposition (mathematics)">transposition</a>&nbsp;of two adjacent characters;</li>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Longest_common_subsequence_problem" title="Longest common subsequence problem">longest common subsequence</a>&nbsp;(LCS) distance allows only insertion and deletion, not substitution;</li>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Hamming_distance" title="Hamming distance">Hamming distance</a>&nbsp;allows only substitution, hence, it only applies to strings of the same length.</li>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Jaro_distance" title="Jaro distance">Jaro distance</a>&nbsp;allows only&nbsp;<a href="https://en.wikipedia.org/wiki/Transposition_(mathematics)" title="Transposition (mathematics)">transposition</a>.</li>
</ul><p>&nbsp;</p><pre><span>use</span> Text<span>::</span>Levenshtein <span>qw</span><span>(</span>distance<span>);</span>

 <span>print</span> <span>distance</span><span>(</span><span>"foo"</span><span>,</span><span>"four"</span><span>);</span>
 <span># prints "2"</span>

 <span>my</span> <span>@words</span>     <span>=</span> <span>qw</span><span>/ four foo bar /</span><span>;</span>
 <span>my</span> <span>@distances</span> <span>=</span> <span>distance</span><span>(</span><span>"foo"</span><span>,</span><span>@words</span><span>);</span>

 <span>print</span> <span>"@distances"</span><span>;</span>
 <span># prints "2 0 3"</span><br /><br /><br /></pre><pre><span>use</span> Algorithm<span>::</span>LCSS <span>qw</span><span>(</span> LCSS CSS CSS_Sorted <span>);</span>
    <span>my</span> <span>$lcss_ary_ref</span> <span>=</span> <span>LCSS</span><span>(</span> <span>\</span><span>@SEQ1</span><span>,</span> <span>\</span><span>@SEQ2</span> <span>);</span>  <span># ref to array</span>
    <span>my</span> <span>$lcss_string</span>  <span>=</span> <span>LCSS</span><span>(</span> <span>$STR1</span><span>,</span> <span>$STR2</span> <span>);</span>    <span># string</span>
    <span>my</span> <span>$css_ary_ref</span> <span>=</span> <span>CSS</span><span>(</span> <span>\</span><span>@SEQ1</span><span>,</span> <span>\</span><span>@SEQ2</span> <span>);</span>    <span># ref to array of arrays</span>
    <span>my</span> <span>$css_str_ref</span> <span>=</span> <span>CSS</span><span>(</span> <span>$STR1</span><span>,</span> <span>$STR2</span> <span>);</span>      <span># ref to array of strings</span>
    <span>my</span> <span>$css_ary_ref</span> <span>=</span> <span>CSS_Sorted</span><span>(</span> <span>\</span><span>@SEQ1</span><span>,</span> <span>\</span><span>@SEQ2</span> <span>);</span>  <span># ref to array of arrays</span>
    <span>my</span> <span>$css_str_ref</span> <span>=</span> <span>CSS_Sorted</span><span>(</span> <span>$STR1</span><span>,</span> <span>$STR2</span> <span>);</span>    <span># ref to array of strings<br /><br /><br /><br /></span></pre><p>There are many different modules on CPAN for calculating the edit distance between two strings. Here's just a selection.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshteinXS">Text::LevenshteinXS</a>&nbsp;and&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshtein%3A%3AXS">Text::Levenshtein::XS</a>&nbsp;are both versions of the Levenshtein algorithm that require a C compiler, but will be a lot faster than this module.</p><p>The Damerau-Levenshtein edit distance is like the Levenshtein distance, but in addition to insertion, deletion and substitution, it also considers the transposition of two adjacent characters to be a single edit. The module&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshtein%3A%3ADamerau">Text::Levenshtein::Damerau</a>&nbsp;defaults to using a pure perl implementation, but if you've installed&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshtein%3A%3ADamerau%3A%3AXS">Text::Levenshtein::Damerau::XS</a>&nbsp;then it will be a lot quicker.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3AWagnerFischer">Text::WagnerFischer</a>&nbsp;is an implementation of the Wagner-Fischer edit distance, which is similar to the Levenshtein, but applies different weights to each edit type.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3ABrew">Text::Brew</a>&nbsp;is an implementation of the Brew edit distance, which is another algorithm based on edit weights.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3AFuzzy">Text::Fuzzy</a>&nbsp;provides a number of operations for partial or fuzzy matching of text based on edit distance.&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3AFuzzy%3A%3APP">Text::Fuzzy::PP</a>&nbsp;is a pure perl implementation of the same interface.</p><p><a href="http://search.cpan.org/perldoc?String%3A%3ASimilarity">String::Similarity</a>&nbsp;takes two strings and returns a value between 0 (meaning entirely different) and 1 (meaning identical). Apparently based on edit distance.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3ADice">Text::Dice</a>&nbsp;calculates&nbsp;<a href="https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient">Dice's coefficient</a>&nbsp;for two strings. This formula was originally developed to measure the similarity of two different populations in ecological research.</p><pre><span>&nbsp;</span></pre>]]></description>
	<dc:creator>Neel</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/8287/post-doc-in-computational-genetics-and-genomics-at-ceinge-biotecnologie-avanzate-naples-italy</guid>
  <pubDate>Tue, 11 Feb 2014 08:06:47 -0600</pubDate>
  <link></link>
  <title><![CDATA[Post doc in Computational Genetics and Genomics at CEINGE Biotecnologie Avanzate, Naples, Italy]]></title>
  <description><![CDATA[
<p>We are seeking one motivated scientist to analyze genomics and transcriptomics data of a large collection of neuroblastoma tumors. The successful candidate will be part of a team of researchers with extensive expertise in genome cancer study. He/she will be involved in the analysis of DNA-seq, RNA-seq, ChIP-seq data using available methods running in R and UNIX environment.</p>

<p>Qualifications</p>

<p>PhD or Post-Graduated Master degree is required. Successful candidates will have some expertise in data analysis of NGS data by using methods running in R and UNIX environment. Familiarity with genome databases and browsers is required.</p>

<p>Application</p>

<p>Candidates should send a CV and a brief personal statement focusing on their skills and interests related to the research project.</p>

<p>Contacts</p>

<p>Start date: 1° April 2014<br />Salary on grant: 25,000 euros per year.<br />Contact Person (Referent): Mario Capasso<br />Ref. Email: mario.capasso@unina.it and achille.iolascon@unina.it<br />Tel: +39 081 3737889</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/35534/awk-for-bioinformatician-and-computational-biologist</guid>
	<pubDate>Tue, 06 Feb 2018 14:54:35 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/35534/awk-for-bioinformatician-and-computational-biologist</link>
	<title><![CDATA[Awk for Bioinformatician and computational biologist]]></title>
	<description><![CDATA[<p>Awk is a programming language which allows easy manipulation of structured data and is mostly used for pattern scanning and processing. It searches one or more files to see if they contain lines that match with the specified patterns and then perform associated actions. The basic syntax is:</p><blockquote><p><br />awk '/pattern1/ {Actions}<br /> /pattern2/ {Actions}' file</p></blockquote><p><br />The working of Awk is as follows<br />Awk reads the input files one line at a time.<br />For each line, it matches with given pattern in the given order, if matches performs the corresponding action.<br />If no pattern matches, no action will be performed.<br />In the above syntax, either search pattern or action are optional, But not both.<br />If the search pattern is not given, then Awk performs the given actions for each line of the input.<br />If the action is not given, print all that lines that matches with the given patterns which is the default action.<br />Empty braces with out any action does nothing. It wont perform default printing operation.<br />Each statement in Actions should be delimited by semicolon.<br />Say you have data.tsv with the following contents:</p><p><br />$ cat data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3 ACTTATATATATATA<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT <br />By default Awk prints every line from the file.</p><p><br />$ awk '{print;}' data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3 ACTTATATATATATA<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT <br />We print the line which matches the pattern contig3</p><p><br />$ awk '/contig3/' data/test.tsv<br />contig3 ACTTATATATATATA<br />Awk has number of builtin variables. For each record i.e line, it splits the record delimited by whitespace character by default and stores it in the $n variables. If the line has 5 words, it will be stored in $1, $2, $3, $4 and $5. $0 represents the whole line. NF is a builtin variable which represents the total number of fields in a record.</p><p><br />$ awk '{print $1","$2;}' data/test.tsv<br />contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2,ACTTTATATATT<br />contig3,ACTTATATATATATA<br />contig4,ACTTATATATATATA<br />contig5,ACTTTATATATT</p><p>$ awk '{print $1","$NF;}' data/test.tsv<br />contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2,ACTTTATATATT<br />contig3,ACTTATATATATATA<br />contig4,ACTTATATATATATA<br />contig5,ACTTTATATATT</p><p><br />Awk has two important patterns which are specified by the keyword called BEGIN and END. The syntax is as follows:</p><blockquote><p>BEGIN { Actions before reading the file}<br />{Actions for everyline in the file} <br />END { Actions after reading the file }</p></blockquote><p><br />For example,<br />$ awk 'BEGIN{print "Header,Sequence"}{print $1","$2;}END{print "-------"}' data/test.tsv<br />Header,Sequence<br />contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2,ACTTTATATATT<br />contig3,ACTTATATATATATA<br />contig4,ACTTATATATATATA<br />contig5,ACTTTATATATT<br />------- <br />We can also use the concept of a conditional operator in print statement of the form print CONDITION ? PRINT_IF_TRUE_TEXT : PRINT_IF_FALSE_TEXT. For example, in the code below, we identify sequences with lengths &gt; 14:</p><p>$ awk '{print (length($2)&gt;14) ? $0"&gt;14" : $0"&lt;=14";}' data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG&gt;14<br />contig2 ACTTTATATATT&lt;=14<br />contig3 ACTTATATATATATA&gt;14<br />contig4 ACTTATATATATATA&gt;14<br />contig5 ACTTTATATATT&lt;=14<br />We can also use 1 after the last block {} to print everything (1 is a shorthand notation for {print $0} which becomes {print} as without any argument print will print $0 by default), and within this block, we can change $0, for example to assign the first field to $0 for third line (NR==3), we can use:</p><p>$ awk 'NR==3{$0=$1}1' data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT<br />You can have as many blocks as you want and they will be executed on each line in the order they appear, for example, if we want to print $1 three times (here we are using printf instead of print as the former doesn't put end-of-line character),</p><p>$ awk '{printf $1"\t"}{printf $1"\t"}{print $1}' data/test.tsv<br />contig1 contig1 contig1<br />contig2 contig2 contig2<br />contig3 contig3 contig3<br />contig4 contig4 contig4<br />contig5 contig5 contig5 <br />Although, we can also skip executing later blocks for a given line by using next keyword:</p><p>$ awk '{printf $1"\t"}NR==3{print "";next}{print $1}' data/test.tsv<br />contig1 contig1<br />contig2 contig2<br />contig3 <br />contig4 contig4<br />contig5 contig5</p><p>$ awk 'NR==3{print "";next}{printf $1"\t"}{print $1}' data/test.tsv<br />contig1 contig1<br />contig2 contig2</p><p>contig4 contig4<br />contig5 contig5<br />You can also use getline to load the contents of another file in addition to the one you are reading, for example, in the statement given below, the while loop will load each line from test.tsv into k until no more lines are to be read:</p><p>$ awk 'BEGIN{while((getline k &lt;"data/test.tsv")&gt;0) print "BEGIN:"k}{print}' data/test.tsv<br />BEGIN:contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />BEGIN:contig2 ACTTTATATATT<br />BEGIN:contig3 ACTTATATATATATA<br />BEGIN:contig4 ACTTATATATATATA<br />BEGIN:contig5 ACTTTATATATT<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3 ACTTATATATATATA<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT <br />You can also store data in the memory with the syntax VARIABLE_NAME[KEY]=VALUE which you can later use through for (INDEX in VARIABLE_NAME) command:</p><p>$ awk '{i[$1]=1}END{for (j in i) print j"&lt;="i[j]}' data/test.tsv<br />contig1&lt;=1<br />contig2&lt;=1<br />contig3&lt;=1<br />contig4&lt;=1<br />contig5&lt;=1</p>]]></description>
	<dc:creator>Poonam Mahapatra</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/8317/new-version-of-modeller-913</guid>
	<pubDate>Thu, 13 Feb 2014 09:07:57 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/8317/new-version-of-modeller-913</link>
	<title><![CDATA[New version of Modeller, 9.13]]></title>
	<description><![CDATA[<p>The new version of Modeller, 9.13, is now available for download! Please see the download page at <a href="http://www.facebook.com/l.php?u=http%3A%2F%2Fsalilab.org%2Fmodeller%2F&amp;h=mAQG5wo_Z&amp;enc=AZOoq2B7BxT95AT3Mw3za3VlbmRFke43YMI5vAjCAbBlIcf3bptn8pmFC1Idxrssy98117S03IgdcNmEWcQBi9bmi8Or_ut1D1yybt1ZonvPoCT3_LOglcYV7o6bEaa442_6LhbjefEaelkq0aq6dl0w&amp;s=1" target="_blank">http://salilab.org/modeller/</a> for more information.</p><p><img src="http://salilab.org/modeller/gifs/modeller.jpg" alt="image" width="848" height="272" style="border: 0px; border: 0px;"><br /> <br /> If you have a license key for Modeller 8 or 9, there is no need to reregister for Modeller 9.13 - the same license key will work. (It won't <span>do any harm to reregister if you want to, though!)<br /> <br /> 9.13 is primarily a bugfix release relative to the last public release(9.12). Major user-visible changes include:<br /> <br /> # Modeller now includes a variety of SOAP (statistically optimized atomic potential) scores for assessing proteins, loops, and interfaces.<br /> <br /> # The Lennard-Jones interaction energy is now artificially truncated at very short distance; this makes simulations with poor starting conditions much less likely to 'blow up'.<br /> <br /> # model.get_insertions(), model.get_deletions() and model.loops() now have an include_termini option; if False, residue ranges that include chain termini are excluded from the output.<br /> <br /> See the Modeller manual for a full change log: <a href="http://salilab.org/modeller/9.13/manual/node39.html" target="_blank">http://salilab.org/modeller/9.13/manual/node39.html</a><br /> <br /> If you encounter bugs in Modeller 9.13, please see <a href="http://salilab.org/modeller/9.13/manual/node10.html" target="_blank">http://salilab.org/modeller/9.13/manual/node10.html</a> for information on how to report them.</span></p><p><span>Reference:</span></p><p><span>http://salilab.org/modeller/</span></p>]]></description>
	<dc:creator>Radha Agarkar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/36211/project-based-approach-to-improve-bioinformatics-education-with-skilled-and-meaningful-access-to-omics-data</guid>
	<pubDate>Wed, 11 Apr 2018 13:31:42 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/36211/project-based-approach-to-improve-bioinformatics-education-with-skilled-and-meaningful-access-to-omics-data</link>
	<title><![CDATA[Project-based approach to improve bioinformatics education with skilled and meaningful access to omics data]]></title>
	<description><![CDATA[<p>Pine Biotech has been collaborating with Loyola University of New Orleans on piloting a new approach to bioinformatics education using the intuitive and logic-drive bioinformatics platform T-BioInfo.</p><p>https://edu.t-bio.info/collaborative-model-bioinformatics-education-combining-biologically-inspired-bioinformatics-project-based-learning/</p>]]></description>
	<dc:creator>eliabrodsky</dc:creator>
</item>

</channel>
</rss>