<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/14927?offset=390</link>
	<atom:link href="https://bioinformaticsonline.com/related/14927?offset=390" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/26306/busco</guid>
	<pubDate>Sun, 07 Feb 2016 16:02:39 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/26306/busco</link>
	<title><![CDATA[BUSCO]]></title>
	<description><![CDATA[<p>Assessing genome assembly and annotation completeness with Benchmarking Universal Single-Copy Orthologs</p>
<p>More at http://busco.ezlab.org/</p><p>Address of the bookmark: <a href="http://busco.ezlab.org/" rel="nofollow">http://busco.ezlab.org/</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/26325/crossmap</guid>
	<pubDate>Mon, 08 Feb 2016 15:47:00 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/26325/crossmap</link>
	<title><![CDATA[CrossMap]]></title>
	<description><![CDATA[<p>CrossMap is a program for convenient conversion of genome coordinates (or annotation files) between <em>different assemblies</em> (such as Human <a href="http://www.ncbi.nlm.nih.gov/assembly/2928/">hg18 (NCBI36)</a> &lt;&gt; <a href="http://www.ncbi.nlm.nih.gov/assembly/2758/">hg19 (GRCh37)</a>, Mouse <a href="http://www.ncbi.nlm.nih.gov/assembly/165668/">mm9 (MGSCv37)</a> &lt;&gt; <a href="http://www.ncbi.nlm.nih.gov/assembly/327618/">mm10 (GRCm38)</a>).</p>
<p>It supports most commonly used file formats including SAM/BAM, Wiggle/BigWig, BED, GFF/GTF, VCF.</p>
<p>CrossMap is designed to liftover genome coordinates between assemblies. It&rsquo;s <em>not</em> a program for aligning sequences to reference genome.</p>
<p>We <em>do not</em> recommend using CrossMap to convert genome coordinates between species.</p>
<p>More at http://crossmap.sourceforge.net/</p><p>Address of the bookmark: <a href="http://crossmap.sourceforge.net/" rel="nofollow">http://crossmap.sourceforge.net/</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/26432/summer-2016</guid>
  <pubDate>Sun, 21 Feb 2016 06:17:55 -0600</pubDate>
  <link></link>
  <title><![CDATA[Summer 2016]]></title>
  <description><![CDATA[
<p>REU at Fordham University- Summer 2016</p>

<p>An NSF-funded REU to study Y-chromosome diversity and sex-biased dispersal in wild brown rats (Rattus norvegicus) is available in the Munshi-South Lab at Fordham University. Our lab is currently investigating rat evolution at scales ranging from landscape genetics of individual cities to global patterns of diversity. Development of resources for investigating Y-chromosome diversity will support many of these studies. The REU student will work with the lab to bioinformatically identify Y-chromosome SNPs, design SNPtype assays,<br />extract DNA, genotype samples, and analyze data.</p>

<p>We seek applicants interested in bioinformatics, evolutionary biology, and related disciplines.  Applicants must have taken a college-level genetics course.  This REU will require attention to detail, reliability, independence, and critical thinking.</p>

<p>This position is based at Fordham University's field station, the Louis Calder Center, in Armonk, NY. The Calder Center is located approximately 25 miles north of New York City in a protected woodland area. Housing<br />will be provided at the Calder Center for the duration of the REU (May 23 to Aug 12, 2016). Additionally, the student will receive a $6,000 stipend. The selected student will participate in professional development activities through the Calder Centers REU program, including presentation of results at a research colloquium at the end of the summer.</p>

<p>To apply, please send a one page personal statement about your scientific interests and how this REU will support your professional goals, unofficial transcripts including a list of Spring 2016 courses, and names of two professional references (including title, address, phone number, and email address) as a single pdf (with your last name in the file name) to Dr. Jason Munshi-South (jmunshisouth@fordham.edu).</p>

<p>Applications are due March 4th, 2016.</p>

<p>Jason Munshi-South</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/26543/breakseq2</guid>
	<pubDate>Mon, 29 Feb 2016 17:45:38 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/26543/breakseq2</link>
	<title><![CDATA[BreakSeq2]]></title>
	<description><![CDATA[<p>Ultrafast and accurate nucleotide-resolution analysis of structural variants</p>
<p>More at http://bioinform.github.io/breakseq2/</p>
<p>Download BreakSeq2</p>
<p>Latest version: https://github.com/bioinform/breakseq2/archive/2.2.tar.gz<br><br>For other versions, see "releases". https://github.com/bioinform/breakseq2/releases</p><p>Address of the bookmark: <a href="http://bioinform.github.io/breakseq2/" rel="nofollow">http://bioinform.github.io/breakseq2/</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/27555/phd-at-institute-of-life-sciences-bhubaneswar</guid>
  <pubDate>Mon, 30 May 2016 03:36:04 -0500</pubDate>
  <link></link>
  <title><![CDATA[PhD at INSTITUTE OF LIFE SCIENCES, Bhubaneswar]]></title>
  <description><![CDATA[
<p>INSTITUTE OF LIFE SCIENCES</p>

<p>Bhubaneswar 751023</p>

<p>Advt No. 07/2016</p>

<p>Institute of Life Sciences (ILS), Bhubaneswar, an autonomous Institute of the Department of Biotechnology, Ministry of Science &amp; Technology, Government of India engaged in advanced research invites applications from Indian nationals for the Ph.D. program. The main focus of the projects will be computational biology in the following areas.</p>

<p>S. No. Area of Research Principal investigator</p>

<p>1. Computational Cancer Biology Dr. Anshuman Dixit</p>

<p>2. Immunogenomics &amp; Systems Biology Dr. Sunil Kumar Raghav</p>

<p>3. Chromatin remodeling and hematopoiesis Dr. Punit Prasad</p>

<p>Candidates are strongly encouraged to visit ILS webpage for detailed information, regarding the research activities of the above mentioned scientists.</p>

<p>Essential Qualifications:</p>

<p>(a) Eligibility: M.Sc., M.V.Sc., M.Pharm., M.S. Pharma. (with NET/GATE/GPAT/BINC/any other equivalent national level exam) or M.Tech with minimum of 60% marks (or equivalent grade point). Those awaiting final result may also apply.</p>

<p>Applications received after the last date will not be accepted. The envelope should clearly be superscribed with “Application for Ph.D. program (computational biology)”. Short-listed candidates selected for the interview will be published in the Institute website (www.ils.res.in).</p>

<p>Application Fees: Applicants except SC/ST candidates are required to send a non-refundable D.D. for Rs.100/- in favour of “Director, Institute of Life Sciences, Bhubaneswar” payable at Bhubaneswar along with duly filled-in application form by the date mentioned below. Director, ILS reserves the right to withdraw the procedure without assigning any reasons thereof.</p>

<p>Important dates: </p>

<p>Last date of receiving applications: 24th June 2016 </p>

<p>Date of display of short-listed candidates and instructions on the Institute website: 30th June 2016 </p>

<p>Date of interview: The interview will be organized on 25th July 2016</p>

<p>Advertisement: https://www.ils.res.in/wp-content/uploads/2016/05/advt07-16.pdf</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/26852/awesome-bioinformatics-pipelines</guid>
	<pubDate>Wed, 30 Mar 2016 21:50:41 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/26852/awesome-bioinformatics-pipelines</link>
	<title><![CDATA[Awesome bioinformatics pipelines !]]></title>
	<description><![CDATA[<p><span>A curated list of awesome pipeline toolkits ...</span></p>
<p><span>https://github.com/pditommaso/awesome-pipeline</span></p><p>Address of the bookmark: <a href="https://github.com/pditommaso/awesome-pipeline" rel="nofollow">https://github.com/pditommaso/awesome-pipeline</a></p>]]></description>
	<dc:creator>Jitendra Prajapati</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/2002/ibl-laboratory</guid>
  <pubDate>Mon, 12 Aug 2013 02:02:29 -0500</pubDate>
  <link></link>
  <title><![CDATA[IBL laboratory]]></title>
  <description><![CDATA[
<p>The IBL laboratory focuses on the multi-disciplinary analyses of the global responses of model microorganisms, cyanobacteria (mainly Synechocystis PCC6803) and yeasts (mainly Saccharomyces cerevisae) to environmental stresses triggered by oxidative agents, heavy metals, or drastic changes in nutrients availability. The genome-wide responses studied with the "omics" techniques (transcriptomics, proteomics, metabolomics and genetics) generate a wealth of experimental data, which are processed, archived, integrated and represented as working models through bioinformatics and mathematics. </p>

<p>Link : http://www-dsv.cea.fr/en/instituts/institut-de-biologie-et-de-technologies-de-saclay-ibitec-s/unites-de-recherche/service-de-biologie-integrative-et-genetique-moleculaire-sbigem/laboratoire-de-biologie-integrative-lbi/presentation__1</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/1212/computational-proteomics-lets-remember-the-basics</guid>
	<pubDate>Thu, 01 Aug 2013 17:24:20 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/1212/computational-proteomics-lets-remember-the-basics</link>
	<title><![CDATA[Computational Proteomics : Lets remember the basics]]></title>
	<description><![CDATA[<p>I spend some of my valuable time in computational drug designing sector. I remember my initial proteomics days, playing with interactive protein visualization software and dreaming big. Fortunately or unfortunately, I switched to genomics and handling the genomic floods in Petabytes which is expected to be in Brontobytes in coming years. Did I mention Brontobytes ??? Let me call to my server personnel &hellip; it gonna tsunami !!!!!</p><p>Today, refreshing my old memories I decided to blog about the basic knowledge of biochemistry and computational proteomics&nbsp;skills, but after I found several article on internet saying exactly what I had wanted to say I thought I might as well just redirect BOL's blog readers there instead:</p><p>Here is the list of website and videos links which provide a good resource for you basic chemistry need:</p><p><a href="http://tecreativ.blogspot.co.uk/2012/09/funny-shortcut-remember-periodic-table.html"></a><a href="http://tecreativ.blogspot.co.uk/2012/09/funny-shortcut-remember-periodic-table.html"></a><a href="http://tecreativ.blogspot.co.uk/2012/09/funny-shortcut-remember-periodic-table.html"></a><a href="http://tecreativ.blogspot.co.uk/2012/09/funny-shortcut-remember-periodic-table.html">http://tecreativ.blogspot.co.uk/2012/09/funny-shortcut-remember-periodic-table.html</a></p><p>This blog have some specific hindi word to remember entire periodic table. I really like</p><p>Group 14 (C Si Ge Sn Pb) -&gt; Sentence &ldquo;<strong>C</strong>hemistry&nbsp;<strong>Si</strong>r&nbsp;<strong>G</strong>iv<strong>e</strong>s&nbsp;<strong>S</strong>a<strong>n</strong>ki&nbsp;<strong>P</strong>ro<strong>b</strong>lems&rdquo;</p><p>Sanki is a hindi word which mean crazy :P</p><p>I found this link useful as well&nbsp;<a href="http://www.wikihow.com/Memorise-the-Periodic-Table"></a><a href="http://www.wikihow.com/Memorise-the-Periodic-Table"></a><a href="http://www.wikihow.com/Memorise-the-Periodic-Table"></a><a href="http://www.wikihow.com/Memorise-the-Periodic-Table">http://www.wikihow.com/Memorise-the-Periodic-Table</a></p><p>The eagle genomics group provide an element of bioinformatics in periodic tables. Yes you got it, this is not periodic table rather bioinformatics tools with periodicals</p><p><a href="http://elements.eaglegenomics.com/"></a><a href="http://elements.eaglegenomics.com/"></a><a href="http://elements.eaglegenomics.com/"></a><a href="http://elements.eaglegenomics.com/">http://elements.eaglegenomics.com/</a></p><p>You can also try this video links, which provide you an overview with tricks on periodic tables:</p><p><a href="http://www.youtube.com/watch?v=fLSfgNxoVGk"></a><a href="http://www.youtube.com/watch?v=fLSfgNxoVGk"></a><a href="http://www.youtube.com/watch?v=fLSfgNxoVGk"></a><a href="http://www.youtube.com/watch?v=fLSfgNxoVGk">http://www.youtube.com/watch?v=fLSfgNxoVGk</a></p><p><a href="http://www.youtube.com/user/periodicvideos"></a><a href="http://www.youtube.com/user/periodicvideos"></a><a href="http://www.youtube.com/user/periodicvideos"></a><a href="http://www.youtube.com/user/periodicvideos">http://www.youtube.com/user/periodicvideos</a></p><p>For drug design educational material, software, tools, databses, viewer, file format and many more stuff at one place&nbsp;<a href="http://www.allfordrugs.com/drug-design/.%C2%A0I"></a><a href="http://www.allfordrugs.com/drug-design/"></a><a href="http://www.allfordrugs.com/drug-design/"></a><a href="http://www.allfordrugs.com/drug-design/">http://www.allfordrugs.com/drug-design/</a>&nbsp;I highly recommend you all computational drug designer to bookmark this page for future studies as well.</p><p>I just remember one of my mini project in which I use my flash knowledge (flash .. oh ya flash) to explain amino acids in interactive and user friendly manner. I can&rsquo;t provide It right now, but promise you to provide a link in near future. I hope that you will enjoy my flashy creative skills :).</p><p>Moreover, I found some of very interesting tricks to remember all amino acids chemical formulae on youtube at</p><p><a href="http://www.youtube.com/watch?v=gqrWb0fmzQ&amp;list=PL6132651E70BB5575"></a><a href="http://www.youtube.com/watch?v=gqrWb0fmzQ&amp;list=PL6132651E70BB5575"></a><a href="http://www.youtube.com/watch?v=gqrWb0fmzQ&amp;list=PL6132651E70BB5575"></a><a href="http://www.youtube.com/watch?v=gqrWb0fmzQ&amp;list=PL6132651E70BB5575">http://www.youtube.com/watch?v=gqrWb0fmzQ&amp;list=PL6132651E70BB5575</a></p><p><a href="http://www.youtube.com/watch?v=C2GfoGXfySQ&amp;list=PL6132651E70BB5575"></a><a href="http://www.youtube.com/watch?v=C2GfoGXfySQ&amp;list=PL6132651E70BB5575"></a><a href="http://www.youtube.com/watch?v=C2GfoGXfySQ&amp;list=PL6132651E70BB5575"></a><a href="http://www.youtube.com/watch?v=C2GfoGXfySQ&amp;list=PL6132651E70BB5575">http://www.youtube.com/watch?v=C2GfoGXfySQ&amp;list=PL6132651E70BB5575</a></p><p><br />Key points for computer added drug designers?<br />1. A shortage of biochemistry skills means that you absolutely nowhere in understanding the key concept and do research.<br />2. Keep handy with complex mathematical formula, before merely running tools or software.<br />3. Dig it better and deeper guys .. design it.</p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/4552/imtech-lab</guid>
  <pubDate>Sun, 15 Sep 2013 09:41:04 -0500</pubDate>
  <link></link>
  <title><![CDATA[IMTECH Lab]]></title>
  <description><![CDATA[
<p>Computer Aided Protein Structure Prediction; Identification of Vaccine<br />Candidates (T-Epitope prediction); Analysis of Nucleotide/Protein Sequences; Development of Web Server/</p>

<p>Software; Creation of Public Domain Resources in Biology<br />Present Status::</p>

<p>Developing prediction methods for gene, beta-turn, secondary structure and MHC-binding sites.<br />Area of Interest ::</p>

<p>Comparison of force field simulations. Analysis of DNA-protein interactions using molecular mechanics methods.Drug Target Identification using in silico biology.</p>

<p>More @ http://www.imtech.res.in/bic/index.php?option=com_content&amp;view=article&amp;id=65</p>

<p>PIs: http://www.imtech.res.in/bic/index.php?option=com_content&amp;view=article&amp;id=69</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/37590/parallel-processing-with-perl</guid>
	<pubDate>Sat, 25 Aug 2018 11:32:40 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/37590/parallel-processing-with-perl</link>
	<title><![CDATA[Parallel Processing with Perl !]]></title>
	<description><![CDATA[<p>Here is a small tutorial on how to make best use of multiple processors for bioinformatics analysis. One best way is using perl threads and forks. Knowing how these threads and forks work is very important before implementing them. Getting to know how these work would be really useful before reading this tutorial.</p><p>Many times in bioinformatics we need to deal with huge datasets which&nbsp; are more than 100GB size. The traditional way to analysis a file is using the while loop</p><p>while (FILE){</p><p>Do something;</p><p>}</p><p>This is very slow(since we are using only one processor) and if we have 500 million lines in the dataset it takes more than a day to iterate through the whole dataset. So how do we make best use of all our processors and get the work done quickly?</p><p>Here is a very simple and efficient technique with perl which i have been using. I am&nbsp; more inclined towards using perl fork than perl threads.</p><p>One of the oldest way to fork is</p><blockquote><p>my $fork = fork();<br />if($fork){&nbsp;&nbsp;&nbsp;<br />push (@childs,$fork);&nbsp;<br />}<br />elseif($fork==0){<br /><strong>your code here;</strong><br />exit(0);<br />}<br />else{die &ldquo;Couldnt fork : $!&rdquo;;}</p><p>## wait for the child process to finish<br />foreach(@childs){<br />my $tmp=waitid($_,0);<br />}</p></blockquote><p>what a fork does is it creates a child process and takes the variables and code with it to analyze it separately (detached from the parent process) and thus a separate process is created( which usually runs on a separate processor). Thats it!! One big disadvantage of forking is its very difficult to share variables among the different processes. I will show you how to do it easily but still it has its own drawbacks.</p><blockquote><p>Okie, now if you really do not want to use fork in your code, that&rsquo;s okie too..There are many useful modules which do it for you very efficiently. One really useful module is Parallel::ForkManager. You can use Parallel::ForkManager to manage the number of forks you want to generate (number of processors you want to use).</p><p><strong>Simple usage:</strong><br />use Parallel::ForkManager;<br />my $max_processors=8;<br />my $fork= new Parallel::ForkManager($max_processors);<br />foreach (@dna) {<br />$fork-&gt;start and next; # do the fork<br /><strong>you code here;</strong><br />$fork-&gt;finish; # do the exit in the child process<br />}<br />$pm-&gt;wait_all_children;</p></blockquote><p>so you will be generating 8 forks which do the same thing for your each element of array. when one child finishes, Parallel::ForkManager generates a new one and thus you will be using all your processors to analyze the data. Now, if you have generated 8 child processes and want to write the data to one file. You need to lock the file to do this, because you will have problems with the buffering. You can lock the file using flock command.</p><blockquote><p>open (my $QUAL, &ldquo;myfile.txt&rdquo;);<br />flock $QUAL, LOCK_EX or die &ldquo;cant lock file $!&rdquo;;<br />print $QUAL &ldquo;$output&rdquo;;<br />flock $QUAL, LOCK_UN or die &ldquo;$!&rdquo;;<br />close $QUAL;</p></blockquote><p>I would not suggest using flock when dealing with multiple processes because it will decrease the processing efficiency( each child process must wait for the lock to be released by the other child process). Instead, I would suggest each fork writing to a separate file and after the processing just concatenating them.</p><p><strong>Putting it all together, If you have 100GB data you can do this</strong></p><blockquote><p><strong>step 1</strong>&nbsp;: split the dataset equally according to number of processors you have. this may take a few hours(about 2-3 hrs for 100GB file)<br />You can use unix &ldquo;split&rdquo; command for this<br />for example:<br />my $number_split=int($number_of_entries_in_your_dataset/$max_processors);<br />my $split_Files=`split -l $number_split &ldquo;your_file.fasta&rdquo; &ldquo;file_name&rdquo;`;</p><p><strong>step2</strong>: open you directory comtaining you split files and start Parallel::ForkManager.<br /><strong>For example:</strong><br />opendir(DIRECTORY, $split_files_directory) or die $!; ### open the directory<br />my $fork= new Parallel::ForkManager($max_processors);<br />while (my $file = readdir(DIRECTORY)) { ### read the directory<br />if($file=~/^\./){next;}<br />print $file,&rdquo;\n&rdquo;;<br />########## Start fork ##########<br />my $pid= $super_fork-&gt;start and next;<br /><strong>Whatever you want to do with the split file ;</strong><br /><strong>analyze my piece of $file;</strong><br />######### end fork ###############<br />$super_fork-&gt;finish;<br />}<br />$super_fork-&gt;wait_all_children;</p></blockquote><p>So basically each processor will be active with its piece of data (split file) and thus you have created 8 processes at one time which run without interfering with the other process. I again will not suggest writing output from each child process to one file(for reasons above). Write output from each fork to a separate file and finally concatenate them. Thats it, you have just increased your program speed by 8 times!! Isnt it easy?</p><p><strong>Note:</strong><br />You may worry about concatenation of the output each child generates, since it does take some time(remember 100GB). I think now you can use a mysql database LOAD DATA LOCAL INFILE command to load all the files into a single table(Should take about 3hrs for 100Gb dataset) and then export the whole table into one file. This should be faster than just concatenating them using &ldquo;cat&rdquo; command.(correct me if I am wrong)</p><p>Or much simpler way is to use pipes</p><p>cat output_dir/* | my_pipe or my_pipe &lt;(file1) final_file;</p><p>Thats it guys!! Enjoy programming and please do comment. I am not a computer scientist so forgive me for any mistakes and if any please report them. Thank you.</p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

</channel>
</rss>