<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/32485?offset=420</link>
	<atom:link href="https://bioinformaticsonline.com/related/32485?offset=420" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2457/rdataminingcom-r-and-data-mining</guid>
	<pubDate>Thu, 15 Aug 2013 18:37:23 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2457/rdataminingcom-r-and-data-mining</link>
	<title><![CDATA[Rdatamining.com : R and Data Mining]]></title>
	<description><![CDATA[<p>This website presents examples, documents and resources on data mining with R. <br>Documents on using R for data mining are available to download for non-commercial personal use, including&nbsp;R Reference card for Data Mining, R and Data Mining: Examples and Case Studies and Time Series Analysis and Mining with R.</p><p>Address of the bookmark: <a href="http://www.rdatamining.com/" rel="nofollow">http://www.rdatamining.com/</a></p>]]></description>
	<dc:creator>Poonam Mahapatra</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/44400/pevzner-lab</guid>
  <pubDate>Thu, 02 Nov 2023 05:39:26 -0500</pubDate>
  <link></link>
  <title><![CDATA[Pevzner Lab !]]></title>
  <description><![CDATA[
<p>The laboratory works on genome sequencing, immunoproteogenomics, antibiotics sequencing, and comparative genomics - computational technologies that enabled new applications and allowed scientists to attack biological problems that remained beyond the reach of previous techniques.</p>

<p>https://bioalgorithms.ucsd.edu/research4.html</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/2573/most-commonly-used-awk-by-bioinformatician</guid>
	<pubDate>Mon, 19 Aug 2013 01:12:38 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/2573/most-commonly-used-awk-by-bioinformatician</link>
	<title><![CDATA[Most Commonly used Awk by Bioinformatician]]></title>
	<description><![CDATA[<p style="text-align: center;">&nbsp;</p><p>Awk is a programming language that is specifically designed for quickly manipulating space delimited data. Although you can achieve all its functionality with Perl, awk is simpler in many practical cases.</p><p>Why awk? You can replace a pipeline of 'stuff | grep | sed | cut...' with a single call to awk. For a simple script, most of the timelag is in loading these apps into memory, and it's much faster to do it all with one. This is ideal for something like an openbox pipe menu where you want to generate something on the fly. You can use awk to make a neat one-liner for some quick job in the terminal, or build an awk section into a shell script. You can find a lot of online tutorials, but here I will only show a few examples which cover most of bioinformatician daily uses of awk.</p><p>choose rows where column 3 is larger than column 5:</p><p>awk '$3&gt;$5' input.txt &gt; output.txt</p><p>extract column 2,4,5:</p><p>awk '{print $2,$4,$5}' input.txt &gt; output.txt</p><p>awk 'BEGIN{OFS="\t"}{print $2,$4,$5}' input.txt</p><p>show rows between 20th and 80th:</p><p>awk 'NR&gt;=20&amp;&amp;NR&lt;=80' input.txt &gt; output.txt</p><p>calculate the average of column 2:</p><p>awk '{x+=$2}END{print x/NR}' input.txt</p><p>regex (egrep):</p><p>awk '/^test[0-9]+/' input.txt</p><p>calculate the sum of column 2 and 3 and put it at the end of a row or replace the first column:</p><p>awk '{print $0,$2+$3}' input.txt</p><p>awk '{$1=$2+$3;print}' input.txt</p><p>join two files on column 1:</p><p>awk 'BEGIN{while((getline&lt;"file1.txt")&gt;0)l[$1]=$0}$1 in l{print $0"\t"l[$1]}' file2.txt &gt; output.txt</p><p>count number of occurrence of column 2 (uniq -c):</p><p>awk '{l[$2]++}END{for (x in l) print x,l[x]}' input.txt</p><p>apply "uniq" on column 2, only printing the first occurrence (uniq):</p><p>awk '!($2 in l){print;l[$2]=1}' input.txt</p><p>count different words (wc):</p><p>awk '{for(i=1;i!=NF;++i)c[$i]++}END{for (x in c) print x,c[x]}' input.txt</p><p>deal with simple CSV:</p><p>awk -F, '{print $1,$2}'</p><p>substitution (sed is simpler in this case):</p><p>awk '{sub(/test/, "no", $0);print}' input.txt</p><p>&nbsp;</p><p>OK now here's where to read this stuff properly explained. roll</p><p>Two thorough tutorials:</p><p>http://www.gnu.org/software/gawk/manual/gawk.html</p><p>http://www.grymoire.com/Unix/Awk.html</p><p>A famous list of useful one-liners - though they're short, many are quite tricky:</p><p>http://www.pement.org/awk/awk1line.txt</p><p>And some nice explanations of those one-liners. After reading this you'll have a pretty good grasp!</p><p>http://www.catonmat.net/blog/awk-one-li &hellip; -part-one/</p><p>http://www.catonmat.net/blog/ten-awk-ti &hellip; -pitfalls/</p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/4288/new-born-babies-get-ready-to-know-their-whole-genome-soon</guid>
	<pubDate>Thu, 05 Sep 2013 07:24:02 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/4288/new-born-babies-get-ready-to-know-their-whole-genome-soon</link>
	<title><![CDATA[New born babies get ready to know their whole genome soon!!!]]></title>
	<description><![CDATA[<p>USA launch a pilot projects to examine medical information of newborn baby, which are being funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) and the National Human Genome Research Institute (NHGRI), both parts of the National Institutes of Health.</p><p>Awards of $5 million to four grantees have been made in fiscal year 2013 under the Genomic Sequencing and Newborn Screening Disorders research program. The program will be funded at $25 million over five years, as funds are made available.</p><p>"Hundreds of US babies will be pioneers in genomic medicine through a&nbsp;US$25-million programme to sequence their genomes&nbsp;soon after they are born."</p><p><strong>Source</strong>:</p><p><a href="http://blogs.nature.com/news/2013/09/scientists-to-sequence-hundreds-of-newborns-genomes.html">http://blogs.nature.com/news/2013/09/scientists-to-sequence-hundreds-of-newborns-genomes.html</a></p><p><a href="http://www.genome.gov/27554919">http://www.genome.gov/27554919</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/4106/phd-at-national-institute-for-research-in-reproductive-health</guid>
  <pubDate>Fri, 30 Aug 2013 04:50:35 -0500</pubDate>
  <link></link>
  <title><![CDATA[PhD at National Institute for Research in Reproductive Health]]></title>
  <description><![CDATA[
<p>National Institute for Research in Reproductive Health</p>

<p>(Indian Council of Medical Research )<br />Jehangir Merwanji Street, Parel, Mumbai 400 012</p>

<p>Advertisement No. 1/NIRRH/Ph.D. 2013<br />Admission to Ph.D. Programme – 2013</p>

<p>National Institute for Research in Reproductive Health, Mumbai, a premier institute of the Indian Council of Medical Research, conducts basic, clinical and operational research in different areas of reproductive health. The thrust areas of research include: Fertility Regulation, Infertility and Reproductive Disorders, Reproductive Tract Infections, Maternal and Child Health, Osteoporosis, Genetic Disorders, Stem Cell Biology, Structural Biology, Bioinformatics and Reproductive Toxicology. Institute is affiliated to the University of Mumbai for the award of Ph.D. degree in Applied Biology, Biochemistry, Life Sciences and Biotechnology. The institute invites applications from young and bright students for enrollment in Ph.D. programme.</p>

<p>More at http://www.nirrh.res.in/announcements/phd_program_2013.htm</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/33976/goldgenomes-online-database</guid>
	<pubDate>Wed, 26 Jul 2017 07:49:29 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/33976/goldgenomes-online-database</link>
	<title><![CDATA[GOLD:Genomes Online Database]]></title>
	<description><![CDATA[<p><span>GOLD</span><span>:Genomes Online Database, is a World Wide Web resource for comprehensive access to information regarding genome and metagenome sequencing projects, and their associated metadata, around the world.</span></p>
<p>https://gold.jgi.doe.gov/</p><p>Address of the bookmark: <a href="https://gold.jgi.doe.gov/" rel="nofollow">https://gold.jgi.doe.gov/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/2742/baumbach-lab</guid>
  <pubDate>Wed, 21 Aug 2013 10:56:35 -0500</pubDate>
  <link></link>
  <title><![CDATA[Baumbach Lab]]></title>
  <description><![CDATA[
<p>The Computational Biology research group was established in October 2012 at the Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark (SDU). It emerged from the Computational Systems Biology group, founded in March 2010 at the Max Planck Institute for Informatics (MPII) and the Cluster of Excellence for Multimodel Computing and Interaction (MMCI) at Saarland University, Saarbrücken, Germany.<br />​<br />The group is headed by Prof. Dr. Jan Baumbach and currently hosts nine PhD students and one postdoctoral fellow at both, IMADA/SDU and MMCI/MPII.</p>

<p>More at &gt;&gt; http://www.baumbachlab.net/</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34519/bandage-interactive-visualization-of-de-novo-genome-assemblies</guid>
	<pubDate>Mon, 04 Dec 2017 10:09:37 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34519/bandage-interactive-visualization-of-de-novo-genome-assemblies</link>
	<title><![CDATA[Bandage: interactive visualization of de novo genome assemblies]]></title>
	<description><![CDATA[<p>Bandage (a Bioinformatics Application for Navigating&nbsp;<em>De&nbsp;novo</em>&nbsp;Assembly Graphs Easily) is a tool for visualizing assembly graphs with connections. Users can zoom in to specific areas of the graph and interact with it by moving nodes, adding labels, changing colors and extracting sequences. BLAST searches can be performed within the Bandage graphical user interface and the hits are displayed as highlights in the graph. By displaying connections between contigs, Bandage presents new possibilities for analyzing&nbsp;<em>de novo</em>&nbsp;assemblies that are not possible through investigation of contigs alone.</p>
<p><strong>Availability and implementation:</strong>&nbsp;Source code and binaries are freely available at&nbsp;<a href="https://github.com/rrwick/Bandage" target="pmc_ext">https://github.com/rrwick/Bandage</a>. Bandage is implemented in C++ and supported on Linux, OS X and Windows. A full feature list and screenshots are available at&nbsp;<a href="http://rrwick.github.io/Bandage" target="pmc_ext">http://rrwick.github.io/Bandage</a>.</p><p>Address of the bookmark: <a href="http://rrwick.github.io/Bandage/" rel="nofollow">http://rrwick.github.io/Bandage/</a></p>]]></description>
	<dc:creator>Shruti Paniwala</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2839/look-up-a-biological-numbers</guid>
	<pubDate>Fri, 23 Aug 2013 03:27:45 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2839/look-up-a-biological-numbers</link>
	<title><![CDATA[Look up a biological numbers]]></title>
	<description><![CDATA[<p><strong>Did you ever need to look up a number</strong><span>&nbsp;like the volume of a cell or the cellular concentration of ATP, only to find yourself spending much more time than you wanted on the Internet or flipping through textbooks - all without much success?&nbsp;</span><br><br><span>Well, it didn&rsquo;t happen only to you. It is often surprising how difficult it can be to find concrete biological numbers, even for properties that have been measured numerous times. To help solve this for one and all, BioNumbers (</span><strong>the database of key numbers in molecular biology</strong><span>) was created. Along with the numbers, you'll find the relevant&nbsp;</span><strong>references to the original literature</strong><span>, useful comments, and related numbers.&nbsp;</span></p>
<p><span><span>To cite BioNumbers please refer to: Milo et al. Nucl. Acids Res. (2010) 38: D750-D753. When using a specific entry from the database it is highly recommended that you also specify the BioNumbers 6 digit ID, e.g. "BNID 100986, Milo et al 2010".&nbsp;</span></span></p><p>Address of the bookmark: <a href="http://bionumbers.hms.harvard.edu/" rel="nofollow">http://bionumbers.hms.harvard.edu/</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34867/magic-blast-a-tool-for-mapping-large-next-generation-rna-or-dna-sequencing-runs-against-a-whole-genome-or-transcriptome</guid>
	<pubDate>Tue, 26 Dec 2017 22:23:39 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34867/magic-blast-a-tool-for-mapping-large-next-generation-rna-or-dna-sequencing-runs-against-a-whole-genome-or-transcriptome</link>
	<title><![CDATA[Magic-BLAST: a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome.]]></title>
	<description><![CDATA[<p>Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. Each alignment optimizes a composite score, taking into account simultaneously the two reads of a pair, and in case of RNA-seq, locating the candidate introns and adding up the score of all exons. This is very different from other versions of BLAST, where each exon is scored as a separate hit and read-pairing is ignored.</p>
<p>Magic-BLAST incorporates within the NCBI BLAST code framework ideas developed in the NCBI Magic pipeline, in particular hit extensions by local walk and jump&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/26109056">(http://www.ncbi.nlm.nih.gov/pubmed/26109056)</a>, and recursive clipping of mismatches near the edges of the reads, which avoids accumulating artefactual mismatches near splice sites and is needed to distinguish short indels from substitutions near the edges.</p><p>Address of the bookmark: <a href="https://ncbi.github.io/magicblast/" rel="nofollow">https://ncbi.github.io/magicblast/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>