<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/44541?offset=20</link>
	<atom:link href="https://bioinformaticsonline.com/related/44541?offset=20" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/33869/import-r-data</guid>
	<pubDate>Wed, 12 Jul 2017 08:30:46 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/33869/import-r-data</link>
	<title><![CDATA[Import R Data]]></title>
	<description><![CDATA[<p>It is often necessary to import sample textbook data into R before you start working on your homework.</p><div id="node-69"><div><p><strong>Excel File</strong></p><p>Quite frequently, the sample data is in&nbsp;<span>Excel&nbsp;</span>format, and needs to be imported into R prior to use. For this, we can use the function&nbsp;<span>read.xls&nbsp;</span>from the&nbsp;<span>gdata&nbsp;</span>package. It reads from an Excel spreadsheet and returns a&nbsp;<a href="http://www.r-tutor.com/r-introduction/data-frame">data frame</a>. The following shows how to load an Excel spreadsheet named&nbsp;<span>"mydata.xls"</span>. This method requires Perl runtime to be present in the system.</p><blockquote><div id="listing-68"><span><a></a></span>&gt;&nbsp;library(gdata)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;gdata&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;help(read.xls)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;documentation&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.xls("mydata.xls")&nbsp;&nbsp;#&nbsp;read&nbsp;from&nbsp;first&nbsp;sheet</div></blockquote><p>Alternatively, we can use the function&nbsp;<span>loadWorkbook&nbsp;</span>from the&nbsp;<span>XLConnect&nbsp;</span>package to read the entire workbook, and then load the worksheets with&nbsp;<span>readWorksheet</span>. The&nbsp;<span>XLConnect&nbsp;</span>package requires Java to be pre-installed.</p><blockquote><div id="listing-69"><span><a></a></span>&gt;&nbsp;library(XLConnect)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;XLConnect&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;wk&nbsp;=&nbsp;loadWorkbook("mydata.xls")&nbsp;<br /><span><a></a></span>&gt;&nbsp;df&nbsp;=&nbsp;readWorksheet(wk,&nbsp;sheet="Sheet1")</div></blockquote><p>&nbsp;</p><h4><a></a>Minitab File</h4><p>If the data file is in&nbsp;<span>Minitab Portable Worksheet&nbsp;</span>format, it can be opened with the function&nbsp;<span>read.mtp&nbsp;</span>from the&nbsp;<span>foreign&nbsp;</span>package. It returns a&nbsp;<a href="http://www.r-tutor.com/r-introduction/list">list</a>&nbsp;of components in the Minitab worksheet.</p><blockquote><div id="listing-70"><span><a></a></span>&gt;&nbsp;library(foreign)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;the&nbsp;foreign&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;help(read.mtp)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;documentation&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.mtp("mydata.mtp")&nbsp;&nbsp;#&nbsp;read&nbsp;from&nbsp;.mtp&nbsp;file</div></blockquote><p>&nbsp;</p><h4><a></a>SPSS File</h4><p>For the data files in&nbsp;<span>SPSS&nbsp;</span>format, it can be opened with the function&nbsp;<span>read.spss&nbsp;</span>also from the&nbsp;<span>foreign&nbsp;</span>package. There is a&nbsp;<span>"to.data.frame"&nbsp;</span>option for choosing whether a data frame is to be returned. By default, it returns a list of components instead.</p><blockquote><div id="listing-71"><span><a></a></span>&gt;&nbsp;library(foreign)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;the&nbsp;foreign&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;help(read.spss)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;documentation&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.spss("myfile",&nbsp;to.data.frame=TRUE)</div></blockquote><p>&nbsp;</p><h4><a></a>Table File</h4><p>A data table can resides in a text file. The cells inside the table are separated by blank characters. Here is an example of a table with 4 rows and 3 columns.</p><blockquote><div id="listing-72"><span><a></a></span>100&nbsp;&nbsp;&nbsp;a1&nbsp;&nbsp;&nbsp;b1&nbsp;<br /><span><a></a></span>200&nbsp;&nbsp;&nbsp;a2&nbsp;&nbsp;&nbsp;b2&nbsp;<br /><span><a></a></span>300&nbsp;&nbsp;&nbsp;a3&nbsp;&nbsp;&nbsp;b3&nbsp;<br /><span><a></a></span>400&nbsp;&nbsp;&nbsp;a4&nbsp;&nbsp;&nbsp;b4</div></blockquote><p>Now copy and paste the table above in a file named&nbsp;<span>"mydata.txt"&nbsp;</span>with a text editor. Then load the data into the workspace with the function&nbsp;<span>read.table</span>.</p><blockquote><div id="listing-73"><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.table("mydata.txt")&nbsp;&nbsp;#&nbsp;read&nbsp;text&nbsp;file&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;print&nbsp;data&nbsp;frame&nbsp;<br /><span><a></a></span>&nbsp;&nbsp;&nbsp;V1&nbsp;V2&nbsp;V3&nbsp;<br /><span><a></a></span>1&nbsp;100&nbsp;a1&nbsp;b1&nbsp;<br /><span><a></a></span>2&nbsp;200&nbsp;a2&nbsp;b2&nbsp;<br /><span><a></a></span>3&nbsp;300&nbsp;a3&nbsp;b3&nbsp;<br /><span><a></a></span>4&nbsp;400&nbsp;a4&nbsp;b4</div></blockquote><p>For further detail of the function&nbsp;<span>read.table</span>, please consult the R documentation.</p><blockquote><div id="listing-74"><span><a></a></span>&gt;&nbsp;help(read.table)</div></blockquote><p>&nbsp;</p><h4><a></a>CSV File</h4><p>The sample data can also be in&nbsp;<span>comma separated values&nbsp;</span>(CSV) format. Each cell inside such data file is separated by a special character, which usually is a comma, although other characters can be used as well.</p><p>The first row of the data file should contain the column names instead of the actual data. Here is a sample of the expected format.</p><blockquote><div id="listing-75"><span><a></a></span>Col1,Col2,Col3&nbsp;<br /><span><a></a></span>100,a1,b1&nbsp;<br /><span><a></a></span>200,a2,b2&nbsp;<br /><span><a></a></span>300,a3,b3</div></blockquote><p>After we copy and paste the data above in a file named&nbsp;<span>"mydata.csv"&nbsp;</span>with a text editor, we can read the data with the function&nbsp;<span>read.csv</span>.</p><blockquote><div id="listing-76"><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.csv("mydata.csv")&nbsp;&nbsp;#&nbsp;read&nbsp;csv&nbsp;file&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;<br /><span><a></a></span>&nbsp;&nbsp;Col1&nbsp;Col2&nbsp;Col3&nbsp;<br /><span><a></a></span>1&nbsp;&nbsp;100&nbsp;&nbsp;&nbsp;a1&nbsp;&nbsp;&nbsp;b1&nbsp;<br /><span><a></a></span>2&nbsp;&nbsp;200&nbsp;&nbsp;&nbsp;a2&nbsp;&nbsp;&nbsp;b2&nbsp;<br /><span><a></a></span>3&nbsp;&nbsp;300&nbsp;&nbsp;&nbsp;a3&nbsp;&nbsp;&nbsp;b3</div></blockquote><p>In various European locales, as the comma character serves as the decimal point, the function&nbsp;<span>read.csv2&nbsp;</span>should be used instead. For further detail of the&nbsp;<span>read.csv&nbsp;</span>and&nbsp;<span>read.csv2&nbsp;</span>functions, please consult the R documentation.</p><blockquote><div id="listing-77"><span><a></a></span>&gt;&nbsp;help(read.csv)</div></blockquote><p>&nbsp;</p><h4><a></a>Working Directory</h4><p>Finally, the code samples above assume the data files are located in the R&nbsp;<span>working</span>&nbsp;<span>directory</span>, which can be found with the function&nbsp;<span>getwd</span>.</p><blockquote><div id="listing-78"><span><a></a></span>&gt;&nbsp;getwd()&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;get&nbsp;current&nbsp;working&nbsp;directory</div></blockquote><p>You can select a different working directory with the function&nbsp;<span>setwd()</span>, and thus avoid entering the full path of the data files.</p><blockquote><div id="listing-79"><span><a></a></span>&gt;&nbsp;setwd("")&nbsp;&nbsp;&nbsp;#&nbsp;set&nbsp;working&nbsp;directory</div></blockquote><p>Note that the forward slash should be used as the path separator even on Windows platform.</p><blockquote><div id="listing-80"><span><a></a></span>&gt;&nbsp;setwd("C:/MyDoc")</div></blockquote></div></div>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/28200/machine-learning</guid>
	<pubDate>Fri, 01 Jul 2016 12:57:12 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/28200/machine-learning</link>
	<title><![CDATA[Machine Learning !!!]]></title>
	<description><![CDATA[<p>In machine learning, computers apply&nbsp;<strong>statistical learning</strong>&nbsp;techniques to automatically identify patterns in data. These techniques can be used to make highly accurate predictions.</p>
<p><em>Keep scrolling.</em>&nbsp;Using a data set about homes, we will create a machine learning model to distinguish homes in New York from homes in San Francisco.</p><p>Address of the bookmark: <a href="http://www.r2d3.us/visual-intro-to-machine-learning-part-1/" rel="nofollow">http://www.r2d3.us/visual-intro-to-machine-learning-part-1/</a></p>]]></description>
	<dc:creator>Gudiya Pal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/29658/bookmarks-biostatistics-materials-and-books</guid>
	<pubDate>Tue, 08 Nov 2016 07:42:42 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/29658/bookmarks-biostatistics-materials-and-books</link>
	<title><![CDATA[Bookmarks Biostatistics materials and books]]></title>
	<description><![CDATA[<p>Biostatistics did not spring fully formed from the brow of R. A. Fisher, but evolved over many years. This process is continuing, although it may not be obvious from the outside. It has been ten years since the first edition of this book appeared (and rather longer since it was begun). Over this time, new areas of biostatistics have been developed and emphases and interpretations have changed</p>
<p>Please bookmarks your favourate biostatistics&nbsp;books in commend sectons ...</p><p>Address of the bookmark: <a href="http://www.cos.ufrj.br/~bioestatistica/livros/Introduction%20to%20Biostatistics.pdf" rel="nofollow">http://www.cos.ufrj.br/~bioestatistica/livros/Introduction%20to%20Biostatistics.pdf</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40611/deepvariant-an-analysis-pipeline-that-uses-a-deep-neural-network-to-call-genetic-variants-from-next-generation-dna-sequencing-data</guid>
	<pubDate>Sat, 25 Jan 2020 13:28:09 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40611/deepvariant-an-analysis-pipeline-that-uses-a-deep-neural-network-to-call-genetic-variants-from-next-generation-dna-sequencing-data</link>
	<title><![CDATA[DeepVariant : an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.]]></title>
	<description><![CDATA[<p><span>DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.</span></p>
<p><span><span>DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data. DeepVariant relies on&nbsp;</span><a href="https://github.com/google/nucleus">Nucleus</a><span>, a library of Python and C++ code for reading and writing data in common genomics file formats (like SAM and VCF) designed for painless integration with the&nbsp;</span><a href="https://www.tensorflow.org/">TensorFlow</a><span>&nbsp;machine learning framework.</span></span></p>
<p><span><a href="https://ai.googleblog.com/2017/12/deepvariant-highly-accurate-genomes.html">https://ai.googleblog.com/2017/12/deepvariant-highly-accurate-genomes.html</a></span></p>
<p><span><a href="https://www.biorxiv.org/content/10.1101/092890v6">https://www.biorxiv.org/content/10.1101/092890v6</a></span></p>
<p><span><img src="https://4.bp.blogspot.com/-2KlXZO60sWE/WiGc8qlZfxI/AAAAAAAACOs/s1pNiKI8jsAvJLr1E_po5udDO8eObm_awCLcBGAs/s640/image3.png" width="640" height="427" alt="image" style="border: 0px;"></span></p><p>Address of the bookmark: <a href="https://github.com/google/deepvariant" rel="nofollow">https://github.com/google/deepvariant</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41209/juicebox-visualization-and-analysis-software-for-hi-c-data</guid>
	<pubDate>Fri, 21 Feb 2020 00:33:38 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41209/juicebox-visualization-and-analysis-software-for-hi-c-data</link>
	<title><![CDATA[Juicebox: Visualization and analysis software for Hi-C data]]></title>
	<description><![CDATA[<p>Juicebox is visualization software for Hi-C data. This distribution includes the source code for Juicebox,&nbsp;<a href="https://github.com/theaidenlab/juicer/wiki/Download">Juicer Tools</a>, and&nbsp;<a href="https://aidenlab.org/assembly/">Assembly Tools</a>.&nbsp;<a href="https://github.com/theaidenlab/juicebox/wiki/Download">Download Juicebox here</a>, or use&nbsp;<a href="https://aidenlab.org/juicebox">Juicebox on the web</a>. Detailed documentation is available&nbsp;<a href="https://github.com/theaidenlab/juicebox/wiki">on the wiki</a>. Instructions below pertain primarily to usage of command line tools and the Juicebox jar files.</p>
<p>Juicebox can now be used to visualize and interactively (re)assemble genomes. Check out the Juicebox Assembly Tools Module website&nbsp;<a href="https://aidenlab.org/assembly">https://aidenlab.org/assembly</a>&nbsp;for more details on how to use Juicebox for assembly.</p>
<p>GUI at&nbsp;<a href="https://aidenlab.org/juicebox/">https://aidenlab.org/juicebox/</a></p><p>Address of the bookmark: <a href="https://github.com/aidenlab/Juicebox" rel="nofollow">https://github.com/aidenlab/Juicebox</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40583/trelliscope-flexibly-visualize-large-complex-data-in-great-detail-from-within-the-r-statistical-programming-environment</guid>
	<pubDate>Tue, 21 Jan 2020 04:22:49 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40583/trelliscope-flexibly-visualize-large-complex-data-in-great-detail-from-within-the-r-statistical-programming-environment</link>
	<title><![CDATA[Trelliscope: flexibly visualize large, complex data in great detail from within the R statistical programming environment.]]></title>
	<description><![CDATA[<p>Trelliscope provides a way to flexibly visualize large, complex data in great detail from within the R statistical programming environment. Trelliscope is a component in the<span>&nbsp;</span><a href="http://deltarho.org/docs-trelliscope/deltarho.org">DeltaRho</a><span>&nbsp;</span>environment.</p>
<p>For those familiar with<span>&nbsp;</span><a href="http://cm.bell-labs.com/cm/ms/departments/sia/project/trellis/">Trellis Display</a>,<span>&nbsp;</span><a href="http://docs.ggplot2.org/0.9.3.1/facet_wrap.html">faceting in ggplot</a>, or the notion of<span>&nbsp;</span><a href="http://en.wikipedia.org/wiki/Small_multiple">small multiples</a>, Trelliscope provides a scalable way to break a set of data into pieces, apply a plot method to each piece, and then arrange those plots in a grid and interactively sort, filter, and query panels of the display based on metrics of interest. With Trelliscope, we are able to create multipanel displays on data with a very large number of subsets and view them in an interactive and meaningful way.</p><p>Address of the bookmark: <a href="http://deltarho.org/docs-trelliscope/#introduction" rel="nofollow">http://deltarho.org/docs-trelliscope/#introduction</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/21443/a-guide-for-complete-r-beginners-getting-data-into-r</guid>
	<pubDate>Tue, 24 Feb 2015 20:15:08 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/21443/a-guide-for-complete-r-beginners-getting-data-into-r</link>
	<title><![CDATA[A guide for complete R beginners :- Getting data into R]]></title>
	<description><![CDATA[<p>For a beginner this can be is the hardest part, it is also the most important to get right.</p><p>It is possible to create a vector by typing data directly into R using the combine function &lsquo;c&rsquo;</p><blockquote><p><strong>x </strong></p></blockquote><p>same as</p><blockquote><p><strong>x </strong></p></blockquote><p>creates the vector x with the numbers between 1 and 5.</p><p>You can see what is in an object at any time by typing its name;</p><blockquote><p><strong>x</strong></p></blockquote><p>will produce the output<strong> &lsquo;[1] 1 2 3 4 5&prime;</strong></p><p>Note that names need to be quoted</p><blockquote><p><strong>daysofweek </strong><strong>&larr; c(&lsquo;Monday&rsquo;, &lsquo;Tuesday&rsquo;, &lsquo;Wednesday&rsquo;, &lsquo;Thursday&rsquo;, &lsquo;Friday&rsquo;);</strong></p></blockquote><p>Usually however you want to input from a file. We have touched on the &lsquo;read.table&rsquo; function already.</p><blockquote><p><strong>mydata </strong></p></blockquote><p>Now <strong>mydata</strong> is a data frame with multiple vectors</p><p>each vector can be identified by the default syntax</p><p>#if any of these are typed it will print to screen</p><blockquote><p><strong>mydata$V1 mydata$V2 mydata$V3 </strong></p></blockquote><p>By default the function assumes certain things from the file</p><ul>
<li>The file is a plain text file (there are function to read excel files: <em>not covered here</em>)</li>
<li>columns are separated by any number of tabs or spaces</li>
<li>there is the same number of data points in each column</li>
<li>there is no header row (labels for the columns)</li>
<li>there is no column with names for the rows** [I&rsquo;ll explain].</li>
</ul><p><span style="text-decoration: underline;">If any of these are false, we need to tell that to the function</span></p><p>If it has a header column</p><blockquote><p><strong>mydata <em>header=T also works</em></strong></p></blockquote><p>Note that there is a comma between different parts of the functions arguments</p><p>If there is one less column in the header row, then R assumes that the 1<sup>st</sup> column of data after the header are the row names</p><p>Now the vectors (columns) are identified by their name</p><p>#if any of these are typed it will print to screen</p><blockquote><p><strong>mydata$A mydata$B mydata$C </strong></p></blockquote><p># Summary about the whole data frame</p><blockquote><p><strong>summary(mydata)</strong></p></blockquote><p># Summary information of column A</p><blockquote><p><strong>summary(mydata$A) </strong></p></blockquote><p>We can shortcut having to type the data frame each time by attaching it</p><blockquote><p><strong>attach(mydata)</strong></p></blockquote><p># summary of column B as &lsquo;mydata&rsquo; is attached</p><blockquote><p><strong>summary(B)</strong></p></blockquote><p><span style="text-decoration: underline;">Two other important options for </span><em><span style="text-decoration: underline;">read.table</span></em></p><p>If is is separated only by tabs and has a header</p><blockquote><p><strong>mydata </strong></p></blockquote><p>Really useful if you have spaces in the contents of some columns, so R does not mess up reading the columns . However if the columns or of an uneven length it will tell you.</p><p>If you know that the file has uneven columns</p><blockquote><p><strong>mydata </strong></p></blockquote><p>This causes R to fill empty spaces in a columns with &lsquo;NA&rsquo; .</p><p>The last two examples will still work with our file and give the same result as with only headers=T</p><p><span style="text-decoration: underline;">Graphs</span></p><p>to get an idea of what R is capable of type</p><blockquote><p><strong>demo(graphics)</strong></p></blockquote><p>steps through the examples, and the code is printed to the screen</p><p>We will work with simpler examples that have immediate use to biologists.</p><p>Remember to get more information about the options to a function type &lsquo;?function&rsquo;</p><p><span style="text-decoration: underline;">Histogram of A</span><span style="text-decoration: underline;"></span></p><blockquote><p><strong>hist(mydata$A)</strong></p></blockquote><p>If there was more data we could increase the number of vertical columns with the option, breaks=50 (or another relevant number).</p><blockquote><p><strong>boxplot(mydata)</strong></p></blockquote><p>We can get rid of the need to type the data frame each time by using the <strong>attach</strong> function</p><p># if not already done so</p><blockquote><p><strong>attach(mydata) </strong></p><p><strong>boxplot(mydata$A, mydata$B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p>same as</p><blockquote><p><strong>boxplot(A, B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p><span style="text-decoration: underline;">Scatter plot</span></p><p># if not already done so</p><blockquote><p><strong>attach(mydata) </strong></p><p><strong>plot(A,B) # or plot(mydata$A, mydata$B)</strong></p></blockquote><p><strong><span style="text-decoration: underline;">SAVING an image</span></strong></p><p>Windows users (Rgui) RIGHT click on image and select which you want.</p><p><span style="text-decoration: underline;">These instructions work for everyone.</span></p><p>You need to create a new device of the type of file you need, then send the data to that device</p><p>to save as a png file (easy to load into the likes of powerpoint, also great for web applications.</p><blockquote><p><strong>png(&lsquo;filename&rsquo;) </strong></p><p><strong>boxplot(A, B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p>or to save as a pdf</p><blockquote><p><strong>pdf(&lsquo;filename&rsquo;) </strong></p><p><strong>boxplot(A, B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p><span style="text-decoration: underline;">Note</span></p><ul>
<li>Nothing will appear on screen, the output is going to the file</li>
<li>Also it may not be saved immediately but will once the device (or R) is turned quit.</li>
</ul><p>To quit R type</p><p><strong>q() # </strong>If you save your session, next time you start R, you will have your data preloaded.</p><p>Or if you want to remain in R</p><blockquote><pre><strong>dev.off() #</strong>turns of the png (or pdf etc) device, thus forces the data to save</pre></blockquote>]]></description>
	<dc:creator>Archana Malhotra</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/37586/julia-programming-language-a-python-and-r-rival</guid>
	<pubDate>Sat, 25 Aug 2018 04:46:39 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/37586/julia-programming-language-a-python-and-r-rival</link>
	<title><![CDATA[Julia Programming Language, a Python and R rival]]></title>
	<description><![CDATA[<p>Big data has grown to become one of the most lucrative fields. In fact, data scientists are some of the most sought people. They are usually hired to analyze, control and parse large chunks of data. Implementing these actions using traditional techniques is not a walk in the park. This is why most data scientists prefer using programming languages such as R and Python. However, there is one more programming language that can do the job. That is Julia programming language.</p><p>What Is Julia Language?</p><p>Julia is a programming language that came into the limelight in 2012. It is a general-purpose programming language that was designed for solving scientific computations. Julia was meant to be an alternative to Python, R and other programming languages that were mainly used for manipulating data. This is because it has numerous features that can minimize the complexities of numerical computations.&nbsp;</p><p>Julia optimizes on the best features of Python and R while at the same time overlooks their weaknesses. This explains why it is viewed as an alternative to these programming languages. For instance, it utilizes the readability and simplicity of Python then performs faster.</p><p>Julia is the most preferred programming language for data scientists and mathematicians. This is because its core features are similar to the ones that are used on most data software. Also, the language is ideal for these two subjects because its syntax is similar to the standard mathematical formulas.</p><p>Key Features Of Julia Language<br />Uses JIT Compilation<br />Parallelism<br />Dynamic Typing<br />Simple Syntax<br />Allows Metaprogramming<br />Accessible to Libraries<br />-1-Array Indexing</p><p>Julia Vs Python And R Programming Languages<br />1. Speed<br />Julia is faster than both Python and R. This is a very critical aspect that is given special attention in the big data programming. The high speed of Julia is because of JIT compilers. You will need to install external libraries on Python to achieve similar speed.</p><p>2. Syntax<br />Julia has a math-friendly syntax. The syntax of this programming language is similar to the mathematical formulas hence can be used to perform mathematical and scientific computations. This syntax makes it easier to learn than Python.</p><p>3. Parallelism<br />Although both Python and R use parallelism, Julia uses a top-level parallelism. Julia allows the processor to perform to the optimum level than what Python and R can achieve.</p><p>4. Versatility<br />Julia programming language is more versatile than Python and R. It allows a programmer to move from different codes and functions with ease.</p><p>The only area that Python and R are superior to Julia is in terms of community. Given that Julia is a new programming language, it has a small community as compared to others which have been around for years.</p><p>In overall Julia programming language is a better alternative that you can use to handle Big data projects. Despite having a small community, it is one of those programming languages that you can easily learn.</p>]]></description>
	<dc:creator>Radha Agarkar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/27070/venn-diagrams-on-r-studio</guid>
	<pubDate>Mon, 25 Apr 2016 16:22:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/27070/venn-diagrams-on-r-studio</link>
	<title><![CDATA[Venn Diagrams on R Studio]]></title>
	<description><![CDATA[<h3>First step: Install &amp; load &ldquo;VennDiagram&rdquo; package.</h3>
<pre><code><span># install.packages('VennDiagram')</span>
<span>library</span><span>(</span><span>VennDiagram</span><span>)</span>
</code></pre>
<h3>Second step: Load data</h3>
<p>Add filepath if &ldquo;catdoge.csv&rdquo; is not in working-directory.</p>
<pre><code><span>d</span> <span>&lt;-</span> <span>read.csv</span><span>(</span><span>"catdoge.csv"</span><span>)</span></code><br><br></pre><p>Address of the bookmark: <a href="http://rstudio-pubs-static.s3.amazonaws.com/13301_6641d73cfac741a59c0a851feb99e98b.html" rel="nofollow">http://rstudio-pubs-static.s3.amazonaws.com/13301_6641d73cfac741a59c0a851feb99e98b.html</a></p>]]></description>
	<dc:creator>Jitendra Prajapati</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/13226/you-and-your-friend-have-similar-dna</guid>
	<pubDate>Sun, 27 Jul 2014 20:44:05 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/13226/you-and-your-friend-have-similar-dna</link>
	<title><![CDATA[You and your friend have similar DNA !!!]]></title>
	<description><![CDATA[<p>New research out of Massachusetts claims that people often choose friends that are similar to them in genetics and they are more accurate than you might suppose. A study published on PNAS&nbsp;http://www.pnas.org/content/111/Supplement_3/10796.full found that people are apt to pick friends who are genetically similar to themselves - so much so that friends tend to be as alike at the genetic level as a person's fourth cousin.</p><div style="text-align: center;"><img src="http://i.kinja-img.com/gawker-media/image/upload/s--CwLwHa43--/18fbmlokxcmqcjpg.jpg" alt="image" width="300" height="271" style="border: 0px; border: 0px;"></div><p>Scientists with a long-running Framingham Heart Study looked at 1,932 people (examination of about 1.5 million markers of genetic variations), comparing unrelated friends to unrelated strangers. They found that friends shared about 1% of their genes &mdash; a percentage much higher than those shared with strangers.This new findings made it clear that people have more DNA in common with those who are selected as friends than with strangers in the same population.&nbsp;</p><p>The genes that lined up the most were olfactory genes, which deal with smell. The ones that lined up the least were immune system genes. The researchers weren't sure why that happened :/. Olfactory genes might be a straightforward explanation: People who like the same smells tend to be drawn to similar environments, where they meet others with the same tendencies.</p><p>Reference:</p><p>http://www.pnas.org/content/111/Supplement_3/10796.full</p><p>Image : http://i.kinja-img.com</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>