<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Import R Data]]></title>
	<link>https://bioinformaticsonline.com/pages/view/33869/import-r-data?</link>
	<atom:link href="https://bioinformaticsonline.com/pages/view/33869/import-r-data?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/33869/import-r-data</guid>
	<pubDate>Wed, 12 Jul 2017 08:30:46 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/33869/import-r-data</link>
	<title><![CDATA[Import R Data]]></title>
	<description><![CDATA[<p>It is often necessary to import sample textbook data into R before you start working on your homework.</p><div id="node-69"><div><p><strong>Excel File</strong></p><p>Quite frequently, the sample data is in&nbsp;<span>Excel&nbsp;</span>format, and needs to be imported into R prior to use. For this, we can use the function&nbsp;<span>read.xls&nbsp;</span>from the&nbsp;<span>gdata&nbsp;</span>package. It reads from an Excel spreadsheet and returns a&nbsp;<a href="http://www.r-tutor.com/r-introduction/data-frame">data frame</a>. The following shows how to load an Excel spreadsheet named&nbsp;<span>"mydata.xls"</span>. This method requires Perl runtime to be present in the system.</p><blockquote><div id="listing-68"><span><a></a></span>&gt;&nbsp;library(gdata)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;gdata&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;help(read.xls)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;documentation&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.xls("mydata.xls")&nbsp;&nbsp;#&nbsp;read&nbsp;from&nbsp;first&nbsp;sheet</div></blockquote><p>Alternatively, we can use the function&nbsp;<span>loadWorkbook&nbsp;</span>from the&nbsp;<span>XLConnect&nbsp;</span>package to read the entire workbook, and then load the worksheets with&nbsp;<span>readWorksheet</span>. The&nbsp;<span>XLConnect&nbsp;</span>package requires Java to be pre-installed.</p><blockquote><div id="listing-69"><span><a></a></span>&gt;&nbsp;library(XLConnect)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;XLConnect&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;wk&nbsp;=&nbsp;loadWorkbook("mydata.xls")&nbsp;<br /><span><a></a></span>&gt;&nbsp;df&nbsp;=&nbsp;readWorksheet(wk,&nbsp;sheet="Sheet1")</div></blockquote><p>&nbsp;</p><h4><a></a>Minitab File</h4><p>If the data file is in&nbsp;<span>Minitab Portable Worksheet&nbsp;</span>format, it can be opened with the function&nbsp;<span>read.mtp&nbsp;</span>from the&nbsp;<span>foreign&nbsp;</span>package. It returns a&nbsp;<a href="http://www.r-tutor.com/r-introduction/list">list</a>&nbsp;of components in the Minitab worksheet.</p><blockquote><div id="listing-70"><span><a></a></span>&gt;&nbsp;library(foreign)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;the&nbsp;foreign&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;help(read.mtp)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;documentation&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.mtp("mydata.mtp")&nbsp;&nbsp;#&nbsp;read&nbsp;from&nbsp;.mtp&nbsp;file</div></blockquote><p>&nbsp;</p><h4><a></a>SPSS File</h4><p>For the data files in&nbsp;<span>SPSS&nbsp;</span>format, it can be opened with the function&nbsp;<span>read.spss&nbsp;</span>also from the&nbsp;<span>foreign&nbsp;</span>package. There is a&nbsp;<span>"to.data.frame"&nbsp;</span>option for choosing whether a data frame is to be returned. By default, it returns a list of components instead.</p><blockquote><div id="listing-71"><span><a></a></span>&gt;&nbsp;library(foreign)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;load&nbsp;the&nbsp;foreign&nbsp;package&nbsp;<br /><span><a></a></span>&gt;&nbsp;help(read.spss)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;documentation&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.spss("myfile",&nbsp;to.data.frame=TRUE)</div></blockquote><p>&nbsp;</p><h4><a></a>Table File</h4><p>A data table can resides in a text file. The cells inside the table are separated by blank characters. Here is an example of a table with 4 rows and 3 columns.</p><blockquote><div id="listing-72"><span><a></a></span>100&nbsp;&nbsp;&nbsp;a1&nbsp;&nbsp;&nbsp;b1&nbsp;<br /><span><a></a></span>200&nbsp;&nbsp;&nbsp;a2&nbsp;&nbsp;&nbsp;b2&nbsp;<br /><span><a></a></span>300&nbsp;&nbsp;&nbsp;a3&nbsp;&nbsp;&nbsp;b3&nbsp;<br /><span><a></a></span>400&nbsp;&nbsp;&nbsp;a4&nbsp;&nbsp;&nbsp;b4</div></blockquote><p>Now copy and paste the table above in a file named&nbsp;<span>"mydata.txt"&nbsp;</span>with a text editor. Then load the data into the workspace with the function&nbsp;<span>read.table</span>.</p><blockquote><div id="listing-73"><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.table("mydata.txt")&nbsp;&nbsp;#&nbsp;read&nbsp;text&nbsp;file&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;print&nbsp;data&nbsp;frame&nbsp;<br /><span><a></a></span>&nbsp;&nbsp;&nbsp;V1&nbsp;V2&nbsp;V3&nbsp;<br /><span><a></a></span>1&nbsp;100&nbsp;a1&nbsp;b1&nbsp;<br /><span><a></a></span>2&nbsp;200&nbsp;a2&nbsp;b2&nbsp;<br /><span><a></a></span>3&nbsp;300&nbsp;a3&nbsp;b3&nbsp;<br /><span><a></a></span>4&nbsp;400&nbsp;a4&nbsp;b4</div></blockquote><p>For further detail of the function&nbsp;<span>read.table</span>, please consult the R documentation.</p><blockquote><div id="listing-74"><span><a></a></span>&gt;&nbsp;help(read.table)</div></blockquote><p>&nbsp;</p><h4><a></a>CSV File</h4><p>The sample data can also be in&nbsp;<span>comma separated values&nbsp;</span>(CSV) format. Each cell inside such data file is separated by a special character, which usually is a comma, although other characters can be used as well.</p><p>The first row of the data file should contain the column names instead of the actual data. Here is a sample of the expected format.</p><blockquote><div id="listing-75"><span><a></a></span>Col1,Col2,Col3&nbsp;<br /><span><a></a></span>100,a1,b1&nbsp;<br /><span><a></a></span>200,a2,b2&nbsp;<br /><span><a></a></span>300,a3,b3</div></blockquote><p>After we copy and paste the data above in a file named&nbsp;<span>"mydata.csv"&nbsp;</span>with a text editor, we can read the data with the function&nbsp;<span>read.csv</span>.</p><blockquote><div id="listing-76"><span><a></a></span>&gt;&nbsp;mydata&nbsp;=&nbsp;read.csv("mydata.csv")&nbsp;&nbsp;#&nbsp;read&nbsp;csv&nbsp;file&nbsp;<br /><span><a></a></span>&gt;&nbsp;mydata&nbsp;<br /><span><a></a></span>&nbsp;&nbsp;Col1&nbsp;Col2&nbsp;Col3&nbsp;<br /><span><a></a></span>1&nbsp;&nbsp;100&nbsp;&nbsp;&nbsp;a1&nbsp;&nbsp;&nbsp;b1&nbsp;<br /><span><a></a></span>2&nbsp;&nbsp;200&nbsp;&nbsp;&nbsp;a2&nbsp;&nbsp;&nbsp;b2&nbsp;<br /><span><a></a></span>3&nbsp;&nbsp;300&nbsp;&nbsp;&nbsp;a3&nbsp;&nbsp;&nbsp;b3</div></blockquote><p>In various European locales, as the comma character serves as the decimal point, the function&nbsp;<span>read.csv2&nbsp;</span>should be used instead. For further detail of the&nbsp;<span>read.csv&nbsp;</span>and&nbsp;<span>read.csv2&nbsp;</span>functions, please consult the R documentation.</p><blockquote><div id="listing-77"><span><a></a></span>&gt;&nbsp;help(read.csv)</div></blockquote><p>&nbsp;</p><h4><a></a>Working Directory</h4><p>Finally, the code samples above assume the data files are located in the R&nbsp;<span>working</span>&nbsp;<span>directory</span>, which can be found with the function&nbsp;<span>getwd</span>.</p><blockquote><div id="listing-78"><span><a></a></span>&gt;&nbsp;getwd()&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#&nbsp;get&nbsp;current&nbsp;working&nbsp;directory</div></blockquote><p>You can select a different working directory with the function&nbsp;<span>setwd()</span>, and thus avoid entering the full path of the data files.</p><blockquote><div id="listing-79"><span><a></a></span>&gt;&nbsp;setwd("")&nbsp;&nbsp;&nbsp;#&nbsp;set&nbsp;working&nbsp;directory</div></blockquote><p>Note that the forward slash should be used as the path separator even on Windows platform.</p><blockquote><div id="listing-80"><span><a></a></span>&gt;&nbsp;setwd("C:/MyDoc")</div></blockquote></div></div>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>

</channel>
</rss>