<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/44252?offset=90</link>
	<atom:link href="https://bioinformaticsonline.com/related/44252?offset=90" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/21443/a-guide-for-complete-r-beginners-getting-data-into-r</guid>
	<pubDate>Tue, 24 Feb 2015 20:15:08 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/21443/a-guide-for-complete-r-beginners-getting-data-into-r</link>
	<title><![CDATA[A guide for complete R beginners :- Getting data into R]]></title>
	<description><![CDATA[<p>For a beginner this can be is the hardest part, it is also the most important to get right.</p><p>It is possible to create a vector by typing data directly into R using the combine function &lsquo;c&rsquo;</p><blockquote><p><strong>x </strong></p></blockquote><p>same as</p><blockquote><p><strong>x </strong></p></blockquote><p>creates the vector x with the numbers between 1 and 5.</p><p>You can see what is in an object at any time by typing its name;</p><blockquote><p><strong>x</strong></p></blockquote><p>will produce the output<strong> &lsquo;[1] 1 2 3 4 5&prime;</strong></p><p>Note that names need to be quoted</p><blockquote><p><strong>daysofweek </strong><strong>&larr; c(&lsquo;Monday&rsquo;, &lsquo;Tuesday&rsquo;, &lsquo;Wednesday&rsquo;, &lsquo;Thursday&rsquo;, &lsquo;Friday&rsquo;);</strong></p></blockquote><p>Usually however you want to input from a file. We have touched on the &lsquo;read.table&rsquo; function already.</p><blockquote><p><strong>mydata </strong></p></blockquote><p>Now <strong>mydata</strong> is a data frame with multiple vectors</p><p>each vector can be identified by the default syntax</p><p>#if any of these are typed it will print to screen</p><blockquote><p><strong>mydata$V1 mydata$V2 mydata$V3 </strong></p></blockquote><p>By default the function assumes certain things from the file</p><ul>
<li>The file is a plain text file (there are function to read excel files: <em>not covered here</em>)</li>
<li>columns are separated by any number of tabs or spaces</li>
<li>there is the same number of data points in each column</li>
<li>there is no header row (labels for the columns)</li>
<li>there is no column with names for the rows** [I&rsquo;ll explain].</li>
</ul><p><span style="text-decoration: underline;">If any of these are false, we need to tell that to the function</span></p><p>If it has a header column</p><blockquote><p><strong>mydata <em>header=T also works</em></strong></p></blockquote><p>Note that there is a comma between different parts of the functions arguments</p><p>If there is one less column in the header row, then R assumes that the 1<sup>st</sup> column of data after the header are the row names</p><p>Now the vectors (columns) are identified by their name</p><p>#if any of these are typed it will print to screen</p><blockquote><p><strong>mydata$A mydata$B mydata$C </strong></p></blockquote><p># Summary about the whole data frame</p><blockquote><p><strong>summary(mydata)</strong></p></blockquote><p># Summary information of column A</p><blockquote><p><strong>summary(mydata$A) </strong></p></blockquote><p>We can shortcut having to type the data frame each time by attaching it</p><blockquote><p><strong>attach(mydata)</strong></p></blockquote><p># summary of column B as &lsquo;mydata&rsquo; is attached</p><blockquote><p><strong>summary(B)</strong></p></blockquote><p><span style="text-decoration: underline;">Two other important options for </span><em><span style="text-decoration: underline;">read.table</span></em></p><p>If is is separated only by tabs and has a header</p><blockquote><p><strong>mydata </strong></p></blockquote><p>Really useful if you have spaces in the contents of some columns, so R does not mess up reading the columns . However if the columns or of an uneven length it will tell you.</p><p>If you know that the file has uneven columns</p><blockquote><p><strong>mydata </strong></p></blockquote><p>This causes R to fill empty spaces in a columns with &lsquo;NA&rsquo; .</p><p>The last two examples will still work with our file and give the same result as with only headers=T</p><p><span style="text-decoration: underline;">Graphs</span></p><p>to get an idea of what R is capable of type</p><blockquote><p><strong>demo(graphics)</strong></p></blockquote><p>steps through the examples, and the code is printed to the screen</p><p>We will work with simpler examples that have immediate use to biologists.</p><p>Remember to get more information about the options to a function type &lsquo;?function&rsquo;</p><p><span style="text-decoration: underline;">Histogram of A</span><span style="text-decoration: underline;"></span></p><blockquote><p><strong>hist(mydata$A)</strong></p></blockquote><p>If there was more data we could increase the number of vertical columns with the option, breaks=50 (or another relevant number).</p><blockquote><p><strong>boxplot(mydata)</strong></p></blockquote><p>We can get rid of the need to type the data frame each time by using the <strong>attach</strong> function</p><p># if not already done so</p><blockquote><p><strong>attach(mydata) </strong></p><p><strong>boxplot(mydata$A, mydata$B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p>same as</p><blockquote><p><strong>boxplot(A, B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p><span style="text-decoration: underline;">Scatter plot</span></p><p># if not already done so</p><blockquote><p><strong>attach(mydata) </strong></p><p><strong>plot(A,B) # or plot(mydata$A, mydata$B)</strong></p></blockquote><p><strong><span style="text-decoration: underline;">SAVING an image</span></strong></p><p>Windows users (Rgui) RIGHT click on image and select which you want.</p><p><span style="text-decoration: underline;">These instructions work for everyone.</span></p><p>You need to create a new device of the type of file you need, then send the data to that device</p><p>to save as a png file (easy to load into the likes of powerpoint, also great for web applications.</p><blockquote><p><strong>png(&lsquo;filename&rsquo;) </strong></p><p><strong>boxplot(A, B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p>or to save as a pdf</p><blockquote><p><strong>pdf(&lsquo;filename&rsquo;) </strong></p><p><strong>boxplot(A, B, name=c(&ldquo;Value A&rdquo;, &ldquo;Value B&rdquo;) , ylab=&ldquo;Count of Something&rdquo;)</strong></p></blockquote><p><span style="text-decoration: underline;">Note</span></p><ul>
<li>Nothing will appear on screen, the output is going to the file</li>
<li>Also it may not be saved immediately but will once the device (or R) is turned quit.</li>
</ul><p>To quit R type</p><p><strong>q() # </strong>If you save your session, next time you start R, you will have your data preloaded.</p><p>Or if you want to remain in R</p><blockquote><pre><strong>dev.off() #</strong>turns of the png (or pdf etc) device, thus forces the data to save</pre></blockquote>]]></description>
	<dc:creator>Archana Malhotra</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/35635/ete-3-reconstruction-analysis-and-visualization-of-phylogenomic-data</guid>
	<pubDate>Mon, 19 Feb 2018 06:46:15 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/35635/ete-3-reconstruction-analysis-and-visualization-of-phylogenomic-data</link>
	<title><![CDATA[ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data]]></title>
	<description><![CDATA[<p><span>ETE v3, featuring numerous improvements in the underlying library of methods, and providing a novel set of standalone tools to perform common tasks in comparative genomics and phylogenetics. </span></p>
<p><span>The new features include </span></p>
<p><span>(i) building gene-based and supermatrix-based phylogenies using a single command, </span></p>
<p><span>(ii) testing and visualizing evolutionary models, </span></p>
<p><span>(iii) calculating distances between trees of different size or including duplications, and </span></p>
<p><span>(iv) providing seamless integration with the NCBI taxonomy database. </span></p>
<p><span>ETE is freely available at&nbsp;</span><a href="http://etetoolkit.org/" target="">http://etetoolkit.org</a></p><p>Address of the bookmark: <a href="http://etetoolkit.org" rel="nofollow">http://etetoolkit.org</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38304/lordfast-sensitive-and-fast-alignment-search-tool-for-long-noisy-read-sequencing-data</guid>
	<pubDate>Tue, 27 Nov 2018 04:43:57 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38304/lordfast-sensitive-and-fast-alignment-search-tool-for-long-noisy-read-sequencing-data</link>
	<title><![CDATA[lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data]]></title>
	<description><![CDATA[<p><span>lordFAST is a sensitive tool for mapping long reads with high error rates. lordFAST is specially designed for aligning reads from PacBio sequencing technology but provides the user the ability to change alignment parameters depending on the reads and application.</span></p>
<p>lordFAST, a novel long-read mapper that is specifically designed to align reads generated by PacBio and potentially other SMS technologies to a reference. lordFAST not only has higher sensitivity than the available alternatives, it is also among the fastest and has a very low memory footprint.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://github.com/vpc-ccg/lordfast" rel="nofollow">https://github.com/vpc-ccg/lordfast</a></p>]]></description>
	<dc:creator>BioJoker</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40613/genome-in-a-bottle-giab-consortium</guid>
	<pubDate>Sat, 25 Jan 2020 13:50:52 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40613/genome-in-a-bottle-giab-consortium</link>
	<title><![CDATA[Genome in a Bottle (GIAB) Consortium]]></title>
	<description><![CDATA[<p><span>The</span><a href="http://www.genomeinabottle.org/"> Genome in a Bottle (GIAB) Consortium</a><span> is a public-private-academic consortium hosted by </span><a href="http://www.nist.gov/" target="_blank">NIST</a><span> to develop the technical infrastructure (reference standards, reference methods, and reference data) to enable translation of whole human genome sequencing to clinical practice. </span></p>
<p><span><a href="https://www.nist.gov/news-events/news/2016/09/nist-releases-new-family-standardized-genomes">https://www.nist.gov/news-events/news/2016/09/nist-releases-new-family-standardized-genomes</a></span></p><p>Address of the bookmark: <a href="https://jimb.stanford.edu/giab/" rel="nofollow">https://jimb.stanford.edu/giab/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42581/autogluon-automl-for-text-image-and-tabular-data</guid>
	<pubDate>Thu, 07 Jan 2021 05:33:17 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42581/autogluon-automl-for-text-image-and-tabular-data</link>
	<title><![CDATA[AutoGluon: AutoML for Text, Image, and Tabular Data]]></title>
	<description><![CDATA[<p><span>AutoGluon automates machine learning tasks enabling you to easily achieve strong predictive performance in your applications. With just a few lines of code, you can train and deploy high-accuracy machine learning and deep learning models on text, image, and tabular data.</span></p><p>Address of the bookmark: <a href="https://github.com/awslabs/autogluon" rel="nofollow">https://github.com/awslabs/autogluon</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44742/nasa-open-science-data-repository</guid>
	<pubDate>Wed, 18 Dec 2024 11:54:47 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44742/nasa-open-science-data-repository</link>
	<title><![CDATA[NASA Open Science Data Repository]]></title>
	<description><![CDATA[<p><span>The NASA Open Science Data Repository (OSDR) enables access to space-related data from experiments and missions that investigate biological and health responses of terrestrial life to spaceflight. The goal of OSDR is to enable multi-modal and multi-hierarchical fundamental space life science data be reused toward basic science, applied science, and operational outcomes for space exploration and knowledge discovery. These data include &lsquo;omics, phenotypic, physiological, behavioral, hardware, environmental telemetry; raw, processed; tabular, text, code, bioimaging, and video.</span></p>
<p><span>https://www.nasa.gov/reference/osdr-data-processing/</span></p><p>Address of the bookmark: <a href="https://www.nasa.gov/osdr/" rel="nofollow">https://www.nasa.gov/osdr/</a></p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/991/master-thesis-trans-membrane-topology-prediction-through-markov-based-decoders</guid>
	<pubDate>Wed, 17 Jul 2013 16:16:17 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/991/master-thesis-trans-membrane-topology-prediction-through-markov-based-decoders</link>
	<title><![CDATA[Master Thesis: Trans-membrane topology prediction through Markov based decoders]]></title>
	<description><![CDATA[<p dir="ltr"><span>Abstract:</span></p><p dir="ltr"><span></span><span>Background/Motivation: </span></p><p dir="ltr"><span>The dearth of structural information on alpha helical membrane protein (MPs) has hindered thus far the development of reliable knowledge &ndash;based potentials that can be used for automatic prediction of trans-membrane (TM) protein structure. While algorithm for identification of TM segments is available, modelling of the domains of alpha helical MPs involves assembling the segments into a bundle. This requires the correct assignment of the buried and lipid-exposed faces of the TM domains.</span><span>&nbsp;</span></p><p dir="ltr"><span>Results: </span><span><span><span>In a cross validated test on single sequences, our trans-membrane MM, correctly predicts the entire topology for 77% of the sequences in a standard dataset of 86 proteins with supervised topology. These results compare favorably with existing methods.</span></span></span><span>&nbsp;</span></p><p dir="ltr"><span><strong>Source Code</strong>: Matlab</span></p><p dir="ltr"><span></span><span>Conclusion/Implementation</span><span><span><span>: Here discriminant data mining approach was used to predict the location and orientation of alpha helices in membrane-spanning proteins. It is based on a first order Markov model (MM) with an architecture that corresponds closely to the biological systems. The model is enriched with three types of states for the loop on the cytoplasmic side (outer loop), loop for the non-cytoplasmic side (inner side), and trans-membrane part. The closed association between the biological and Markov states allows us to infer which part of the model architecture are important to capture the information which encodes the membrane topology, and gain a better understanding of the mechanism and constraints involved. Predictor Model was established by various &nbsp;Markov decoder , and assignment of the membrane helix boundaries was apparent.</span></span></span></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/991" length="161792" type="application/vnd.ms-powerpoint" />
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/40770/scientist-bioinformatics-positions</guid>
  <pubDate>Thu, 30 Jan 2020 06:53:40 -0600</pubDate>
  <link></link>
  <title><![CDATA[Scientist Bioinformatics Positions]]></title>
  <description><![CDATA[
<p>Bioinformatics-Multi_Omics_Integration</p>

<p>https://www.researchgate.net/job/939073_Senior_Scientist_Bioinformatics-Multi_Omics_Integration</p>

<p> <br />Senior_Scientist_Bioinformatics-Transcriptomics_Analysis     </p>

<p>https://www.researchgate.net/job/939075_Senior_Scientist_Bioinformatics-Transcriptomics_Analysis-Belgium_France_Switzerland_The_Netherlands</p>

<p>Senior Scientist Bioinformatics - Network Analytics</p>

<p>https://www.researchgate.net/job/939070_Senior_Scientist_Bioinformatics-Network_Analytics_Belgium_France_Switzerland_the_Netherlands</p>

<p>Team Leader Bioinformatics Data Sciences - Mechelen, Belgium</p>

<p>https://www.researchgate.net/job/938787_Team_Leader_Bioinformatics_Data_Sciences-Mechelen_Belgium</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/39939/automatic-predictive-model-constructor-apmc</guid>
	<pubDate>Mon, 16 Sep 2019 09:43:21 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/39939/automatic-predictive-model-constructor-apmc</link>
	<title><![CDATA[Automatic Predictive Model Constructor - APMC]]></title>
	<description><![CDATA[<div><div><div><div><div><div><div><div><div><div><div><div><div><div><div>I would like to invite everyone interested in the subject of machine learning in life science, to test <strong>APMC</strong> module,</div><div>it`s a fully automatic tool (created by students) to simply create and develop supervised machine learning models</div><div>for classification and regression purposes. Links to tool, instruction and documentation bellow:</div><div><span style="font-size: 12.8px;"></span></div><ul>
<li><span style="font-size: 12.8px;">APMC:&nbsp;</span><a href="https://gene-calc.pl/apmc?fbclid=IwAR1j51l7qXsL3BuMPb-P5yQhwkmDCiVdoP-qodeCrbu2DbWtxtihRJ0n9-g" target="_blank">https://gene-calc.pl/apmc</a></li>
<li><span>How to use:&nbsp;</span><a href="https://l.facebook.com/l.php?u=https%3A%2F%2Fgene-calc.pl%2Fapmc%2Fhow-to-use%3Ffbclid%3DIwAR3tCwJiegeuVn_ZZ-YPD7lB7UrqGWaab_zItU30MvFKZiuheSEiGUxyZ9Y&amp;h=AT1x8z09NwNUiLjTgNw8Vzg9OLsEjnpHESvjOescfLF-mzjMMqTBnkh5AqHRkOaXwjVHetdQtQO7mgstwke6ivUz-hzT-ifo5TrMBuMm8XMTmvhz7nyDdKmQZ38yyXW942J_47Oj5YxYxWaMDreugIU2ytT2yvxvgKi-FgNo4N7mvYoj_1A5eCuNxHWuGA3voYn0GAWSSR96ZK4gsj3pvqBcCK9Zi2Fo8IoBNK9JZIbtnV9fdvZLMEUryCoWEceZkMX-76jmGinOXss5L3AGp_6oSUr_aFus73B4q5PXMbKubUoU4inr-0kVoO0werx5YNPWdgXtpiyD6TKXQIhI6lDtyi2jx645A5CKqW-nARPqKwa-Iwtt-KGoNyHvcSnhvfLPK9n4Lhs8W6PK9ZeobOqHwm4y1C1my-N4dvlmvGBWTgSj_E31e0GIhYxvI9Uk3nREVnMw3lfD20BTmwL-wfhSidm8Lue_Akn1Flpfcl0jP1DBpkcwJ3OMxDVA82bL4lcsGmyLGedXjrpKAiVGF3R_e57r9EeI5bWyrbYZGTaHJdOGJBQSvplDir_AfH9Pr5NSRVZOStr13e6XxUIXhCiR58Qua_yuQOsNYBKGN5OP7XAL0DeFIKmI" target="_blank">https://gene-calc.pl/apmc/how-to-use</a></li>
<li><span>Documentation:&nbsp;</span><a href="https://gene-calc.pl/apmc/documentation?fbclid=IwAR1_2agQ8vnqDw0DudUI5UJq3_ip0EFwWR3zyccOynaDlbzkfFmYXnPtFXI" target="_blank">https://gene-calc.pl/apmc/documentation</a></li>
</ul></div></div></div></div></div></div></div></div></div></div></div></div></div></div>]]></description>
	<dc:creator>Jan Bińkowski</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44483/baclife-an-automated-genome-mining-tool-for-identification-of-lifestyle-associated-genes</guid>
	<pubDate>Fri, 15 Mar 2024 04:59:14 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44483/baclife-an-automated-genome-mining-tool-for-identification-of-lifestyle-associated-genes</link>
	<title><![CDATA[bacLIFE: an automated genome mining tool for identification of lifestyle associated genes]]></title>
	<description><![CDATA[<p style="margin-top: 0px; margin-bottom: 16px; color: #1f2328; font-size: 16px; font-style: normal; font-weight: 400; text-align: start; background-color: #ffffff;" dir="auto">bacLIFE is a streamlined computational workflow that annotates bacterial genomes and performs large-scale comparative genomics to predict bacterial lifestyles and to pinpoint candidate genes, denominated<span>&nbsp;</span><strong style="font-weight: var(--base-text-weight-semibold, 600);">lifestyle-associated genes (LAGs)</strong>, and biosynthetic gene clusters associated with each lifestyle detected. This whole process is divided into different modules:</p>
<ul style="margin-top: 0px; margin-bottom: 16px; color: #1f2328; font-size: 16px; font-style: normal; font-weight: 400; text-align: start; background-color: #ffffff;" dir="auto">
<li><strong style="font-weight: var(--base-text-weight-semibold, 600);">Clustering module</strong><span>&nbsp;</span>Predicts, clusters and annotates the genes of every input genome</li>
<li style="margin-top: 0.25em;"><strong style="font-weight: var(--base-text-weight-semibold, 600);">Lifestyle prediction</strong><span>&nbsp;</span>Employs a machine learning model to forecast bacterial lifestyle or other specified metadata</li>
<li style="margin-top: 0.25em;"><strong style="font-weight: var(--base-text-weight-semibold, 600);">Analitical module (Shiny app)</strong><span>&nbsp;</span>Results from the previous modules are embedded in a user-friendly interface for comprehensive and interactive comparative genomics.</li>
</ul>
<p style="margin-top: 0px; margin-bottom: 16px; color: #1f2328; font-size: 16px; font-style: normal; font-weight: 400; text-align: start; background-color: #ffffff;" dir="auto">You can find the complete wiki here [<a href="https://github.com/Carrion-lab/bacLIFE/wiki/bacLIFE-wiki">https://github.com/Carrion-lab/bacLIFE/wiki/bacLIFE-wiki</a>]</p><p>Address of the bookmark: <a href="https://github.com/Carrion-lab/bacLIFE" rel="nofollow">https://github.com/Carrion-lab/bacLIFE</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>

</channel>
</rss>