<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/27321?offset=1000</link>
	<atom:link href="https://bioinformaticsonline.com/related/27321?offset=1000" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/43605/installing-elgg-on-ubuntu</guid>
	<pubDate>Thu, 25 Nov 2021 01:45:59 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/43605/installing-elgg-on-ubuntu</link>
	<title><![CDATA[Installing ELGG on Ubuntu !]]></title>
	<description><![CDATA[<p>Follow this:</p><h3>Settings file</h3><ul>
<li>
<div>
<div>
<div>Error</div>
<div>
<p>Your web server does not have permission to create the settings.php file in your installation directory. You have two choices:</p>
<p>1. Change the permissions on the elgg-config directory of your Elgg installation</p>
<p>2. Copy the file elgg-config/settings.example.php to elgg-config/settings.php and follow the instructions in it for setting your database parameters.</p>
</div>
</div>
</div>
</li>
</ul><p>&nbsp;</p><p>To resolve this --&nbsp;</p><p>Change the permision&nbsp;</p><p>chmod 777 elgg-config</p><h3>Web server</h3><ul>
<li>
<div>
<div>
<div>Warning</div>
<div>
<p>Your server does not support automatic testing of the rewrite rules and your browser does not support checking via JavaScript. You can continue the installation, but you may experience problems with your site. You can manually test the rewrite rules by clicking this link:&nbsp;test. You will see the word success if the rules are working.</p>
</div>
</div>
</div>
</li>
</ul><p>create a .httaccess</p><p>and provide detail in it</p><p><span style="text-decoration: underline;">Installation of Elgg</span><span></span></p><p>&nbsp;</p><p>(Note: Elgg requires&nbsp;<a href="http://www.cs.oswego.edu/~odendahl/misc/howto/mysql/">installing an instance of MySQL</a>.)</p><p>&nbsp;</p><p><span style="text-decoration: underline;">Steps to make adjustment and disposal of trial installations easier</span>.</p><p>&nbsp;</p><p><span style="text-decoration: underline;">MySQL</span></p><p>At this point, we assume you've set up your&nbsp;<tt>MySQL</tt>&nbsp;database</p><div><ol>
<li>Start your&nbsp;<tt>mysqld</tt>&nbsp;server daemon.<br />(For the sake of this example, we'll assume we're running on port 9999; you shouldn't actually use this port because only one daemon can run on any port number.)</li>
<li>Login to the server as&nbsp;<tt>root</tt>&nbsp;user, using the&nbsp;<tt>mysql</tt>&nbsp;client.</li>
<li>Create a database to store Elgg's tables. We'll call the database&nbsp;<tt>elggalpha</tt>.<br /><tt>mysql&gt; create database elggalpha;</tt></li>
<li>Grant access to a user &mdash; for this example we'll call him/her&nbsp;<tt>elggalphauser</tt>.
<pre>mysql&gt; grant all privileges on elggalpha.* to 'elggalphauser'@'moxie'
    -&gt; identified by 'secretpassword';
</pre>
</li>
</ol></div><p>&nbsp;</p><p><span style="text-decoration: underline;">Elgg</span></p><p>&nbsp;</p><div><ol>
<li>Type&nbsp;<tt>umask 022</tt></li>
<li>Change into your&nbsp;<tt>public_html</tt>&nbsp;directory (<em>aka</em>&nbsp;folder).<br /><tt>cd public_html</tt></li>
<li>make a directory called&nbsp;<tt>elgg</tt><br /><tt>mkdir elgg</tt></li>
<li>Change into the&nbsp;<tt>elgg</tt>&nbsp;subdirectory of&nbsp;<tt>public_html</tt>.<br /><tt>cd elgg</tt></li>
<li>download elgg into this directory</li>
<li>you'll end up with a file named<br /><tt>/home/<em>your-user-id</em>/public_html/elgg/elgg-X.X.X.X.zip</tt><br />(It's not literally&nbsp;<tt>X.X.X.X</tt>, it might be&nbsp;<tt>elgg-1.8.0.1.zip</tt>, for example.)</li>
<li>unzip elgg<br /><tt>unzip -q elgg-1.8.0.1.zip</tt></li>
<li>you'll end up with a directory (folder) named<br /><tt>/home/<em>your-user-id</em>/public_html/elgg/elgg-X.X.X.X</tt></li>
<li>make a symbolic link (<em>aka</em>&nbsp;shortcut) to this<br /><tt>ln -s elgg-X.X.X.X alpha</tt><br />(For example,&nbsp;<tt>ln -s elgg-1.8.0.1 alpha</tt>)</li>
<li>Give user&nbsp;<tt>nobody</tt>&nbsp;access to this directory<br /><tt>/usr/misc/bin/acl.sh -r -u nobody elgg-1.8.0.1</tt></li>
<li>create a data directory for&nbsp;<tt>elgg</tt>&nbsp;(get into your home directory first):
<pre>cd
mkdir -p elgg/alpha/data
chmod -R 755 elgg/alpha/data
</pre>
</li>
<li>Change into the&nbsp;<tt>elgg/alpha</tt>&nbsp;directory.<br /><tt>cd elgg/alpha</tt></li>
<li>Give user&nbsp;<tt>nobody</tt>&nbsp;access to this subdirectory&nbsp;<tt>data</tt><br /><tt>/usr/misc/bin/acl.sh -u nobody data</tt></li>
<li>In your browser, navigate to your elgg installation<br /><tt>http://moxie.cs.oswego.edu/~<em>your-user-id</em>/elgg/alpha/</tt><br />You'll have a screen which includes this message:
<pre>Welcome

Installing Elgg has 6 simple steps and reading this welcome is the first one!

If you haven't already, read through the installation instructions included with Elgg (or click the instructions link at the bottom of the page).

If you are ready to proceed, click the Next button.
</pre>
</li>
<li>Follow configuration process:
<ul>
<li>The first screen probably will contain a success message in green and a failure message in pink:
<pre><span>Requirements check</span>

Your server failed the requirements check. After you have fixed the below issues, refresh this
page. Check the troubleshooting links at the bottom of this page if you need further assistance.
<span>PHP</span></pre>
<div>Your server's PHP satisfies all of Elgg's requirements.</div>
<pre>
<span>Web server</span></pre>
<div>We think your server is running the Apache web server. The rewrite test failed and the most likely cause is that AllowOverride is not set to All for Elgg's directory. This prevents Apache from processing the .htaccess file which contains the rewrite rules. A less likely cause is Apache is configured with an alias for your Elgg directory and you need to set the RewriteBase in your .htaccess. There are further instructions in the .htaccess file in your Elgg directory.</div>
<pre>
<span>Database</span></pre>
<div>The database requirements are checked when Elgg loads its database.</div>
</li>
<li>To correct this, replace the&nbsp;<tt>.htaccess</tt>&nbsp;file that Elgg created with one that you can edit:
<pre>cp -ip .htaccess temp-htaccess
rm .htaccess
mv temp-htaccess .htaccess
</pre>
</li>
<li>Edit the&nbsp;<tt>.htaccess</tt>&nbsp;file: Go to approximately line 101 (where it says&nbsp;<tt>#RewriteBase /</tt>) and add the line:<br /><tt>RewriteBase /~<em>your-user-id</em>/elgg/alpha/</tt></li>
<li>Be sure to save the edited file.
<p>&nbsp;</p>
<p>&nbsp;</p>
</li>
<li>Click the&nbsp;<span>Refresh</span>&nbsp;button.
<p>&nbsp;</p>
<p>&nbsp;</p>
</li>
<li>If this hasn't fixed things, seek professional help.
<p>&nbsp;</p>
<p>&nbsp;</p>
</li>
<li>Click the&nbsp;<span>Next</span>&nbsp;button.
<p>&nbsp;</p>
<p>&nbsp;</p>
</li>
<li>Fill in the parameters appropriate to your MySQL installation.
<pre><span>Database installation</span>

If you haven't already created a database for Elgg, do that now. Then fill in the values below to
initialize the Elgg database.

<span>Database Username</span></pre>
<div>elggalphauser</div>
<pre>User that has full priviledges to the MySQL database that you created for Elgg

<span>Database Password</span></pre>
<div>secretpassword</div>
<pre>Password for the above database user account

<span>Database Name</span></pre>
<div>elggalpha</div>
<pre>Name of the Elgg database

<span>Database Host</span></pre><sup>&Dagger;</sup>
<div>moxie:9999</div>
<pre>Hostname of the MySQL server (usually localhost)
<strong>&Dagger;You might need to use <tt>127.0.0.1:9999</tt> for the host
    if you haven't set up MySQL as we've done.</strong>



<span>Database Table Prefix</span></pre>
<div>elgg_</div>
<pre>The prefix given to all of Elgg's tables (usually elgg_)
</pre>
<div><span>Next</span></div>
<p>&nbsp;</p>
<p>&nbsp;</p>
</li>
<li>Continue filling in forms as requested.
<pre>Configure site</pre>
<div>Database has been installed.</div>
<pre>
We need some information about the site as we configure Elgg. If you haven't created a data directory for Elgg, you need to do so now.

<span>Site Name</span></pre>
<div>My Elgg Site version Alpha</div>
<pre>The name of your new Elgg site

<span>Site Email Address</span></pre>
<div><em>your-user-id</em>@oswego.edu</div>
<pre>Email address used by Elgg for communication with users

<span>Site URL</span></pre>
<div>http://moxie.cs.oswego.edu/~<em>your-user-id</em>/elgg/alpha/</div>
<pre>The address of the site (Elgg usually guesses this correctly)

<span>Elgg Install Directory</span></pre>
<div>/home/<em>your-user-id</em>/public_html/elgg/elgg.1.8.0.1/</div>
<pre>The directory where you put the Elgg code (Elgg usually guesses this correctly)

<span>Data Directory</span></pre>
<div>/home/<em>your-user-id</em>/elgg/alpha/data</div>
<pre>The directory that you created for Elgg to save files (the permissions on this directory are checked
when you click Next)

<span>Default Site Access</span></pre>
<div>Public</div>
<pre>The default access level for new user created content
</pre>
</li>
<li>
<pre><span>Create admin account</span></pre>
<div>Site settings have been saved.</div>
<pre>
It is now time to create an administrator's account.

<span>Display Name</span></pre>
<div>&nbsp;</div>
<pre>The name that is displayed on the site for this account

<span>Email Address</span></pre>
<div>&nbsp;</div>
<pre>
<span>Username</span></pre>
<div>&nbsp;</div>
<pre>Account username used for logging in

<span>Password</span></pre>
<div>&nbsp;</div>
<pre>Account password must be at least 6 characters long

<span>Password Again</span></pre>
<div>&nbsp;</div>
<pre>Retype password to confirm</pre>
</li>
</ul>
</li>
</ol></div>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/12593/visiting-scientist-computational-genomics-two-positions</guid>
  <pubDate>Mon, 07 Jul 2014 22:53:41 -0500</pubDate>
  <link></link>
  <title><![CDATA[Visiting Scientist - Computational Genomics (two positions)]]></title>
  <description><![CDATA[
<p>Scientific/Managerial &amp; International Recruitment</p>

<p>ICRISAT seeks applications from Indian nationals Visiting Scientist-Computational Genomics (2 positions), to be part of a team of Centre of Excellence in Genomics (CEG), (www.icrisat.org/ceg) to work on legume genomics projects.  The positions will be based at ICRISAT’s Headquarters in Patancheru, Hyderabad, India.</p>

<p>ICRISAT is a non-profit, non-political organization that conducts agricultural research for development in Asia and sub-Saharan Africa with a wide array of partners throughout the world. Covering 6.5 million square kilometers of land in 55 countries, the semi-arid tropics is home to over 2 billion people, with 650 million of these are the poorest of the poor. ICRISAT and its partners help empower those living in the semi-arid tropics, especially smallholder farmers, to overcome poverty, hunger, malnutrition and a degraded environment through more efficient and profitable agriculture. ICRISAT is headquartered in Greater Hyderabad, Andhra Pradesh, India and belongs to the Consortium of Centers supported by the Consultative Group on International Agricultural Research (CGIAR).</p>

<p>The Job: Responsibilities for these positions include:</p>

<p>    Analyzing and handling large-scale next generation sequencing DNA and RNA data<br />    Data mining and development of pipelines and troubleshooting<br />    Genome diversity analysis such as SNPs, Indels, Structural Variations, population structure<br />    Genome wide association study (GWAS) related analysis- LD analysis, hapmap and trait mapping<br />    Expression analysis based on RNA-Seq data, annotation, gene ontology and metabolic pathway analysis<br />    Epigenome analysis, small RNA identification<br />    Gene family analysis, sequence level protein analysis, orthology/paralogy and molecular modelling<br />    Compiling and analysis of results, writing reports and research papers</p>

<p>The Person:  Ph.D. or MSc/MTech/PGDCA with two years research experience in Biotechnology, Computational biology, Agricultural/ Plant Biotechnology, Genetics, Molecular Biology or related discipline. Good knowledge of programming/scripting in at least two of following languages: Perl, C, C++, R, Shell Scripting and Python is plus.</p>

<p>How to apply: Please apply latest by 20 July 2014.  The application should include the name of the position applied for, a letter of motivation, a full Curriculum Vita (CV), and the names and contact information of three references that are knowledgeable of the candidate’s professional qualifications and work experience. Technical details and more information about these positions can be obtained from R.K.VARSHNEY@CGIAR.ORG. All applications will be acknowledged, however only short listed candidates will be contacted.</p>

<p>Apply here https://recruit.zoho.com/ats/Portal.na?digest=T642sgLYWZOStExJ77cPrcM*sIMGZETWw4yPxngbmHA-</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/33219/dbcan-a-web-server-and-database-for-automated-carbohydrate-active-enzyme-annotation</guid>
	<pubDate>Mon, 29 May 2017 05:39:29 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/33219/dbcan-a-web-server-and-database-for-automated-carbohydrate-active-enzyme-annotation</link>
	<title><![CDATA[dbCAN: a web server and DataBase for automated Carbohydrate-active enzyme ANnotation]]></title>
	<description><![CDATA[<p><a href="http://csbl.bmb.uga.edu/dbCAN/index.php">dbCAN</a>&nbsp;is a web server and&nbsp;<span style="text-decoration: underline;">D</span>ata<span style="text-decoration: underline;">B</span>ase for&nbsp;<a href="http://csbl.bmb.uga.edu/dbCAN/annotate.php"><strong>automated&nbsp;<span style="text-decoration: underline;">C</span>arbohydrate-active enzyme&nbsp;<span style="text-decoration: underline;">AN</span>notation</strong></a>, funded by the&nbsp;<a href="http://bioenergycenter.org/">BioEnergy Science Center of the DOE</a>. Similar resources on the web include&nbsp;<a href="http://www.cazy.org/" target="_blank">CAZy database</a>&nbsp;and&nbsp;<a href="http://cricket.ornl.gov/cgi-bin/cat.cgi" target="_blank">CAT</a>. All data in dbCAN are generated based on the family classification from&nbsp;<a href="http://www.cazy.org/" target="_blank">CAZy database</a>&nbsp;while it has the following&nbsp;<strong><span style="text-decoration: underline;">unique features</span></strong>&nbsp;compared with CAZy database and CAT:</p>
<ul>
<li>dbCAN provides the capability of&nbsp;<a href="http://csbl.bmb.uga.edu/dbCAN/annotate.php">automated and comprehensive CAZyme annotation</a>&nbsp;of a given genome submitted by the user;</li>
<li>dbCAN provides an explicitly defined&nbsp;<span style="text-decoration: underline;">signature domain</span>&nbsp;for each and every CAZyme family along with its location in all the relevant full-length CAZyme proteins in all sequenced&nbsp;<a href="http://csbl.bmb.uga.edu/dbCAN/genome.php">genomes</a>;</li>
<li>dbCAN provides the most complete set of&nbsp;<span style="text-decoration: underline;">metagenomic CAZyme</span>&nbsp;genes published so far and represents the first step towards discovering novel CAZyme catalysts in metagenomes;</li>
<li>dbCAN provides a&nbsp;<span style="text-decoration: underline;">subfamily classification</span>&nbsp;of the existing CAZyme families based on sequence similarities;</li>
<li>dbCAN make all pre-computed data freely available to the public, including sequence alignments,&nbsp;<a href="http://csbl.bmb.uga.edu/dbCAN/download/">hidden markov models (HMMs)</a>&nbsp;and phylogenies of the signature domain regions in each and every CAZyme family and subfamily.</li>
</ul>
<p><a href="http://csbl.bmb.uga.edu/dbCAN/help.php">dbCAN</a>&nbsp;is updated regularly when&nbsp;<a href="http://www.cazy.org/" target="_blank">CAZy database</a>&nbsp;created new families based on latest literature.</p><p>Address of the bookmark: <a href="http://csbl.bmb.uga.edu/dbCAN/index.php" rel="nofollow">http://csbl.bmb.uga.edu/dbCAN/index.php</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/11035/bioinformatics-jrfsrf-position-at-nii</guid>
  <pubDate>Sun, 25 May 2014 16:54:04 -0500</pubDate>
  <link></link>
  <title><![CDATA[Bioinformatics JRF/SRF position at NII]]></title>
  <description><![CDATA[
<p>NATIONAL INSTITUTE OF IMMUNOLOGY, NEW DELHI-110067</p>

<p>Applications are invited for the position of Senior Research Fellow for the following time-bound sponsored project as per the details given below:</p>

<p>1. BTIS project on, “Bioinformatics Center-National Infrastructural Facility in the Area of Immunology” funded by DBT</p>

<p>Senior Research Fellow (P) (One Position only)</p>

<p>Dr. Debasisa Mohanty<br />Staff Scientist-VI<br />deb@nii.res.in</p>

<p>Qualifications: M.Sc in Biological Sciences or Biotechnology with at least 04 years of Research experience in Bioinformatics or computational Biology after the master’s degree is essential.</p>

<p>Emoluments: The selected candidates will draw consolidated emoluments as per Institute Rules, depending upon qualifications &amp; experience</p>

<p>Rs. 18,000/- per month consolidated plus 30% HRA if Leading to Ph.D/NET/GATE Qualified otherwise Rs. 14,000/- per month + 30% HRA.</p>

<p>Job description: The candidate should be well versed in programming in PERL/C++/HTML/CGI, web server and portal development, computational analysis of<br />protein structure &amp; function, molecular dynamics simulations and use of high performance computing systems.</p>

<p>GENERAL TERMS AND CONDITIONS:-</p>

<p>1. The candidates selected for the above posts will be on contract for one year or duration of the project whichever is shorter, at a time.<br />2. No hostel/ housing facility will be provided.<br />3. Number of posts may vary and shall be need based. Advertisement is no commitment.<br />4. Applicants may clearly mention the category they belong to i.e. SC/ST/OBC/PH and attach documentary proof of the same.<br />5. No TA/DA will be paid for attending the interview, if called for.<br />6. Apart from sending application in the prescribed format given below, candidates should send complete Curriculum Vitae along with the names of three referees. Curriculum Vitae should contain details of the experimental expertise.</p>

<p>HOW TO APPLY Interested candidates may apply directly, STRICTLY IN THE PRESCRIBED FORMAT GIVEN BELOW, through e-mail, to the Investigator of the project, clearly indicating the name of the project along with their complete C.V., e-mail id, fax numbers, telephone numbers. Only Short listed candidates will be called for interview and they required to submit attested copies of all their certificates and a Demand Draft of Rs 100/- drawn on Canara Bank or Indian Bank payable at Delhi/New Delhi in favour of the Director, NII (SC / ST and PH candidates are exempted subject to submission of documentary proof), at the time of interview.</p>

<p>LAST DATE OF RECEIPT OF APPLICATIONS: 06th June, 2014</p>

<p>Advertisement</p>

<p>www1.nii.res.in/sites/default/files/projectappointment-Dr.Mohanty-6June2014.pdf</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38908/busca-an-integrative-web-server-to-predict-subcellular-localization-of-proteins</guid>
	<pubDate>Thu, 07 Feb 2019 14:08:11 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38908/busca-an-integrative-web-server-to-predict-subcellular-localization-of-proteins</link>
	<title><![CDATA[BUSCA: an integrative web server to predict subcellular localization of proteins]]></title>
	<description><![CDATA[<p><span>BUSCA (Bologna Unified Subcellular Component Annotator) is a web-server for predicting protein subcellular localization. BUSCA integrates different tools to predict localization-related protein features (DeepSig, TPpred3, PredGPI and ENSEMBLE3.0) as well as tools for discriminating subcellular localization of both globular and membrane proteins (BaCelLo, MemLoci and SChloro).</span></p><p>Address of the bookmark: <a href="http://busca.biocomp.unibo.it/" rel="nofollow">http://busca.biocomp.unibo.it/</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/13014/bioinformatics-jrf-vacancy-at-icgeb-new-delhi</guid>
  <pubDate>Wed, 23 Jul 2014 16:07:15 -0500</pubDate>
  <link></link>
  <title><![CDATA[Bioinformatics JRF vacancy at ICGEB, New Delhi]]></title>
  <description><![CDATA[
<p>Junior Research Fellow for a DBT sponsored project entitled "Computational and experimental characterization of stage specific arginine methylation in P. falciparum proteome". </p>

<p>Candidates should have a 1st class MSc/MTech/BTech degree in Bioinformatics. Please send complete CV, quoting Application for RMETH-JRF-2014, by email to Dr. Dinesh Gupta: dinesh@icgeb.res.in</p>

<p>Closing date for applications: 6 August 2014</p>

<p>More at http://www.icgeb.org/tl_files/Vacancies/JRF.pdf</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41565/csar-web-a-web-server-of-contig-scaffolding-using-algebraic-rearrangements</guid>
	<pubDate>Fri, 10 Apr 2020 04:39:36 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41565/csar-web-a-web-server-of-contig-scaffolding-using-algebraic-rearrangements</link>
	<title><![CDATA[CSAR-web: a web server of contig scaffolding using algebraic rearrangements]]></title>
	<description><![CDATA[<p><span>CSAR-web is a web-based tool that allows the users to efficiently and accurately scaffold (i.e. order and orient) the contigs of a target draft genome based on a complete or incomplete reference genome from a related organism.&nbsp;</span></p>
<p><span><span>CSAR-web can serve as a convenient and useful scaffolding tool allowing the users to efficiently and accurately scaffold their draft genomes according to a complete or incomplete reference genome.&nbsp;</span></span></p><p>Address of the bookmark: <a href="http://genome.cs.nthu.edu.tw/CSAR-web" rel="nofollow">http://genome.cs.nthu.edu.tw/CSAR-web</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/11313/linux-sort-commands-for-bioinformatics</guid>
	<pubDate>Sat, 31 May 2014 15:41:16 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/11313/linux-sort-commands-for-bioinformatics</link>
	<title><![CDATA[Linux Sort Commands for Bioinformatics]]></title>
	<description><![CDATA[<p>Almost all the scripting languages such as Perl, Python etc have built-in sort, but unfortunately none of them are as flexible as sort command. But one when it come to space efficiency GNU sort stands at the top. It can sort a 20Gb file with less than 2Gb memory. It is not trivial to implement so powerful a sort by yourself.</p><p>sort a space-delimited file based on its first column, then the second if the first is the same, and so on:<br />sort input.txt</p><p>sort a huge file (GNU sort ONLY):<br />sort -S 1500M -t $HOME/tmp input.txt &gt; sorted.txt</p><p>sort starting from the third column, skipping the first two columns:<br />sort +2 input.txt</p><p>sort the second column as numbers, descending order; if identical, sort the 3rd as strings, ascending order:<br />sort -k2,2nr -k3,3 input.txt</p><p>sort starting from the 4th character at column 2, as numbers:<br />sort -k2.4n input.txt</p><p>More Linxu sort command information<br /><br />If you have any sort commands you'd like to share, please add them to our comments section below. For more help, you can also type:<br /><br />man sort<br /><br />or<br /><br />sort --help<br /><br />on your Unix/Linux system.</p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43907/htop-explained</guid>
	<pubDate>Wed, 06 Jul 2022 01:28:09 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43907/htop-explained</link>
	<title><![CDATA[htop explained]]></title>
	<description><![CDATA[<p>For the longest time I did not know what everything meant in htop.</p>
<p>I thought that load average&nbsp;<code>1.0</code>&nbsp;on my two core machine means that the CPU usage is at 50%. That's not quite right. And also, why does it say&nbsp;<code>1.0</code>?</p>
<p>I decided to look everything up and document it here.</p><p>Address of the bookmark: <a href="https://peteris.rocks/blog/htop/" rel="nofollow">https://peteris.rocks/blog/htop/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/11399/next-generation-sequencing-in-r-or-bioconductor-environment</guid>
	<pubDate>Mon, 02 Jun 2014 18:03:09 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/11399/next-generation-sequencing-in-r-or-bioconductor-environment</link>
	<title><![CDATA[Next generation sequencing in R or bioconductor environment]]></title>
	<description><![CDATA[<p>There are many R software and bioconductor packages for NGS data analysis, some of them are as follows</p><h3><a name="TOC-Biostrings" id="TOC-Biostrings"></a>Biostrings</h3><p>The Biostrings package from Bioconductor provides an advanced environment for efficient sequence management and analysis in R. It contains many speed and memory effective string containers, string matching algorithms, and other utilities, for fast manipulation of large sets of biological sequences. The objects and functions provided by Biostrings form the basis for many other sequence analysis packages. <a href="http://bioconductor.org/packages/release/bioc/html/Biostrings.html">Documentation</a></p><div><div style="text-align: left;"><div style="color: #000000;"><h4><a name="TOC-IRanges-Overview" id="TOC-IRanges-Overview"></a>IRanges Overview</h4><p>IRanges provides the low-level infrastructure and containers for handling sets of integer ranges within Bioconductor's BioC-Seq domain. Its classes and methods provide support for many more high-level packages like GenomicRanges, ShortRead, Rsamtools, etc. <a href="http://bioconductor.org/packages/release/bioc/html/IRanges.html">Documentation</a></p><div style="text-align: right;"><div style="text-align: left;"><h4><a name="TOC-GenomicRanges-Overview" id="TOC-GenomicRanges-Overview"></a>GenomicRanges Overview</h4><p>The <em>GenomicRanges</em> package serves as the foundation for representing genomic locations within the Bioconductor project. It is built upon the <em>IRanges</em> infrastructure and defines three major data containers - <em>GRanges, GRangesList</em> and <em>GappedAlignments</em> - which are supporting other important BioC-Seq packages including <em>ShortRead, Rsamtools, rtracklayer, GenomicFeatures</em> and <em>BSgenome</em>.&nbsp; Compared to the IRanges container, the GRanges/<em>GRangesList</em> classes are more flexible and extensible to store additional information about sequence ranges, such as chromosome identifiers (sequence space), strand information and annotation data. <a href="http://bioconductor.org/packages/release/bioc/html/GenomicRanges.html">Documentation</a></p></div></div></div></div><h3><a name="TOC-Motif-Discovery" id="TOC-Motif-Discovery"></a>Motif Discovery</h3><h4><a name="TOC-cosmo" id="TOC-cosmo"></a>cosmo</h4><p>The cosmo package allows to search a set of unaligned DNA sequences for a shared motif that may function as transcription factor binding site. The algorithm extends the popular motif discovery tool MEME (Bailey and Elkan, 1995) in that it allows the search to be supervised by specifying a set of constraints that the motif to be discovered must satisfy. <a href="http://bioconductor.org/packages/release/bioc/html/cosmo.html">Documentation</a></p></div><div>
<p><span></span><span></span></p>
<div style="color: #0000ff;"><h4><a name="TOC-BCRANK" id="TOC-BCRANK"></a>BCRANK</h4><p>BCRANK is a method that takes a ranked list of genomic regions as input and outputs short DNA sequences that are overrepresented in some part of the list. The algorithm was developed for detecting transcription factor (TF) binding sites in a large number of enriched regions from high-throughput ChIP-chip or ChIP-seq experiments, but it can be applied to any ranked list of DNA sequences. Documentation</p>
<p><a href="http://bioconductor.org/packages/release/bioc/html/BCRANK.html"></a></p>
<p>rGADEM: <a href="http://bioconductor.org/packages/devel/bioc/html/rGADEM.html">Documentation</a></p><p>MotIV: <a href="http://bioconductor.org/packages/devel/bioc/html/MotIV.html">Documentation</a></p></div><h3><a name="TOC-ShortRead" id="TOC-ShortRead"></a>ShortRead</h3><p>The ShortRead package provides input, quality control, filtering, parsing, and manipulation functionality for short read sequences produced by high throughput sequencing technologies. While support is provided for many sequencing technologies, this package is primairly focused on Solexa/Illumina reads. <a href="http://bioconductor.org/packages/release/bioc/html/ShortRead.html">Documentation</a></p><h3><a name="TOC-Rsamtools" id="TOC-Rsamtools"></a>Rsamtools</h3><p>Rsamtools provides functions for parsing and inspecting samtools BAM formatted binary alignment data. SAM/BAM is quickly becoming a universal standard alignment format, and is now supported by a wide variety of alignment tools. <a href="http://bioconductor.org/help/bioc-views/2.7/bioc/html/Rsamtools.html">Documentation</a></p>
<p><a href="http://samtools.sourceforge.net/">Samtools Website</a><br /> <a href="http://bio-bwa.sourceforge.net/">BWA (Burrows-Wheeler Alignment) Website</a><br /><span style="color: #0000ff;"></span></p>
<div style="color: #000000;">&nbsp;</div></div><div>
<p><span style="color: #000000;">Additional tools for SNP analysis:&nbsp;</span></p>
<p><a href="http://bioconductor.org/help/bioc-views/release/bioc/html/snpMatrix.html">snpMatrix</a></p><h3><a name="TOC-BSgenome" id="TOC-BSgenome"></a>BSgenome</h3><p>BSgenome provides an object oriented infrastructure for interacting with a Biostring based genome sequence. BSgenome packages exist for many common genomes, and can be created to represent custom genomes. See the "How to forge a BSgenome data package" Vignette for instructions to create a new BSgenome package if a prebuilt package does not exist for your organism. <a href="http://bioconductor.org/packages/release/bioc/html/BSgenome.html">Documentation</a></p><h3><a name="TOC-rtracklayer" id="TOC-rtracklayer"></a>rtracklayer</h3><p>rtracklayer provides an interface for exporting annotation feature data to various genome browsers and file formats (such as GFF). See the Small RNA Profiling exercise for an example of using rtracklayer to visualize alignment coverage. <a href="http://bioconductor.org/packages/release/bioc/html/rtracklayer.html">Documentation</a></p><h3><a name="TOC-biomaRt" id="TOC-biomaRt"></a>biomaRt</h3><p>The biomaRt package, provides an interface to a growing collection of databases implementing the BioMart software suite (http:// www.biomart.org). The package enables online retrieval of large amounts of data in a uniform way without the need to know the underlying database schemas. This data is retrieved automatically via the Internet, so it's recommended that you cache the data locally, or check versions if your code will be adversely affected by updates to these data. <a href="http://bioconductor.org/packages/release/bioc/html/biomaRt.html">Documentation</a></p><h3><a name="TOC-ChIP-Seq-Analysis-Packages" id="TOC-ChIP-Seq-Analysis-Packages"></a>ChIP-Seq Analysis Packages</h3><p>Bioconductor provides various packages for analyzing and visualizing ChIP-Seq data. Only a small selection of these packages is introduced here. Additional useful introductions to this topic are: <a href="http://www.bioconductor.org/workshops/2009/SeattleJan09/ChIP-seq/">BioC ChIP-seq Case Study</a> and BioC <a href="http://www.bioconductor.org/help/course-materials/2009/SeattleNov09/ChIP-seq/">ChIP-Seq</a>.</p><h4><a name="TOC-chipseq" id="TOC-chipseq"></a>chipseq</h4><p>The chipseq package combines a variety of HT-Seq packages to a pipeline for ChIP-Seq data analysis. <a href="http://bioconductor.org/packages/release/bioc/html/chipseq.html">Documentation</a></p><h4><a name="TOC-BayesPeak" id="TOC-BayesPeak"></a>BayesPeak</h4><p>BayesPeak is a peak calling package for identifying DNA binding sites of proteins in ChIP-Seq experiments. Its algorithm uses hidden Markov models (HMM) and Bayesian statistical methods. The following sample code introduces the identification of peaks with the BayesPeak package as well as the incorporation of read coverage information obtained by the chipseq package. <a href="http://bioconductor.org/packages/release/bioc/html/BayesPeak.html">Documentation</a> [ <a href="http://www.biomedcentral.com/1471-2105/10/299">Publication</a> ]</p><h4><a name="TOC-PICS" id="TOC-PICS"></a>PICS</h4><p>The PICS package applies probabilistic inference to aligned-read ChIP-Seq data in order to identify regions bound by transcription factors. PICS identifies enriched regions by modeling local concentrations of directional reads, and uses DNA fragment length prior information to discriminate closely adjacent binding events via a Bayesian hierarchical t-mixture model. The following sample code uses the test data set from the above BayesPeak package in order to compare the results from both methods by identifying their consensus peak set. <a href="http://www.bioconductor.org/packages/release/bioc/html/PICS.html">Documentation</a> [ <a href="http://www.hubmed.org/display.cgi?uids=20528864">Publication</a> ]</p><h4><a name="TOC-ChIPpeakAnno" id="TOC-ChIPpeakAnno"></a>ChIPpeakAnno</h4><p>The ChIPpeakAnno package provides. batch annotation of the peaks identified from either ChIP-seq or ChIP-chip experiments. It includes functions to retrieve the sequences around peaks, obtain enriched Gene Ontology (GO) terms, find the nearest gene, exon, miRNA or custom features such as most conserved elements and other transcription factor binding sites supplied by users. The package leverages the biomaRt, IRanges, Biostrings, BSgenome, GO.db, multtest and stat packages. <a href="http://bioconductor.org/packages/release/bioc/html/ChIPpeakAnno.html">Documentation</a></p><h4><a name="TOC-Additional-ChIP-Seq-Packages" id="TOC-Additional-ChIP-Seq-Packages"></a>Additional ChIP-Seq Packages</h4><p>DiffBind: <a href="http://www.bioconductor.org/packages/release/bioc/html/DiffBind.html">Documentation</a></p><p>MOSAICS: <a href="http://bioconductor.org/packages/devel/bioc/html/mosaics.html">Documentation</a></p><p>iSeq: <a href="http://bioconductor.org/packages/release/bioc/html/iSeq.html">Documentation</a></p><p>ChIPseqR: <a href="http://bioconductor.org/packages/release/bioc/html/ChIPseqR.html">Documentation</a></p><p>ChiPsim: <a href="http://bioconductor.org/packages/release/bioc/html/ChIPsim.html">Documentation</a></p><p>CSAR: <a href="http://www.bioconductor.org/packages/devel/bioc/html/CSAR.html">Documentation</a></p><p>ChIP-Seq Pipeline: <a href="http://www.bioconductor.org/packages/release/bioc/html/PICS.html">PICS</a>, rGADEM and MotIV (<a href="http://www.rglab.org/pics-and-bioconductor/">developer web site</a>)</p><p>SPP: <a href="http://compbio.med.harvard.edu/Supplements/ChIP-seq/">ChIP-seq processing pipeline</a></p><p><a href="http://compbio.med.harvard.edu/Supplements/ChIP-seq/tutorial.html">SPP Tutorial</a></p><p><a href="http://liulab.dfci.harvard.edu/MACS/index.html">MACS</a></p><p><a href="http://gmdd.shgmo.org/Computational-Biology/ChIP-Seq/download/SIPeS">SIPeS</a></p><h3><a name="TOC-RNA-Seq-Analysis" id="TOC-RNA-Seq-Analysis"></a>RNA-Seq Analysis</h3><h4><a name="TOC-Counting-Reads-that-Overlap-with-Annotation-Ranges-" id="TOC-Counting-Reads-that-Overlap-with-Annotation-Ranges-"></a>Counting Reads that Overlap with Annotation Ranges&nbsp;</h4><p>The GenomicRanges package provides support for importing into R short read alignment data in BAM format (via Rsamtools) and associating them with genomic feature ranges, such as exons or genes. This way one can quantify the number of reads aligning to annotated genomic regions. The package defines general purpose containers for storing genomic intervals as well as more specialized containers for storing alignments against a reference genome. The two main functions for read counting provided by this infrastructure are <span>countOverlaps <span style="color: #000000;"><span>and</span></span> summarizeOverlaps</span>. For their proper usage, it is important to read the corresponding <a href="http://www.bioconductor.org/packages/devel/bioc/vignettes/GenomicRanges/inst/doc/summarizeOverlaps.pdf">PDF manual</a>. <a href="http://bioconductor.org/packages/release/bioc/html/GenomicRanges.html">Documentation</a></p><h4><a name="TOC-Differential-Gene-Expression-Analysis-with-DESeq" id="TOC-Differential-Gene-Expression-Analysis-with-DESeq"></a>Differential Gene Expression Analysis with DESeq</h4><p>The DESeq package contains functions to call differentially expressed genes (DEGs) in count tables based on a model using the negative binomial distribution. It expects as input a data frame with the raw read counts per region/gene of interest (rows) for each test sample (columns).&nbsp; Such a count table can be imported into R or generated from BAM alignment files using the <span>countOverlaps</span> function as introduced above. <a href="http://www.bioconductor.org/packages/release/bioc/html/DESeq.html">Documentation</a></p><h4><a name="TOC-Differential-Gene-Expression-Analysis-with-edgeR" id="TOC-Differential-Gene-Expression-Analysis-with-edgeR"></a>Differential Gene Expression Analysis with edgeR</h4><p>The edgeR package uses empirical Bayes estimation and exact tests based on the negative binomial distribution to call differentially expressed genes (DEGs) in count data.&nbsp;</p>
<p><a href="http://www.bioconductor.org/packages/release/bioc/html/edgeR.html">Documentation</a></p>
<p><span style="color: #000000;">A variety of additional R packages are available for normalizing RNA-Seq read count data and identifying differentially expressed genes (DEG): <br /> </span></p><p><a href="http://bioconductor.org/packages/devel/bioc/html/easyRNASeq.html">easyRNASeq</a> (simplifies read counting per genome feature)</p><p><a href="http://www.bioconductor.org/packages/release/bioc/html/DEXSeq.html">DEXSeq</a> (Inference of differential exon usage);&nbsp;<a href="http://www.bioconductor.org/packages/release/data/experiment/html/parathyroidSE.html">parathyroidSE</a> explains how to generate exon read counts in R</p><p><a href="http://bioconductor.org/packages/release/bioc/html/DEGseq.html">DEGseq</a></p><p><a href="http://www.bioconductor.org/packages/release/bioc/html/baySeq.html">baySeq</a> (also see: <a href="http://www.bioconductor.org/packages/release/bioc/html/segmentSeq.html">segmentSeq</a>)</p><p><a href="http://bioconductor.org/packages/release/bioc/html/Genominator.html">Genominator</a> (<a href="http://www.hubmed.org/display.cgi?uids=20167110">Bullard et al. 2010</a>)</p><div style="text-align: right;"><div style="text-align: left;"><h4><a name="TOC-Detection-of-Alternative-Splice-Junctions" id="TOC-Detection-of-Alternative-Splice-Junctions"></a>Detection of Alternative Splice Junctions</h4>
<p><span style="color: #000000;">Another utility of RNA-Seq experiments is the analysis of splice junctions. The following software suggestions provide this utility:</span></p>
<p><a href="http://woldlab.caltech.edu/rnaseq/">ERANGE<br /> </a><a href="http://tophat.cbcb.umd.edu/">TopHat</a></p><p><a href="http://biogibbs.stanford.edu/%7Ekinfai/SpliceMap/">SpliceMap</a></p><p><a href="http://solidsoftwaretools.com/gf/project/splitseek/">SplitSeek</a></p><h3><a name="TOC-DNA-Methylation-Data-Analysis" id="TOC-DNA-Methylation-Data-Analysis"></a>DNA-Methylation Data Analysis</h3><div><ul>
<li><span style="font-size: 10pt;"><a href="http://www.bioconductor.org/help/course-materials/2012/BiocEurope2012/mattia_pelizzola_methylPipe.pdf">methylPipe</a></span></li>
<li><span style="font-size: 10pt;"><a href="http://www.bioconductor.org/packages/devel/bioc/html/bsseq.html">bsseq</a></span></li>
<li><a href="http://www.bioconductor.org/packages/devel/bioc/html/BiSeq.html">BiSeq</a></li>
<li>Much more under <a href="http://www.bioconductor.org/packages/devel/BiocViews.html#___DNAMethylation">BiocViews</a></li>
</ul></div></div></div><h3><a name="TOC-HT-Seq-Data-Visualization" id="TOC-HT-Seq-Data-Visualization"></a>HT-Seq Data Visualization</h3>
<p><a href="http://www.bioconductor.org/packages/release/bioc/html/ggbio.html">ggbio</a>: ggplot2 extension for genomics data (<a href="http://tengfei.github.com/ggbio/">online manual</a>) <a href="http://www.bioconductor.org/packages/devel/bioc/html/Gviz.html">Gviz</a>:&nbsp;Plotting data and annotation information along genomic coordinates <a href="http://bioconductor.org/packages/release/bioc/html/HilbertVis.html">HilbertVis</a>: Hilbert genome plots</p>
<p><a href="http://bioconductor.org/packages/release/bioc/html/GenomeGraphs.html">GenomeGraphs</a>: Plotting genomic information from Ensembl</p><p><a href="http://www.hubmed.org/display.cgi?uids=18507856">TileQC</a>: Flow Cell Quality Visualization</p><p><a href="http://bioconductor.org/packages/release/bioc/html/rtracklayer.html">rtracklayer</a>: R interface to genome browsers</p><p><a href="http://genoplotr.r-forge.r-project.org/">genoPlotR</a>: Plotting maps of genes and genomes</p><p><a href="http://bioconductor.org/packages/release/bioc/html/Genominator.html">Genominator</a>: Tools for storing, accessing, analyzing and visualizing genomic data.</p><p>&nbsp;</p><p>To install all packages</p><blockquote><p>source("http://bioconductor.org/biocLite.R")<br />biocLite()<br />biocLite(c("ShortRead", "Biostrings", "IRanges", "BSgenome", "rtracklayer", "biomaRt", "chipseq", "ChIPpeakAnno", "Rsamtools", "BayesPeak", "PICS", "GenomicRanges", "DESeq", "edgeR", "leeBamViews", "GenomicFeatures", "BSgenome.Celegans.UCSC.ce2"))</p></blockquote></div>]]></description>
	<dc:creator>John Parker</dc:creator>
</item>

</channel>
</rss>