<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/5307?</link>
	<atom:link href="https://bioinformaticsonline.com/related/5307?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37509/vcftools-perform-common-tasks-with-vcf-files-such-as-file-validation-file-merging-intersecting-complements</guid>
	<pubDate>Tue, 07 Aug 2018 10:01:46 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37509/vcftools-perform-common-tasks-with-vcf-files-such-as-file-validation-file-merging-intersecting-complements</link>
	<title><![CDATA[VCFtools: perform common tasks with VCF files such as file validation, file merging, intersecting, complements]]></title>
	<description><![CDATA[<p>VCFtools contains a Perl API (<a href="http://vcftools.sourceforge.net/perl_module.html#Vcf.pm">Vcf.pm</a>) and a number of Perl scripts that can be used to perform common tasks with VCF files such as file validation, file merging, intersecting, complements, etc. The Perl tools support all versions of the VCF specification (3.2, 3.3, 4.0, 4.1 and 4.2), nevertheless, the users are encouraged to use the latest versions VCFv4.1 or VCFv4.2. The VCFtools in general have been used mainly with diploid data, but the Perl tools aim to support polyploid data as well. Run any of the Perl scripts with the&nbsp;<strong>--help</strong>&nbsp;switch to obtain more help.</p>
<p>Many of the&nbsp;<strong>Perl scripts require that the VCF files are compressed by&nbsp;<span>bgzip</span>&nbsp;and indexed by&nbsp;<span>tabix</span></strong>&nbsp;(both tools are part of the tabix package, available for&nbsp;<a href="https://sourceforge.net/projects/samtools/files/tabix/">download here</a>). The VCF files can be compressed and indexed using the following commands</p>
<p>bgzip my_file.vcf<br>tabix -p vcf my_file.vcf.gz</p>
<p>&nbsp;</p>
<p>http://vcftools.sourceforge.net/perl_module.html</p><p>Address of the bookmark: <a href="http://vcftools.sourceforge.net/perl_module.html" rel="nofollow">http://vcftools.sourceforge.net/perl_module.html</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2727/download-mutliple-fasta-file-from-ncbi-in-one-go</guid>
	<pubDate>Wed, 21 Aug 2013 08:13:30 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2727/download-mutliple-fasta-file-from-ncbi-in-one-go</link>
	<title><![CDATA[Download mutliple fasta file from NCBI in one GO!!]]></title>
	<description><![CDATA[<p>if you have less time, then use three ways mentioned in bookmark link to extract/download all fasta sequences in single click given that you already have a list of GIs or accession IDs .</p>
<p>Alternatively, use one liner perl script:</p>
<p>perl -ne 'if(/^&gt;(\S+)/){$c=$i{$1}}$c?print:chomp;$i{$_}=1 if @ARGV' GIs.txt &gt;sequence.fasta</p>
<p>where GIs.txt contains&nbsp;a list of GIs or accession IDs.</p>
<p>(from :<a href="http://edwards.sdsu.edu/labsite/index.php/robert?start=5">http://edwards.sdsu.edu/labsite/index.php/robert?start=5</a>)</p><p>Address of the bookmark: <a href="http://edwards.sdsu.edu/labsite/index.php/robert/380-ncbi-sequence-or-fasta-batch-download-using-entrez" rel="nofollow">http://edwards.sdsu.edu/labsite/index.php/robert/380-ncbi-sequence-or-fasta-batch-download-using-entrez</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40871/nanopore-adaptor</guid>
	<pubDate>Mon, 03 Feb 2020 00:10:29 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40871/nanopore-adaptor</link>
	<title><![CDATA[Nanopore adaptor !]]></title>
	<description><![CDATA[<p>Porechop is a tool for finding and removing adapters from&nbsp;<a href="https://nanoporetech.com/">Oxford Nanopore</a>&nbsp;reads. Adapters on the ends of reads are trimmed off, and when a read has an adapter in its middle, it is treated as chimeric and chopped into separate reads. Porechop performs thorough alignments to effectively find adapters, even at low sequence identity.</p>
<p>Porechop also supports demultiplexing of Nanopore reads that were barcoded with the&nbsp;<a href="https://store.nanoporetech.com/native-barcoding-kit-1d.html">Native Barcoding Kit</a>,&nbsp;<a href="https://store.nanoporetech.com/pcr-barcoding-kit-96.html">PCR Barcoding Kit</a>&nbsp;or&nbsp;<a href="https://store.nanoporetech.com/rapid-barcoding-sequencing-kit.html">Rapid Barcoding Kit</a>.</p>
<p><span>The known Nanopore adapters that Porechop looks for are defined</span></p>
<p><a href="https://github.com/rrwick/Porechop/blob/master/porechop/adapters.py">https://github.com/rrwick/Porechop/blob/master/porechop/adapters.py</a></p>
<p>They are:</p>
<ul>
<li>Ligation kit adapters</li>
<li>Rapid kit adapters</li>
<li>PCR kit adapters</li>
<li>Barcodes</li>
<li>Native barcoding</li>
<li>Rapid barcoding</li>
</ul><p>Address of the bookmark: <a href="https://github.com/rrwick/Porechop/blob/master/porechop/adapters.py" rel="nofollow">https://github.com/rrwick/Porechop/blob/master/porechop/adapters.py</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/35144/converting-fastq-to-fasta</guid>
	<pubDate>Fri, 12 Jan 2018 03:49:09 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/35144/converting-fastq-to-fasta</link>
	<title><![CDATA[Converting FASTQ to FASTA]]></title>
	<description><![CDATA[<div id="block-system-main"><div><div><div><div><div><div><p>There are several ways you can convert fastq to fasta sequences. Some methods are listed below.</p><h3>Using SED</h3><p><span><code><span>sed</span></code></span>&nbsp;can be used to selectively print the desired lines from a file, so if you print the first and 2rd line of every 4 lines, you get the sequence header and sequence needed for fasta format.</p><pre>sed -n '1~4s/^@/&gt;/p;2~4p' INFILE.fastq &gt; OUTFILE.fasta
</pre><h3>Using PASTE</h3><p>You can linerize every 4 lines in a tabular format and print first and second field using&nbsp;<span><code>paste</code></span></p><pre>cat INFILE.fastq | paste - - - - |cut -f 1, 2| sed 's/@/&gt;/'g | tr -s "/t" "/n" &gt; OUTFILE.fasta
</pre><h3>EMBOSS:seqret</h3><p>Standard script that can be used for many purposes. One such use is fastq-fasta conversion</p><pre>seqret -sequence reads.fastq -outseq reads.fasta
</pre><p><span><code><span>awk</span></code></span>&nbsp;can be used for conversion as follows:</p><h3>Using AWK</h3><pre>cat infile.fq | awk '{if(NR%4==1) {printf("&gt;%s\n",substr($0,2));} else if(NR%4==2) print;}' &gt; file.fa
</pre><h3>FASTX-toolkit</h3><p><span><code>fastq_to_fasta</code></span>&nbsp;is available in the FASTX-toolkit that scales really well with the huge datasets</p><pre>fastq_to_fasta -h
usage: fastq_to_fasta [-h] [-r] [-n] [-v] [-z] [-i INFILE] [-o OUTFILE]
# Remember to use -Q33 for illumina reads!
version 0.0.6
       [-h]         = This helpful help screen.
       [-r]         = Rename sequence identifiers to numbers.
       [-n]         = keep sequences with unknown (N) nucleotides.
                   Default is to discard such sequences.
       [-v]         = Verbose - report number of sequences.
                   If [-o] is specified,  report will be printed to STDOUT.
                   If [-o] is not specified (and output goes to STDOUT),
                   report will be printed to STDERR.
       [-z]         = Compress output with GZIP.
       [-i INFILE]  = FASTA/Q input file. default is STDIN.
       [-o OUTFILE] = FASTA output file. default is STDOUT.
</pre><h3>Bioawk</h3><p>Another option to convert fastq to fasta format using&nbsp;<span><code>bioawk</code></span></p><pre>bioawk -c fastx '{print "&gt;"$name"\n"$seq}' input.fastq &gt; output.fasta
</pre><h3>Seqtk</h3><p>From the same developer, there is another option using a tool called&nbsp;<span><code>seqtk</code></span></p><pre>seqtk seq -a input.fastq &gt; output.fasta
</pre><p>Note that you can use either compressed or uncompressed files for this tool</p></div></div></div></div></div></div></div>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/396/bioinformatics-introduction-to-perl</guid>
	<pubDate>Thu, 11 Jul 2013 09:49:37 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/396/bioinformatics-introduction-to-perl</link>
	<title><![CDATA[Bioinformatics: Introduction to PERL]]></title>
	<description><![CDATA[<p>This course is aimed at those new to programming and provides an introduction to programming using <strong>Perl</strong>. By the end of this course, attendees should be able to write simple <strong>Perl</strong> programs and to understand more complex <strong>Perl</strong> programs written by others. The course will be taught using the online <a href="http://sofiarobb.com/learning-perl-toc/" title="http://sofiarobb.com/learning-perl-toc/">Learning Perl</a> materials created by <a href="http://stajich.bioinformatics.ucr.edu/members/sofia-robb" title="http://stajich.bioinformatics.ucr.edu/members/sofia-robb">Sofia Robb</a> of the <a href="http://www.ucr.edu/" title="http://www.ucr.edu/">University of California Riverside</a>. Further information is <a href="http://ruddles.bio.cam.ac.uk/%7Edpjudge/Descriptions/PERL.php" title="http://ruddles.bio.cam.ac.uk/~dpjudge/Descriptions/PERL.php">available</a>.</p>]]></description>
	<dc:creator>Archana Malhotra</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/1182/installing-perl-gd-module</guid>
	<pubDate>Mon, 22 Jul 2013 14:02:01 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/1182/installing-perl-gd-module</link>
	<title><![CDATA[Installing Perl GD Module]]></title>
	<description><![CDATA[<div><p>In comparative genome analysis work, we usually compare more than two genomes and looks for syntenic regions amongst them. In my research I used Evolution Highway (RH) <a href="http://eh-demo.ncsa.uiuc.edu/">http://eh-demo.ncsa.uiuc.edu/</a>, which is a collaborative project designed to provide a visual means for simultaneously comparing genomes of multiple amniote species. The tool removes the burden of manually aligning these maps and allows cognitive skills to be used toward something more valuable than preparation and transformation of data. In addition to EH, attractive Circos (<a href="http://circos.ca/">http://circos.ca/</a>) is also very popular for this kind of analysis.</p><p>The EH is available online, and can be easily access and use, whereas Circos installation is not entirely straightforward. One of the most difficult parts of the installation involves installing the GD library. Since there weren't good instructions for installing this library on the internet I decided to post instructions here in case they are useful to anyone else.</p><p><strong>Following are the steps to install GD modules in Mac OS</strong><br /><br />1. Setup<br /><br />Create a folder for the files:<br /><br />$ mkdir -p /SourceCache<br />$ cd /SourceCache<br /><br />Get and unpack the required Jpeg-6b and GD libraries:<br />Download Jpeg-6b (<a href="http://code.google.com/p/google-desktop-for-linux-mirror/downloads/detail?name=jpeg-6b.tar.gz&amp;can=2&amp;q">http://code.google.com/p/google-desktop-for-linux-mirror/downloads/detail?name=jpeg-6b.tar.gz&amp;can=2&amp;q</a>)<br />Download GD (<a href="http://search.cpan.org/%7Elds/GD-2.46/">http://search.cpan.org/~lds/GD-2.46/</a>)<br /><br />Place the "tar.gz" files in "/SourceCache" and double click to unpack.<br /><br />2. Install libjpeg<br /><br />Copy the "config.sub" and "config.guess" files to "/SourceCache". Note that your "config.sub" and ""config.guess" files may be in a slightly different location. The commands below show where they were on my machine:<br /><br />$ cd /SourceCache/jpeg-6b/src<br />$ cp /usr/share/libtool/config/config.sub .<br />$ cp /usr/share/libtool/config/config.guess .<br /><br />Configure libjpeg as follows. Note that this was installed on a 64 bit machine. However, this method may configure it in a 32 bit format. This may not be the best way to configure the installation but it works.<br /><br />$ .configure --enable-shared<br />$ make<br /><br />Check to see if the following directories exist on your machine. Create the missing directories in the following manner:<br /><br />$ mkdir -p /usr/local/include<br />$ mkdir -p /usr/local/bin<br />$ mkdir -p /usr/local/lib<br />$ mkdir -p /usr/local/man/man1<br /><br />Finish making and installing libjpeg:<br /><br />$ make install<br /><br />3. Install GD<br /><br />$ cd /SourceCache/GD-2.46/GD/<br />$ perl Makefile.PL<br />$ make<br />$ make test (optional)<br />$ make html (optional)<br />$ make install</p><p><strong>Other way for Mac OS</strong><br />The easiest way to get a lot of these is with a program called Fink, which is similar in nature to the CPAN installer, but installs common GNU utilities. Fink is available from &lt;<a href="http://sourceforge.net/projects/fink/%3E">http://sourceforge.net/projects/fink/&gt;</a>.<br /><br />Follow the instructions for setting up Fink. Once it's installed, you'll want to run the following as root: fink install gd<br /><br />It will prompt you for a number of dependencies, type 'y' and hit enter to install all of the dependencies. Then watch it work.<br /><br />To prevent creating conflicts with the software that Apple installs by default, Fink creates its own directory tree at /sw where it installs most of the software that it installs. This means your libraries and headers for libgd will be at /sw/lib and /sw/include instead of /usr/lib and /usr/local/include. Because of these changed locations for the libraries, the Perl GD module will not install directly via CPAN, because it looks for the specific paths instead of getting them from your environment. But there's a way around that :-)<br /><br />Instead of typing "install GD" at the cpan&gt; prompt, type look GD. This should go through the motions of downloading the latest version of the GD module, then it will open a shell and drop you into the build directory. Apply below patch to the Makefile.PL file (save the patch into a file and use the command patch &lt; patchfile.)<br /><br />Then, run these commands to finish the installation of the GD module:<br /><br />perl Makefile.PL<br />make<br />make test<br />make install<br />And don't forget to run exit to get back to CPAN.</p><p>&nbsp;</p><p><strong>Install on MS Window, using PPM</strong></p><p>C:\Documents and Settings\Owner&gt;ppm<br />PPM interactive shell (2.2.0) - type 'help' for available commands.<br />PPM&gt; install GD<br />Install package 'GD?' (y/N): y<br />Installing package 'GD'...<br />Downloading <a href="http://ppm.ActiveState.com/PPMPackages/5.6plus/MSW">http://ppm.ActiveState.com/PPMPackages/5.6plus/MSW</a>. ...<br />Installing C:\Perl\site\lib\auto\GD\GD.bs<br />Installing C:\Perl\site\lib\auto\GD\GD.dll<br />Installing C:\Perl\site\lib\auto\GD\GD.exp<br />Installing C:\Perl\site\lib\auto\GD\GD.lib<br />Installing C:\Perl\html\site\lib\GD.html<br />Installing C:\Perl\site\lib\GD.pm<br />Installing C:\Perl\site\lib\qd.pl<br />Installing C:\Perl\site\lib\auto\GD\autosplit.ix<br />PPM&gt;<br /><br /><br />If you can't install it from ppm. You can download it:<br /><a href="http://ppm.ActiveState.com/PPMPackages/5.6plus/MSW">http://ppm.ActiveState.com/PPMPackages/5.6plus/MSW</a>.<br /><br /><br />BTW,All Perl 5.6.1 Modules are located at:<br /><br /><a href="http://ppm.ActiveState.com/PPMPackages/5.6plus/MSW">http://ppm.ActiveState.com/PPMPackages/5.6plus/MSW</a>.</p><p>&nbsp;</p><p><strong>Install the Perl GD Module on Linux</strong><br /><br />$ sudo perl -MCPAN -e shell<br /><br />Since it was the first time I had run this command on this particular machine I had to answer a lot of questions but simply selected the defaults for everything as this usually works for me. Once in the CPAN shell I entered<br /><br />$ install Bundle::CPAN<br /><br />and selected all of the defaults again. Once the CPAN bundle had finished installing I tried to install GD::Graph by typing<br /><br />$ install GD::Graph<br /><br />but it failed with hundreds of errors &ndash; the first of which was<br /><br />GD.xs:7:16: error: gd.h: No such file or directory<br /><br />This was fixed with the following apt-get command (in the bash shell)<br /><br />$ sudo apt-get install libgd2-xpm-dev<br /><br />back in the CPAN shell I still couldn&rsquo;t get GD::Graph to build and I guessed this was because of some left over files from the failed build. I don&rsquo;t know the command to clean things up inside the CPAN shell and am too lazy to read the docs so I simply went into the .cpan/build directory in my home directory and deleted anything that started with GD &ndash; eg<br /><br />$ rm -rf GD-2.35-HC_vkB<br /><br />$ rm -rf GDGraph-1.44-Evfibe<br /><br />and so on. Those strings at the end (VkB and so on) look random so they might be different on your machine. Then I went back into the CPAN shell and ran<br /><br />$ install GD::Graph<br /><br />There were a few dependencies which the script fetched and installed for me but everything worked smoothly.</p><p>Manual and other Perl Module instalation are mentioned in my previous blog @ <a href="http://bioinformaticsonline.com/blog/view/710/how-to-install-perl-modules-manually-using-cpan-command-and-other-quick-ways">http://bioinformaticsonline.com/blog/view/710/how-to-install-perl-modules-manually-using-cpan-command-and-other-quick-ways</a></p></div>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2376/citrus-perl</guid>
	<pubDate>Wed, 14 Aug 2013 14:57:44 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2376/citrus-perl</link>
	<title><![CDATA[Citrus Perl]]></title>
	<description><![CDATA[<p>Citrus Perl is a binary distribution of Perl created for GUI application developers. The distribution includes <a href="http://wxperl.sourceforge.net">wxPerl</a>, the Perl wrapper for <a href="http://www.wxwidgets.org">wxWidgets</a>. Where supported by the operating system wxWidgets is available as a package for the 2.8.x stable branch and the 2.9.x development branch.</p><p>Address of the bookmark: <a href="http://www.citrusperl.com/" rel="nofollow">http://www.citrusperl.com/</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/4037/perl-and-bioperl-tutorials</guid>
	<pubDate>Wed, 28 Aug 2013 05:51:38 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/4037/perl-and-bioperl-tutorials</link>
	<title><![CDATA[Perl and BioPerl Tutorials]]></title>
	<description><![CDATA[<p>This bookmark is created to store the useful Perl and BioPerl tutorial links at one place. Feel free to share and add more useful tutorial links here ....&nbsp;</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="http://cbb.sjtu.edu.cn/course/database/beginning.pdf" rel="nofollow">http://cbb.sjtu.edu.cn/course/database/beginning.pdf</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/19792/irishgrid-irish-grid-mapping-system</guid>
	<pubDate>Fri, 26 Dec 2014 07:53:24 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/19792/irishgrid-irish-grid-mapping-system</link>
	<title><![CDATA[irishgrid: Irish Grid Mapping System]]></title>
	<description><![CDATA[<p>Perl module for creating geographic 10km-square maps using either SVG or PNG (with GD library) output format.</p>
<p>Originally design to map the location of objects in a 10 km map IrishGrid includes:</p>
<ul>
<li>native support of the Irish Grid System (see <a href="http://www.osi.ie/">http://www.osi.ie/</a>)</li>
<li>optimize for speed (there's as less as possible data to conversion)</li>
<li>customized color functions</li>
</ul>
<p>https://code.google.com/p/irishgrid/downloads/detail?name=irishgrid.pl</p><p>Address of the bookmark: <a href="https://code.google.com/p/irishgrid/" rel="nofollow">https://code.google.com/p/irishgrid/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/22567/rosalind-problem-solution-with-perl</guid>
	<pubDate>Tue, 09 Jun 2015 23:35:18 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/22567/rosalind-problem-solution-with-perl</link>
	<title><![CDATA[Rosalind Problem Solution with Perl]]></title>
	<description><![CDATA[<p>Rosalind is a platform for learning bioinformatics and programming through problem solving. <a href="http://rosalind.info/problems/list-view/?location=bioinformatics-textbook-track">Take a tour</a> to get the hang of how Rosalind works.</p><p>Bioinformatics Textbook Track</p><p>Find more about Rosalind puzzle at http://rosalind.info/problems/list-view/?location=bioinformatics-textbook-track</p><p>I will provide solution of all the Rosalind problem with Perl for community.</p><p>Check out the right sidebar for more links ...</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>