<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Rahul Nayak's bookmarks]]></title>
	<link>https://bioinformaticsonline.com/bookmarks/owner/rahul?offset=0</link>
	<atom:link href="https://bioinformaticsonline.com/bookmarks/owner/rahul?offset=0" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43863/snakemake-tutorials</guid>
	<pubDate>Mon, 09 May 2022 05:20:41 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43863/snakemake-tutorials</link>
	<title><![CDATA[Snakemake Tutorials !]]></title>
	<description><![CDATA[<p>A lesson introducing the Snakemake workflow system for bioinformatics analysis.</p>
<blockquote>
<h2 id="prerequisites">Prerequisites<a href="https://carpentries-incubator.github.io/snakemake-novice-bioinformatics/index.html#prerequisites"></a></h2>
<p>This is an intermediate lesson and assumes learners have already done some bioinformatics:</p>
<ul>
<li>Familiarity with the BASH command shell, including concepts like pipes, variables and loops.</li>
<li>Knowledge of bioinformatics fundamentals like the FASTQ file format and transcriptome sequencing, in order to understand the example workflow.</li>
</ul>
<p>No previous knowledge of Snakemake or workflow systems is required.</p>
<p>https://carpentries-incubator.github.io/snakemake-novice-bioinformatics/index.html</p>
</blockquote><p>Address of the bookmark: <a href="https://carpentries-incubator.github.io/snakemake-novice-bioinformatics/aio/index.html" rel="nofollow">https://carpentries-incubator.github.io/snakemake-novice-bioinformatics/aio/index.html</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43859/mumco-is-a-simple-bash-script-that-uses-whole-genome-alignment-information-provided-by-mummer-v4-to-detect-variants</guid>
	<pubDate>Wed, 27 Apr 2022 04:34:12 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43859/mumco-is-a-simple-bash-script-that-uses-whole-genome-alignment-information-provided-by-mummer-v4-to-detect-variants</link>
	<title><![CDATA[MUM&amp;Co is a simple bash script that uses Whole Genome Alignment information provided by MUMmer (v4) to detect variants.]]></title>
	<description><![CDATA[<p dir="auto">MUM&amp;Co is able to detect:<br>Deletions, insertions, tandem duplications and tandem contractions (&gt;=50bp &amp; &lt;=150kb)<br>Inversions (&gt;=1kb) and translocations (&gt;=10kb)</p><p>Address of the bookmark: <a href="https://github.com/SAMtoBAM/MUMandCo" rel="nofollow">https://github.com/SAMtoBAM/MUMandCo</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43856/puffaligner-a-fast-efficient-and-accurate-aligner-based-on-the-pufferfish-index</guid>
	<pubDate>Thu, 21 Apr 2022 05:41:39 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43856/puffaligner-a-fast-efficient-and-accurate-aligner-based-on-the-pufferfish-index</link>
	<title><![CDATA[PuffAligner: a fast, efficient and accurate aligner based on the Pufferfish index]]></title>
	<description><![CDATA[<p><span>PuffAligner, a fast, accurate and versatile aligner built on top of the Pufferfish index. PuffAligner is able to produce highly sensitive alignments, similar to those of Bowtie2, but much more quickly. While exhibiting similar speed to the ultrafast STAR aligner, PuffAligner requires considerably less memory to construct its index and align reads. PuffAligner strikes a desirable balance with respect to the time, space and accuracy tradeoffs made by different alignment tools and provides a promising foundation on which to test new alignment ideas over large collections of sequences.</span></p><p>Address of the bookmark: <a href="https://github.com/COMBINE-lab/pufferfish/tree/cigar-strings" rel="nofollow">https://github.com/COMBINE-lab/pufferfish/tree/cigar-strings</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43828/understanding-hifi-reads</guid>
	<pubDate>Thu, 24 Mar 2022 19:48:11 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43828/understanding-hifi-reads</link>
	<title><![CDATA[Understanding HiFi Reads !]]></title>
	<description><![CDATA[<p><span>While little public data is available for either of the new synthetic long read approaches, Illumina showed an example comparison earlier this year at the&nbsp;</span><a href="https://www.festivalofgenomics.com/rami-mehio" target="_blank">Festival of Genomics &amp; Biodata conference</a><span>&nbsp;(FoG 2022). In the IGV screenshot presented (below), synthetic Infinity reads &ndash; labeled &ldquo;Longas&rdquo; &ndash; are at the top, followed by standard Illumina short reads, and PacBio HiFi reads labeled &ldquo;CCS&rdquo; depicted at the bottom:</span></p><p>Address of the bookmark: <a href="http://pacb.com/blog/the-hifi-difference-true-long-reads-vs-synthetic-long-reads/" rel="nofollow">http://pacb.com/blog/the-hifi-difference-true-long-reads-vs-synthetic-long-reads/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43826/tiara-deep-learning-based-classification-system-for-eukaryotic-sequences</guid>
	<pubDate>Mon, 14 Mar 2022 23:02:11 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43826/tiara-deep-learning-based-classification-system-for-eukaryotic-sequences</link>
	<title><![CDATA[Tiara: deep learning-based classification system for eukaryotic sequences]]></title>
	<description><![CDATA[<p><span>With a large number of metagenomic datasets becoming available, eukaryotic metagenomics emerged as a new challenge. The proper classification of eukaryotic nuclear and organellar genomes is an essential step toward a better understanding of eukaryotic diversity.</span></p><p>Address of the bookmark: <a href="https://academic.oup.com/bioinformatics/article/38/2/344/6375939" rel="nofollow">https://academic.oup.com/bioinformatics/article/38/2/344/6375939</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43815/kebabs-package-provides-functionality-for-kernel-based-analysis-of-biological-sequences-via-support-vector-machine-svm-based-methods</guid>
	<pubDate>Fri, 04 Mar 2022 00:14:11 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43815/kebabs-package-provides-functionality-for-kernel-based-analysis-of-biological-sequences-via-support-vector-machine-svm-based-methods</link>
	<title><![CDATA[kebabs: package provides functionality for kernel based analysis of biological sequences via Support Vector Machine (SVM) based methods]]></title>
	<description><![CDATA[<p><span>The&nbsp;</span><tt>kebabs</tt><span>&nbsp;package provides functionality for kernel based analysis of biological sequences via Support Vector Machine (SVM) based methods. Biological sequences include DNA, RNA, and amino acid (AA) sequences. Sequence kernels define similarity measures between sequences. The package implements some of the most important kernels for sequence analysis in a very flexible and efficient way and extends the standard position-independent functionality of these kernels in a novel way to take the position of patterns in the sequences into account for the similarity measure.</span></p>
<p>http://www.bioinf.jku.at/software/kebabs/</p>
<p>http://bioconductor.org/packages/release/bioc/vignettes/kebabs/inst/doc/kebabs.pdf</p><p>Address of the bookmark: <a href="http://www.bioinf.jku.at/software/kebabs/" rel="nofollow">http://www.bioinf.jku.at/software/kebabs/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43810/seqfu-a-suite-of-utilities-for-the-robust-and-reproducible-manipulation-of-sequence-files</guid>
	<pubDate>Tue, 01 Mar 2022 03:13:33 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43810/seqfu-a-suite-of-utilities-for-the-robust-and-reproducible-manipulation-of-sequence-files</link>
	<title><![CDATA[SeqFu: A Suite of Utilities for the Robust and Reproducible Manipulation of Sequence Files]]></title>
	<description><![CDATA[<p>A general-purpose program to manipulate and parse information from FASTA/FASTQ files, supporting gzipped input files. Includes functions to&nbsp;<em>interleave</em>&nbsp;and&nbsp;<em>de-interleave</em>&nbsp;FASTQ files, to&nbsp;<em>rename</em>&nbsp;sequences and to&nbsp;<em>count</em>&nbsp;and print&nbsp;<em>statistics</em>&nbsp;on sequence lengths. SeqFu is available for Linux and MacOS.</p>
<ul>
<li>A compiled program delivering high performance analyses</li>
<li>Supports FASTA/FASTQ files, also Gzip compressed</li>
<li>A growing collection of handy utilities, also for quick inspection of the datasets</li>
</ul>
<p>Can be easily&nbsp;<a href="https://telatin.github.io/seqfu2/installation">installed</a>&nbsp;via conda:</p>
<div>
<div>
<pre><code>conda <span>install</span> <span>-c</span> bioconda seqfu</code></pre>
</div>
</div><p>Address of the bookmark: <a href="https://telatin.github.io/seqfu2/" rel="nofollow">https://telatin.github.io/seqfu2/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43725/comparative-genomics-workshops</guid>
	<pubDate>Tue, 25 Jan 2022 20:39:58 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43725/comparative-genomics-workshops</link>
	<title><![CDATA[Comparative Genomics Workshops !]]></title>
	<description><![CDATA[<p><span>This meeting's objective was to obtain a big picture look at the current state of the field of comparative&nbsp;genomics with a focus on commonalities across genomic investigations into humans, model organisms&nbsp;(both traditional and non-traditional), agricultural species, wildlife species and microbes.</span></p>
<p>https://www.genome.gov/event-calendar/perspectives-in-comparative-genomics-and-evolution</p><p>Address of the bookmark: <a href="https://www.genome.gov/event-calendar/perspectives-in-comparative-genomics-and-evolution" rel="nofollow">https://www.genome.gov/event-calendar/perspectives-in-comparative-genomics-and-evolution</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43722/crossmap-program-for-genome-coordinates-conversion-between-different-assemblies</guid>
	<pubDate>Tue, 25 Jan 2022 17:59:32 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43722/crossmap-program-for-genome-coordinates-conversion-between-different-assemblies</link>
	<title><![CDATA[CrossMap: program for genome coordinates conversion between different assemblies]]></title>
	<description><![CDATA[<p><span>CrossMap is a program for genome coordinates conversion between&nbsp;</span><em>different assemblies</em><span>&nbsp;(such as&nbsp;</span><a href="http://www.ncbi.nlm.nih.gov/assembly/2928/">hg18 (NCBI36)</a><span>&nbsp;&lt;=&gt;&nbsp;</span><a href="http://www.ncbi.nlm.nih.gov/assembly/2758/">hg19 (GRCh37)</a><span>). It supports commonly used file formats including&nbsp;</span><a href="https://samtools.github.io/hts-specs/SAMv1.pdf">BAM</a><span>,&nbsp;</span><a href="https://en.wikipedia.org/wiki/CRAM_(file_format)">CRAM</a><span>,&nbsp;</span><a href="https://en.wikipedia.org/wiki/SAM_(file_format)">SAM</a><span>,&nbsp;</span><a href="https://genome.ucsc.edu/goldenPath/help/wiggle.html">Wiggle</a><span>,&nbsp;</span><a href="https://genome.ucsc.edu/goldenPath/help/bigWig.html">BigWig</a><span>,&nbsp;</span><a href="https://genome.ucsc.edu/FAQ/FAQformat.html#format1">BED</a><span>,&nbsp;</span><a href="https://genome.ucsc.edu/FAQ/FAQformat.html#format3">GFF</a><span>,&nbsp;</span><a href="https://genome.ucsc.edu/FAQ/FAQformat.html#format4">GTF</a><span>,&nbsp;</span><a href="https://docs.gdc.cancer.gov/Data/File_Formats/MAF_Format/">MAF</a><span>&nbsp;</span><a href="https://samtools.github.io/hts-specs/VCFv4.2.pdf">VCF</a><span>, and&nbsp;</span><a href="https://sites.google.com/site/gvcftools/home/about-gvcf">gVCF</a><span>.</span></p><p>Address of the bookmark: <a href="http://crossmap.sourceforge.net/" rel="nofollow">http://crossmap.sourceforge.net/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43714/hiv-genome-database</guid>
	<pubDate>Fri, 21 Jan 2022 05:40:15 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43714/hiv-genome-database</link>
	<title><![CDATA[HIV genome database !]]></title>
	<description><![CDATA[<p>HIV resources</p>
<p>https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html</p><p>Address of the bookmark: <a href="https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html" rel="nofollow">https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

</channel>
</rss>