<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: All]]></title>
	<link>https://bioinformaticsonline.com/snippets?offset=210</link>
	<atom:link href="https://bioinformaticsonline.com/snippets?offset=210" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/42027/bash-script-to-get-intergenic-region-from-genome-files</guid>
	<pubDate>Sat, 08 Aug 2020 20:09:51 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/42027/bash-script-to-get-intergenic-region-from-genome-files</link>
	<title><![CDATA[Bash script to get intergenic region from genome files !]]></title>
	<description><![CDATA[<code>#For the intergenic region, we will require the size of the chromosomes.

wget http://xxx.chrom.sizes
cat xxx.chrom.sizes | sed &#039;s/^chr//&#039; | sed &#039;s/Cp/Pt/&#039; &gt; tmp
mv tmp xxx.chrom.sizes

gunzip -c genome_file.gtf.gz |
awk &#039;BEGIN{OFS=&quot;\t&quot;;} $3==&quot;gene&quot; {print $1,$4-1,$5}&#039; |
bedtools sort -g xxx.chrom.sizes |
bedtools complement -i stdin -g xxx.chrom.sizes |
gzip &gt; my_intergenic.bed.gz</code>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/42026/bash-script-to-extract-intronic-fragments</guid>
	<pubDate>Sat, 08 Aug 2020 20:07:46 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/42026/bash-script-to-extract-intronic-fragments</link>
	<title><![CDATA[Bash script to extract intronic fragments !]]></title>
	<description><![CDATA[<code>#To obtain introns, we simply need the gene and exonic coordinates; 
#by subtracting the exonic regions from the genic region, we have the intronic region.

gunzip -c genome_file.gtf.gz |
awk &#039;BEGIN{OFS=&quot;\t&quot;;} $3==&quot;gene&quot; {print $1,$4-1,$5}&#039; |
bedtools sort |
bedtools subtract -a stdin -b my_exon.bed.gz |
gzip &gt; my_intron.bed.gz</code>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/42025/bash-script-to-get-exon-fragments-from-genome-files</guid>
	<pubDate>Sat, 08 Aug 2020 20:05:53 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/42025/bash-script-to-get-exon-fragments-from-genome-files</link>
	<title><![CDATA[Bash script to get exon fragments from genome files !]]></title>
	<description><![CDATA[<code>#Exons are already defined in the GTF file, so we simply need to print lines that are marked exonic.

gunzip -c genome_file.gtf.gz |
awk &#039;BEGIN{OFS=&quot;\t&quot;;} $3==&quot;exon&quot; {print $1,$4-1,$5}&#039; |
bedtools sort |
bedtools merge -i - | gzip &gt; my_exon.bed.gz</code>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/42007/script-to-extract-the-cluster-detail</guid>
	<pubDate>Mon, 27 Jul 2020 00:20:25 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/42007/script-to-extract-the-cluster-detail</link>
	<title><![CDATA[Script to extract the cluster detail !]]></title>
	<description><![CDATA[<code>$ lsb_release -a
No LSB modules are available.
Distributor ID:	Ubuntu
Description:	Ubuntu 18.04.1 LTS
Release:	18.04
Codename:	bionic
$ cat /proc/cpuinfo | grep -i &#039;model name&#039; | head -n 1
model name	: Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz</code>]]></description>
	<dc:creator>BioJoker</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/41986/perl-one-liner-to-print-only-non-uppercase-letters</guid>
	<pubDate>Tue, 21 Jul 2020 21:25:33 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/41986/perl-one-liner-to-print-only-non-uppercase-letters</link>
	<title><![CDATA[Perl One-Liner to print only non-uppercase letters]]></title>
	<description><![CDATA[<code>#Go through file and only print words that do not have any uppercase letters.
perl -ne &#039;print unless m/[A-Z]/&#039; dna.fa &gt; dnaOnlyLowercase.fa

#To lowercase everything 
perl -pne &#039;tr/[A-Z]/[a-z]/&#039; dnaUpperCase.fa &gt;dnawithoutuppercase.fa;</code>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/41974/command-to-sort-the-bed-file</guid>
	<pubDate>Thu, 16 Jul 2020 07:51:42 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/41974/command-to-sort-the-bed-file</link>
	<title><![CDATA[Command to sort the bed file !]]></title>
	<description><![CDATA[<code>#Command to sort the bed file

sort -V -k1,1 -k2,2 test.bed</code>]]></description>
	<dc:creator>BioHack</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/41967/reformat-the-multifasta-for-sequence-length</guid>
	<pubDate>Mon, 13 Jul 2020 08:43:44 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/41967/reformat-the-multifasta-for-sequence-length</link>
	<title><![CDATA[Reformat the multifasta for sequence length !]]></title>
	<description><![CDATA[<code>#awk oneliner to reformat the multifasta sequences

awk &#039;!/^&gt;/ {printf &quot;%s&quot;, $0; n = &quot;\n&quot;} /^&gt;/ {print n $0; n = &quot;&quot;}&#039; file.fasta | fold -w 100</code>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/41964/get-gc-across-the-entire-cds</guid>
	<pubDate>Sun, 12 Jul 2020 05:30:24 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/41964/get-gc-across-the-entire-cds</link>
	<title><![CDATA[get GC across the entire CDS !]]></title>
	<description><![CDATA[<code>#look at GC across the entire CDS.

gffread -x - -g &lt;genome.fasta&gt; &lt;annotation.gff&gt; | \
seqtk comp - | \
awk -v OFS=&quot;\t&quot; &#039;{ print $1, &quot;0&quot;, $2, ($4 + $5) / $2 }&#039;</code>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/41947/onliner-to-split-the-multifasta-to-singlefasta-files</guid>
	<pubDate>Sat, 04 Jul 2020 22:09:29 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/41947/onliner-to-split-the-multifasta-to-singlefasta-files</link>
	<title><![CDATA[Onliner to split the multifasta to singlefasta files !]]></title>
	<description><![CDATA[<code>#Split the multifasta to singlefasta 
# Multi fasta 
#Single fasta

awk &#039;$0 ~ &quot;^&gt;&quot; { match($1, /^&gt;([^:]+)/, id); filename=id[1]} {print &gt;&gt; filename&quot;.fa&quot;}&#039; sequence.fasta</code>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/snippets/view/41932/sequence-ids-conversion-files</guid>
	<pubDate>Fri, 03 Jul 2020 05:20:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/snippets/view/41932/sequence-ids-conversion-files</link>
	<title><![CDATA[Sequence Ids conversion files !]]></title>
	<description><![CDATA[<code>ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/

Name	Size	Date Modified
ARCHIVE/		02/01/2020, 05:30:00
ASN_BINARY/		03/07/2020, 07:49:00
GENE_INFO/		03/07/2020, 07:48:00
0 B	10/02/2012, 05:30:00
15.1 kB	30/06/2020, 23:01:00
expression/		06/03/2017, 05:30:00
2.0 GB	03/07/2020, 07:44:00
61.8 MB	03/07/2020, 07:44:00
21.4 MB	03/07/2020, 07:44:00
45.1 MB	03/07/2020, 07:44:00
864 MB	03/07/2020, 07:45:00
279 kB	03/07/2020, 07:45:00
83.4 MB	03/07/2020, 07:45:00
572 MB	03/07/2020, 07:46:00
715 MB	03/07/2020, 07:47:00
30.2 MB	03/07/2020, 07:47:00
232 MB	03/07/2020, 14:38:00
1.2 kB	06/09/2011, 05:30:00
11.6 kB	16/05/2020, 01:32:00
770 kB	03/07/2020, 14:38:00
special_requests/		18/04/2020, 00:15:00
737 B	09/06/2011, 05:30:00

ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2go.gz
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2accession.gz
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2ensembl.gz
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2pubmed.gz
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2refseq.gz
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_group.gz
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_history.gz
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_neighbors.gz</code>]]></description>
	<dc:creator>Surabhi Chaudhary</dc:creator>
</item>

</channel>
</rss>