<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/38472?offset=1310</link>
	<atom:link href="https://bioinformaticsonline.com/related/38472?offset=1310" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/1467/biopython-cookbook</guid>
	<pubDate>Thu, 08 Aug 2013 06:43:02 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/1467/biopython-cookbook</link>
	<title><![CDATA[BioPython Cookbook]]></title>
	<description><![CDATA[<p>If you are planning to start learning BioPython ( it does not bite but&nbsp;swallow :P just kidding) then this online cookbook will be really helpful for you.</p><p>http://biopython.org/DIST/docs/tutorial/Tutorial.html</p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/32730/ncbi-prokaryotic-genome-annotation-pipeline</guid>
	<pubDate>Tue, 16 May 2017 08:56:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/32730/ncbi-prokaryotic-genome-annotation-pipeline</link>
	<title><![CDATA[NCBI Prokaryotic Genome Annotation Pipeline]]></title>
	<description><![CDATA[<p>NCBI Prokaryotic Genome Annotation Pipeline is designed to annotate bacterial and archaeal genomes (chromosomes and plasmids).</p>
<p>Genome annotation is a multi-level process that includes prediction of protein-coding genes, as well as other functional genome units such as structural RNAs, tRNAs, small RNAs, pseudogenes, control regions, direct and inverted repeats, insertion sequences, transposons and other mobile elements.</p>
<p>NCBI has developed an automatic prokaryotic genome annotation pipeline that combines&nbsp;<em>ab initio</em>&nbsp;gene prediction algorithms with homology based methods. The first version of NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP;&nbsp;<a href="https://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=pubmed&amp;dopt=Abstract&amp;list_uids=18416670">see Pubmed Article</a>) developed in 2005 has been replaced with an upgraded version that is capable of processing a larger data volume. You can find a more detailed description of the new version of&nbsp;the pipeline in&nbsp;<a href="https://www.ncbi.nlm.nih.gov/books/NBK174280/">NCBI Handbook chapter</a>. NCBI's annotation pipeline depends on several internal databases and is not currently available for download or use outside of the NCBI environment.</p>
<p>https://www.ncbi.nlm.nih.gov/genome/annotation_prok/</p><p>Address of the bookmark: <a href="https://www.ncbi.nlm.nih.gov/genome/annotation_prok/" rel="nofollow">https://www.ncbi.nlm.nih.gov/genome/annotation_prok/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/1514/list-of-pharmacogenomics-companies-worldwide</guid>
	<pubDate>Fri, 09 Aug 2013 13:24:47 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/1514/list-of-pharmacogenomics-companies-worldwide</link>
	<title><![CDATA[List of pharmacogenomics companies worldwide]]></title>
	<description><![CDATA[<div><div><p>Pharmacogenomics are the most promising area of research. Here is the list of some Pharmacogenomics companies worldwide. Feel free to add more pharmacogenomics companies if not mentioned in here.</p><p>Great Pharmacogenomics companies <br /><a href="http://www.aruplab.com/">www.aruplab.com</a> <br /><a href="http://www.clarientinc.com/">www.clarientinc.com</a> <br /><a href="http://www.cns-hts.com/">www.cns-hts.com</a> <br /><a href="http://www.dnanow.com/">www.dnanow.com</a> <br /><a href="http://www.dnavision.be/">www.dnavision.be</a> <br /><a href="http://www.dnavision.com/">www.dnavision.com</a> <br /><a href="http://www.dxsdiagnostics.com/">www.dxsdiagnostics.com</a> <br /><a href="http://www.entrogen.com/">www.entrogen.com</a> <br /><a href="http://www.exiqon.com/">www.exiqon.com</a> <br /><a href="http://www.gene.com/">www.gene.com</a> <br /><a href="http://www.genomichealth.com/">www.genomichealth.com</a> <br /><a href="http://www.genoptix.com/">www.genoptix.com</a> <br /><a href="http://www.genpathdiagnostics.com/">www.genpathdiagnostics.com</a> <br /><a href="http://www.gentris.com/">www.gentris.com</a> <br /><a href="http://www.immunicon.com/">www.immunicon.com</a> <br /><a href="http://www.ingenuity.com/">www.ingenuity.com</a> <br /><a href="http://www.lab21.com/">www.lab21.com</a> <br /><a href="http://www.labcorp.com/">www.labcorp.com</a> <br /><a href="http://www.lion-ag.de/">www.lion-ag.de</a> <br /><a href="http://www.lynxgen.com/">www.lynxgen.com</a> <br /><a href="http://www.mayoclinic.com/">www.mayoclinic.com</a> <br /><a href="http://www.mesoscale.com/">www.mesoscale.com</a> <br /><a href="http://www.microcide.com/">www.microcide.com</a> <br /><a href="http://www.mitokor.com/">www.mitokor.com </a> <br /><a href="http://www.monarchlifesciences.com/">www.monarchlifesciences.com</a> <br /><a href="http://www.mplnet.com/">www.mplnet.com</a> <br /><a href="http://www.orchidbio.com/">www.orchidbio.com</a> <br /><a href="http://www.pebio.com/">www.pebio.com</a> <br /><a href="http://www.phenomenome.com/">www.phenomenome.com</a> <br /><a href="http://www.phenopath.com/">www.phenopath.com</a> <br /><a href="http://www.ppgx.com/">www.ppgx.com</a> <br /><a href="http://www.prometheuslabs.com/">www.prometheuslabs.com</a> <br /><a href="http://www.protogene.com/">www.protogene.com</a> <br /><a href="http://www.questdiagnostics.com/">www.questdiagnostics.com</a> <br /><a href="http://www.rigelinc.com/">www.rigelinc.com</a> <br /><a href="http://www.rii.com/">www.rii.com</a> <br /><a href="http://www.saladax.com/">www.saladax.com</a> <br /><a href="http://www.tmdlab.com/">www.tmdlab.com</a> <br /><a href="http://www.transgenomic.com/">www.transgenomic.com</a> <br /><a href="http://www.twt.com/">www.twt.com</a> <br /><a href="http://www.uslabs.net/">www.uslabs.net</a> <br /><a href="http://www.variagenics.com/">www.variagenics.com</a> <br /><br />Great Equipment Companies for Genomics <br /><a href="http://www.affymetrix.com/">www.affymetrix.com</a> <br /><a href="http://www.illumina.com/">www.illumina.com</a> <br /><a href="http://www.iontorrent.com/">www.iontorrent.com</a> <br /><a href="http://www.sequenom.com/">www.sequenom.com</a> <br /><a href="http://www.appliedbiosystems.com/">www.appliedbiosystems.com</a> <br /><a href="http://www.454.com/">www.454.com</a> <br /><a href="http://www.appliedbiosystems.com/">www.appliedbiosystems.com</a><br /><br />Genomics in India <br /><a href="http://www.ganitlabs.in/">www.ganitlabs.in</a> <br /><a href="http://www.sandor.co.in/">www.sandor.co.in</a> <br /><a href="http://www.igib.res.in/">www.igib.res.in</a> <br /><a href="http://www.genotypic.co.in/">www.genotypic.co.in</a> <br /><a href="http://www.ocimumbio.com/">www.ocimumbio.com</a> <br /><a href="http://www.abcgenomics.com/">www.abcgenomics.com</a> <br /><a href="http://www.xcelrisgenomics.com/">www.xcelrisgenomics.com</a> <br /><a href="http://www.ayugen.com/">www.ayugen.com</a> <br /><a href="http://www.geneombiotech.com/">www.geneombiotech.com</a> <br /><br /> Large Global Whole Genome Companies <br /><a href="http://www.decode.com/">www.decode.com</a> <br /><a href="http://www.23andme.com/">www.23andme.com</a> <br /><a href="http://www.navigenics.com/">www.navigenics.com</a><br />www.pathway.com<br /><br /> Global companies offering genomics services <br /><a href="http://www.asuragen.com/">www.asuragen.com</a> <br /><a href="http://www.baseclear.com/">www.baseclear.com</a> <br /><a href="http://www.agtcenter.com/">www.agtcenter.com</a> <br /><a href="http://www.ambrygen.com/">www.ambrygen.com</a> <br /><a href="http://www.arosab.com/">www.arosab.com</a> <br /><a href="http://www.agrf.org.au/">www.agrf.org.au</a> <br /><a href="http://www.beckmangenomics.com/">www.beckmangenomics.com</a> <br /><a href="http://www.genomics.cn/">www.genomics.cn</a> <br /><a href="http://www.bsf.a-star.edu.sg/">www.bsf.a-star.edu.sg</a> <br /><a href="http://www.cbm.fvg.it/">www.cbm.fvg.it</a> <br /><a href="http://www.cincinnatichildrens.org/">www.cincinnatichildrens.org</a> <br /><a href="http://www.cofactorgenomics.com/">www.cofactorgenomics.com</a> <br /><a href="http://www.covance.com/">www.covance.com</a> <br /><a href="http://www.dnalandmarks.ca/">www.dnalandmarks.ca</a> <br /><a href="http://www.dnavision.com/">www.dnavision.com</a> <br /><a href="http://www.expressionanalysis.com/">www.expressionanalysis.com</a> <br /><a href="http://www.fasteris.com/">www.fasteris.com</a> <br /><a href="http://www.gatc-biotech.com/">www.gatc-biotech.com</a> <br /><a href="http://www.genesdiffusion.com/">www.genesdiffusion.com</a> <br /><a href="http://www.geneseek.com/">www.geneseek.com</a> <br /><a href="http://www.geneticvisions.com/">www.geneticvisions.com</a> <br /><a href="http://www.geneworks.com.au/">www.geneworks.com.au</a> <br /><a href="http://www.genizon.com/">www.genizon.com</a> <br /><a href="http://www.genoskan.dk/uk">www.genoskan.dk/uk</a> <br /><a href="http://www.gpbio.jp/">www.gpbio.jp</a> <br /><a href="http://www.igatechnology.com/">www.igatechnology.com</a> <br /><a href="http://www.igenixinc.com/">www.igenixinc.com</a> <br /><a href="http://www.auxologico.it/">www.auxologico.it</a> <br /><a href="http://www.lifeandbrain.com/">www.lifeandbrain.com</a> <br /><a href="http://www.macrogen.co.kr/eng">www.macrogen.co.kr/eng</a> <br /><a href="http://www.gqinnovationcenter.com/">www.gqinnovationcenter.com</a> <br /><a href="http://www.mftservices.de/">www.mftservices.de</a> <br /><a href="http://www.ncgr.org/">www.ncgr.org</a> <br /><a href="http://www.ramaciotti.unsw.edu.au/">www.ramaciotti.unsw.edu.au</a> <br /><a href="http://www.rikengenesis.jp/">www.rikengenesis.jp</a> <br /><a href="http://www.sabiosciences.com/">www.SABiosciences.com</a> <br /><a href="http://www.sequensysbio.com/">www.sequensysbio.com</a> <br /><a href="http://www.servicexs.com/">www.servicexs.com</a> <br /><a href="http://www.snp-genetics.com/">www.snp-genetics.com</a> <br /><a href="http://www.takara-bio.com/">www.takara-bio.com</a> <br /><a href="http://www.gen-probe.com/">www.gen-probe.com</a> <br /><a href="http://www.traitgenetics.com/">www.traitgenetics.com</a></p></div></div>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/1886/interpretomics</guid>
	<pubDate>Sun, 11 Aug 2013 10:24:33 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/1886/interpretomics</link>
	<title><![CDATA[InterpretOmics]]></title>
	<description><![CDATA[<p>InterpretOmics, a big data analytics startup that focuses on life sciences, has received angel funding of around Rs 10 crore from a group of investors including Singapore's information technology and shipping company, Amarante.</p><p>http://www.interpretomics.co/</p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/34375/the-10th-north-east-bioinformatics-network-nebinet-annual-coordinators-meet</guid>
	<pubDate>Sat, 18 Nov 2017 15:02:44 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/34375/the-10th-north-east-bioinformatics-network-nebinet-annual-coordinators-meet</link>
	<title><![CDATA[The 10th North East Bioinformatics Network (NEBINet) Annual Coordinators' Meet]]></title>
	<description><![CDATA[<p>The 10th North East Bioinformatics Network (NEBINet) Annual Coordinators' Meet organised by the Bioinformatics Centre, St Edmund's College, Shillong and sponsored by the Department of Biotechnology, Government of India, was held at St Edmund's College Auditorium here on Thursday. Meghalaya Governor Ganga Prasad graced the inaugural programme as chief guest. <br />In his inaugural address, the Governor said the panorama of scientific scenario has greatly changed over the years, the thrust areas have undergone a metamorphosis but the conceptual underpinning of the basic sciences still continues. <br />"Of late, the activity of basic research has been intricately intertwined with technology. And we are determined to carry forward this change, for it is through technology that science can actually reach the masses in our country and afar, and the changing times have also inculcated a culture of cross-departmental and interdisciplinary research. Science and technology has always played a pivotal role in taking a nation towards greater heights by ways of innovations and inventions," he added. <br />Prasad also hoped that discussions, suggestions and sharing of innovative ideas during the two-day 10th NEBINet Annual Coordinators' Meet will open up new avenues to make substantial advancement in Biological Sciences which will provide a platform for proper and effective delivery mechanism for the common man. <br />During the inaugural function, Advisor of Department of Biotechnology Dr T Madhan Mohan gave an overview of the NEBINet and Bioinformatics programme. <br />President of Epygen Biotech FZ LLC, Dubai, UAE, Dr Debayan Ghosh, delivered the keynote address. <br />St Edmund's College governing body secretary Brother Simon Coelho and St Edmund's College Principal Dr Sylvanus Lamare also spoke during the function.</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/2054/postdoc-positions-mammalian-transcriptome-evolution-at-sib</guid>
  <pubDate>Mon, 12 Aug 2013 19:58:33 -0500</pubDate>
  <link></link>
  <title><![CDATA[Postdoc Positions - Mammalian Transcriptome Evolution at SIB]]></title>
  <description><![CDATA[
<p>BIOINFORMATICS POSTDOC IN FUNCTIONAL EVOLUTIONARY GENOMICS</p>

<p>Center for Integrative Genomics, University of Lausanne, Switzerland</p>

<p>Two postdoctoral positions (2 years with possible extensions up to 5 years) are available immediately in the evolutionary genomics group of Henrik Kaessmann.</p>

<p>We are seeking highly qualified and enthusiastic applicants with strong skills in computational biology/bioinformatics, preferably also with experience in data mining and comparative or evolutionary genome analysis.</p>

<p>We have been interested in a range of topics related to the functional evolution of genomes from primates (e.g., the emergence of new genes and their functions) and other mammals (e.g., the origin and evolution of mammalian sex chromosomes). In the framework of a recently launched series of projects, a large amount of transcriptome and genome (e.g., epigenome) data are being produced by the wet lab unit of the group using next generation sequencing technologies for a unique collection of tissues from representative mammals and outgroup species (e.g., birds). Topics of current projects based on these data include the origins and/or evolution of protein-coding genes, alternative splicing, microRNAs, long noncoding RNAs, and dosage compensation.</p>

<p>The postdoctoral fellow will perform integrated evolutionary/bioinformatics analyses based on data produced in the lab and available genomic data. The specific project will be developed together with the candidate.</p>

<p>The language of the institute is English, and its members form an international group that is rapidly expanding. The institute is located in Lausanne, a beautiful city at Lake Geneva.</p>

<p>For more information on the group and our institute more generally, please refer to our website: http://www.unil.ch/cig/page7858_en.html</p>

<p>Please submit a CV, statement of research interest, and names of three references to: Henrik Kaessmann (Henrik.Kaessmann@unil.ch).</p>

<p>Webpage : http://www.unil.ch/cig/page7858.html</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/2534/bioinformatician-needs-ten-heads</guid>
	<pubDate>Sat, 17 Aug 2013 10:30:45 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/2534/bioinformatician-needs-ten-heads</link>
	<title><![CDATA[Bioinformatician needs ten heads !!!]]></title>
	<description><![CDATA[<p>Bioinformatics demands more and ... lots more knowledge. In this case Ravan, a mythological character from the Ramayan, can only be a real bioinformatician. :) :P</p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/2534" length="90547" type="image/jpeg" />
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/35552/the-brent-lab</guid>
  <pubDate>Fri, 09 Feb 2018 10:55:27 -0600</pubDate>
  <link></link>
  <title><![CDATA[The Brent Lab]]></title>
  <description><![CDATA[
<p>The Brent Lab is developing and applying computational methods for mapping gene regulation networks, modeling them quantitatively, and engineering new behaviors into them.</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2253/best-practices-in-bioinformatics-training-for-life-scientists</guid>
	<pubDate>Tue, 13 Aug 2013 15:47:34 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2253/best-practices-in-bioinformatics-training-for-life-scientists</link>
	<title><![CDATA[Best practices in bioinformatics training for life scientists]]></title>
	<description><![CDATA[<p>Among life scientists, from clinicians to environmental researchers, a common theme is the need not just to use, and gain familiarity with, bioinformatics tools and resources but also to understand their underlying fundamental theoretical and practical concepts.</p>
<p>Find the detail paper at http://bib.oxfordjournals.org/content/early/2013/06/25/bib.bbt043.full</p><p>Address of the bookmark: <a href="http://bib.oxfordjournals.org/content/early/2013/06/25/bib.bbt043.full" rel="nofollow">http://bib.oxfordjournals.org/content/early/2013/06/25/bib.bbt043.full</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/36197/bioinformatics-oneliner</guid>
	<pubDate>Tue, 10 Apr 2018 04:13:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/36197/bioinformatics-oneliner</link>
	<title><![CDATA[Bioinformatics OneLiner]]></title>
	<description><![CDATA[<p>To remove all line ends (\n) from a Unix text file:</p><pre>sed ':a;N;$!ba;s/\n//g' filename.txt &gt; newfilename_oneline.txt</pre><p>To get average for a column of numbers (here the second column $2):</p><pre>awk '{ sum += $2; n++ } END { if (n &gt; 0) print sum / n; }'</pre><p>To get sequence length for all sequences in a fasta file:</p><pre>awk '/^&gt;/ {if (seqlen){print seqlen}; print ;seqlen=0;next; } { seqlen = seqlen +length($0)}END{print seqlen}' \<br />filename.fasta</pre><p>To copy (move, rename, etc) files based on their list in a text file:</p><pre>cat file_list.txt | while read line; do cp "$line" complete_dataset/"$line"; done</pre><p>To split bam files into sets with mapped and unmapped reads:</p><pre>samtools view -F4 sample.bam &gt; sample.mapped.sam<br />samtools view -f4 sample.bam &gt; sample.unmapped.sam</pre><p>To gzip all your fastq files using gnu parallel and gzip:</p><pre>parallel gzip ::: *.fastq</pre><p>To gzip all your fastq files using pigz:</p><pre>pigz *.fastq</pre><p>To count all sequences in a fasta file:</p><pre>grep "^&gt;" yourfile.fasta -c</pre><p>To count all sequences in all fasta files in your current directory:</p><pre>for a in *.fasta; do ls $a; grep "^&gt;" -c $a; done</pre><p>To keep only one copy of duplicated lines:</p><pre>awk '!seen[$0]++'</pre><p>To sum assembly size from SPAdes contigs.fasta or scaffolds.fasta file:</p><pre>grep "^&gt;" scaffolds.fasta | cut -f 4 -d '_' | paste -sd+ | bc</pre><p>To remove everything after the first space at each line, e.g. to to simplify fasta headers:</p><pre>cut -d' ' -f1 &lt; your_file</pre><p>To count reads in a all .fastq.gz files in your current folder (fast, using gnu parallel):</p><pre>parallel "echo {} &amp;&amp; gunzip -c {} | wc -l | awk '{d=\$1; print d/4;}'" ::: *.gz</pre><p>To count reads in a all .fastq.gz files in your current folder:</p><pre>zcat *.gz | echo $((`wc -l`/4))</pre><p>To count reads in a all .fastq files in your current folder:</p><pre>cat *.fastq | echo $((`wc -l`/4))</pre><p>To count base pairs in a all .fastq.gz files in your current folder:</p><pre>zcat *.fastq.gz | paste - - - - | cut -f 2 | tr -d '\n' | wc -c </pre><p>To split multifasta file into many fasta files:</p><pre>awk '/^&gt;/ {OUT=substr($0,2) ".fa"}; {print &gt;&gt; OUT; close(OUT)}' Input_File</pre><p>To convert Illumina FASTQ 1.3 to 1.8:</p><pre>sed -e '4~4y/@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\\]^_`abcdefghi/!"#$%&amp;'\''()*+,-.\/0123456789:;&lt;=&gt;?@ABCDEFGHIJ/' f.fastq</pre><p>To convert FASTQ to FASTA:</p><pre>sed -n '1~4s/^@/&gt;/p;2~4p' </pre><p>To get fastq read length distribution:</p><pre>cat reads.fastq | awk '{if(NR%4==2) print length($1)}' | sort | uniq -c</pre><p>To deinterleave interleaved fastq file:</p><pre>cat myf.fq | paste - - - - - - - - | tee &gt;(cut -f 1-4 | tr "\t" "\n" &gt; myfile_1.fq) | cut -f 5-8 | \<br />tr "\t" "\n" &gt; myf2.fq </pre><p>To filter and sort contig identifiers from SPAdes assembly (e.g. here lenght &gt;= 4000 + coverage &gt;=100):</p><pre>grep "^&gt;" scaffolds.fasta | sed s"/_/ /"g | awk '{ if ($4 &gt;= 4000 &amp;&amp; $6 &gt;= 100) print $0 }' | sort -k 4 -n | \<br />sed s"/ /_/"g</pre><p>To append something to all headers of your fasta files:</p><pre>sed 's/&gt;.*/&amp;YOURSTRING/' filename.fasta &gt; new_filename.fasta</pre><p>To replace/squeeze multiple adjacent spaces by only one space:&nbsp;</p><pre>tr -s " " &lt; file</pre><p>To filter fastq based on length (here larger than or equal to 21, but smaller than or equal to 25.</p><pre>cat your.fastq | paste - - - - | awk 'length($2)&nbsp; &gt;= 21 &amp;&amp; length($2) &lt;= 25' | sed 's/\t/\n/g' &gt; filtered.fastq</pre><p>To print difference between the last and first row in 5th column:</p><pre>awk '{if (!first){first=$5;}; last=$5;} END {print last-first}' myfile.txt</pre><p>To sample only 200 first bases from all sequences in a multifasta file (e.g. from assembly scaffolds.fasta file here):</p><pre>awk '/^&gt;/{ seqlen=0; print; next; } seqlen &lt; 200 { if (seqlen + length($0) &gt; 200) $0 = substr($0, 1, 200-seqlen);\<br /> seqlen += length($0); print }' scaffolds.fasta &gt; 200bp_scaffolds.fasta</pre><p>&nbsp;To pipe a compressed fasta file directly into makeblastdb.</p><pre>gunzip -c fasta.gz | makeblastdb -in -</pre><p>To remove sequences with duplicate fasta headers from a fasta file.</p><pre>awk '/^&gt;/{f=!d[$1];d[$1]=1}f' in.fasta &gt; out.fasta</pre>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

</channel>
</rss>