The AWK utility is an interpreted programming language typically used as a data extraction and reporting tool. It is a standard feature of most Unix-like operating systems.(Source: Wikipedia)
I love awk but recommend you to try bioawk . Bioawk is a modified version of awk which will parse some common sequence formats. https://github.com/lh3/bioawk
Print line of a tab-delimited file when the 8th field is 10090:awk -F "\t" '$8 == 10090 { print $0 }' myFilePrint fields 1, 2, 3 from a tab-delimited file where the 4th field contains a '99':awk -F "\t" '$4 ~ /99/ {print $1"\t"$2"\t"$3}' myFile
Aakhsyan webpage explain commonly used sed and awk for bioinformatics at
http://raunakms.wordpress.com/2013/06/08/sed-and-awk-for-bioinformatics/
Handy OneLiner at http://bioinformatics.whatheblog.com/2010/03/handy-one-liners-awk/
BioUnix...
This MIT wiki page demonstration shows how to perform some basic bioinformatics tasks using simple UNIX commands.
http://rous.mit.edu/index.php/Unix_commands_applied_to_bioinformatics
Awk is a programming language that is specifically designed for quickly manipulating space delimited data. Although you can achieve all its functionality with Perl, awk is simpler in many practical cases.
Why awk? You can replace a pipeline...