<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/2759?offset=690</link>
	<atom:link href="https://bioinformaticsonline.com/related/2759?offset=690" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	
<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/6012/project-junior-research-fellow-ccmb</guid>
  <pubDate>Fri, 01 Nov 2013 10:38:22 -0500</pubDate>
  <link></link>
  <title><![CDATA[Project Junior Research Fellow @ CCMB]]></title>
  <description><![CDATA[
<p>Temporary Project positions available purely on temporary basis - Oct/2013</p>

<p>1. Project Junior Research Fellow / Project Assistant</p>

<p>Last Date: 11th Nov 2013</p>

<p>Qualification B.Tech (Comp. Sci.), B.Tech/M.Tech (Bioinformatics), MCA,  M.Sc. (Mathematics/Statistics)</p>

<p>Desirable Qualifications: Programming in FORTRAN/ C /PERL, Web application technologies</p>

<p>Upper Age limit 28</p>

<p>Rs.12000 / Rs.16000 (as sanctioned by the funding agency)</p>

<p>General terms and conditions:</p>

<p>    Positions are purely temporary and co-terminus with the project.</p>

<p>    HRDG (CSIR) prevailing guidelines are applicable these positions.</p>

<p>    All categories of applicants are required to submit online application.</p>

<p>    Enhancement of stipend to Project JRF to Project SRF will be with the due recommendation of Principal Investigator and approval of the Director on the evaluation of the 3 member Standing Committee consisting of Chairperson at the level of Chief Scientist, Coordinator of the JRFs/RAs/PDFs and the Principal Investigator of the Project.</p>

<p>    The age relaxation as per HRDG (CSIR) norms: SC/ST/OBC/Women/Physically Handicapped persons – five years.</p>

<p>    The Stipend normally be fixed at Rs.22000/- for Research Associates/Post Doc. Fellows. However, a selected RA/PDF may be placed in the higher start of stipend if there is ample justification and such recommendation is made by the Selection Committee. Based on the recommendation with justification by the PI and approval of the Director, person getting stipend at lower rate may be elevated to higher rate subject to availability of the funds in the project.</p>

<p>    Recruitment will be based on initial screening based on qualifications and experience criteria and also based on suitability of the candidates to the nature of research project. This screening will be followed by written test followed / interview. After completing this process, candidates will be shortlisted and appointed in specific project subjects as and when appropriate positions become available. The pool of selected candidates will be valid for six months.</p>

<p>    Remunerations indicate are maximum admissible and will depend upon the availability of funds and subject to conditions applicable to projects from different funding agencies at the time of recruitment.</p>

<p>Apply : http://www.ccmb.res.in/positions/projects/temp_positions.php</p>

<p>Form download : http://www.ccmb.res.in/positions/projects/oct-2013/pdf_download.php</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/9586/list-of-bioinformatics-companies-and-genomics-service-providers</guid>
	<pubDate>Wed, 02 Apr 2014 06:52:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/9586/list-of-bioinformatics-companies-and-genomics-service-providers</link>
	<title><![CDATA[List of bioinformatics companies and genomics service providers]]></title>
	<description><![CDATA[<p>Plz check out link for bioinformatics and genomics companies.&nbsp;</p><p>Address of the bookmark: <a href="http://grouthbio.com/Genome_Software_Service.php" rel="nofollow">http://grouthbio.com/Genome_Software_Service.php</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/6233/edwards-lab</guid>
  <pubDate>Sun, 10 Nov 2013 15:07:08 -0600</pubDate>
  <link></link>
  <title><![CDATA[Edwards Lab]]></title>
  <description><![CDATA[
<p>We study the evolutionary biology of birds and relatives, combining field, museum and genomics approaches to understand the basis of avian diversity, evolution and behavior. Our guiding approaches include population genetics, which provides a quantitative framework for studying speciation, geographic variation and genome evolution; systematics, which acknowledges that the focal species of any study has relatives that are behaviorally and ecologically no less interesting; and natural history, which gives meaning to the genes and genomic patterns we study.</p>

<p>Lab page: http://www.oeb.harvard.edu/faculty/edwards/index.html</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/18187/bioinformatician-for-a-lab-at-the-weizmann-institute-of-science-israel</guid>
  <pubDate>Mon, 13 Oct 2014 04:38:28 -0500</pubDate>
  <link></link>
  <title><![CDATA[Bioinformatician for a lab at the Weizmann Institute of Science, Israel]]></title>
  <description><![CDATA[
<p>We are looking for enthusiastic, motivated and talented people, at all career stages (MSc, PhD, postdoctoral fellows), to join the lab! Bioinformatics in particular are invited to apply. <br />Our lab focuses on understanding molecular mechanisms of protein modifications in cancer and immune regulation. <br />We employ advanced high-throughput proteomic and genomic methods, cell biology, biochemistry, immunology, in-vivo models as well as systems biology and bioinformatics to study the biology of PTMs in health and disease. Read more here: http://yifatmerbl.com.</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/6560/the-graveley-lab</guid>
  <pubDate>Tue, 19 Nov 2013 18:02:48 -0600</pubDate>
  <link></link>
  <title><![CDATA[The Graveley Lab]]></title>
  <description><![CDATA[
<p>Research in the Graveley lab is primarily focused on the regulation of alternative splicing and small RNA mediated gene regulation. These are fascinating and extraordinarily important mechanisms by which genes can be regulated. Our long-term goals are to understand how these processes are regulated at a mechanistic level and to understand the logic of these processes in significant biological settings. To achieve these goals, we strive to think outside the box to creatively attack the problems being addressed using a wide variety of approaches that include biochemistry, genetics, imaging, deep sequencing, large-scale RNAi screening and bioinformatics.</p>

<p>Lab page @ http://graveleylab.cam.uchc.edu/Graveley/index.html</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/22891/17-marie-curie-phd-position-available-immediately</guid>
  <pubDate>Tue, 23 Jun 2015 06:52:06 -0500</pubDate>
  <link></link>
  <title><![CDATA[17 Marie Curie PhD position available immediately]]></title>
  <description><![CDATA[
<p>Kindly look into following webpage:<br />http://medhealth.leeds.ac.uk/info/1450/scholarships/1795/marie_curie_phd_training_network</p>

<p>The closing date for application will be 26 June 2015.</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/7153/phd-student-in-computational-systems-biology</guid>
  <pubDate>Tue, 10 Dec 2013 18:46:05 -0600</pubDate>
  <link></link>
  <title><![CDATA[Ph.D. student in Computational Systems Biology]]></title>
  <description><![CDATA[
<p>Ph.D. student in Computational Systems Biology</p>

<p>Location : The Luxembourg Centre for Systems Biomedicine (LCSB) at the University of Luxembourg, Luxembourg, Luxembourg<br />Deadline for applications : unknown.<br />Description :</p>

<p>The Luxembourg Centre for Systems Biomedicine (LCSB) was created within the Health Technologies Initiative from the Government of Luxembourg as one of the research priorities of the University of Luxembourg. The LCSB is an Interdisciplinary Centre of the University that combines experimental and computational approaches to analyse complex biological systems and disease processes. The Computational Biology Group (CBG) provides the LCSB with a solid infrastructure in developing theoretical framework for computational modeling on biomedical problems, especially in the area of network biology in the context of cellular programming/reprogramming. The CBG group includes researchers with theoretical, computational and wet lab backgrounds, thereby providing an unusually interdisciplinary environment.<br />The Computational Biology Group seeks a highly-skilled Ph.D. student to work on an exciting project on reconstruction and analysis of an integrated gene regulatory network model to elucidate key mechanisms of cellular reprogramming. The model will rely on the integration and mining of diverse transcriptomics and epigenomics data of different cell types from the Central Nervous System. The Ph.D. student is expected to collaborate with other members of the CBG to develop a computational methodology aiming at designing, in-silico, cellular reprogramming events, with a focus on the nervous system. This project will be carried out in collaboration with Prof. Noel Buckleys lab at Kings College London.<br />Requirements of the ideal candidate:<br />Master degree in Bioinformatics, Computer Science, Biology or a related discipline<br />Prior experience in mathematical modelling of biological networks, especially in network inference and analysis<br />Excellent working knowledge in English.<br />.<br />We offer:<br />Full contract for Ph.D. student for three years with possibility of renewal<br />Opportunity to do applied research to medical problems within a highly dynamic research institution (LCSB) and in collaboration with internationally recognized partners<br />An exciting international environment<br />A very competitive salary</p>

<p>For further information, please contact:</p>

<p>Prof. Dr. Antonio del Sol<br />E-mail: antonio.delsol@uni.lu</p>

<p>Applications should contain the following documents:<br />A detailed curriculum vitae<br />cover letter mentioning the reference number<br />description of past research experience and future interests<br />name and addresses of three referees</p>

<p>All applications should be sent preferably in electronic version until December 31st, 2013 to the following address:</p>

<p>Luxembourg Centre for Systems Biomedicine (LCSB)<br />University of Luxembourg<br />7, avenue des Hauts-Fourneaux<br />L-4362 Esch-sur-Alzette<br />Tel: +352-466644-6982 (Office)<br />Email: antonio.delsol@uni.lu<br />http://www.lcsb.lu</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/34552/edit-distance-application-in-bioinformatics</guid>
	<pubDate>Thu, 07 Dec 2017 08:46:51 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/34552/edit-distance-application-in-bioinformatics</link>
	<title><![CDATA[Edit distance application in bioinformatics !]]></title>
	<description><![CDATA[<p>There are other popular measures of&nbsp;<a href="https://en.wikipedia.org/wiki/Edit_distance" title="Edit distance">edit distance</a>, which are calculated using a different set of allowable edit operations. For instance,</p><ul>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Damerau%E2%80%93Levenshtein_distance" title="Damerau&ndash;Levenshtein distance">Damerau&ndash;Levenshtein distance</a>&nbsp;allows insertion, deletion, substitution, and the&nbsp;<a href="https://en.wikipedia.org/wiki/Transposition_(mathematics)" title="Transposition (mathematics)">transposition</a>&nbsp;of two adjacent characters;</li>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Longest_common_subsequence_problem" title="Longest common subsequence problem">longest common subsequence</a>&nbsp;(LCS) distance allows only insertion and deletion, not substitution;</li>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Hamming_distance" title="Hamming distance">Hamming distance</a>&nbsp;allows only substitution, hence, it only applies to strings of the same length.</li>
<li>the&nbsp;<a href="https://en.wikipedia.org/wiki/Jaro_distance" title="Jaro distance">Jaro distance</a>&nbsp;allows only&nbsp;<a href="https://en.wikipedia.org/wiki/Transposition_(mathematics)" title="Transposition (mathematics)">transposition</a>.</li>
</ul><p>&nbsp;</p><pre><span>use</span> Text<span>::</span>Levenshtein <span>qw</span><span>(</span>distance<span>);</span>

 <span>print</span> <span>distance</span><span>(</span><span>"foo"</span><span>,</span><span>"four"</span><span>);</span>
 <span># prints "2"</span>

 <span>my</span> <span>@words</span>     <span>=</span> <span>qw</span><span>/ four foo bar /</span><span>;</span>
 <span>my</span> <span>@distances</span> <span>=</span> <span>distance</span><span>(</span><span>"foo"</span><span>,</span><span>@words</span><span>);</span>

 <span>print</span> <span>"@distances"</span><span>;</span>
 <span># prints "2 0 3"</span><br /><br /><br /></pre><pre><span>use</span> Algorithm<span>::</span>LCSS <span>qw</span><span>(</span> LCSS CSS CSS_Sorted <span>);</span>
    <span>my</span> <span>$lcss_ary_ref</span> <span>=</span> <span>LCSS</span><span>(</span> <span>\</span><span>@SEQ1</span><span>,</span> <span>\</span><span>@SEQ2</span> <span>);</span>  <span># ref to array</span>
    <span>my</span> <span>$lcss_string</span>  <span>=</span> <span>LCSS</span><span>(</span> <span>$STR1</span><span>,</span> <span>$STR2</span> <span>);</span>    <span># string</span>
    <span>my</span> <span>$css_ary_ref</span> <span>=</span> <span>CSS</span><span>(</span> <span>\</span><span>@SEQ1</span><span>,</span> <span>\</span><span>@SEQ2</span> <span>);</span>    <span># ref to array of arrays</span>
    <span>my</span> <span>$css_str_ref</span> <span>=</span> <span>CSS</span><span>(</span> <span>$STR1</span><span>,</span> <span>$STR2</span> <span>);</span>      <span># ref to array of strings</span>
    <span>my</span> <span>$css_ary_ref</span> <span>=</span> <span>CSS_Sorted</span><span>(</span> <span>\</span><span>@SEQ1</span><span>,</span> <span>\</span><span>@SEQ2</span> <span>);</span>  <span># ref to array of arrays</span>
    <span>my</span> <span>$css_str_ref</span> <span>=</span> <span>CSS_Sorted</span><span>(</span> <span>$STR1</span><span>,</span> <span>$STR2</span> <span>);</span>    <span># ref to array of strings<br /><br /><br /><br /></span></pre><p>There are many different modules on CPAN for calculating the edit distance between two strings. Here's just a selection.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshteinXS">Text::LevenshteinXS</a>&nbsp;and&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshtein%3A%3AXS">Text::Levenshtein::XS</a>&nbsp;are both versions of the Levenshtein algorithm that require a C compiler, but will be a lot faster than this module.</p><p>The Damerau-Levenshtein edit distance is like the Levenshtein distance, but in addition to insertion, deletion and substitution, it also considers the transposition of two adjacent characters to be a single edit. The module&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshtein%3A%3ADamerau">Text::Levenshtein::Damerau</a>&nbsp;defaults to using a pure perl implementation, but if you've installed&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3ALevenshtein%3A%3ADamerau%3A%3AXS">Text::Levenshtein::Damerau::XS</a>&nbsp;then it will be a lot quicker.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3AWagnerFischer">Text::WagnerFischer</a>&nbsp;is an implementation of the Wagner-Fischer edit distance, which is similar to the Levenshtein, but applies different weights to each edit type.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3ABrew">Text::Brew</a>&nbsp;is an implementation of the Brew edit distance, which is another algorithm based on edit weights.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3AFuzzy">Text::Fuzzy</a>&nbsp;provides a number of operations for partial or fuzzy matching of text based on edit distance.&nbsp;<a href="http://search.cpan.org/perldoc?Text%3A%3AFuzzy%3A%3APP">Text::Fuzzy::PP</a>&nbsp;is a pure perl implementation of the same interface.</p><p><a href="http://search.cpan.org/perldoc?String%3A%3ASimilarity">String::Similarity</a>&nbsp;takes two strings and returns a value between 0 (meaning entirely different) and 1 (meaning identical). Apparently based on edit distance.</p><p><a href="http://search.cpan.org/perldoc?Text%3A%3ADice">Text::Dice</a>&nbsp;calculates&nbsp;<a href="https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient">Dice's coefficient</a>&nbsp;for two strings. This formula was originally developed to measure the similarity of two different populations in ecological research.</p><pre><span>&nbsp;</span></pre>]]></description>
	<dc:creator>Neel</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/6836/research-fellow-mendel-laboratory</guid>
  <pubDate>Tue, 26 Nov 2013 00:07:57 -0600</pubDate>
  <link></link>
  <title><![CDATA[Research Fellow @ Mendel laboratory]]></title>
  <description><![CDATA[
<p>IRCCS Casa Sollievo della Sofferenza – Mendel laboratory is seeking one talented bioinformatician (Rome)<br />Start date: immediate</p>

<p>Duration: 1 year</p>

<p>Funding Source: Institutional<br />Salary on grant: B2 (€ 22.000/year gross)<br />Contact Person (Referent): Tommaso Mazza<br />Ref. E-Mail: t.mazza@css-mendel.it<br />Tel: +39 06 44160526<br />Fax: +39 06 44160548</p>

<p>Job Description: The bioinformatics unit at IRCCS Casa Sollievo della Sofferenza - Mendel laboratory in Rome is looking for one young PhD bioinformatician with specific experience and/or interest in the analysis of transcriptomic data.</p>

<p>The candidate will be mainly in charge of developing research on a range of hot applications and projects, dealing with microarrays, RNA-Seq and miRNA-Seq data. Main activities will be: (i) data analysis (short-reads mapping, variants call and annotation, functional enrichment analysis of gene expression data); (ii) networks analysis and simulation (artificial knockout, redundancy and lethality analysis, gene set essentiality); (iii) developing of ad-hoc software solutions/routines on clusters of CPUs and GPUs.</p>

<p>The correct cultural background (training in Biology / Computer Science / Statistics or a mix of the three) and a strong interest in working with high throughput data analysis will be considered at the same level of specific experience in the above-mentioned fields.<br />Knowledge of molecular modeling and simulation and one of these languages: python, perl, R, Java, C++, C# is a golden plus. Good knowledge of Scientific English will be positively evaluated for this position, together with good presentation and teamwork skills.</p>

<p>A CV with one professional reference, details on educational background and of the biological and/or bioinformatic and/or data analysis skills and experience should be sent by email for a preliminary selection to: Tommaso Mazza, CSS-Mendel: t.mazza@css-mendel.it</p>

<p>Context<br />Casa Sollievo della Sofferenza is an Institute for hospitalization, care, and scientific research located in San Giovanni Rotondo, Italy. It integrates clinical assistance (with inpatient and outpatient facilities) and research. It has an affiliate institute, CSS-Mendel, located in Rome. Between the two sites, it employs over 100 researchers who focus on genetics. The Center is equipped with state of the art genomics technology (SOLiD 5500XL next generation sequencer, Illumina MiSeq, Affymetrix/Agilent microarray platforms, etc) as well as a dedicated high performance computing facility, a non-conventional workstation of GPUs and a short- and long-term storage disk.</p>

<p>Applications<br />Candidates should send:<br />• a cover letter explaining the role they would like to undertake within the Center, even if it is not listed in this job adv, stating clearly why they would be a good fit to the proposed role, and what they would bring to the Center in terms of expertise, ideas, talent;<br />• a CV including a list of publications;<br />• List of referees;</p>

<p>More at http://www.css-mendel.it/</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/35534/awk-for-bioinformatician-and-computational-biologist</guid>
	<pubDate>Tue, 06 Feb 2018 14:54:35 -0600</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/35534/awk-for-bioinformatician-and-computational-biologist</link>
	<title><![CDATA[Awk for Bioinformatician and computational biologist]]></title>
	<description><![CDATA[<p>Awk is a programming language which allows easy manipulation of structured data and is mostly used for pattern scanning and processing. It searches one or more files to see if they contain lines that match with the specified patterns and then perform associated actions. The basic syntax is:</p><blockquote><p><br />awk '/pattern1/ {Actions}<br /> /pattern2/ {Actions}' file</p></blockquote><p><br />The working of Awk is as follows<br />Awk reads the input files one line at a time.<br />For each line, it matches with given pattern in the given order, if matches performs the corresponding action.<br />If no pattern matches, no action will be performed.<br />In the above syntax, either search pattern or action are optional, But not both.<br />If the search pattern is not given, then Awk performs the given actions for each line of the input.<br />If the action is not given, print all that lines that matches with the given patterns which is the default action.<br />Empty braces with out any action does nothing. It wont perform default printing operation.<br />Each statement in Actions should be delimited by semicolon.<br />Say you have data.tsv with the following contents:</p><p><br />$ cat data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3 ACTTATATATATATA<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT <br />By default Awk prints every line from the file.</p><p><br />$ awk '{print;}' data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3 ACTTATATATATATA<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT <br />We print the line which matches the pattern contig3</p><p><br />$ awk '/contig3/' data/test.tsv<br />contig3 ACTTATATATATATA<br />Awk has number of builtin variables. For each record i.e line, it splits the record delimited by whitespace character by default and stores it in the $n variables. If the line has 5 words, it will be stored in $1, $2, $3, $4 and $5. $0 represents the whole line. NF is a builtin variable which represents the total number of fields in a record.</p><p><br />$ awk '{print $1","$2;}' data/test.tsv<br />contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2,ACTTTATATATT<br />contig3,ACTTATATATATATA<br />contig4,ACTTATATATATATA<br />contig5,ACTTTATATATT</p><p>$ awk '{print $1","$NF;}' data/test.tsv<br />contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2,ACTTTATATATT<br />contig3,ACTTATATATATATA<br />contig4,ACTTATATATATATA<br />contig5,ACTTTATATATT</p><p><br />Awk has two important patterns which are specified by the keyword called BEGIN and END. The syntax is as follows:</p><blockquote><p>BEGIN { Actions before reading the file}<br />{Actions for everyline in the file} <br />END { Actions after reading the file }</p></blockquote><p><br />For example,<br />$ awk 'BEGIN{print "Header,Sequence"}{print $1","$2;}END{print "-------"}' data/test.tsv<br />Header,Sequence<br />contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2,ACTTTATATATT<br />contig3,ACTTATATATATATA<br />contig4,ACTTATATATATATA<br />contig5,ACTTTATATATT<br />------- <br />We can also use the concept of a conditional operator in print statement of the form print CONDITION ? PRINT_IF_TRUE_TEXT : PRINT_IF_FALSE_TEXT. For example, in the code below, we identify sequences with lengths &gt; 14:</p><p>$ awk '{print (length($2)&gt;14) ? $0"&gt;14" : $0"&lt;=14";}' data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG&gt;14<br />contig2 ACTTTATATATT&lt;=14<br />contig3 ACTTATATATATATA&gt;14<br />contig4 ACTTATATATATATA&gt;14<br />contig5 ACTTTATATATT&lt;=14<br />We can also use 1 after the last block {} to print everything (1 is a shorthand notation for {print $0} which becomes {print} as without any argument print will print $0 by default), and within this block, we can change $0, for example to assign the first field to $0 for third line (NR==3), we can use:</p><p>$ awk 'NR==3{$0=$1}1' data/test.tsv<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT<br />You can have as many blocks as you want and they will be executed on each line in the order they appear, for example, if we want to print $1 three times (here we are using printf instead of print as the former doesn't put end-of-line character),</p><p>$ awk '{printf $1"\t"}{printf $1"\t"}{print $1}' data/test.tsv<br />contig1 contig1 contig1<br />contig2 contig2 contig2<br />contig3 contig3 contig3<br />contig4 contig4 contig4<br />contig5 contig5 contig5 <br />Although, we can also skip executing later blocks for a given line by using next keyword:</p><p>$ awk '{printf $1"\t"}NR==3{print "";next}{print $1}' data/test.tsv<br />contig1 contig1<br />contig2 contig2<br />contig3 <br />contig4 contig4<br />contig5 contig5</p><p>$ awk 'NR==3{print "";next}{printf $1"\t"}{print $1}' data/test.tsv<br />contig1 contig1<br />contig2 contig2</p><p>contig4 contig4<br />contig5 contig5<br />You can also use getline to load the contents of another file in addition to the one you are reading, for example, in the statement given below, the while loop will load each line from test.tsv into k until no more lines are to be read:</p><p>$ awk 'BEGIN{while((getline k &lt;"data/test.tsv")&gt;0) print "BEGIN:"k}{print}' data/test.tsv<br />BEGIN:contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />BEGIN:contig2 ACTTTATATATT<br />BEGIN:contig3 ACTTATATATATATA<br />BEGIN:contig4 ACTTATATATATATA<br />BEGIN:contig5 ACTTTATATATT<br />contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG<br />contig2 ACTTTATATATT<br />contig3 ACTTATATATATATA<br />contig4 ACTTATATATATATA<br />contig5 ACTTTATATATT <br />You can also store data in the memory with the syntax VARIABLE_NAME[KEY]=VALUE which you can later use through for (INDEX in VARIABLE_NAME) command:</p><p>$ awk '{i[$1]=1}END{for (j in i) print j"&lt;="i[j]}' data/test.tsv<br />contig1&lt;=1<br />contig2&lt;=1<br />contig3&lt;=1<br />contig4&lt;=1<br />contig5&lt;=1</p>]]></description>
	<dc:creator>Poonam Mahapatra</dc:creator>
</item>

</channel>
</rss>