BOL: Related items

Senior Research Fellow @ Indian Agricultural Statistics Research Institute

Thu, 23 Jan 2014 06:22:15 -0600

Indian Agricultural Statistics Research Institute
Library Avenue, Pusa, New Delhi – 110012

Walk-in-Interview

Walk-in-interview will be held on February 11, 2014 at 10:00 A.M. at IASRI, New Delhi for a project “Whole Genome Sequencing and Development of Allied Genomics Resources in Two Commercially Important Fish-Labeo rohita and Clarias batrachus” funded by Department of Biotechnology, Ministry of Science and Technology, Government of India, New Delhi for the following posts. The appointment will be on contractual basis upto September, 2016 or till the termination of the project whichever is earlier and the incumbent shall not have any claim for regular appointment under ICAR.

Senior Research Fellow Two

Post-Graduation in Bioinformatics/ Agricultural Statistics/ Statistics/ Computer Science/ Computer Application/ Biotechnology or equivalent with 1st Division

 Knowledge of Statistical Analysis /Bioinformatics tools/computer programming for computational genomics.

Emoluments for Research Associate: Consolidated Rs: 16000/- per month + 30% HRA (1st Two years) and Rs: 18000/- per month + 30% HRA (3rd Year)

Age Limit: Age should be not more than 35 years (5 years relaxation for SC/ST/women candidates and 3 years for OBC candidates as on date of interview).
Interested candidates are requested to appear for Walk-in-Interview on the date and time as specified above in Room No. 106, Training Cum Administrative Block of the Institute along with their application giving bio-data with attested copies of certificates, degrees, testimonials, etc. and one passport size photograph. Original certificates/ Degrees are needed to be produced at the time of interview. No T.A. /D.A. will be paid for appearing in the interview.

Advertisement: http://www.iasri.res.in/employment/2014/srf_cabin.pdf

Senior SAS Programmer - URGENT ROLE - Permanant - Welwyn Garden City - UK

Fri, 03 Jul 2015 08:14:23 -0500

SAS Programmer URGENTLY required !! My client is looking for an experienced Senior SAS Programmer, to join their bubbly dynamic team in Welwyn Garden City. You must have experience within SAS and/or R programming language. I am looking for someone with a background within either Life Sciences, Statistics, Computer Science, Bioinformatics etc. I am looking for someone with leadership qualities, you must have excellent analyst skills. Please call Dareen Evans on 01772 278050 or email your cv to dareen.evans@itworkshealth.co.uk

JRF @ Institute of Cytology & Preventive Oncology

Sat, 01 Feb 2014 13:47:29 -0600

Institute of Cytology & Preventive Oncology (ICPO) which was initially established as Cytology Research Centre ( CRC ) by the Indian Council of Medical Research (ICMR) in 1979, came into the existence in 1989 when CRC was elevated to the level of Institute. ICPO was instituted with the main aim of promoting research in the field of cancers that are most prevalent in India with an emphasis on their early detection and prevention.

Candidates having the below mentioned qualifications may appear for Walk in Interview at ICPO on 5th Feb 2014 between 10.00 AM and 12.00 PM under the NIF project entitled "Prediction of drug tragets of chemical constituents present within non-codified medicinal plants" under Dr Subhash M.Agarwal, Scientist C

Position : JRF
No of Post : One
Pay : Rs 12000/- + 30% HRA

Desired Profile : M.Sc in Bioinformatics with good academic record. Candidate with experience in database development and scripting would be preferred
Age Limit : Below 28 years
Period : 2 months

Interested candidates may send their applications with bio-data by email (smagarwal@gmail.com) or post addressed to Dr Subhash M Agarwal, Scientist C, Bioinformatics Division, Institute of Cytology and Preventive Oncology (ICPO) I-7, Sector 39, Noida-201301 so as to reach latest by 04.02.14

Deadline : 04.02.14

http://icmr.nic.in/icmrnews/icpo_jrf.pdf

Edit distance application in bioinformatics !

Neel — Thu, 07 Dec 2017 08:46:51 -0600

There are other popular measures of edit distance, which are calculated using a different set of allowable edit operations. For instance,

the Damerau–Levenshtein distance allows insertion, deletion, substitution, and the transposition of two adjacent characters;
the longest common subsequence (LCS) distance allows only insertion and deletion, not substitution;
the Hamming distance allows only substitution, hence, it only applies to strings of the same length.
the Jaro distance allows only transposition.

use Text::Levenshtein qw(distance);

 print distance("foo","four");
 # prints "2"

 my @words     = qw/ four foo bar /;
 my @distances = distance("foo",@words);

 print "@distances";
 # prints "2 0 3"

use Algorithm::LCSS qw( LCSS CSS CSS_Sorted );
    my $lcss_ary_ref = LCSS( \@SEQ1, \@SEQ2 );  # ref to array
    my $lcss_string  = LCSS( $STR1, $STR2 );    # string
    my $css_ary_ref = CSS( \@SEQ1, \@SEQ2 );    # ref to array of arrays
    my $css_str_ref = CSS( $STR1, $STR2 );      # ref to array of strings
    my $css_ary_ref = CSS_Sorted( \@SEQ1, \@SEQ2 );  # ref to array of arrays
    my $css_str_ref = CSS_Sorted( $STR1, $STR2 );    # ref to array of strings

There are many different modules on CPAN for calculating the edit distance between two strings. Here's just a selection.

Text::LevenshteinXS and Text::Levenshtein::XS are both versions of the Levenshtein algorithm that require a C compiler, but will be a lot faster than this module.

The Damerau-Levenshtein edit distance is like the Levenshtein distance, but in addition to insertion, deletion and substitution, it also considers the transposition of two adjacent characters to be a single edit. The module Text::Levenshtein::Damerau defaults to using a pure perl implementation, but if you've installed Text::Levenshtein::Damerau::XS then it will be a lot quicker.

Text::WagnerFischer is an implementation of the Wagner-Fischer edit distance, which is similar to the Levenshtein, but applies different weights to each edit type.

Text::Brew is an implementation of the Brew edit distance, which is another algorithm based on edit weights.

Text::Fuzzy provides a number of operations for partial or fuzzy matching of text based on edit distance. Text::Fuzzy::PP is a pure perl implementation of the same interface.

String::Similarity takes two strings and returns a value between 0 (meaning entirely different) and 1 (meaning identical). Apparently based on edit distance.

Text::Dice calculates Dice's coefficient for two strings. This formula was originally developed to measure the similarity of two different populations in ecological research.

Peng Lab

Tue, 18 Feb 2014 13:53:46 -0600

Peng Lab at Janelia Farm Research Campus, Howard Hughes Medical Institute focuses on data mining for bioinformatics and computational molecular biology, particularly, bioimage data mining and informatics. These bioimages include cellular and molecular images and related medical images.

* Analysis of Gene Expression Pattern Images: high-performance image analysis and mining for different model organisms, such as fruitfly, C. elegans, and mouse;
* Feature/Model Learning: developing algorithms and software

Location :Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia, USA.

http://research.janelia.org/peng/

Awk for Bioinformatician and computational biologist

Poonam Mahapatra — Tue, 06 Feb 2018 14:54:35 -0600

Awk is a programming language which allows easy manipulation of structured data and is mostly used for pattern scanning and processing. It searches one or more files to see if they contain lines that match with the specified patterns and then perform associated actions. The basic syntax is:

awk '/pattern1/ {Actions}
/pattern2/ {Actions}' file

The working of Awk is as follows
Awk reads the input files one line at a time.
For each line, it matches with given pattern in the given order, if matches performs the corresponding action.
If no pattern matches, no action will be performed.
In the above syntax, either search pattern or action are optional, But not both.
If the search pattern is not given, then Awk performs the given actions for each line of the input.
If the action is not given, print all that lines that matches with the given patterns which is the default action.
Empty braces with out any action does nothing. It wont perform default printing operation.
Each statement in Actions should be delimited by semicolon.
Say you have data.tsv with the following contents:

$ cat data/test.tsv
contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
contig2 ACTTTATATATT
contig3 ACTTATATATATATA
contig4 ACTTATATATATATA
contig5 ACTTTATATATT
By default Awk prints every line from the file.

$ awk '{print;}' data/test.tsv
contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
contig2 ACTTTATATATT
contig3 ACTTATATATATATA
contig4 ACTTATATATATATA
contig5 ACTTTATATATT
We print the line which matches the pattern contig3

$ awk '/contig3/' data/test.tsv
contig3 ACTTATATATATATA
Awk has number of builtin variables. For each record i.e line, it splits the record delimited by whitespace character by default and stores it in the $n variables. If the line has 5 words, it will be stored in $1, $2, $3, $4 and $5. $0 represents the whole line. NF is a builtin variable which represents the total number of fields in a record.

$ awk '{print $1","$2;}' data/test.tsv
contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
contig2,ACTTTATATATT
contig3,ACTTATATATATATA
contig4,ACTTATATATATATA
contig5,ACTTTATATATT

$ awk '{print $1","$NF;}' data/test.tsv
contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
contig2,ACTTTATATATT
contig3,ACTTATATATATATA
contig4,ACTTATATATATATA
contig5,ACTTTATATATT

Awk has two important patterns which are specified by the keyword called BEGIN and END. The syntax is as follows:

BEGIN { Actions before reading the file}
{Actions for everyline in the file}
END { Actions after reading the file }

For example,
$ awk 'BEGIN{print "Header,Sequence"}{print $1","$2;}END{print "-------"}' data/test.tsv
Header,Sequence
contig1,ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
contig2,ACTTTATATATT
contig3,ACTTATATATATATA
contig4,ACTTATATATATATA
contig5,ACTTTATATATT
-------
We can also use the concept of a conditional operator in print statement of the form print CONDITION ? PRINT_IF_TRUE_TEXT : PRINT_IF_FALSE_TEXT. For example, in the code below, we identify sequences with lengths > 14:

$ awk '{print (length($2)>14) ? $0">14" : $0"<=14";}' data/test.tsv
contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG>14
contig2 ACTTTATATATT<=14
contig3 ACTTATATATATATA>14
contig4 ACTTATATATATATA>14
contig5 ACTTTATATATT<=14
We can also use 1 after the last block {} to print everything (1 is a shorthand notation for {print $0} which becomes {print} as without any argument print will print $0 by default), and within this block, we can change $0, for example to assign the first field to $0 for third line (NR==3), we can use:

$ awk 'NR==3{$0=$1}1' data/test.tsv
contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
contig2 ACTTTATATATT
contig3
contig4 ACTTATATATATATA
contig5 ACTTTATATATT
You can have as many blocks as you want and they will be executed on each line in the order they appear, for example, if we want to print $1 three times (here we are using printf instead of print as the former doesn't put end-of-line character),

$ awk '{printf $1"\t"}{printf $1"\t"}{print $1}' data/test.tsv
contig1 contig1 contig1
contig2 contig2 contig2
contig3 contig3 contig3
contig4 contig4 contig4
contig5 contig5 contig5
Although, we can also skip executing later blocks for a given line by using next keyword:

$ awk '{printf $1"\t"}NR==3{print "";next}{print $1}' data/test.tsv
contig1 contig1
contig2 contig2
contig3
contig4 contig4
contig5 contig5

$ awk 'NR==3{print "";next}{printf $1"\t"}{print $1}' data/test.tsv
contig1 contig1
contig2 contig2

contig4 contig4
contig5 contig5
You can also use getline to load the contents of another file in addition to the one you are reading, for example, in the statement given below, the while loop will load each line from test.tsv into k until no more lines are to be read:

$ awk 'BEGIN{while((getline k <"data/test.tsv")>0) print "BEGIN:"k}{print}' data/test.tsv
BEGIN:contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
BEGIN:contig2 ACTTTATATATT
BEGIN:contig3 ACTTATATATATATA
BEGIN:contig4 ACTTATATATATATA
BEGIN:contig5 ACTTTATATATT
contig1 ACTGTCTGTCACTGTGTTGTGATGTTGTGTGTG
contig2 ACTTTATATATT
contig3 ACTTATATATATATA
contig4 ACTTATATATATATA
contig5 ACTTTATATATT
You can also store data in the memory with the syntax VARIABLE_NAME[KEY]=VALUE which you can later use through for (INDEX in VARIABLE_NAME) command:

$ awk '{i[$1]=1}END{for (j in i) print j"<="i[j]}' data/test.tsv
contig1<=1
contig2<=1
contig3<=1
contig4<=1
contig5<=1

Assistant Professor @ King Saud University Riyadh

Fri, 21 Feb 2014 05:57:18 -0600

Qualifications: Candidates must have a Ph.D. and a strong background in Molecular and Cellular Biology, protein expression, FACS, or computational biology, and ability to work collaboratively.

This position will have a significant focus on providing analytical support for next generation sequencing data analysis – Exome-sequencing, Targetted sequencing as well as high-throughput genotyping on Illumina platform.

Job location:

Genome Research Chair
King Saud University, Riyadh-11451
KSA

Interested candidate may forward their CV to grcksu@gmail.com

Project-based approach to improve bioinformatics education with skilled and meaningful access to omics data

eliabrodsky — Wed, 11 Apr 2018 13:31:42 -0500

Pine Biotech has been collaborating with Loyola University of New Orleans on piloting a new approach to bioinformatics education using the intuitive and logic-drive bioinformatics platform T-BioInfo.

https://edu.t-bio.info/collaborative-model-bioinformatics-education-combining-biologically-inspired-bioinformatics-project-based-learning/

SRF position in Computational Systems Biology Computational biology Group, IIIT-Delhi

Sun, 23 Feb 2014 20:56:08 -0600

An opportunity to perform research in DST supported project that involves building of mathematical models to understand the functional relationship between circadian rhythms and memory formation under stressful condition. In this project, mathematical model of circadian rhythms based on gene regulatory mechanisms will be unified with the mathematical model of calcium signal transduction pathway to understand and predict the formation of fear memory under stressful conditions. The research scholar will spend full time on this project to build new models and expected to contribute significantly to prepare the results for publication and presentation, and to contribute to grant proposals.

Required Qualifications: Masters in physics/chemistry/mathematics (or) MTech in bioengineering, chemical (or) Masters in any traditional field of science with outstanding performance throughout the program. Candidate should have cleared GATE/UGC-CSIR examinations. Applicant should have done basic mathematics courses like calculus, differential equations, numerical analysis etc in their degree program and have obtained good grades in those courses. Knowledge of MATLAB and C or at least one traditional programming language is absolutely necessary. Strong inclination to understand biological concepts is a must for this research work as this project is about modeling biological systems.

Salary: A fixed salary of Rs 18000 PM including HRA will be paid.

Last date for application: This advertisement is open until suitable candidate is found for the project.

Preferred Qualifications: - Expertise in dynamical systems theory, bifurcation theory, numerical simulations, parameter estimation.

Independence and high motivation for carrying out interdisciplinary research. - Excellent communication skills and ability to work independently. - Good working habits.

Interested candidates should submit both curriculum vitae and statement of interest in PDF format to sriramk@iiitd.ac.in and should clearly mention in the subject "Application for SRF".

Biologist versus computational biologist !

Abhimanyu Singh — Mon, 29 Oct 2018 04:23:24 -0500

This is how it work :)