www.bx.psu.edu - We describe a new method for predicting the ancestral order and orientation of those intervals from their observed adjacencies in modern species. We combine the results from this method with data from chromosome painting experiments to produce a map...
The School of Biotechnology offers a curriculum that reflects the multidisciplinary nature of Biotechnology, integrating theoretical and applied science in undergraduate and graduate courses. The school has six departments with about 300 employees,...
http://etetoolkit.org - ETE v3, featuring numerous improvements in the underlying library of methods, and providing a novel set of standalone tools to perform common tasks in comparative genomics and phylogenetics.
The new features include
(i) building gene-based and...
The purpose of this cheat sheet is to introduce biologist and bioinformatician to the frequently used tools for NGS analysis as well as giving experience in writing one-liners.
File System ls — list items in current directory ls...
bitbucket.org - S-plot2 creates an interactive, two-dimensional heatmap capturing the similarities and dissimilarities in nucleotide usage between genomic sequences (partial or complete). In S-plot2, whole eukaryotic chromosomes and smaller prokaryotic genomes can...
To remove all line ends (\n) from a Unix text file:
sed ':a;N;$!ba;s/\n//g' filename.txt > newfilename_oneline.txt
To get average for a column of numbers (here the second column $2):
awk '{ sum += $2; n++ } END { if (n > 0) print sum / n;...
jimb.stanford.edu - The Genome in a Bottle (GIAB) Consortium is a public-private-academic consortium hosted by NIST to develop the technical infrastructure (reference standards, reference methods, and reference data) to enable translation of whole human genome...
cov-lineages.org - The Pango nomenclature is being used by researchers and public health agencies worldwide to track the transmission and spread of SARS-CoV-2, including variants of concern. This website documents all current Pango lineages and their spread, as well...
Here is a small tutorial on how to make best use of multiple processors for bioinformatics analysis. One best way is using perl threads and forks. Knowing how these threads and forks work is very important before implementing them. Getting to know...