http://www.codons.org/ - %MM calculates whether a given gene sequence encodes amino acids using the most common codons possible, the least common codons possible, or (most typically) some combination of these extremes. See our PLoS ONE paper for more details on how the...
When you're just starting out with biological programming, it's easy to run into complex problems that make you wonder how anyone has ever managed to write a program.
doua.prabi.fr - One code to find them all is a set of perl scripts to extract useful information from RepeatMasker about transposable elements, retrieve their sequences and get some quantitative information.
Assemble RepeatMasker hits into complete TE copies,...
PhD opportunity at Université de Liège - Belgium
The Bioinformatics and Systems Biology Unit of Université de Liège (Belgium) is looking for a highly motivated master student with programming skills for a PhD thesis project (4 years, fully...
github.com - This code is used to scaffold your assemblies using Hi-C data. This version implements some improvements in the original SALSA algorithm. If you want to use the old version, it can be found in the old_salsa branch.
To use the latest version,...
bioinformatics.oxfordjournals.org - An ultra–high-performance protein–protein docking software for heterogeneous supercomputers
Summary: The application of protein–protein docking in large-scale interactome analysis is a major challenge in structural bioinformatics...
ml.ssu.ac.kr - gSearch compares sequence variants in the Genome Variation Format (GVF) or Variant Call Format (VCF) with a pre-compiled annotation or with variants in other genomes. Its search algorithms are subsequently optimized and implemented in a...
github.com - LRCstats is an open-source pipeline for benchmarking DNA long read correction algorithms for long reads outputted by third generation sequencing technology such as machines produced by Pacific Biosciences. The reads produced by third generation...