github.com - Flye is a de novo assembler for single molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. The...
bioinfo.ut.ee - FastGT is a program package for whole-genome genotyping of genome variants directly from raw sequencing reads. It is written in C and runs in Linux. FastGT uses a list of variant-specific k-mer pairs that are unique in human genome, counts the...
github.com - HASLR, a hybrid assembler which uses both second and third generation sequencing reads to efficiently generate accurate genome assemblies. Our experiments show that HASLR is not only the fastest assembler but also the one with the lowest number of...
github.com - This tool extracts heterozygous kmer pairs from kmer count databases and performs gymnastics with them. We are able to disentangle genome structure by comparing the sum of kmer pair coverages (CovA + CovB) to their relative coverage (CovB / (CovA +...
milkweedgenome.org - Some of the useful bioinformatics scripts.
For example ... contig-stats.pl is a Perl script that will automatically describe features of a sequence assembly.
http://milkweedgenome.org/?q=scripts
Solved with perl http://rosalind.info/problems/1a/
#Find the most frequent k-mers in a string.#Given: A DNA string Text and an integer k.#Return: All most frequent k-mers in Text (in any order).use strict;use warnings;my...
The question at http://rosalind.info/problems/1d/
Script are moved to http://bioinformaticsonline.com/snippets/view/34633/clump-finding-problem-solved-with-perl
Once your research group is ready to make a larger investment and hire a bioinformatician to gain a competitive edge, there are several key traits to seek out in potential candidates. The best bioinformatician are:-