github.com - The k-mer Weighted Inner Product.
This software implements a de novo, alignment free measure of sample genetic dissimilarity which operates upon raw sequencing reads. It is able to calculate the genetic dissimilarity between samples without any...
www.zbh.uni-hamburg.de - Tallymer is based on enhanced suffix arrays. This gives a much larger flexibility concerning the choice of the k-mer size. Tallymer can process large data sizes of several billion bases. We used it in a variety of applications to study the...
scikit-learn.org - Machine Learning in Python
Simple and efficient tools for data mining and data analysis Accessible to everybody, and reusable in various contexts Built on NumPy, SciPy, and matplotlib Open source, commercially usable - BSD license
More...
liulab-dfci.github.io - This is the course material for STAT115/215 BIO/BST282 at Harvard University.
Xiaole Shirley Liu (lead instructor)Joshua StarmerMartin HembergTing WangFeng Yue
Ming TangYang LiuJack KangScarlett GeJiazhen RongPhillip NicolMaartin De Vries
We...
This is a comprehensive guide to prepare for your next coding interview. It's great for recent graduates and has questions and practice materials structured from traditional big tech interview formats.While it does not include the latest...
A while ago we posted a request for online resources on must-read papers in evolutionary biology, for a literature club. We received many answers in the meantime
Awk is a programming language which allows easy manipulation of structured data and is mostly used for pattern scanning and processing. It searches one or more files to see if they contain lines that match with the specified patterns and then...