github.com - Cactus is a reference-free whole-genome multiple alignment program. The principal algorithms are described here: https://doi.org/10.1101/gr.123356.111
Cactus uses substantial resources. For primate-sized genomes (3 gigabases each), you should...
github.com - The EDTA package was designed to filter out false discoveries in raw TE candidates and generate a high-quality non-redundant TE library for whole-genome TE annotations. Selection of initial search programs were based on benckmarkings on the...
gvolante.riken.jp - gVolante provides an online interface for completeness assessment of user’s original or publicly available sequence datasets as well as for browsing results of completeness assessment performed on publicly available genome and...
https://lnkd.in/f5aUbBM - Major Machine Learning Types Explianed in 11 Pages Credit: McKinsey
For more applied resource you can see at:
1. Data Science Process https://lnkd.in/fMHtxYP
2. Data Visualization in Business https://lnkd.in/fYUCzgC
3. Understand How to answer...
github.com - It is designed to work with patterned data. Famous examples of problems related to patterned data are:
recovering signals in networks after a stimulation (cascade network reverse engineering),
analysing periodic signals.
github.com - jackalope simply and efficiently simulates (i) variants from reference genomes and (ii) reads from both Illumina and Pacific Biosciences (PacBio) platforms. It can either read reference genomes from FASTA files or simulate new ones. Genomic variants...
msaprobs.sourceforge.net - MSAProbs is a well-established state-of-the-art multiple sequence alignment algorithm for protein sequences. The design of MSAProbs is based on a combination of pair hidden Markov models and partition functions to calculate posterior...
ccb.jhu.edu - The program gffcompare can be used to compare, merge, annotate and estimate accuracy of one or more GFF files (the “query” files), when compared with a reference annotation (also provided as GFF).
github.com - Simka is a de novo comparative metagenomics tool. Simka represents each dataset as a k-mer spectrum and compute several classical ecological distances between them.
Developper: Gaëtan Benoit, PhD, former member of...