GATB Library. The Genome Analysis Toolbox with de-Bruijn graph. A large part of tools developed by the GenScale team are based on this library.These methods enable the analysis of data sets of any size on multi-core desktop...
github.com - VariantBam is a tool to extract/count specific sets of sequencing reads from next-generational sequencing files. To save money, disk space and I/O, one may not want to store an entire BAM on disk. In many cases, it would be more efficient to store...
github.com - jackalope simply and efficiently simulates (i) variants from reference genomes and (ii) reads from both Illumina and Pacific Biosciences (PacBio) platforms. It can either read reference genomes from FASTA files or simulate new ones. Genomic variants...
console.cloud.google.com - List of publically available databases on google server.
More at https://software.broadinstitute.org/gatk/download/bundle
ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/VCF/GATK/.
ftp://ftp.broadinstitute.org/bundle/hg38/hg38bundle/
ecogenomics.github.io - CheckM provides a set of tools for assessing the quality of genomes recovered from isolates, single cells, or metagenomes. It provides robust estimates of genome completeness and contamination by using collocated sets of genes that are ubiquitous...
bioinf.shenwei.me - FASTA and FASTQ are basic and ubiquitous formats for storing nucleotide and protein sequences. Common manipulations of FASTA/Q file include converting, searching, filtering, deduplication, splitting, shuffling, and sampling. Existing tools only...
github.com - MetaPred2CS Web server is a meta-predictor based on Support Vector Machine (SVM) that combines 6 individual sequence based protein-protein interaction prediction methods to predict prokaryotic two-component...
www.khanacademy.org - Topics
Displaying and describing data
Modeling distributions of data
Describing relationships in quantitative data
Designing studies
Probability
Random variables
Sampling distributions
Confidence intervals (one sample)
Significance tests...
gehlenborglab.shinyapps.io - UpSetR generates static UpSet plots. The UpSet technique visualizes set intersections in a matrix layout and introduces aggregates based on groupings and queries. The matrix layout enables the effective representation of associated data, such...