Bioinformatics is now at the heart of modern biology and medicine. From decoding genomes and predicting antimicrobial resistance, to developing personalized medicine and advancing evolutionary research, computational skills are no longer optional...
Bioinformatics is a rapidly growing field at the intersection of biology, computer science, mathematics, and statistics. As data volumes increase, as well as the diversity of data types (genomics, proteomics, metabolomics, imaging, single‑cell...
A package for windowed principal component analysis. WinPCA performs principal component analyses (PCA) in sliding windows along chromosomes. Both hard-called genotypes (input: VCF or TSV) or genotype likelihoods (input: VCF, TSV or BEAGLE)...
termal, a fast, interactive, terminal-based viewer for multiple sequence alignments (MSAs), designed for use on remote systems such as high-performance computing (HPC)...
HiTE is a Python software that uses a dynamic boundary adjustment approach to detect and annotate full-length Transposable Elements in Genome Assemblies. In comparison to other tools, HiTE demonstrates superior performance in detecting a...
This is PeGAS, a powerful bioinformatic tool designed for the seamless quality control, assembly, and annotation of Illumina paired-end reads specific to pathogenic bacteria. This tool integrates state-of-the-art open-source software to provide a...
geNomad is a tool that identifies virus and plasmid genomes from nucleotide sequences. It provides state-of-the-art classification performance and can be used to quickly find mobile genetic elements from genomes, metagenomes, or metatranscriptomes.
Jaeger is a tool that utilizes homology-free machine learning to identify phage genome sequences that are hidden within metagenomes. It is capable of detecting both phages and prophages within metagenomic assemblies.
dna2bit is a software tool developed in C++11, leveraging the capabilities of OpenMP for parallel computing and the popcount technique for efficient bit manipulation. It has been thoroughly tested using the g++ and clang compilers on both Linux and...
GrAnnoT is an annotation transfer tool for pangenome graphs. It can transfer linear genome annotations to a pangenome graph containing the genome, and also transfer the pangenome graph's annotations on the genomes it contains. It also outputs...
AlfaPang constructs variation graphs, leveraging its alignment-free and reference-free approach, based solely on intrinsic sequence properties. This design allows AlfaPang's runtime and memory usage to scale linearly with the size of input...
Firecrawl is an API service that takes a URL, crawls it, and converts it into clean markdown or structured data. We crawl all accessible subpages and give you clean data for each. No sitemap required. Check out our documentation.
Pst....
FANTASIA is an advanced pipeline for the automatic functional annotation of protein sequences using state-of-the-art protein language models. It integrates deep learning embeddings and in-memory similarity searches, retrieving reference vectors from...
Jaeger is a tool that utilizes homology-free machine learning to identify phage genome sequences that are hidden within metagenomes. It is capable of detecting both phages and prophages within metagenomic assemblies.
The performance of the Jaeger...
dna2bit: an ultra-fast and accurate genomic distance estimation software
dna2bit is a software tool developed in C++11, leveraging the capabilities of OpenMP for parallel computing and the popcount technique for efficient bit manipulation.
2,438,386 bacterial genomes at your fingertips consistently processed & characterized, enriched with metadata, accessible via a flexible search engine.
BakRep (Denglish blend of Bakterien & Repository) simplifies access to this data. It...
If you’re starting out in bioinformatics or looking to sharpen your computational biology skills, having the right learning resources makes all the difference. Here’s my curated list of 10 must-read books — from beginner-friendly introductions to...
Single Nucleotide Polymorphisms (SNPs) are the most common type of genetic variation in humans—and many other organisms. A single base change in the DNA sequence (for example, an A instead of a G) can influence everything from our eye color to...
In statistics and bioinformatics, you’ll often see results reported with p-values, FDR, and q-values (q-scores). But what do these terms mean, and how are they different? Let’s break them down with simple definitions and a step-by-step example.