The laboratory is focused on the discovery and analysis of structural variation (SVs) from genomic sequence data. As part of the 1000 Genomes Project and other endeavors, we have helped produce initial fine-scale maps using a variety of SV discovery...
www.geneontology.org - The GO knowledgebase is composed of two primary components:
the Gene Ontology (GO), which provides the logical structure of the biological functions (‘terms’) and their relationships to one another, manifested as a directed...
github.com - The Gene NEighborhood Scoring Tool (G-NEST) combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all window sizes. Primary author of final code = William F. Martin....
NVIDIA and the Arc Institute have introduced Evo 2, a groundbreaking AI model designed to understand, predict, and generate DNA sequences. This marks a major advancement in computational biology, offering scientists an unprecedented tool to decode...
github.com - The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without...
github.com - OMArk is a software for proteome (protein-coding gene repertoire) quality assessment. It provides measures of proteome completeness, characterizes the consistency of all protein coding genes with regard to their homologs, and identifies the presence...
github.com - Progressive Cactus is a whole-genome alignment package.
Distribution package for the Prgressive Cactus multiple genome aligner. Dependencies are linked as submodules
https://github.com/glennhickey/progressiveCactus