www.nature.com - Validated a widely accessible approach that can be used to establish functional causality for noncoding sequence variants identified by GWASs.
https://www.nature.com/articles/nm.3975
Scientists have reconstructed the genome of an ancient human who lived nearly 5,700 years ago in Southern Denmark from the birch pitch- an ancient tar-like substance.
github.com - genome simulation across a population with zeta-distributed allele frequency, snps, insertions, deletions, and multi-nucleotide polymorphisms
More at https://github.com/ekg/mutatrix
./mutatrix -S sample -P test/ -p 2 -n 10 reference.fasta
github.com - RefKA, a reference-based approach for long read genome assembly. This approach relies on breaking up a closely related reference genome into bins, aligning k-mers unique to each bin with PacBio reads, and then assembling each bin in parallel...
github.com - Often, genome assembly projects have illumina whole genome sequencing reads available for the assembled individual. The k-mer spectrum of this read set can be used for independently evaluating assembly quality without the need of a high quality...
github.com - Despite marked recent improvements in long-read sequencing technology, the assembly of diploid genomes remains a difficult task. A major obstacle is distinguishing between alternative contigs that represent highly heterozygous regions. If primary...
github.com - HapSolo, that identifies secondary contigs and defines a primary assembly based on multiple pairwise contig alignment metrics. HapSolo evaluates candidate primary assemblies using BUSCO scores and then distinguishes among candidate assemblies using...
Lab focus on understanding how organisms adapt to their environments. They combine omics approaches with detailed molecular and phenotypic analyses to get a comprehensive picture of adaptation. Our aim at being internationally recognized as a...
www.ncbi.nlm.nih.gov - Install and use the NCBI Datasets command line tools
The NCBI Datasets datasets command line tools are datasets and dataformat .
Use datasets to download biological sequence data across all domains of life from...