BOL: Related items

5700 year-old human genome !

Jit — Thu, 19 Dec 2019 11:22:18 -0600

A Landmark in genomics, scientists have done something that hasn't been done ever.

Scientists have reconstructed the genome of an ancient human who lived nearly 5,700 years ago in Southern Denmark from the birch pitch- an ancient tar-like substance.

By sequencing the sample, researchers not only discovered the ancient human DNA but also microbial DNA reflecting the oral microbiome of the person who chewed the pitch, along with plant and animal DNA that could be the recent meal she might have consumed.

The DNA sample is comparable in quality to well-preserved teeth and skull bones. The DNA suggests that the chewer was a female, most likely with dark skin, dark brown hair and blue eyes.

https://www.nature.com/articles/s41467-019-13549-9

Artistic reconstruction. (Tom Björklund)

More at https://gizmodo.com/scientists-reconstruct-lola-after-finding-her-dna-in-1840481633

mutatrix: a population genome simulator which generates simulated genomes.

Jit — Tue, 28 Jan 2020 04:06:58 -0600

genome simulation across a population with zeta-distributed allele frequency, snps, insertions, deletions, and multi-nucleotide polymorphisms

More at https://github.com/ekg/mutatrix

./mutatrix -S sample -P test/ -p 2 -n 10 reference.fasta

Address of the bookmark: https://github.com/ekg/mutatrix

China’s BGI says it can sequence a genome for just $100

Neel — Sat, 29 Feb 2020 04:49:43 -0600

Using technology originally acquired in the US, the Chinese gene giant BGI Group says it will make genome sequencing cheaper than ever, breaking the $100 barrier for the first time.

The Shenzhen company says the low cost will be possible with an “extreme” DNA sequencing system it plans to offer that is capable of decoding the genomes of 100,000 people a year.

Ref: https://www.technologyreview.com/s/615289/china-bgi-100-dollar-genome/

HASLR: a hybrid assembler which uses both second and third generation sequencing reads

BioStar — Mon, 04 May 2020 02:04:03 -0500

HASLR, a hybrid assembler which uses both second and third generation sequencing reads to efficiently generate accurate genome assemblies. Our experiments show that HASLR is not only the fastest assembler but also the one with the lowest number of misassemblies on all the samples compared to other tested assemblers. Furthermore, the generated assemblies in terms of contiguity and accuracy are on par with the other tools on most of the samples. Availability. HASLR is an open source tool available at https://github.com/vpc-ccg/haslr.

Address of the bookmark: https://github.com/vpc-ccg/haslr

Sequence Ontology Bioinformatics Analysis (SOBA) tool to provide a simple statistical and graphical summary of an annotated genome

BioStar — Wed, 22 Jul 2020 10:11:13 -0500

We have developed the Sequence Ontology Bioinformatics Analysis (SOBA) tool to provide a simple statistical and graphical summary of an annotated genome. We envisage its use during annotation jamborees, genome comparison and for use by developers for rapid feedback during annotation software development and testing. SOBA also provides annotation consistency feedback to ensure correct use of terminology within annotations, and guides users to add new terms to the Sequence Ontology when required. SOBA is available at http://www.sequenceontology.org/cgi-bin/soba.cgi.

More at https://pubmed.ncbi.nlm.nih.gov/20494974/

Address of the bookmark: http://www.sequenceontology.org/cgi-bin/soba.cgi

Svardal lab

Sat, 20 Feb 2021 10:01:19 -0600

In the Svardal lab they are interested how the astonishing natural diversity we see on earth came into being, by which forces it formed and how it is changing today. Hence, they are trying to understand the process of evolution, with mathematical models and through the analysis of genome sequencing data.

Genomes, and in particular differences between them, are a crucial source of information to understand evolution and biology in general. They provide a record of the evolutionary past of populations, their relatedness patterns, their demography, and their adaptations.

More at https://www.uantwerpen.be/en/staff/hannes-svardal/svardal-lab/

HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding

Jit — Sat, 08 May 2021 21:25:00 -0500

HapSolo, that identifies secondary contigs and defines a primary assembly based on multiple pairwise contig alignment metrics. HapSolo evaluates candidate primary assemblies using BUSCO scores and then distinguishes among candidate assemblies using a cost function. The cost function can be defined by the user but by default considers the number of missing, duplicated and single BUSCO genes within the assembly. HapSolo performs hill climbing to minimize cost over thousands of candidate assemblies.

Address of the bookmark: https://github.com/esolares/HapSolo

Josefa González Lab

Thu, 19 Aug 2021 08:52:56 -0500

Lab focus on understanding how organisms adapt to their environments. They combine omics approaches with detailed molecular and phenotypic analyses to get a comprehensive picture of adaptation. Our aim at being internationally recognized as a leading lab in the field of environmental adaptation.
Lab share our passion for science with the general public by leading outreach projects aimed at increasing science awareness.

More at https://www.biologiaevolutiva.org/gonzalez_lab/

ncbi-datasets-cli -- Quickstart: command line tools !

Jit — Tue, 07 Dec 2021 02:51:26 -0600

Install and use the NCBI Datasets command line tools

The NCBI Datasets datasets command line tools are datasets and dataformat .

Use datasets to download biological sequence data across all domains of life from NCBI.

Use dataformat to convert metadata from JSON Lines format to other formats.

Conda download:

https://anaconda.org/conda-forge/ncbi-datasets-cli

Buld Download

https://www.ncbi.nlm.nih.gov/datasets/builder/?tax_id=29979

Address of the bookmark: https://www.ncbi.nlm.nih.gov/datasets/docs/v1/quickstarts/command-line-tools/

PLAR: Pipeline for lncRNA annotation from RNA-seq data

Abhi — Fri, 07 Jan 2022 06:18:01 -0600

Due to several requests, we are releasing an assingment of orthologs, determined using the same methods used in Hezroni et al. (BLAST, Whole Genome Alignment (WGA), and synteny). One is comparing human GENCODE genes (from GENCODE v30) to lncRNAs from other species identified by PLAR. Available here.

Species	Assembly	Code	Transcriptome	lncRNAs	Protein-coding
Human	hg19	hg19	Download	Download	Download
Rhesus	rheMac3	rm3	Download	Download	Download
Marmoset	calJac3	cj3	Download	Download	Download
Mouse	mm9	mm9	Download	Download	Download
Rabbit	oryCun2	oc2	Download	Download	Download
Dog	canFam3	cf3	Download	Download	Download
Ferret	musFur1	oa3	Download	Download	Download
Opossum	monDom5	md5	Download	Download	Download
Chicken	galGal4	gg4	Download	Download	Download
Lizard	anoCar2	ac2	Download	Download	Download
Coelacanth	latCha1	lc1	Download	Download	Download
Zebrafish	danRer7	dr7	Download	Download	Download
Stickleback	gasAcu1	ga1	Download	Download	Download
Nile tilapia	oreNil2	ot2	Download	Download	Download
Spotted gar	lepOcu1	lo1	Download	Download	Download
Elephant shark	calMil1	cm1	Download	Download	Download
Sea urchin	strPur4	sp4	Download	Download	Download

Address of the bookmark: http://www.weizmann.ac.il/Biological_Regulation/IgorUlitsky/PLAR