BOL: Related items

HASLR: a hybrid assembler which uses both second and third generation sequencing reads

BioStar — Mon, 04 May 2020 02:04:03 -0500

HASLR, a hybrid assembler which uses both second and third generation sequencing reads to efficiently generate accurate genome assemblies. Our experiments show that HASLR is not only the fastest assembler but also the one with the lowest number of misassemblies on all the samples compared to other tested assemblers. Furthermore, the generated assemblies in terms of contiguity and accuracy are on par with the other tools on most of the samples. Availability. HASLR is an open source tool available at https://github.com/vpc-ccg/haslr.

Address of the bookmark: https://github.com/vpc-ccg/haslr

Sequence Ontology Bioinformatics Analysis (SOBA) tool to provide a simple statistical and graphical summary of an annotated genome

BioStar — Wed, 22 Jul 2020 10:11:13 -0500

We have developed the Sequence Ontology Bioinformatics Analysis (SOBA) tool to provide a simple statistical and graphical summary of an annotated genome. We envisage its use during annotation jamborees, genome comparison and for use by developers for rapid feedback during annotation software development and testing. SOBA also provides annotation consistency feedback to ensure correct use of terminology within annotations, and guides users to add new terms to the Sequence Ontology when required. SOBA is available at http://www.sequenceontology.org/cgi-bin/soba.cgi.

More at https://pubmed.ncbi.nlm.nih.gov/20494974/

Address of the bookmark: http://www.sequenceontology.org/cgi-bin/soba.cgi

Svardal lab

Sat, 20 Feb 2021 10:01:19 -0600

In the Svardal lab they are interested how the astonishing natural diversity we see on earth came into being, by which forces it formed and how it is changing today. Hence, they are trying to understand the process of evolution, with mathematical models and through the analysis of genome sequencing data.

Genomes, and in particular differences between them, are a crucial source of information to understand evolution and biology in general. They provide a record of the evolutionary past of populations, their relatedness patterns, their demography, and their adaptations.

More at https://www.uantwerpen.be/en/staff/hannes-svardal/svardal-lab/

HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding

Jit — Sat, 08 May 2021 21:25:00 -0500

HapSolo, that identifies secondary contigs and defines a primary assembly based on multiple pairwise contig alignment metrics. HapSolo evaluates candidate primary assemblies using BUSCO scores and then distinguishes among candidate assemblies using a cost function. The cost function can be defined by the user but by default considers the number of missing, duplicated and single BUSCO genes within the assembly. HapSolo performs hill climbing to minimize cost over thousands of candidate assemblies.

Address of the bookmark: https://github.com/esolares/HapSolo

Josefa González Lab

Thu, 19 Aug 2021 08:52:56 -0500

Lab focus on understanding how organisms adapt to their environments. They combine omics approaches with detailed molecular and phenotypic analyses to get a comprehensive picture of adaptation. Our aim at being internationally recognized as a leading lab in the field of environmental adaptation.
Lab share our passion for science with the general public by leading outreach projects aimed at increasing science awareness.

More at https://www.biologiaevolutiva.org/gonzalez_lab/

ncbi-datasets-cli -- Quickstart: command line tools !

Jit — Tue, 07 Dec 2021 02:51:26 -0600

Install and use the NCBI Datasets command line tools

The NCBI Datasets datasets command line tools are datasets and dataformat .

Use datasets to download biological sequence data across all domains of life from NCBI.

Use dataformat to convert metadata from JSON Lines format to other formats.

Conda download:

https://anaconda.org/conda-forge/ncbi-datasets-cli

Buld Download

https://www.ncbi.nlm.nih.gov/datasets/builder/?tax_id=29979

Address of the bookmark: https://www.ncbi.nlm.nih.gov/datasets/docs/v1/quickstarts/command-line-tools/

PLAR: Pipeline for lncRNA annotation from RNA-seq data

Abhi — Fri, 07 Jan 2022 06:18:01 -0600

Due to several requests, we are releasing an assingment of orthologs, determined using the same methods used in Hezroni et al. (BLAST, Whole Genome Alignment (WGA), and synteny). One is comparing human GENCODE genes (from GENCODE v30) to lncRNAs from other species identified by PLAR. Available here.

Species	Assembly	Code	Transcriptome	lncRNAs	Protein-coding
Human	hg19	hg19	Download	Download	Download
Rhesus	rheMac3	rm3	Download	Download	Download
Marmoset	calJac3	cj3	Download	Download	Download
Mouse	mm9	mm9	Download	Download	Download
Rabbit	oryCun2	oc2	Download	Download	Download
Dog	canFam3	cf3	Download	Download	Download
Ferret	musFur1	oa3	Download	Download	Download
Opossum	monDom5	md5	Download	Download	Download
Chicken	galGal4	gg4	Download	Download	Download
Lizard	anoCar2	ac2	Download	Download	Download
Coelacanth	latCha1	lc1	Download	Download	Download
Zebrafish	danRer7	dr7	Download	Download	Download
Stickleback	gasAcu1	ga1	Download	Download	Download
Nile tilapia	oreNil2	ot2	Download	Download	Download
Spotted gar	lepOcu1	lo1	Download	Download	Download
Elephant shark	calMil1	cm1	Download	Download	Download
Sea urchin	strPur4	sp4	Download	Download	Download

Address of the bookmark: http://www.weizmann.ac.il/Biological_Regulation/IgorUlitsky/PLAR

Bioinfo Lab

Fri, 04 Mar 2022 00:17:00 -0600

The Institute of Bioinformatics conducts internationally renowned research and provides profound education in bioinformatics. Its research focuses on development and application of machine learning and statistical methods in biology and medicine.

Contact:
Computer Science Building (Science Park 3)
Altenberger Str. 69, A-4040 Linz, Austria
Tel. +43 732 2468 4520 / Fax +43 732 2468 4539
E-mail secretary@bioinf.jku.at

http://www.bioinf.jku.at/

genomenotebook

Abhi — Thu, 20 Apr 2023 13:19:01 -0500

https://dbikard.github.io/genomenotebook/

Install

pip install genomenotebook

How to use

Create a simple genome browser with a search bar. The sequence appears when zooming in.

import genomenotebook as gn

g=gn.GenomeBrowser(genome_path, gff_path, init_pos=10000)
g.show()

Tracks can be added to visualize your favorite genomics data. See Examples for more !!!!

Address of the bookmark: https://dbikard.github.io/genomenotebook/

Mitochondrial genome assembly tools !

Abhi — Wed, 06 Sep 2023 00:37:18 -0500

Mitochondrial genome assembly tools are specialized software and algorithms designed to accurately reconstruct the mitochondrial genome (mitogenome) from sequencing data, typically obtained through techniques like next-generation sequencing (NGS). The mitochondrial genome is relatively small compared to the nuclear genome, making it an ideal target for assembly. Here are some commonly used mitochondrial genome assembly tools:

MitoFinder: Mitofinder is a pipeline to assemble mitochondrial genomes and annotate mitochondrial genes from trimmed read sequencing data.

MitoHiFi: a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads

MITObim: MITObim is a tool specifically developed for the iterative assembly of mitochondrial genomes. It starts with a reference mitogenome and iteratively refines the assembly using the read data.

MITOS: MITOS is a web-based platform that provides a pipeline for annotating mitochondrial genomes. It integrates multiple software tools for assembly, annotation, and visualization of mitogenomes.

MIRA: MIRA (Mimicking Intelligent Read Assembly) is a versatile genome assembly tool that can be used for mitochondrial genome assembly. It supports various sequencing technologies and allows for reference-based or de novo assembly.

NOVOPlasty: NOVOPlasty is a user-friendly tool designed for de novo assembly of organelle genomes, including mitochondria. It utilizes a seed-and-extend algorithm and is suitable for both short-read and long-read data.

MITOS2: MITOS2 is an updated version of the MITOS pipeline, which automates the annotation of mitochondrial genomes. It provides improved accuracy and additional features for mitochondrial genome analysis.

GetOrganelle: While primarily designed for chloroplast genome assembly, GetOrganelle can also be used for mitochondrial genome assembly. It is particularly useful for dealing with high-throughput sequencing data.

SPAdes: SPAdes (St. Petersburg genome assembler) is a versatile genome assembly tool that can be employed for mitochondrial genome assembly, especially when dealing with complex datasets that may contain nuclear mitochondrial DNA sequences (numts).

IDBA-UD: IDBA-UD (Iterative De Bruijn Graph De Novo Assembler) is another de novo assembly tool that can be used for mitochondrial genome assembly, especially in cases with relatively low coverage.

Velvet: Velvet is a de novo assembly tool that can be applied to mitochondrial genome assembly, especially when working with short-read data.

When selecting a mitochondrial genome assembly tool, it's important to consider the specific characteristics of your sequencing data, such as read length and coverage, as well as the complexity of the mitochondrial genome. Additionally, some tools are better suited for specific organisms or research objectives, so choosing the right tool will depend on your particular project requirements.