BOL: Related items

Tools to Predict the Impact of Missense Variants !

Jit — Mon, 23 Apr 2018 12:57:33 -0500

Prioritizing missense variants for further experimental investigation is a key challenge in current sequencing studies for exploring complex and Mendelian diseases. A large number of in silico tools have been employed for the task of pathogenicity prediction, including PolyPhen‐2, SIFT, FatHMM, MutationTaster‐2, MutationAssessor, Combined Annotation Dependent Depletion, LRT, phyloP, and GERP++, as well as optimized methods of combining tool scores, such as Condel and Logit. Due to the wealth of these methods, an important practical question to answer is which of these tools generalize best, that is, correctly predict the pathogenic character of new variants.

Study of 10 tools on five datasets that such a comparative evaluation of these tools is hindered by two types of circularity: they arise due to (1) the same variants or (2) different variants from the same protein occurring both in the datasets used for training and for evaluation of these tools, which may lead to overly optimistic results. Comparative evaluations of predictors that do not address these types of circularity may erroneously conclude that circularity confounded tools are most accurate among all tools, and may even outperform optimized combinations of tools.

Following tools are useful for mis sense muation detection ...

PolyPhen‐2 (PP2)
“Predicts possible impact of an amino acid substitution on the structure and function of a human protein using straightforward physical and comparative considerations”

MutationTaster‐2 (MT2)
“Evaluation of the disease‐causing potential of DNA sequence alterations”

MutationAssessor (MASS)
“Predicts the functional impact of amino acid substitutions in proteins, such as mutations discovered in cancer or missense polymorphisms”

LRT
“Identify a subset of deleterious mutations that disrupt highly conserved amino acids within protein‐coding sequences, which are likely to be unconditionally deleterious”

SIFT
“Predicts whether an amino acid substitution affects protein function”

GERP++
“Identifies constrained elements in multiple alignments by quantifying substitution deficits. These deficits represent substitutions that would have occurred if the element were neutral DNA, but did not occur because the element has been under functional constraint. We refer to these deficits as “rejected substitutions.” Rejected substitutions are a natural measure of constraint that reflects the strength of past purifying selection on the element”

phyloP
“Compute conservation or acceleration P values based on an alignment and a model of neutral evolution”

FatHMM unweighted (FatHMM‐U)
Predicts “functional consequences of both coding variants, that is, nonsynonymous single‐nucleotide variants, and noncoding variants”

FatHMM weighted (FatHMM‐W)
Predicts “functional consequences of both coding variants, that is, nonsynonymous single‐nucleotide variants, and noncoding variants” and its weighting scheme attributes higher tolerance scores to SNVs in proteins, related proteins, or domains that already include a high fraction of pathogenic variantsh

Combined Annotation Dependent Depletion (CADD)
“CADD is a tool for scoring the deleteriousness of single‐nucleotide variants as well as insertion/deletions variants in the human genome”

MIX: Combining multiple assemblies from NGS data

Rahul Nayak — Tue, 08 May 2018 04:58:05 -0500

Mix is a tool that combines two or more draft assemblies, without relying on a reference genome and has the goal to reduce contig fragmentation and thus speed-up genome finishing. The proposed algorithm builds an extension graph where vertices represent extremities of contigs and edges represent existing alignments between these extremities. These alignment edges are used for contig extension. The resulting output assembly corresponds to a path in the extension graph that maximizes the cumulative contig length.

The Mix algorithm, approach and results were published in BMC bioinformatics : http://www.biomedcentral.com/1471-2105/14/S15/S16.

Address of the bookmark: https://github.com/cbib/MIX

vcfR: a package to manipulate and visualize VCF data in R

Jit — Thu, 25 Oct 2018 09:05:59 -0500

VcfR is an R package intended to allow easy manipulation and visualization of variant call format (VCF) data. Functions are provided to rapidly read from and write to VCF files. Once VCF data is read into R a parser function extracts matrices from the VCF data for use with typical R functions. This information can then be used for quality control or other purposes. Additional functions provide visualization of genomic data. Once processing is complete data may be written to a VCF file or converted into other popular R objects (e.g., genlight, DNAbin). VcfR provides a link between VCF data and the R environment connecting familiar software with genomic data.

Address of the bookmark: https://github.com/knausb/vcfR

List of comparative genomics resources !

Shruti Paniwala — Tue, 28 Jun 2022 04:08:06 -0500

3D-GENOMICS -- A Database to Compare Structural and Functional Annotations of Proteins between Sequenced Genomes

Compare structural and functional annotations of proteins between sequenced genomes.

ARED Organism -- expansion of ARED reveals AU-rich element cluster variations between human and mouse

View AREs in the human transcriptome and study the comparative genomics of AREs in model organisms.

ATGC -- Alignable Tight Genomic Clusters Database

Find information about orthologous genes in prokaryotes.

AnimalQTLdb -- a livestock QTL database tool set for positional QTL information mining and beyond

Search for publicly available QTL data on livestocks and animal species.

BGDB -- Bovine Genome Database

Find information about bovine genomics data.

COMPARE -- a multi-organism system for cross-species data comparison and transfer of information

A multi-organism web-based resource system designed to easily retrieve, correlate and interpret data across species.

CONDOR -- COnserved Non-coDing Orthologous Regions

A database resource of developmentally associated conserved non-coding elements.

CORG -- A database for COmparative Regulatory Genomics

Delineate conserved non-coding blocks from upstream regions of putative orthologous gene pairs from man, mouse, rat, fugu, Mus musculus, Danio rerio, and zebrafish.

COXPRESdb -- a database of coexpressed gene networks in mammals

Find coexpressed gene lists and networks in human and mouse.

CVTree -- A Phylogenetic Tree Reconstruction Tool Based on Whole Genomes

Construct phylogenetic tree of microorganisms based on oligopeptide content of their complete proteomes.

CleanEST -- the cleansed EST libraries database

A novel database server that classifies GenBank's dbEST (database of expressed gene sequences) libraries and removes contaminants.

CoCoa -- COefficient of COAncestry software

Find information about the ancestral relationship between genes.

CoGemiR -- a comparative genomics microRNA database

Provides an overview of the genomic organization of microRNAs and extent of conservation during evolution in different metazoan species.

Comparative Genometrics (CG) -- a database dedicated to biometric comparisons of whole genomes

Conduct comparative biometric analysis of chromosomes of different organisms.

DoTS -- Database Of Transcribed Sequences

Search for Indices of gene and transcripts in human and mouse.

DroSpeGe -- rapid access database for new Drosophila species genomes

Search and compare 12 new and old Drosophila genomes.

ECR Browser -- A Tool for Visualizing and Accessing Data from Comparisons of Multiple Vertebrate Genomes

Access to whole genome alignments of human, mouse, rat and fish sequences.

EPGD -- Eukaryotic Paralog Group Database

Find eukaryotic paralog/paralogon information.

EVOG -- evolutionary visualizer for overlapping genes

Analyze the evolutionary process of overlapping genes when comparing different species.

GNAT -- Inter-species gene mention normalization (ISGN)

The first publicly available system reported to handle inter-species gene mention normalization.

GenColors -- annotation and comparative genomics of prokaryotes made easy

A web-based software/database system aimed at an improved and accelerated annotation of prokaryotic genomes.

GeneNest gene indices

Visualize gene indices of human, mouse, Arabidopsis, Zebrafish, Drosophila and Sheep.

GenomeTrafac -- a whole genome resource for the detection of transcription factor binding site clusters associated with conventional and microRNA encoding genes conserved between mouse and human gene orthologs

Use comparative genomics approach to characterize gene models and identify putative cis-regulatory regions of RefSeq Gene Orthologs.

IKMC -- International Knockout Mouse Consortium web portal

Find information about mutated mouse genes.

IMG/M -- Integrated Microbial Genomes/Metagenomes

A data management and analysis system for metagenomes

ISED -- Influenza sequence and epitope database.

Search for influenza sequence, vaccine, and drug resistance information.

LAMDHI: The Search for Animal Models Starts Here

LAMHDI, the initiative to Link Animal Models to Human DIsease, is designed to accelerate the research process by providing biomedical researchers with a simple, comprehensive Web-based resource to find the best animal models for their research.

MANTIS -- a phylogenetic framework for multi-species genome comparisons

The missing link between multi-species full genome comparisons and functional analysis.

MBGD -- Microbial genome database for comparative analysis

Conduct comparative analysis of completely sequenced microbial genomes.

MEGA -- Molecular Evolutionary Genetics Analysis

A biologist-centric software for evolutionary analysis of DNA and protein sequences.

MamPol -- a database of nucleotide polymorphism in the Mammalia class

Conduct single nucleotide polymorphisms diversity measurements among homologous sequences from the Mammalia class.

MicrobesOnline -- Prokaryotic Genome Database

Find information about 1000s of microbial genomes.

Narcisse -- a mirror view of conserved syntenies

A database dedicated to the study of genome conservation.

OMA -- the Orthologous MAtrix project

Explore orthologous relations across 352 complete genomes.

OPTIC -- orthologous and paralogous transcripts in clades

Browse complete genomes in several clades.

OrthoDB -- the hierarchical catalog of eukaryotic orthologs

Find groups of orthologous genes.

OrthoMaM -- orthologous mammalian markers

A database of orthologous genomic markers for placental mammal phylogenetics.

PEDANT -- Protein Extraction, Description and ANalysis Tool

Conduct genome wide functional and structural analysis.

PReMod -- a database of genome-wide mammalian cis-regulatory module predictions

Conduct genome-wide cis-regulatory module (CRM) predictions for both the human and the mouse genomes.

PhenomicDB -- Comparison of phenotypes of orthologous genes in human and model organisms

Compare phenotypes of a given gene or gene set in different model organisms.

Phylemon -- A suite of web tools for molecular evolution, phylogenetics and phylogenomics

Phylemon is a web server that integrates a selected suite of more than 20 different tools from the most popular stand-alone programs of phylogenetic and evolutionary analysis.

PhyloPat -- the phylogenetic pattern database

Use this database to see where in the evolution some phylogenetic lineages were started, and over which species they were contained.

Pristionchus.org -- a genome-centric database of the nematode satellite species Pristionchus pacificus

Search for genomic information on nematode satellite species Pristionchus pacificus.

ProtClustDB -- NCBI Protein Clusters Database

Find information about related protein sequences.

ProtozoaDB -- database of protozoan genomes

Database hosting genomics and post-genomics data from multiple protozoans.

Pseudofam -- the pseudogene families database

A database of pseudogene families based on the protein families from the Pfam database.

RIDM - RIKEN Integrated Database of Mammals

Find genomic information about mammals.

RegPrecise -- Regulon Prediction Database

Find information about predicted regulons in prokaryotic transcription regulation.

SALAD -- Surveyed contained motif ALignment diagram and the Associating Dendrogram

Perform systematic comparison of proteome data among species.

SGN -- SOL Genomics Network

A comparative map viewer dedicated to the biology of the Solanaceae family.

ShotgunFunctionalizeR -- R-package for functional comparison of metagenomes

Analyze data from functional analysis on fragmented microbial genetic material.

SnoopCGH -- Comparative Genomic Hybridization software

Visualize and explore comparative genomic hybridization data sets.

SwissRegulon -- a database of genome-wide annotations of regulatory sites

Search for genome-wide annotations of regulatory sites in yeast and prokaryotes genomes.

TaxonGap -- a visualization tool for intra- and inter-species variation among individual biomarkers

Compare and select individual biomarkers.

The Adaptive Evolution Database (TAED) -- a phylogeny based tool for comparative genomics

Search for information on adaptive evolution in gene families of higher plants and chordate.

The CGView Server -- a comparative genomics tool for circular genomes

Generate graphical maps of circular genomes that show sequence features, base composition plots, analysis results and sequence similarity plots.

The ERGO -- Genome analysis and discovery system

Conduct a comprehensive analysis of genes and genomes.

The Macaque Genome: Interactive Poster and Teaching Resource

An interactive online poster presentation on the Macaque genome, including high-quality images, video clips, and Web resources

The TIGR Gene Indices -- clustering and assembling EST and known genes and integration with eukaryotic genomes

Search for annotated genetic information of expressed sequence tags (ESTs) in different eukaryotic organisms.

UniGene

Find mapping and expression information for a unigene cluster (ESTs and full-length mRNA sequences organized into clusters that each represent a unique known or putative gene)

Uprobe -- universal overgo hybridization-based probe retrieval and design

A public online resource for identifying or designing 'universal' overgo-hybridization probes from conserved sequences that can be used to efficiently screen one or more genomic libraries from a designated group of species.

VISTA -- Computational Tools for Comparative Genomics

Comprehensive suite of programs and databases for comparative analysis of genomic sequences.

cBARBEL -- Catfish Breeder and Researcher Bioinformatics Entry Location

Find information about ictalurid catfish.

eggNOG -- evolutionary genealogy of genes: Non-supervised Orthologous Groups

Discover orthologous groups of genes.

metaTIGER -- a metabolic gene evolution resource

Find metabolic networks and phylogenomic information on a taxonomically diverse range of eukaryotes.

xBASE -- a collection of online databases for bacterial comparative genomics

Conduct bacterial comparative genomics.

Bioinformatics tools to explore SSRs in genomes !

BioStar — Tue, 07 Mar 2023 13:06:15 -0600

There are several bioinformatics tools that can be used to explore Simple Sequence Repeats (SSRs), which are also known as microsatellites. Here are a few examples:

MISA: MISA (MIcroSAtellite) is a web-based tool that can identify SSRs in DNA sequences. It can be used to analyze nucleotide sequences from various organisms and can identify perfect, compound, and imperfect SSRs.
SSR Locator: SSR Locator is a web-based tool that identifies SSRs in both DNA and RNA sequences. It can identify perfect, compound, and imperfect SSRs, and can also filter out low complexity regions.
SciRoKo: SciRoKo is a software tool that can identify SSRs in DNA sequences. It can be used to analyze genomic and transcriptomic sequences from various organisms and can identify perfect, compound, and imperfect SSRs.
Primer3: Primer3 is a web-based tool that designs PCR primers for SSRs. It can design primers for perfect and imperfect SSRs, and can be used to design primers for SSRs in various organisms.
QDD: QDD (Quick Detection of Duplication) is a software tool that can identify SSRs in DNA sequences and can also identify duplicate loci. It can be used to analyze genomic and transcriptomic sequences from various organisms.

These are just a few examples of the many bioinformatics tools available for exploring SSRs. Depending on your specific needs and research questions, you may find that other tools are more appropriate for your analysis.

Exploring Bacterial Comparative Genomics: A Bioinformatics Approach

LEGE — Sat, 14 Dec 2024 12:31:14 -0600

In the world of microbiology, bacteria have long fascinated scientists for their diversity, adaptability, and crucial roles in ecosystems and human health. Comparative genomics—a field that involves analyzing and comparing the genomes of different organisms—has revolutionized our understanding of bacterial evolution, adaptation, and pathogenicity. By leveraging bioinformatics tools and techniques, researchers can uncover genomic insights that were once hidden. This blog delves into the principles, methodologies, and applications of bacterial comparative genomics from a bioinformatics perspective.

What is Bacterial Comparative Genomics?

Comparative genomics involves the systematic comparison of genomes across different bacterial species or strains. This approach allows scientists to:

Identify conserved and unique genes.
Explore genetic determinants of pathogenicity.
Understand bacterial evolution and phylogenetics.
Investigate horizontal gene transfer and its role in antibiotic resistance.

Bioinformatics is central to these analyses, enabling the processing and interpretation of large-scale genomic data.

Key Steps in Bacterial Comparative Genomics

Genome Sequencing and Assembly: The process begins with obtaining high-quality bacterial genome sequences. Advances in next-generation sequencing (NGS) technologies have made it faster and more affordable to sequence bacterial genomes. Tools such as SPAdes and Velvet are commonly used for genome assembly.
Genome Annotation: Annotating a genome involves identifying genes, regulatory elements, and other genomic features. Automated tools like Prokka and RAST provide functional annotations, allowing researchers to predict the roles of genes and proteins.
Genome Alignment: Aligning genomes is crucial for identifying conserved regions, single-nucleotide polymorphisms (SNPs), and structural variations. Tools like Mauve and progressiveMauve are commonly employed for whole-genome alignments.
Comparative Analyses:
- Core and Pan-genome Analysis: The core genome consists of genes shared across all strains of a species, while the pan-genome includes all genes found in any strain. Software like Roary and BPGA can perform core and pan-genome analyses.
- Phylogenetic Analysis: Comparative genomics often involves reconstructing evolutionary relationships. Tools such as MEGA and IQ-TREE facilitate phylogenetic tree construction based on genomic data.
- Functional Enrichment Analysis: To understand the biological significance of unique or shared genes, functional enrichment analysis using databases like GO (Gene Ontology) and KEGG is essential.

Recommended Bioinformatics Tools for Comparative Genomics

Here are some additional bioinformatics tools that can aid bacterial comparative genomics:

OrthoFinder: For accurate ortholog identification across multiple genomes.
PanOCT: Specifically designed for pan-genome clustering and annotation.
FASTANI: A tool for calculating Average Nucleotide Identity (ANI) for microbial genome comparisons.
CIRCOS: For visually comparing genomic data through circular genome plots.
Galaxy Platform: A user-friendly web-based platform offering numerous genomic analysis tools.
BLAST: Essential for sequence alignment and similarity searches.
PhyloSift: Focused on phylogenetic analysis of microbial genomes using marker genes.

These tools, in combination with the methods discussed, provide a robust framework for conducting comprehensive comparative genomic studies.

Applications of Bacterial Comparative Genomics

Understanding Pathogenicity: Comparative genomics helps identify virulence factors that distinguish pathogenic strains from non-pathogenic relatives. For instance, comparing genomes of Escherichia coli strains has revealed key genetic determinants of pathogenicity in enterohemorrhagic strains.
Antibiotic Resistance Research: The spread of antibiotic resistance genes through horizontal gene transfer is a major global concern. Comparative analyses can trace the origins and dissemination of resistance genes, aiding in the development of countermeasures.
Microbial Ecology and Evolution: By studying genomic variations, researchers can understand how bacteria adapt to different environments. This is particularly relevant for extremophiles and symbiotic bacteria.
Vaccine Development: Identifying conserved antigens across pathogenic strains is critical for vaccine design. Comparative genomics has been instrumental in developing vaccines against pathogens like Neisseria meningitidis.
Biotechnology Applications: Comparative studies can uncover unique metabolic pathways in bacteria, paving the way for applications in bioremediation, synthetic biology, and industrial microbiology.

Challenges in Bacterial Comparative Genomics

While the field has made significant strides, several challenges remain:

Data Overload: The rapid growth of sequencing data requires robust computational infrastructure and efficient algorithms.
Genome Plasticity: High rates of horizontal gene transfer and genome rearrangements in bacteria complicate comparative analyses.
Annotation Accuracy: Automated annotation tools are not infallible, and manual curation is often needed for high-confidence results.
Interpreting Non-Coding Regions: Understanding the functional significance of non-coding genomic regions remains a challenge.

Future Directions

The integration of bacterial comparative genomics with other ‘omics’ approaches—such as transcriptomics, proteomics, and metabolomics—promises a more comprehensive understanding of bacterial biology. Additionally, advancements in machine learning and artificial intelligence are likely to further enhance bioinformatics analyses, enabling the prediction of complex phenotypes from genomic data.

Conclusion

Bacterial comparative genomics, driven by bioinformatics, continues to unravel the complexities of bacterial life. From combating antibiotic resistance to uncovering the secrets of microbial evolution, this interdisciplinary field holds immense potential for addressing pressing challenges in microbiology and beyond. As technology advances, so too will our ability to harness the power of comparative genomics for scientific and societal benefit.

List of bioinformatics workflow management tools !

Rahul Nayak — Sat, 20 Mar 2021 00:15:25 -0500

Here are list of Workflow Managers

BigDataScript – A cross-system scripting language for working with big data pipelines in computer systems of different sizes and capabilities. [ paper-2014 | web ]
Bpipe – A small language for defining pipeline stages and linking them together to make pipelines. [ web ]
Common Workflow Language – a specification for describing analysis workflows and tools that are portable and scalable across a variety of software and hardware environments, from workstations to cluster, cloud, and high performance computing (HPC) environments. [ web ]
Cromwell – A Workflow Management System geared towards scientific workflows. [ web ]
Galaxy – a popular open-source, web-based platform for data intensive biomedical research. Has several features, from data analysis to workflow management to visualization tools. [ paper-2018 | web ]
Nextflow (recommended) – A fluent DSL modelled around the UNIX pipe concept, that simplifies writing parallel and scalable pipelines in a portable manner. [ paper-2018 | web ]
Ruffus – Computation Pipeline library for python widely used in science and bioinformatics. [ paper-2010 | web ]
SeqWare – Hadoop Oozie-based workflow system focused on genomics data analysis in cloud environments. [ paper-2010 | web ]
Snakemake – A workflow management system in Python that aims to reduce the complexity of creating workflows by providing a fast and comfortable execution environment. [ paper-2018 | web ]
Workflow Descriptor Language – Workflow standard developed by the Broad. [ web ]

Bioinformatics tools developed for Oxford Nanopore data analysis !

biogeek — Wed, 27 Dec 2017 20:47:30 -0600

MinION is the only portable real-time device for DNA and RNA sequencing. Each consumable flow cell can now generate 10–20 Gb of DNA sequence data. Ultra-long read lengths are possible (hundreds of kb) as you can choose your fragment length. One of the technical advantages of ONT data is the read length, which offers great prospects for genome assembly. Generally, assemblers are based on several different types of algorithms, such as greedy, overlap-layout-consensus (OLC), de Bruijn graph (DBG), and string graph.

List of analysis tools developed for Oxford Nanopore data

BWA
Fast nanopore data tuned alignment tool
https://github.com/lh3/bwa

GraphMap
Mapper for long and error-prone reads
https://github.com/isovic/graphmap

LAST
Nanopore tuned alignment tool
http://last.cbrc.jp/

LINKS
Software tool for long read scaffolding
https://github.com/warrenlr/LINKS/

marginAlign
Tools to align nanopore reads to a reference
https://github.com/benedictpaten/marginAlign

minoTour
Real time analysis tools
http://minotour.nottingham.ac.uk/

nanoCORR
Error-correction tool for nanopore sequence data
https://github.com/jgurtowski/nanocorr

NanoOK
Software for nanopore data, quality and error profiles
https://documentation.tgac.ac.uk/display/NANOOK/NanoOK

Nanopolish
Nanopore analysis and genome assembly software
https://github.com/jts/nanopolish

nanopore
Variant-detection tool for nanopore sequence data
https://github.com/mitenjain/nanopore

Nanocorrect
Error-correction tool for nanopore sequence data
https://github.com/jts/nanocorrect/

npReader
Real-time conversion and analysis of nanopore reads
https://github.com/mdcao/npReader

poRe
Tool for analyzing and visualizing nanopore data
https://sourceforge.net/p/rpore/wiki/Home/

PoreSeq
Error-correction and variant-calling software
https://github.com/tszalay/poreseq

Poretools
Nanopore sequence analysis and visualization software
https://github.com/arq5x/poretools

SSPACE-LongRead
Genome scaffolding tool
http://www.baseclear.com/genomics/bioinformatics/basetools/SSPACE-longread

SMIS
Genome scaffolding tool
https://sourceforge.net/projects/phusion2/files/smis/

List of assemblers for Oxford Nanopore MinION long reads

LQS
DALIGNER, Celera OLC Nanocorrect,
Nanopolish corrector
https://github.com/jts/nanopolish

PBcR
HGAP or BLASR, Celera OLC
PBcR corrector
http://wgs-assembler.sourceforge.net/wiki/index.php/PBcR
–
Canu
MHAP, Celera OLC
Canu corrector
https://github.com/marbl/canu

Falcon
String graph, Celera OLC
Falcon corrector
https://github.com/PacificBiosciences/falcon

Miniasm
OLC
https://github.com/lh3/miniasm

ra-integrate
OLC
https://github.com/mariokostelac/ra-integrate/

ALLPATHS-LG
de Bruijn graph
ALLPATHS-L corrector
https://www.broadinstitute.org/software/allpaths-lg/blog/?page_id=12

SPAdes
de Bruijn graph
SPAdes corrector
http://bioinf.spbau.ru/spades

HISAT2: a fast and sensitive alignment program for mapping next-generation sequencing reads

Rahul Nayak — Tue, 08 May 2018 04:27:22 -0500

HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes (as well as to a single reference genome). Based on an extension of BWT for graphs [Sirén et al. 2014], we designed and implemented a graph FM index (GFM), an original approach and its first implementation to the best of our knowledge. In addition to using one global GFM index that represents a population of human genomes, HISAT2 uses a large set of small GFM indexes that collectively cover the whole genome (each index representing a genomic region of 56 Kbp, with 55,000 indexes needed to cover the human population). These small indexes (called local indexes), combined with several alignment strategies, enable rapid and accurate alignment of sequencing reads. This new indexing scheme is called a Hierarchical Graph FM index (HGFM).

more at https://ccb.jhu.edu/software/hisat2/index.shtml

Address of the bookmark: https://github.com/infphilo/hisat2

DNA Nucleotide Counter

Neel — Fri, 12 Oct 2018 04:37:01 -0500

DNA Nucleotide Counter is delivered in a DNA Baser package together with other free molecular biology tools. Download the package and double click it. The programs inside the package will be extracted to the destination folder (specified by you). Go to the destination folder and double click the program you want to use.

It installs in any computer even if you don't have administrator rights!

Address of the bookmark: http://www.dnabaser.com/download/DNA-Counter/index.html