<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/36395?offset=150</link>
	<atom:link href="https://bioinformaticsonline.com/related/36395?offset=150" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/pages/view/36373/tools-to-predict-the-impact-of-missense-variants</guid>
	<pubDate>Mon, 23 Apr 2018 12:57:33 -0500</pubDate>
	<link>https://bioinformaticsonline.com/pages/view/36373/tools-to-predict-the-impact-of-missense-variants</link>
	<title><![CDATA[Tools to Predict the Impact of Missense Variants !]]></title>
	<description><![CDATA[<p><span>Prioritizing missense variants for further experimental investigation is a key challenge in current sequencing studies for exploring complex and Mendelian diseases. A large number of&nbsp;</span><em>in silico</em><span>&nbsp;tools have been employed for the task of pathogenicity prediction, including PolyPhen‐2, SIFT, FatHMM, MutationTaster‐2, MutationAssessor, Combined Annotation Dependent Depletion, LRT, phyloP, and GERP++, as well as optimized methods of combining tool scores, such as Condel and Logit. Due to the wealth of these methods, an important practical question to answer is which of these tools generalize best, that is, correctly predict the pathogenic character of new variants. </span></p><p><span>Study of 10 tools on five datasets that such a comparative evaluation of these tools is hindered by two types of circularity: they arise due to (1) the same variants or (2) different variants from the same protein occurring both in the datasets used for training and for evaluation of these tools, which may lead to overly optimistic results. Comparative evaluations of predictors that do not address these types of circularity may erroneously conclude that circularity confounded tools are most accurate among all tools, and may even outperform optimized combinations of tools.</span></p><p><span>Following tools are useful for mis sense muation detection ...&nbsp;</span></p><p>PolyPhen‐2 (PP2)<br />&ldquo;Predicts possible impact of an amino acid substitution on the structure and function of a human protein using straightforward physical and comparative considerations&rdquo;</p><p>MutationTaster‐2 (MT2)<br />&ldquo;Evaluation of the disease‐causing potential of DNA sequence alterations&rdquo;</p><p>MutationAssessor (MASS)<br />&ldquo;Predicts the functional impact of amino acid substitutions in proteins, such as mutations discovered in cancer or missense polymorphisms&rdquo;</p><p>LRT<br />&ldquo;Identify a subset of deleterious mutations that disrupt highly conserved amino acids within protein‐coding sequences, which are likely to be unconditionally deleterious&rdquo;</p><p>SIFT<br />&ldquo;Predicts whether an amino acid substitution affects protein function&rdquo;</p><p>GERP++<br />&ldquo;Identifies constrained elements in multiple alignments by quantifying substitution deficits. These deficits represent substitutions that would have occurred if the element were neutral DNA, but did not occur because the element has been under functional constraint. We refer to these deficits as &ldquo;rejected substitutions.&rdquo; Rejected substitutions are a natural measure of constraint that reflects the strength of past purifying selection on the element&rdquo;</p><p>phyloP<br />&ldquo;Compute conservation or acceleration P values based on an alignment and a model of neutral evolution&rdquo;</p><p>FatHMM unweighted (FatHMM‐U)<br />Predicts &ldquo;functional consequences of both coding variants, that is, nonsynonymous single‐nucleotide variants, and noncoding variants&rdquo;</p><p>FatHMM weighted (FatHMM‐W)<br />Predicts &ldquo;functional consequences of both coding variants, that is, nonsynonymous single‐nucleotide variants, and noncoding variants&rdquo; and its weighting scheme attributes higher tolerance scores to SNVs in proteins, related proteins, or domains that already include a high fraction of pathogenic variantsh</p><p>Combined Annotation Dependent Depletion (CADD)<br />&ldquo;CADD is a tool for scoring the deleteriousness of single‐nucleotide variants as well as insertion/deletions variants in the human genome&rdquo;</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36518/mix-combining-multiple-assemblies-from-ngs-data</guid>
	<pubDate>Tue, 08 May 2018 04:58:05 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36518/mix-combining-multiple-assemblies-from-ngs-data</link>
	<title><![CDATA[MIX: Combining multiple assemblies from NGS data]]></title>
	<description><![CDATA[<p>Mix is a tool that combines two or more draft assemblies, without relying on a reference genome and has the goal to reduce contig fragmentation and thus speed-up genome finishing. The proposed algorithm builds an extension graph where vertices represent extremities of contigs and edges represent existing alignments between these extremities. These alignment edges are used for contig extension. The resulting output assembly corresponds to a path in the extension graph that maximizes the cumulative contig length.</p>
<p>The Mix algorithm, approach and results were published in BMC bioinformatics :&nbsp;<a href="http://www.biomedcentral.com/1471-2105/14/S15/S16">http://www.biomedcentral.com/1471-2105/14/S15/S16</a>.</p><p>Address of the bookmark: <a href="https://github.com/cbib/MIX" rel="nofollow">https://github.com/cbib/MIX</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38004/vcfr-a-package-to-manipulate-and-visualize-vcf-data-in-r</guid>
	<pubDate>Thu, 25 Oct 2018 09:05:59 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38004/vcfr-a-package-to-manipulate-and-visualize-vcf-data-in-r</link>
	<title><![CDATA[vcfR:  a package to manipulate and visualize VCF data in R]]></title>
	<description><![CDATA[<p><span>VcfR is an R package intended to allow easy manipulation and visualization of variant call format (VCF) data. Functions are provided to rapidly read from and write to VCF files. Once VCF data is read into R a parser function extracts matrices from the VCF data for use with typical R functions. This information can then be used for quality control or other purposes. Additional functions provide visualization of genomic data. Once processing is complete data may be written to a VCF file or converted into other popular R objects (e.g., genlight, DNAbin). VcfR provides a link between VCF data and the R environment connecting familiar software with genomic data.</span></p><p>Address of the bookmark: <a href="https://github.com/knausb/vcfR" rel="nofollow">https://github.com/knausb/vcfR</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43896/list-of-comparative-genomics-resources</guid>
	<pubDate>Tue, 28 Jun 2022 04:08:06 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43896/list-of-comparative-genomics-resources</link>
	<title><![CDATA[List of comparative genomics resources !]]></title>
	<description><![CDATA[<div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1096638041"><span>3D-GENOMICS -- A Database to Compare Structural and Functional Annotations of Proteins between Sequenced Genomes</span></a></div><p>Compare structural and functional annotations of proteins between sequenced genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1100640374"><span>ARED Organism -- expansion of ARED reveals AU-rich element cluster variations between human and mouse</span></a></div><p>View AREs in the human transcriptome and study the comparative genomics of AREs in model organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1234973128"><span>ATGC -- Alignable Tight Genomic Clusters Database</span></a></div><p>Find information about orthologous genes in prokaryotes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174596104"><span>AnimalQTLdb -- a livestock QTL database tool set for positional QTL information mining and beyond</span></a></div><p>Search for publicly available QTL data on livestocks and animal species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518150135"><span>BGDB -- Bovine Genome Database</span></a></div><p>Find information about bovine genomics data.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229012662"><span>COMPARE -- a multi-organism system for cross-species data comparison and transfer of information</span></a></div><p>A multi-organism web-based resource system designed to easily retrieve, correlate and interpret data across species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1218141952"><span>CONDOR -- COnserved Non-coDing Orthologous Regions</span></a></div><p>A database resource of developmentally associated conserved non-coding elements.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1099057221"><span>CORG -- A database for COmparative Regulatory Genomics</span></a></div><p>Delineate conserved non-coding blocks from upstream regions of putative orthologous gene pairs from man, mouse, rat, fugu, Mus musculus, Danio rerio, and zebrafish.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1203608896"><span>COXPRESdb -- a database of coexpressed gene networks in mammals</span></a></div><p>Find coexpressed gene lists and networks in human and mouse.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1097763045"><span>CVTree -- A Phylogenetic Tree Reconstruction Tool Based on Whole Genomes</span></a></div><p>Construct phylogenetic tree of microorganisms based on oligopeptide content of their complete proteomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232729680"><span>CleanEST -- the cleansed EST libraries database</span></a></div><p>A novel database server that classifies GenBank's dbEST (database of expressed gene sequences) libraries and removes contaminants.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1256926144"><span>CoCoa -- COefficient of COAncestry software</span></a></div><p>Find information about the ancestral relationship between genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1227549154"><span>CoGemiR -- a comparative genomics microRNA database</span></a></div><p>Provides an overview of the genomic organization of microRNAs and extent of conservation during evolution in different metazoan species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1117678221"><span>Comparative Genometrics (CG) -- a database dedicated to biometric comparisons of whole genomes</span></a></div><p>Conduct comparative biometric analysis of chromosomes of different organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1151007916"><span>DoTS -- Database Of Transcribed Sequences</span></a></div><p>Search for Indices of gene and transcripts in human and mouse.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174510065"><span>DroSpeGe -- rapid access database for new Drosophila species genomes</span></a></div><p>Search and compare 12 new and old Drosophila genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1098208414"><span>ECR Browser -- A Tool for Visualizing and Accessing Data from Comparisons of Multiple Vertebrate Genomes</span></a></div><p>Access to whole genome alignments of human, mouse, rat and fish sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209738459"><span>EPGD -- Eukaryotic Paralog Group Database</span></a></div><p>Find eukaryotic paralog/paralogon information.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232726869"><span>EVOG -- evolutionary visualizer for overlapping genes</span></a></div><p>Analyze the evolutionary process of overlapping genes when comparing different species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1227633714"><span>GNAT -- Inter-species gene mention normalization (ISGN)</span></a></div><p>The first publicly available system reported to handle inter-species gene mention normalization.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229438992"><span>GenColors -- annotation and comparative genomics of prokaryotes made easy</span></a></div><p>A web-based software/database system aimed at an improved and accelerated annotation of prokaryotic genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1151086258"><span>GeneNest gene indices</span></a></div><p>Visualize gene indices of human, mouse, Arabidopsis, Zebrafish, Drosophila and Sheep.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174489378"><span>GenomeTrafac -- a whole genome resource for the detection of transcription factor binding site clusters associated with conventional and microRNA encoding genes conserved between mouse and human gene orthologs</span></a></div><p>Use comparative genomics approach to characterize gene models and identify putative cis-regulatory regions of RefSeq Gene Orthologs.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518150753"><span>IKMC -- International Knockout Mouse Consortium web portal</span></a></div><p>Find information about mutated mouse genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209411604"><span>IMG/M -- Integrated Microbial Genomes/Metagenomes</span></a></div><p>A data management and analysis system for metagenomes</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1234976694"><span>ISED -- Influenza sequence and epitope database.</span></a></div><p>Search for influenza sequence, vaccine, and drug resistance information.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20140710115515"><span>LAMDHI: The Search for Animal Models Starts Here</span></a></div><p>LAMHDI, the initiative to Link Animal Models to Human DIsease, is designed to accelerate the research process by providing biomedical researchers with a simple, comprehensive Web-based resource to find the best animal models for their research.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1228843803"><span>MANTIS -- a phylogenetic framework for multi-species genome comparisons</span></a></div><p>The missing link between multi-species full genome comparisons and functional analysis.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1099578148"><span>MBGD -- Microbial genome database for comparative analysis</span></a></div><p>Conduct comparative analysis of completely sequenced microbial genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1221077729"><span>MEGA -- Molecular Evolutionary Genetics Analysis</span></a></div><p>A biologist-centric software for evolutionary analysis of DNA and protein sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174596756"><span>MamPol -- a database of nucleotide polymorphism in the Mammalia class</span></a></div><p>Conduct single nucleotide polymorphisms diversity measurements among homologous sequences from the Mammalia class.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1266437314"><span>MicrobesOnline -- Prokaryotic Genome Database</span></a></div><p>Find information about 1000s of microbial genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1208461006"><span>Narcisse -- a mirror view of conserved syntenies</span></a></div><p>A database dedicated to the study of genome conservation.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1219772764"><span>OMA -- the Orthologous MAtrix project</span></a></div><p>Explore orthologous relations across 352 complete genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209738741"><span>OPTIC -- orthologous and paralogous transcripts in clades</span></a></div><p>Browse complete genomes in several clades.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209573208"><span>OrthoDB -- the hierarchical catalog of eukaryotic orthologs</span></a></div><p>Find groups of orthologous genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1221231200"><span>OrthoMaM -- orthologous mammalian markers</span></a></div><p>A database of orthologous genomic markers for placental mammal phylogenetics.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1100009979"><span>PEDANT -- Protein Extraction, Description and ANalysis Tool</span></a></div><p>Conduct genome wide functional and structural analysis.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174489475"><span>PReMod -- a database of genome-wide mammalian cis-regulatory module predictions</span></a></div><p>Conduct genome-wide cis-regulatory module (CRM) predictions for both the human and the mouse genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1151083092"><span>PhenomicDB -- Comparison of phenotypes of orthologous genes in human and model organisms</span></a></div><p>Compare phenotypes of a given gene or gene set in different model organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1190899370"><span>Phylemon -- A suite of web tools for molecular evolution, phylogenetics and phylogenomics</span></a></div><p>Phylemon is a web server that integrates a selected suite of more than 20 different tools from the most popular stand-alone programs of phylogenetic and evolutionary analysis.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232555615"><span>PhyloPat -- the phylogenetic pattern database</span></a></div><p>Use this database to see where in the evolution some phylogenetic lineages were started, and over which species they were contained.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174510223"><span>Pristionchus.org -- a genome-centric database of the nematode satellite species Pristionchus pacificus</span></a></div><p>Search for genomic information on nematode satellite species Pristionchus pacificus.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1236367352"><span>ProtClustDB -- NCBI Protein Clusters Database</span></a></div><p>Find information about related protein sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209410278"><span>ProtozoaDB -- database of protozoan genomes</span></a></div><p>Database hosting genomics and post-genomics data from multiple protozoans.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232554690"><span>Pseudofam -- the pseudogene families database</span></a></div><p>A database of pseudogene families based on the protein families from the Pfam database.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518151439"><span>RIDM - RIKEN Integrated Database of Mammals</span></a></div><p>Find genomic information about mammals.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1272562567"><span>RegPrecise -- Regulon Prediction Database</span></a></div><p>Find information about predicted regulons in prokaryotic transcription regulation.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1272477473"><span>SALAD -- Surveyed contained motif ALignment diagram and the Associating Dendrogram</span></a></div><p>Perform systematic comparison of proteome data among species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229010765"><span>SGN -- SOL Genomics Network</span></a></div><p>A comparative map viewer dedicated to the biology of the Solanaceae family.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1256669040"><span>ShotgunFunctionalizeR -- R-package for functional comparison of metagenomes</span></a></div><p>Analyze data from functional analysis on fragmented microbial genetic material.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1256238439"><span>SnoopCGH -- Comparative Genomic Hybridization software</span></a></div><p>Visualize and explore comparative genomic hybridization data sets.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174489598"><span>SwissRegulon -- a database of genome-wide annotations of regulatory sites</span></a></div><p>Search for genome-wide annotations of regulatory sites in yeast and prokaryotes genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229013521"><span>TaxonGap -- a visualization tool for intra- and inter-species variation among individual biomarkers</span></a></div><p>Compare and select individual biomarkers.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1106063477"><span>The Adaptive Evolution Database (TAED) -- a phylogeny based tool for comparative genomics</span></a></div><p>Search for information on adaptive evolution in gene families of higher plants and chordate.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1216742716"><span>The CGView Server -- a comparative genomics tool for circular genomes</span></a></div><p>Generate graphical maps of circular genomes that show sequence features, base composition plots, analysis results and sequence similarity plots.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1099663588"><span>The ERGO -- Genome analysis and discovery system</span></a></div><p>Conduct a comprehensive analysis of genes and genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1177611772"><span>The Macaque Genome: Interactive Poster and Teaching Resource</span></a></div><p>An interactive online poster presentation on the Macaque genome, including high-quality images, video clips, and Web resources</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1103816940"><span>The TIGR Gene Indices -- clustering and assembling EST and known genes and integration with eukaryotic genomes</span></a></div><p>Search for annotated genetic information of expressed sequence tags (ESTs) in different eukaryotic organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1043767169"><span>UniGene</span></a></div><p>Find mapping and expression information for a unigene cluster (ESTs and full-length mRNA sequences organized into clusters that each represent a unique known or putative gene)</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1216738072"><span>Uprobe -- universal overgo hybridization-based probe retrieval and design</span></a></div><p>A public online resource for identifying or designing 'universal' overgo-hybridization probes from conserved sequences that can be used to efficiently screen one or more genomic libraries from a designated group of species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1098205291"><span>VISTA -- Computational Tools for Comparative Genomics</span></a></div><p>Comprehensive suite of programs and databases for comparative analysis of genomic sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518144404"><span>cBARBEL -- Catfish Breeder and Researcher Bioinformatics Entry Location</span></a></div><p>Find information about ictalurid catfish.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209738040"><span>eggNOG -- evolutionary genealogy of genes: Non-supervised Orthologous Groups</span></a></div><p>Discover orthologous groups of genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1234370319"><span>metaTIGER -- a metabolic gene evolution resource</span></a></div><p>Find metabolic networks and phylogenomic information on a taxonomically diverse range of eukaryotes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1138901833"><span>xBASE -- a collection of online databases for bacterial comparative genomics</span></a></div><p>Conduct bacterial comparative genomics.</p></div>]]></description>
	<dc:creator>Shruti Paniwala</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44213/bioinformatics-tools-to-explore-ssrs-in-genomes</guid>
	<pubDate>Tue, 07 Mar 2023 13:06:15 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44213/bioinformatics-tools-to-explore-ssrs-in-genomes</link>
	<title><![CDATA[Bioinformatics tools to explore SSRs in genomes !]]></title>
	<description><![CDATA[<p>There are several bioinformatics tools that can be used to explore Simple Sequence Repeats (SSRs), which are also known as microsatellites. Here are a few examples:</p><ol>
<li>
<p>MISA: MISA (MIcroSAtellite) is a web-based tool that can identify SSRs in DNA sequences. It can be used to analyze nucleotide sequences from various organisms and can identify perfect, compound, and imperfect SSRs.</p>
</li>
<li>
<p>SSR Locator: SSR Locator is a web-based tool that identifies SSRs in both DNA and RNA sequences. It can identify perfect, compound, and imperfect SSRs, and can also filter out low complexity regions.</p>
</li>
<li>
<p>SciRoKo: SciRoKo is a software tool that can identify SSRs in DNA sequences. It can be used to analyze genomic and transcriptomic sequences from various organisms and can identify perfect, compound, and imperfect SSRs.</p>
</li>
<li>
<p>Primer3: Primer3 is a web-based tool that designs PCR primers for SSRs. It can design primers for perfect and imperfect SSRs, and can be used to design primers for SSRs in various organisms.</p>
</li>
<li>
<p>QDD: QDD (Quick Detection of Duplication) is a software tool that can identify SSRs in DNA sequences and can also identify duplicate loci. It can be used to analyze genomic and transcriptomic sequences from various organisms.</p>
</li>
</ol><p>These are just a few examples of the many bioinformatics tools available for exploring SSRs. Depending on your specific needs and research questions, you may find that other tools are more appropriate for your analysis.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44731/exploring-bacterial-comparative-genomics-a-bioinformatics-approach</guid>
	<pubDate>Sat, 14 Dec 2024 12:31:14 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44731/exploring-bacterial-comparative-genomics-a-bioinformatics-approach</link>
	<title><![CDATA[Exploring Bacterial Comparative Genomics: A Bioinformatics Approach]]></title>
	<description><![CDATA[<p>In the world of microbiology, bacteria have long fascinated scientists for their diversity, adaptability, and crucial roles in ecosystems and human health. Comparative genomics&mdash;a field that involves analyzing and comparing the genomes of different organisms&mdash;has revolutionized our understanding of bacterial evolution, adaptation, and pathogenicity. By leveraging bioinformatics tools and techniques, researchers can uncover genomic insights that were once hidden. This blog delves into the principles, methodologies, and applications of bacterial comparative genomics from a bioinformatics perspective.</p><h4><strong>What is Bacterial Comparative Genomics?</strong></h4><p>Comparative genomics involves the systematic comparison of genomes across different bacterial species or strains. This approach allows scientists to:</p><ul>
<li>
<p>Identify conserved and unique genes.</p>
</li>
<li>
<p>Explore genetic determinants of pathogenicity.</p>
</li>
<li>
<p>Understand bacterial evolution and phylogenetics.</p>
</li>
<li>
<p>Investigate horizontal gene transfer and its role in antibiotic resistance.</p>
</li>
</ul><p>Bioinformatics is central to these analyses, enabling the processing and interpretation of large-scale genomic data.</p><h4><strong>Key Steps in Bacterial Comparative Genomics</strong></h4><ol>
<li>
<p><strong>Genome Sequencing and Assembly</strong>: The process begins with obtaining high-quality bacterial genome sequences. Advances in next-generation sequencing (NGS) technologies have made it faster and more affordable to sequence bacterial genomes. Tools such as SPAdes and Velvet are commonly used for genome assembly.</p>
</li>
<li>
<p><strong>Genome Annotation</strong>: Annotating a genome involves identifying genes, regulatory elements, and other genomic features. Automated tools like Prokka and RAST provide functional annotations, allowing researchers to predict the roles of genes and proteins.</p>
</li>
<li>
<p><strong>Genome Alignment</strong>: Aligning genomes is crucial for identifying conserved regions, single-nucleotide polymorphisms (SNPs), and structural variations. Tools like Mauve and progressiveMauve are commonly employed for whole-genome alignments.</p>
</li>
<li>
<p><strong>Comparative Analyses</strong>:</p>
<ul>
<li>
<p><strong>Core and Pan-genome Analysis</strong>: The core genome consists of genes shared across all strains of a species, while the pan-genome includes all genes found in any strain. Software like Roary and BPGA can perform core and pan-genome analyses.</p>
</li>
<li>
<p><strong>Phylogenetic Analysis</strong>: Comparative genomics often involves reconstructing evolutionary relationships. Tools such as MEGA and IQ-TREE facilitate phylogenetic tree construction based on genomic data.</p>
</li>
<li>
<p><strong>Functional Enrichment Analysis</strong>: To understand the biological significance of unique or shared genes, functional enrichment analysis using databases like GO (Gene Ontology) and KEGG is essential.</p>
</li>
</ul>
</li>
</ol><div>&nbsp;<strong style="font-size: 1em;">Recommended Bioinformatics Tools for Comparative Genomics</strong></div><p>Here are some additional bioinformatics tools that can aid bacterial comparative genomics:</p><ul>
<li>
<p><strong>OrthoFinder</strong>: For accurate ortholog identification across multiple genomes.</p>
</li>
<li>
<p><strong>PanOCT</strong>: Specifically designed for pan-genome clustering and annotation.</p>
</li>
<li>
<p><strong>FASTANI</strong>: A tool for calculating Average Nucleotide Identity (ANI) for microbial genome comparisons.</p>
</li>
<li>
<p><strong>CIRCOS</strong>: For visually comparing genomic data through circular genome plots.</p>
</li>
<li>
<p><strong>Galaxy Platform</strong>: A user-friendly web-based platform offering numerous genomic analysis tools.</p>
</li>
<li>
<p><strong>BLAST</strong>: Essential for sequence alignment and similarity searches.</p>
</li>
<li>
<p><strong>PhyloSift</strong>: Focused on phylogenetic analysis of microbial genomes using marker genes.</p>
</li>
</ul><p>These tools, in combination with the methods discussed, provide a robust framework for conducting comprehensive comparative genomic studies.</p><h4><strong>Applications of Bacterial Comparative Genomics</strong></h4><ol>
<li>
<p><strong>Understanding Pathogenicity</strong>: Comparative genomics helps identify virulence factors that distinguish pathogenic strains from non-pathogenic relatives. For instance, comparing genomes of <em>Escherichia coli</em> strains has revealed key genetic determinants of pathogenicity in enterohemorrhagic strains.</p>
</li>
<li>
<p><strong>Antibiotic Resistance Research</strong>: The spread of antibiotic resistance genes through horizontal gene transfer is a major global concern. Comparative analyses can trace the origins and dissemination of resistance genes, aiding in the development of countermeasures.</p>
</li>
<li>
<p><strong>Microbial Ecology and Evolution</strong>: By studying genomic variations, researchers can understand how bacteria adapt to different environments. This is particularly relevant for extremophiles and symbiotic bacteria.</p>
</li>
<li>
<p><strong>Vaccine Development</strong>: Identifying conserved antigens across pathogenic strains is critical for vaccine design. Comparative genomics has been instrumental in developing vaccines against pathogens like <em>Neisseria meningitidis</em>.</p>
</li>
<li>
<p><strong>Biotechnology Applications</strong>: Comparative studies can uncover unique metabolic pathways in bacteria, paving the way for applications in bioremediation, synthetic biology, and industrial microbiology.</p>
</li>
</ol><h4><strong>Challenges in Bacterial Comparative Genomics</strong></h4><p>While the field has made significant strides, several challenges remain:</p><ul>
<li>
<p><strong>Data Overload</strong>: The rapid growth of sequencing data requires robust computational infrastructure and efficient algorithms.</p>
</li>
<li>
<p><strong>Genome Plasticity</strong>: High rates of horizontal gene transfer and genome rearrangements in bacteria complicate comparative analyses.</p>
</li>
<li>
<p><strong>Annotation Accuracy</strong>: Automated annotation tools are not infallible, and manual curation is often needed for high-confidence results.</p>
</li>
<li>
<p><strong>Interpreting Non-Coding Regions</strong>: Understanding the functional significance of non-coding genomic regions remains a challenge.</p>
</li>
</ul><h4><strong>Future Directions</strong></h4><p>The integration of bacterial comparative genomics with other &lsquo;omics&rsquo; approaches&mdash;such as transcriptomics, proteomics, and metabolomics&mdash;promises a more comprehensive understanding of bacterial biology. Additionally, advancements in machine learning and artificial intelligence are likely to further enhance bioinformatics analyses, enabling the prediction of complex phenotypes from genomic data.</p><h4><strong>Conclusion</strong></h4><p>Bacterial comparative genomics, driven by bioinformatics, continues to unravel the complexities of bacterial life. From combating antibiotic resistance to uncovering the secrets of microbial evolution, this interdisciplinary field holds immense potential for addressing pressing challenges in microbiology and beyond. As technology advances, so too will our ability to harness the power of comparative genomics for scientific and societal benefit.</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/42972/list-of-bioinformatics-workflow-management-tools</guid>
	<pubDate>Sat, 20 Mar 2021 00:15:25 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/42972/list-of-bioinformatics-workflow-management-tools</link>
	<title><![CDATA[List of bioinformatics workflow management tools !]]></title>
	<description><![CDATA[<h3>Here are list of&nbsp;Workflow Managers</h3><ul>
<li><span><a href="https://github.com/pcingola/BigDataScript">BigDataScript</a></span>&nbsp;&ndash; A cross-system scripting language for working with big data pipelines in computer systems of different sizes and capabilities. [&nbsp;<a href="https://pubmed.ncbi.nlm.nih.gov/25189778">paper-2014</a>&nbsp;|&nbsp;<a href="https://pcingola.github.io/BigDataScript">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/ssadedin/bpipe">Bpipe</a></span>&nbsp;&ndash; A small language for defining pipeline stages and linking them together to make pipelines. [&nbsp;<a href="http://docs.bpipe.org/">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/common-workflow-language/common-workflow-language">Common Workflow Language</a></span>&nbsp;&ndash; a specification for describing analysis workflows and tools that are portable and scalable across a variety of software and hardware environments, from workstations to cluster, cloud, and high performance computing (HPC) environments. [&nbsp;<a href="http://www.commonwl.org/">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/broadinstitute/cromwell">Cromwell</a></span>&nbsp;&ndash; A Workflow Management System geared towards scientific workflows. [&nbsp;<a href="https://cromwell.readthedocs.io/">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/galaxyproject">Galaxy</a></span>&nbsp;&ndash; a popular open-source, web-based platform for data intensive biomedical research. Has several features, from data analysis to workflow management to visualization tools. [&nbsp;<a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030816">paper-2018</a>&nbsp;|&nbsp;<a href="https://galaxyproject.org/">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/nextflow-io/nextflow">Nextflow</a>&nbsp;(recommended)</span>&nbsp;&ndash; A fluent DSL modelled around the UNIX pipe concept, that simplifies writing parallel and scalable pipelines in a portable manner. [&nbsp;<a href="https://pubmed.ncbi.nlm.nih.gov/29412134">paper-2018</a>&nbsp;|&nbsp;<a href="http://nextflow.io/">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/cgat-developers/ruffus">Ruffus</a></span>&nbsp;&ndash; Computation Pipeline library for python widely used in science and bioinformatics. [&nbsp;<a href="https://pubmed.ncbi.nlm.nih.gov/20847218">paper-2010</a>&nbsp;|&nbsp;<a href="http://www.ruffus.org.uk/">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/SeqWare/seqware">SeqWare</a></span>&nbsp;&ndash; Hadoop Oozie-based workflow system focused on genomics data analysis in cloud environments. [&nbsp;<a href="https://pubmed.ncbi.nlm.nih.gov/21210981">paper-2010</a>&nbsp;|&nbsp;<a href="https://seqware.github.io/">web</a>&nbsp;]</li>
<li><span><a href="https://bitbucket.org/snakemake">Snakemake</a></span>&nbsp;&ndash; A workflow management system in Python that aims to reduce the complexity of creating workflows by providing a fast and comfortable execution environment. [&nbsp;<a href="https://pubmed.ncbi.nlm.nih.gov/29788404">paper-2018</a>&nbsp;|&nbsp;<a href="https://snakemake.readthedocs.io/">web</a>&nbsp;]</li>
<li><span><a href="https://github.com/broadinstitute/wdl">Workflow Descriptor Language</a></span>&nbsp;&ndash; Workflow standard developed by the Broad. [&nbsp;<a href="https://software.broadinstitute.org/wdl">web</a>&nbsp;]</li>
</ul>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/34916/bioinformatics-tools-developed-for-oxford-nanopore-data-analysis</guid>
	<pubDate>Wed, 27 Dec 2017 20:47:30 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/34916/bioinformatics-tools-developed-for-oxford-nanopore-data-analysis</link>
	<title><![CDATA[Bioinformatics tools developed for Oxford Nanopore data analysis !]]></title>
	<description><![CDATA[<p><span>MinION is the only portable real-time device for DNA and RNA&nbsp;</span><span>sequencing</span><span>. Each consumable flow cell can now generate 10&ndash;20 Gb of DNA&nbsp;</span><span>sequence</span><span>&nbsp;data. Ultra-</span><span>long read lengths are possible (hundreds of kb) as you can choose your fragment length.&nbsp;</span>One of the technical advantages of ONT data is the read length, which offers great prospects for genome assembly. Generally, assemblers are based on several different types of algorithms, such as greedy, overlap-layout-consensus (OLC), de Bruijn graph (DBG), and string graph.</p><p><span>List of analysis tools developed for Oxford Nanopore data</span></p><p>BWA <br />Fast nanopore data tuned alignment tool <br />https://github.com/lh3/bwa</p><p>GraphMap<br />Mapper for long and error-prone reads<br />https://github.com/isovic/graphmap</p><p>LAST<br />Nanopore tuned alignment tool<br />http://last.cbrc.jp/</p><p>LINKS<br />Software tool for long read scaffolding <br />https://github.com/warrenlr/LINKS/</p><p>marginAlign<br />Tools to align nanopore reads to a reference<br />https://github.com/benedictpaten/marginAlign</p><p>minoTour<br />Real time analysis tools<br />http://minotour.nottingham.ac.uk/</p><p>nanoCORR<br />Error-correction tool for nanopore sequence data<br />https://github.com/jgurtowski/nanocorr</p><p>NanoOK<br />Software for nanopore data, quality and error profiles<br />https://documentation.tgac.ac.uk/display/NANOOK/NanoOK</p><p>Nanopolish<br />Nanopore analysis and genome assembly software<br />https://github.com/jts/nanopolish</p><p>nanopore<br />Variant-detection tool for nanopore sequence data<br />https://github.com/mitenjain/nanopore</p><p>Nanocorrect<br />Error-correction tool for nanopore sequence data<br />https://github.com/jts/nanocorrect/</p><p>npReader<br />Real-time conversion and analysis of nanopore reads<br />https://github.com/mdcao/npReader</p><p>poRe<br />Tool for analyzing and visualizing nanopore data<br />https://sourceforge.net/p/rpore/wiki/Home/</p><p>PoreSeq<br />Error-correction and variant-calling software<br />https://github.com/tszalay/poreseq</p><p>Poretools<br />Nanopore sequence analysis and visualization software <br />https://github.com/arq5x/poretools</p><p>SSPACE-LongRead<br />Genome scaffolding tool <br />http://www.baseclear.com/genomics/bioinformatics/basetools/SSPACE-longread</p><p>SMIS<br />Genome scaffolding tool <br />https://sourceforge.net/projects/phusion2/files/smis/</p><p>&nbsp;</p><p>List of assemblers for Oxford Nanopore MinION long reads</p><p>LQS<br />DALIGNER, Celera OLC Nanocorrect, <br />Nanopolish corrector<br />https://github.com/jts/nanopolish</p><p>PBcR<br />HGAP or BLASR, Celera OLC <br />PBcR corrector<br />http://wgs-assembler.sourceforge.net/wiki/index.php/PBcR<br /> &ndash;<br />Canu<br />MHAP, Celera OLC <br />Canu corrector<br />https://github.com/marbl/canu</p><p>Falcon<br />String graph, Celera OLC <br />Falcon corrector<br />https://github.com/PacificBiosciences/falcon</p><p>Miniasm <br />OLC<br />https://github.com/lh3/miniasm</p><p>ra-integrate<br />OLC<br />https://github.com/mariokostelac/ra-integrate/</p><p>ALLPATHS-LG<br />de Bruijn graph <br />ALLPATHS-L corrector<br />https://www.broadinstitute.org/software/allpaths-lg/blog/?page_id=12</p><p>SPAdes <br />de Bruijn graph <br />SPAdes corrector<br />http://bioinf.spbau.ru/spades</p>]]></description>
	<dc:creator>biogeek</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36512/hisat2-a-fast-and-sensitive-alignment-program-for-mapping-next-generation-sequencing-reads</guid>
	<pubDate>Tue, 08 May 2018 04:27:22 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36512/hisat2-a-fast-and-sensitive-alignment-program-for-mapping-next-generation-sequencing-reads</link>
	<title><![CDATA[HISAT2: a fast and sensitive alignment program for mapping next-generation sequencing reads]]></title>
	<description><![CDATA[<p><strong>HISAT2</strong><span>&nbsp;is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes (as well as to a single reference genome). Based on an extension of BWT for graphs&nbsp;</span><a href="http://dl.acm.org/citation.cfm?id=2674828">[Sir&eacute;n et al. 2014]</a><span>, we designed and implemented a graph FM index (GFM), an original approach and its first implementation to the best of our knowledge. In addition to using one global GFM index that represents a population of human genomes, HISAT2 uses a large set of small GFM indexes that collectively cover the whole genome (each index representing a genomic region of 56 Kbp, with 55,000 indexes needed to cover the human population). These small indexes (called local indexes), combined with several alignment strategies, enable rapid and accurate alignment of sequencing reads. This new indexing scheme is called a Hierarchical Graph FM index (HGFM).&nbsp;</span></p>
<p><span>more at&nbsp;https://ccb.jhu.edu/software/hisat2/index.shtml</span></p><p>Address of the bookmark: <a href="https://github.com/infphilo/hisat2" rel="nofollow">https://github.com/infphilo/hisat2</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37915/dna-nucleotide-counter</guid>
	<pubDate>Fri, 12 Oct 2018 04:37:01 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37915/dna-nucleotide-counter</link>
	<title><![CDATA[DNA Nucleotide Counter]]></title>
	<description><![CDATA[<p style="margin: 2px 5px 4px 6px; color: #000011; font-size: 12px; font-style: normal; font-weight: 400; text-align: justify;">DNA Nucleotide Counter is delivered in a DNA Baser package together with other free molecular biology tools.<span>&nbsp;</span><a href="http://www.dnabaser.com/download/biology-tools-package-download-count.html">Download</a><span>&nbsp;</span>the package and double click it. The programs inside the package will be extracted to the destination folder (specified by you). Go to the destination folder&nbsp;and double click the program you want to use.</p>
<p style="margin: 2px 5px 4px 6px; color: #000011; font-size: 12px; font-style: normal; font-weight: 400; text-align: justify;">It<span>&nbsp;</span><a href="http://www.dnabaser.com/download/install-anywhere.html">installs in any computer</a><span>&nbsp;</span>even if you don't have administrator rights!</p><p>Address of the bookmark: <a href="http://www.dnabaser.com/download/DNA-Counter/index.html" rel="nofollow">http://www.dnabaser.com/download/DNA-Counter/index.html</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>

</channel>
</rss>