<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/43791?offset=40</link>
	<atom:link href="https://bioinformaticsonline.com/related/43791?offset=40" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44799/unlocking-evolutionary-secrets-a-dive-into-comparative-genomics-methods</guid>
	<pubDate>Tue, 20 May 2025 00:25:09 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44799/unlocking-evolutionary-secrets-a-dive-into-comparative-genomics-methods</link>
	<title><![CDATA[Unlocking Evolutionary Secrets: A Dive into Comparative Genomics Methods]]></title>
	<description><![CDATA[<p>Comparative genomics is the art and science of comparing genomes&mdash;across species, within species, or even among individuals&mdash;to unravel evolutionary relationships, functional elements, and genetic adaptations. As sequencing technologies have advanced and genome databases have expanded, comparative genomics has become a cornerstone of modern biology, shedding light on everything from antibiotic resistance in bacteria to human disease genetics.</p><p>In this post, we&rsquo;ll explore the core methods used in comparative genomics, the questions they help answer, and how they&rsquo;re shaping our understanding of life.</p><p><strong>1. Whole-Genome Alignment</strong><br />Whole-genome alignment involves mapping the entire genome of one species to another. Tools like MUMmer, MAUVE, and LASTZ perform large-scale sequence alignments to detect conserved regions, rearrangements, insertions, and deletions.</p><p>Use Case:<br />Comparing human and chimpanzee genomes to identify evolutionary conserved sequences (ECS) and regions of divergence.</p><p>Key Challenges:<br />Handling repetitive sequences and genome rearrangements.</p><p>Computational complexity in large genomes.</p><p><strong>2. Synteny and Collinearity Analysis</strong><br />Synteny refers to conserved blocks of gene order across species. Tools like MCScanX, SynMap, or CHITRA (for visualizing synteny interactively) detect these blocks to understand chromosomal evolution.</p><p>Use Case:<br />Studying ancient genome duplications in plants.</p><p>Investigating chromosomal rearrangements in cancer genomes.</p><p><strong>3. Ortholog and Paralog Detection</strong><br />Orthologs are genes in different species that evolved from a common ancestor, while paralogs are genes duplicated within a genome. Identifying them is crucial for functional annotation and evolutionary studies.</p><p>Popular Tools:<br />OrthoFinder, Orthologous MAtrix (OMA), InParanoid, and EggNOG.</p><p>Use Case:<br />Functional prediction of uncharacterized genes based on orthologs in model organisms.</p><p>Tracing gene family evolution.</p><p><strong>4. Phylogenomic Analysis</strong><br />Phylogenomic methods combine phylogenetics and genomics to infer evolutionary trees based on genome-wide data. These methods can handle dozens to hundreds of genomes, using concatenated alignments or gene trees.</p><p>Tools:<br />RAxML, IQ-TREE, ASTRAL, Phylip, BEAST.</p><p>Use Case:<br />Resolving the evolutionary relationships between microbial species.</p><p>Studying speciation events.</p><p><strong>5. Pan-Genome Analysis</strong><br />The pan-genome consists of the core genome (shared by all strains) and the accessory genome (strain-specific genes). This is especially popular in microbial genomics.</p><p>Tools:<br />Roary, Panaroo, BPGA, PGAP.</p><p>Use Case:<br />Understanding virulence factor diversity in E. coli.</p><p>Designing broad-spectrum vaccines.</p><p><strong>6. Comparative Transcriptomics</strong><br />Comparing transcriptomes across species or conditions reveals conserved and unique expression patterns. RNA-seq data can be mapped to reference genomes to identify orthologous expression profiles.</p><p>Use Case:<br />Comparing stress response in extremophiles and model species.</p><p>Studying conserved regulatory networks.</p><p><strong>7. Functional Element Comparison</strong><br />Beyond genes, comparative genomics also targets non-coding regions&mdash;enhancers, promoters, miRNAs. Conservation across species often implies functional importance.</p><p>Tools:<br />PhastCons, GERP, phyloP (based on multiple alignments).</p><p>Use Case:<br />Detecting conserved non-coding elements in vertebrates.</p><p>Studying regulatory divergence in human evolution.</p><p><strong>8. Horizontal Gene Transfer (HGT) Detection</strong><br />In microbes, genes often jump across species boundaries. Comparative genomics can detect HGT by identifying genes that defy the expected phylogenetic pattern.</p><p>Tools:<br />HGTector, DarkHorse, AlienHunter, SIGI-HMM.</p><p>Use Case:<br />Tracing antibiotic resistance genes.</p><p>Exploring microbial adaptability in extreme environments.</p><p><strong>Final Thoughts</strong><br />Comparative genomics is a powerful lens to observe the diversity and unity of life. With a broad toolkit&mdash;from aligners to orthology pipelines, phylogenetic engines to visualization tools&mdash;it allows scientists to ask big questions: How did genomes evolve? What makes species unique? Where do new genes come from?</p><p>Whether you're studying extremophiles, building better crops, or exploring human ancestry, comparative genomics offers the methods to connect the dots across the tree of life.</p><p>&nbsp;</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36218/g-compass-a-comparative-genome-browser</guid>
	<pubDate>Thu, 12 Apr 2018 10:00:27 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36218/g-compass-a-comparative-genome-browser</link>
	<title><![CDATA[G-compass: a comparative genome browser]]></title>
	<description><![CDATA[<p><span>G-compass (</span><a href="http://www.h-invitational.jp/g-compass/" target="_top">http://www.h-invitational.jp/g-compass/</a><span>) is a comparative genome browser. It visualizes evolutionarily conserved genomic regions between human and other 12 vertebrates based on original genome alignments pursuing higher coverage (1,2). Annotations of human genes/transcripts and their ortholog information were derived from&nbsp;</span><a href="http://www.h-invitational.jp/hinv/ahg-db/index.jsp" target="_top">H-InvDB</a><span>&nbsp;and its subdatabase&nbsp;</span><a href="http://www.h-invitational.jp/evola/" target="_top">Evola</a><span>, respectively. G-compass is available for free of charge. [&nbsp;</span><a href="http://www.h-invitational.jp/g-compass/cgi-bin/gc_main.cgi?species_1=Hg18&amp;species_2=pt2&amp;strand_1=%2B&amp;strand_2=%2B&amp;from_win=main&amp;gen_str=2&amp;chr_1=01&amp;chr_2=01&amp;st_1=103804298&amp;ed_1=104204297&amp;st_2=105235351&amp;ed_2=105635350" target="_top">Sample</a><span>&nbsp;]</span></p><p>Address of the bookmark: <a href="http://www.h-invitational.jp/g-compass/" rel="nofollow">http://www.h-invitational.jp/g-compass/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/43046/postdoctoral-fellow-for-a-large-scale-microbial-comparative-genomics</guid>
  <pubDate>Thu, 29 Apr 2021 08:44:53 -0500</pubDate>
  <link></link>
  <title><![CDATA[postdoctoral fellow for a large-scale microbial comparative genomics !]]></title>
  <description><![CDATA[
<p>Asaf Levy hiring a postdoctoral fellow for a large-scale microbial comparative genomics project at the Hebrew University of Jerusalem (Israel). <br />The project is a continuation of Levy Asaf et al. Nature Genetics 2018 paper.<br />Requirements: <br />1.Experience with programming in at least one programming language, preferably Python.<br />2.A PhD in bioinformatics/computational biology<br />3.At least one first authorship publication in a good journal, preferably more.<br />4.Good communication skills in English <br />5.Ability to enter and study in Israel (not applicable for Pakistani people, for example). <br />6.Ability to work in a team.<br />Please send CV to alevy@mail.huji.ac.il</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34328/dfast-a-flexible-prokaryotic-genome-annotation-pipeline-for-faster-genome-publication</guid>
	<pubDate>Tue, 14 Nov 2017 10:26:16 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34328/dfast-a-flexible-prokaryotic-genome-annotation-pipeline-for-faster-genome-publication</link>
	<title><![CDATA[DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication]]></title>
	<description><![CDATA[<p>We developed a prokaryotic genome annotation pipeline, DFAST, that also supports genome submission to public sequence databases. DFAST was originally started as an on-line annotation server, and to date, over 7,000 jobs have been processed since its first launch in 2016. Here, we present a newly implemented background annotation engine for DFAST, which is also available as a standalone command-line program. The new engine can annotate a typical-sized bacterial genome within 10 minutes, with rich information such as pseudogenes, translation exceptions, and orthologous gene assignment between given reference genomes. In addition, the modular framework of DFAST allows users to customize the annotation workflow easily and will also facilitate extensions for new functions and incorporation of new tools in the future.</p>
<div>Availability and Implementation</div>
<p>The software is implemented in Python 3 and runs in both Python 2.7 and 3.4&ndash; on Macintosh and Linux systems. It is freely available at&nbsp;<a href="https://github.com/nigyta/dfast_core/" target="">https://github.com/nigyta/dfast_core/</a>&nbsp;under the GPLv3 license with external binaries bundled in the software distribution. An on-line version is also available at&nbsp;<a href="https://dfast.nig.ac.jp/" target="">https://dfast.nig.ac.jp/</a>.</p><p>Address of the bookmark: <a href="https://dfast.nig.ac.jp/" rel="nofollow">https://dfast.nig.ac.jp/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34567/jobtree-based-python-wrapper-to-run-the-genome-simulation-tool-suite-evolver</guid>
	<pubDate>Fri, 08 Dec 2017 16:26:32 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34567/jobtree-based-python-wrapper-to-run-the-genome-simulation-tool-suite-evolver</link>
	<title><![CDATA[jobTree based python wrapper to run the genome simulation tool suite Evolver]]></title>
	<description><![CDATA[<p><span>evolverSimControl</span><span>&nbsp;(</span><span>eSC</span><span>) can be used to simulate multi-chromosome genome evolution on an arbitrary phylogeny (</span><a href="http://evolution.genetics.washington.edu/phylip/newicktree.html">Newick format</a><span>). In addition to simply running evolver,&nbsp;</span><span>eSC</span><span>&nbsp;also automatically creates statistical summaries of the simulation as it runs including text and image files. Also included are convenience scripts to: check on a running simulation and see detailed status and logging information; extract fasta sequence files from the leaf nodes of a completed simulation; extract pairwise multiple alignment files (</span><a href="http://genome.ucsc.edu/FAQ/FAQformat.html#format5">.maf</a><span>) from leaf and branch nodes from a completed simulation and with the help of&nbsp;</span><a href="https://github.com/dentearl/mafTools/">mafJoin</a><span>, join them together into a single maf covering the entire simulation.</span></p><p>Address of the bookmark: <a href="https://github.com/dentearl/evolverSimControl" rel="nofollow">https://github.com/dentearl/evolverSimControl</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36516/metassembler-merging-and-optimizing-de-novo-genome-assemblies</guid>
	<pubDate>Tue, 08 May 2018 04:52:33 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36516/metassembler-merging-and-optimizing-de-novo-genome-assemblies</link>
	<title><![CDATA[Metassembler: merging and optimizing de novo genome assemblies]]></title>
	<description><![CDATA[<p><span>Metassembler combines multiple whole genome de novo assemblies into a combined consensus assembly using the best segments of the individual assemblies.</span></p>
<p><span><span>Genome assembly projects typically run multiple algorithms in an attempt to find the single best assembly, although those assemblies often have complementary, if untapped, strengths and weaknesses. We present our metassembler algorithm that merges multiple assemblies of a genome into a single superior sequence.&nbsp;</span></span></p><p>Address of the bookmark: <a href="https://sourceforge.net/projects/metassembler/?source=directory" rel="nofollow">https://sourceforge.net/projects/metassembler/?source=directory</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36861/eagler-a-scaffolding-tool-for-long-reads</guid>
	<pubDate>Mon, 04 Jun 2018 05:26:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36861/eagler-a-scaffolding-tool-for-long-reads</link>
	<title><![CDATA[EAGLER: a scaffolding tool for long reads.]]></title>
	<description><![CDATA[<p>EAGLER is a scaffolding tool for long reads. The scaffolder takes as input a draft genome created by any NGS assembler and a set of long reads. The long reads are used to extend the contigs present in the NGS draft and possibly join overlapping contigs. EAGLER supports both PacBio and Oxford Nanopore reads.</p>
<p>The tool should be compatible with most UNIX flavors and has been successfully tested on the following operating systems:</p>
<ul>
<li>Mac OS X 10.11.1</li>
<li>Mac OS X 10.10.3</li>
<li>Ubuntu 14.04 LTS</li>
</ul>

https://bib.irb.hr/datoteka/844447.Diplomski_2015_Luka_terbi.pdf<p>Address of the bookmark: <a href="https://github.com/mculinovic/EAGLER" rel="nofollow">https://github.com/mculinovic/EAGLER</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38443/genoplotr-plot-gene-and-genome-maps-project</guid>
	<pubDate>Wed, 12 Dec 2018 08:33:41 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38443/genoplotr-plot-gene-and-genome-maps-project</link>
	<title><![CDATA[genoPlotR - plot gene and genome maps project!]]></title>
	<description><![CDATA[<p>genoPlotR is a R package to produce reproducible, publication-grade graphics of gene and genome maps. It allows the user to read from usual format such as protein table files and blast results, as well as home-made tabular files.</p>
<h3>Features</h3>
<ul>
<li>Linear representation of several segments of DNA</li>
<li>Comparisons represented by areas between the segments (like Artemis, for example)</li>
<li>Reads from common formats: Genbank, EMBL, blast, Mauve, and from user-generated tab files</li>
<li>Plot several subsegments of the same segment on the same line, separated by a //</li>
<li>Automatic or manual placement of the segments on the plot</li>
<li>Add annotations to all the lines</li>
<li>Create smart, automatic annotations for genomes, based on gene names</li>
<li>Add a user-generated tree</li>
<li>Add a global scale or a scale to each line</li>
<li>Use user-defined graphical functions to represent genes</li>
<li></li>
</ul><p>Address of the bookmark: <a href="http://genoplotr.r-forge.r-project.org/" rel="nofollow">http://genoplotr.r-forge.r-project.org/</a></p>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38561/hawkeye-an-interactive-visual-analytics-tool-for-genome-assemblies</guid>
	<pubDate>Tue, 01 Jan 2019 11:56:17 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38561/hawkeye-an-interactive-visual-analytics-tool-for-genome-assemblies</link>
	<title><![CDATA[Hawkeye: an interactive visual analytics tool for genome assemblies]]></title>
	<description><![CDATA[<p><span>Genome sequencing remains an inexact science, and genome sequences can contain significant errors if they are not carefully examined. Hawkeye is our new visual analytics tool for genome assemblies, designed to aid in identifying and correcting assembly errors. Users can analyze all levels of an assembly along with summary statistics and assembly metrics, and are guided by a ranking component towards likely mis-assemblies. Hawkeye is freely available and released as part of the open source AMOS project&nbsp;</span><span><a href="http://amos.sourceforge.net/hawkeye"><span>http://amos.sourceforge.net/hawkeye</span></a></span><span>.</span></p>
<p>https://genomebiology.biomedcentral.com/articles/10.1186/gb-2007-8-3-r34</p><p>Address of the bookmark: <a href="http://amos.sourceforge.net/wiki/index.php?title=Hawkeye" rel="nofollow">http://amos.sourceforge.net/wiki/index.php?title=Hawkeye</a></p>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/42936/ancient-whole-genome-duplication-wgd-detection-tools</guid>
	<pubDate>Sun, 07 Mar 2021 00:32:44 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/42936/ancient-whole-genome-duplication-wgd-detection-tools</link>
	<title><![CDATA[Ancient whole genome duplication (WGD) detection tools !]]></title>
	<description><![CDATA[<p>There are two methods for ancient WGD detection, one is collinearity analysis, and the other is based on the Ks distribution map. Among them, Ks is defined as the average number of synonymous substitutions at each synonymous site, and there is also a Ka corresponding to it, which refers to the average number of non-synonymous substitutions at each non-synonymous site.</p><p>At present, some people have posted articles about the analysis process of WGD. I searched for the keyword "wgd pipeline" and found the following:</p><p><strong>GenoDup: https:// github.com/MaoYafei/GenoDup-Pipeline</strong><br /><strong>https://peerj.com/articles/6303/</strong><br /><strong>WGDdetector: https:// github.com/yongzhiyang2 012/WGDdetector</strong><br /><strong>https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-2670-3</strong><br /><strong>wgd: https:// github.com/arzwa/wgd</strong><br /><strong>https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1142-2#Sec1</strong><br /><strong>https://bmcbiol.biomedcentral.com/articles/10.1186/s12915-017-0399-x</strong><br /><strong>GeNoGAP https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1142-2</strong><br /><strong>https://bmcbiol.biomedcentral.com/articles/10.1186/s12915-017-0399-x</strong><br /><strong>https://github.com/dfguan/purge_dups</strong><br /><strong>https://www.biorxiv.org/content/10.1101/2020.01.24.917997v1</strong></p><p>This article introduces the usage of wgd.</p><p>Wgd cannot be installed directly with bioconda at present, so it is a little troublesome to install, because it depends on a lot of software. wgd depends on the following software</p><p><strong>BLAST</strong><br /><strong>MCL</strong><br /><strong>MUSCLE/MAFFT/PRANK</strong><br /><strong>PAML</strong><br /><strong>PhyML/FastTree</strong><br /><strong>i-ADHoRe</strong></p><p>But the good news is that most of the software it depends on can be installed with bioconda</p><blockquote><p>conda create -n wgd python=3.5 blast mcl muscle mafft prank paml fasttree cmake libpng mpi=1.0=mpich<br />conda activate wgd</p></blockquote><p>Here mpi=1.0=mpich is selected, because i-adhore depends on mpich. If openmpi is installed, an error will appear while loading shared libraries: libmpi_cxx.so.40: cannot open shared object file: No such file or directory</p><p>After that, the installation is much simpler</p><blockquote><p>git clone https://github.com/arzwa/wgd.git<br />cd wgd<br />pip install .<br />pip install git+https://github.com/arzwa/wgd.git<br />For i-ADHoRe, you need to register at http:// bioinformatics.psb.ugent.be /webtools/i-adhore/licensing/Agree to the license to download i-ADHoRe-3.0</p></blockquote><p>Since my miniconda3 installed ~/opt/, the installation path is so~/opt/miniconda3/envs/wgd/</p><blockquote><p>tar -zxvf i-adhore-3.0.01.tar.gz<br />cd i-adhore-3.0.01<br />mkdir -p build &amp;&amp; cd build<br />cmake .. -DCMAKE_INSTALL_PREFIX=~/opt/miniconda3/envs/wgd/<br />make -j 4 <br />make insatall</p></blockquote><p>Take the sugarcane genome Saccharum spontaneum L as an example. The genome is 8-ploid with 32 chromosomes (2n = 4x8 = 32)</p><p><strong>Download the tutorial for CDS and GFF annotation files</strong></p><blockquote><p><strong>mkdir -p wgd_tutorial &amp;&amp; cd wgd_tutorial</strong><br /><strong>wget http://www.life.illinois.edu/ming/downloads/Spontaneum_genome/Sspon.v20190103.cds.fasta.gz</strong><br /><strong>wget http://www.life.illinois.edu/ming/downloads/Spontaneum_genome/Sspon.v20190103.gff3.gz</strong><br /><strong>gunzip *.gz</strong></p></blockquote><p>First conda activate wgdstart our analysis environment, and then start the analysis</p><p>Step 1 : Use to wgd mclidentify homologous genes in the genome</p><blockquote><p>wgd mcl -n 20 --cds --mcl -s Sspon.v20190103.cds.fasta -o Sspon_cds.out</p></blockquote><p>Step 2 : Use to wgd ksdbuild Ks distribution</p><blockquote><p>wgd ksd --n_threads 80 Sspon_cds.out/Sspon.v20190103.cds.fasta.blast.tsv.mcl Sspon.v20190103.cds.fasta</p></blockquote><p>Step 3 : If the quality of the genome is good, then wgd syncollinearity analysis can be used . It can help us find the collinearity block in the genome and the corresponding anchor point</p><blockquote><p>wgd syn --feature gene --gene_attribute ID \<br /> -ks wgd_ksd/Sspon.v20190103.cds.fasta.ks.tsv \<br /> Sspon.v20190103.gff3 Sspon_cds.out/Sspon.v20190103.cds.fasta.blast.tsv.mcl</p></blockquote><p>&nbsp;For more reading - There are 9 sub-modules in WGD</p><ul>
<li><span>kde: KDE fitting to the Ks distribution</span></li>
<li><span>ksd: Ks distribution construction</span></li>
<li><span>mcl: BLASP comparison of All-vs-ALl + MCL classification analysis.</span></li>
<li><span><span>mix: Hybrid modeling of Ks distribution.</span></span></li>
<li><span>pre: preprocess the CDS file</span></li>
<li><span>syn: Call I-ADHoRe 3.0 to use GFF files for collinearity analysis</span></li>
<li><span>viz: draw histogram and density plot</span></li>
<li><span>wf1: Ks standard analysis procedure of the whole genome paranome (paranome), call mcl, ksd and syn</span></li>
<li><span>wf2: Ks standard analysis procedure of one-vs-one homologous gene (ortholog), call wcl and kSD</span></li>
</ul>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

</channel>
</rss>