<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/41730?offset=300</link>
	<atom:link href="https://bioinformaticsonline.com/related/41730?offset=300" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43090/loretta-a-user-friendly-tool-for-assembling-viral-genomes-from-pacbio-sequence-data</guid>
	<pubDate>Wed, 23 Jun 2021 07:54:53 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43090/loretta-a-user-friendly-tool-for-assembling-viral-genomes-from-pacbio-sequence-data</link>
	<title><![CDATA[LoReTTA, a user-friendly tool for assembling viral genomes from PacBio sequence data]]></title>
	<description><![CDATA[<p>LoReTTA (Long Read Template-Targeted Assembler), a tool designed for performing <em>de novo</em> assembly of long reads generated from viral genomes on the PacBio platform. LoReTTA exploits a reference genome to guide the assembly process, an approach that has been successful with short reads.</p>
<p>https://academic.oup.com/ve/article/7/1/veab042/6248116</p><p>Address of the bookmark: <a href="https://academic.oup.com/ve/article/7/1/veab042/6248116" rel="nofollow">https://academic.oup.com/ve/article/7/1/veab042/6248116</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44313/orthovenn3-an-integrated-platform-for-exploring-and-visualizing-orthologous-data-across-genomes</guid>
	<pubDate>Tue, 02 May 2023 00:48:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44313/orthovenn3-an-integrated-platform-for-exploring-and-visualizing-orthologous-data-across-genomes</link>
	<title><![CDATA[OrthoVenn3: an integrated platform for exploring and visualizing orthologous data across genomes]]></title>
	<description><![CDATA[<p><span>OrthoVenn3 is a powerful tool for comparative genomics analysis, used as a web server for full genome comparisons, annotation, and evolutionary analysis of orthologous clusters across multiple species. It has already been used by thousands of users from over 60 countries.</span></p><p>Address of the bookmark: <a href="https://orthovenn3.bioinfotoolkits.net/" rel="nofollow">https://orthovenn3.bioinfotoolkits.net/</a></p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44581/biokit-a-set-of-tools-dedicated-to-bioinformatics-data-visualisation</guid>
	<pubDate>Tue, 18 Jun 2024 02:04:39 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44581/biokit-a-set-of-tools-dedicated-to-bioinformatics-data-visualisation</link>
	<title><![CDATA[BioKit: a set of tools dedicated to bioinformatics, data visualisation]]></title>
	<description><![CDATA[<p><span>BioKit is a set of tools dedicated to bioinformatics, data visualisation (</span><a href="https://biokit.readthedocs.io/en/latest/references.html#module-biokit.viz" title="biokit.viz"><code><span>biokit.viz</span></code></a><span>), access to online biological data (e.g. UniProt, NCBI thanks to bioservices). It also contains more advanced tools related to data analysis (e.g.,&nbsp;</span><a href="https://biokit.readthedocs.io/en/latest/references.html#module-biokit.stats" title="biokit.stats"><code><span>biokit.stats</span></code></a><span>). Since R is quite common in bioinformatics, we also provide a convenient module to run R inside your Python scripts or shell (:mod:biokit.rtools module).</span></p><p>Address of the bookmark: <a href="https://biokit.readthedocs.io/en/latest/index.html" rel="nofollow">https://biokit.readthedocs.io/en/latest/index.html</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44873/bakrep-denglish-blend-of-bakterien-repository-simplifies-access-to-this-data</guid>
	<pubDate>Wed, 13 Aug 2025 02:31:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44873/bakrep-denglish-blend-of-bakterien-repository-simplifies-access-to-this-data</link>
	<title><![CDATA[BakRep (Denglish blend of Bakterien &amp; Repository) simplifies access to this data]]></title>
	<description><![CDATA[<p>2,438,386 bacterial genomes at your fingertips consistently processed &amp; characterized, enriched with metadata, accessible via a flexible search engine.</p>
<p>BakRep (Denglish blend of Bakterien &amp; Repository) simplifies access to this data. It integrates enriched genomic information with metadata accessible via a flexible search-engine.</p>
<h1>Key features</h1>
<ul>
<li>Assembly statistics: ensure data quality with genome-based key metrics</li>
<li>Taxonomic classification: robust, purely genome-based classifications (<a href="https://gtdb.ecogenomic.org/" target="_blank">GTDB</a>)</li>
<li><a href="https://pubmlst.org/">MLST</a>: subtyping for deeper insights into genetic variation</li>
<li>Annotation: comprehensive &amp; taxonomy-independent (<a href="https://bakta.computational.bio/" target="_blank">Bakta</a>)</li>
<li>Metadata: full original submission records</li>
</ul>
<div>&nbsp;</div><p>Address of the bookmark: <a href="https://bakrep.computational.bio/" rel="nofollow">https://bakrep.computational.bio/</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/26729/ga4gh-data-working-group</guid>
	<pubDate>Sun, 20 Mar 2016 23:13:07 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/26729/ga4gh-data-working-group</link>
	<title><![CDATA[GA4GH Data Working Group]]></title>
	<description><![CDATA[<p>GA4GH Data Working Group</p>
<p>Led by David Haussler (UCSC) and Richard Durbin (Sanger Institute), the Data Working Group (DWG) of the Global Alliance brings together the leading Genome Institutes and Centers with IT industry leaders to create global standards and tools for the secure, privacy respecting and interoperable sharing of Genomic data.</p>
<p>More at&nbsp;http://ga4gh.org/#/</p><p>Address of the bookmark: <a href="http://ga4gh.org/#/" rel="nofollow">http://ga4gh.org/#/</a></p>]]></description>
	<dc:creator>Jitendra Prajapati</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/42324/comparative-genomics-data-set-including-240-mammals-released</guid>
	<pubDate>Thu, 19 Nov 2020 06:45:39 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/42324/comparative-genomics-data-set-including-240-mammals-released</link>
	<title><![CDATA[Comparative Genomics Data Set Including 240 Mammals Released !]]></title>
	<description><![CDATA[<p>The genome of 130 mammals was sequenced by a large international consortium and the data was analyzed together with 110 existing genomes to allow scientists to identify the important positions in the DNA. This report, published in Nature today will help advance research on human disease mutations and inform how best to protect endangered species.</p><p>In addition to the knowledge of the human genome, all these genomes, widely sampled across mammals, can be used to research how particular organisms respond to different conditions. Some otters, for example, have a thick, water-resistant shell, and some rodents, but not all, have adapted to hibernation. These animal traits will help us to understand human traits, such as metabolic diseases.</p><p><img src="https://media.springernature.com/lw685/springer-static/image/art%3A10.1038%2Fs41586-020-2876-6/MediaObjects/41586_2020_2876_Fig1_HTML.png?as=webp" alt="image" style="border: 0px; border: 0px;"></p><p>With climate change and more animal ecosystems being threatened by human activity, the protection of endangered species is becoming increasingly important. Scientists have historically researched several people in various populations of a species to understand the genetic variation that occurs in that species. This is important for understanding how particular species can be protected. In this study, animals on the Red List of Endangered Species of the International Union for Conservation of Nature had fewer differences in their genomes, which is consistent with their endangered status.</p><p>Ref @&nbsp;A comparative genomics multitool for scientific discovery and conservation&nbsp;https://www.nature.com/articles/s41586-020-2876-6</p><p>&nbsp;Data at&nbsp;http://zoonomiaproject.org/</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44545/amr-database</guid>
	<pubDate>Tue, 04 Jun 2024 13:37:21 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44545/amr-database</link>
	<title><![CDATA[AMR Database !]]></title>
	<description><![CDATA[<ul>
<li><a href="http://en.mediterranee-infection.com/article.php?laref=283%26titre=arg-annot">ARG-ANNOT</a>. PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/24145532">24145532</a></li>
<li><a href="https://card.mcmaster.ca/">CARD</a>. PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/23650175">23650175</a></li>
<li><a href="https://megares.meglab.org/">MEGARes</a>&nbsp;PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/27899569">27899569</a></li>
<li><a href="https://www.ncbi.nlm.nih.gov/pathogens/isolates#/refgene/">NCBI</a>&nbsp;BioProject:&nbsp;<a href="https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA313047">PRJNA313047</a></li>
<li><a href="https://cge.cbs.dtu.dk/services/PlasmidFinder/">plasmidfinder</a>&nbsp;PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/24777092">24777092</a></li>
<li><a href="https://cge.cbs.dtu.dk//services/ResFinder/">resfinder</a>. PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/22782487">22782487</a></li>
<li><a href="http://www.mgc.ac.cn/VFs/">VFDB</a>. PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/26578559">26578559</a></li>
<li><a href="https://github.com/katholt/srst2">SRST2</a>'s version of ARG-ANNOT. PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/25422674">25422674</a>.</li>
<li><a href="https://cge.cbs.dtu.dk/services/VirulenceFinder/">VirulenceFinder</a>&nbsp;PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/24574290">24574290</a>.</li>
</ul><p>Address of the bookmark: <a href="https://github.com/sanger-pathogens/ariba/wiki/Task%3A-getref" rel="nofollow">https://github.com/sanger-pathogens/ariba/wiki/Task%3A-getref</a></p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44852/what-is-data-science-%E2%80%94-a-bioinformatics-perspective</guid>
	<pubDate>Mon, 16 Jun 2025 01:44:34 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44852/what-is-data-science-%E2%80%94-a-bioinformatics-perspective</link>
	<title><![CDATA[What is Data Science? — A Bioinformatics Perspective]]></title>
	<description><![CDATA[<p>In today&rsquo;s era of big biology, we&rsquo;re generating more data than ever before&mdash;genomes, transcriptomes, proteomes, metabolomes, microbiomes&hellip; you name it. But raw biological data doesn&rsquo;t speak for itself. Making sense of it requires more than traditional biology. This is where data science steps in.</p><p><strong>So, What Is Data Science?</strong><br />At its core, data science is the interdisciplinary field that extracts knowledge and insights from data using programming, statistics, and domain expertise. In bioinformatics, data science enables us to turn gigabytes of sequence data into biological meaning.</p><p>Imagine trying to understand gene regulation in cancer by analyzing thousands of RNA-seq samples, or predicting antibiotic resistance from bacterial genomes&mdash;these challenges are not solvable through wet lab experiments alone. They require data-driven thinking.</p><p><strong>Data Science Meets Bioinformatics</strong><br />Bioinformatics is inherently a data science domain. From genomics to systems biology, every field in modern biology relies on data science techniques to:</p><p>Clean and process massive datasets</p><p>Discover patterns in high-dimensional data</p><p>Build predictive models (e.g., for disease classification)</p><p>Visualize complex biological networks and trends</p><p>Integrate diverse data types (e.g., transcriptomic + epigenomic data)</p><p><strong>The Bioinformatics Toolkit</strong><br />Here&rsquo;s what data science typically looks like in bioinformatics:</p><p>Task Data Science Role<br />Sequence alignment Efficient algorithms, indexing, parallel processing<br />Gene expression analysis Statistical modeling (e.g., DESeq2, limma)<br />Variant calling Data filtering, probabilistic models<br />Clustering of cells in single-cell data Unsupervised learning<br />Protein structure prediction Deep learning models (e.g., AlphaFold)<br />Metagenomics Data integration, classification, dimensionality reduction</p><p>Common tools include Python, R, Bioconductor, scikit-learn, Pandas, Seurat, and TensorFlow&mdash;often working together in reproducible workflows.</p><p><strong>It's Not Just About Coding</strong><br />A common misconception is that bioinformatics is just programming or scripting. But being a data scientist in bioinformatics also means:</p><p>Understanding experimental design</p><p>Asking biologically meaningful questions</p><p>Choosing the right statistical or machine learning models</p><p>Communicating findings effectively (e.g., plots, dashboards, papers)</p><p>In other words, data science in bioinformatics is where biology, statistics, and computer science converge.</p><p><strong>Why It Matters</strong><br />The real power of data science in bioinformatics is its ability to scale discovery.</p><p>Instead of studying one gene, we can study thousands.</p><p>Instead of analyzing one species, we can explore entire ecosystems.</p><p>Instead of waiting months for lab results, we can generate hypotheses in days.</p><p>From personalized medicine and cancer diagnostics to agricultural genomics and pandemic surveillance, data science is at the heart of the bioinformatics revolution.</p><p><strong>Final Thoughts</strong><br />If you&rsquo;re a biologist who&rsquo;s curious about code, or a data enthusiast fascinated by life sciences, bioinformatics is your playground&mdash;and data science is your toolkit.</p><p>In bioinformatics, data science isn&rsquo;t just useful. It&rsquo;s essential.</p><p>&nbsp;</p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/3918/the-human-genome-project-video-3d-animation-introduction-low</guid>
	<pubDate>Sat, 24 Aug 2013 19:01:19 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/3918/the-human-genome-project-video-3d-animation-introduction-low</link>
	<title><![CDATA[The Human Genome Project Video   3D Animation Introduction Low)]]></title>
	<description><![CDATA[<iframe width="" height="" src="https://www.youtube-nocookie.com/embed/YxoQFSBwyms" frameborder="0" allowfullscreen></iframe>]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/7913/the-genome-factory</guid>
	<pubDate>Thu, 16 Jan 2014 02:09:31 -0600</pubDate>
	<link>https://bioinformaticsonline.com/news/view/7913/the-genome-factory</link>
	<title><![CDATA[The genome factory !!!]]></title>
	<description><![CDATA[<p>Illumina, Inc. announced Tuesday that its new HiSeq X Ten Sequencing System has broken the &ldquo;sound barrier&rdquo; of human genomics by enabling the $1,000 genome. &ldquo;This platform includes dramatic technology breakthroughs that enable researchers to undertake studies of unprecedented scale by providing the throughput to sequence tens of thousands of human whole genomes in a single year in a single lab,&rdquo; Illumina stated.</p><p>Initial customers for the HiSeq X Ten System, which will ship in Q1 2014, include Macrogen, based in Seoul, South Korea and its CLIA laboratory in Rockville, Maryland, the Broad Institute in Cambridge, Massachusetts, and the Garvan Institute of Medical Research in Sydney, Australia.</p><p>&ldquo;For the first time, it looks like it will be possible to deliver the $1,000 genome, which is tremendously exciting,&rdquo; said Eric Lander, founding director of the Broad Institute and a professor of biology at MIT. &ldquo;The HiSeq X Ten should give us the ability to analyze complete genomic information from huge sample populations. Over the next few years, we have an opportunity to learn as much about the genetics of human disease as we have learned in the history of medicine.&rdquo;</p><p>&ldquo;The HiSeq X Ten is an ideal platform for scientists and institutions focused on the discovery of genotypic variation to enable a deeper understanding of human biology and genetic disease,&rdquo; Illumina stated. &ldquo;It can sequence tens of thousands of samples annually with high-quality, high-coverage sequencing, delivering a comprehensive catalog of human variation within and outside coding regions.&rdquo;</p><p>HiSeq X Ten utilizes a number of advanced design features to generate massive throughput. Patterned flow cells, which contain billions of nanowells at fixed locations, combined with a new clustering chemistry deliver a significant increase in data density (6 billion clusters per run). Using state-of-the art optics and faster chemistry, HiSeq X Ten can process sequencing flow cells more quickly than ever before &mdash; generating a 10x increase in daily throughput when compared to current HiSeq 2500 performance.</p><p>The HiSeq X Ten is sold as a set of 10 or more ultra-high throughput sequencing systems, each generating up to 1.8 terabases (Tb) of sequencing data in less than three days or up to 600 gigabases (Gb) per day, per system, providing the throughput to sequence tens of thousands of high-quality, high-coverage genomes per year. Illumina says the $1,000 includes typical instrument depreciation, DNA extraction, library preparation, and estimated labor.</p>]]></description>
	<dc:creator>Madhvan Reddy</dc:creator>
</item>

</channel>
</rss>