<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/38226?</link>
	<atom:link href="https://bioinformaticsonline.com/related/38226?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/41562/submit-your-sars-cov-2-sequence-data-to-genbank</guid>
	<pubDate>Thu, 09 Apr 2020 18:28:25 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/41562/submit-your-sars-cov-2-sequence-data-to-genbank</link>
	<title><![CDATA[Submit your SARS-CoV-2 sequence data to GenBank]]></title>
	<description><![CDATA[<div dir="auto">Submit your SARS-CoV-2 sequence data to GenBank and SRA with our new submission landing page. Submission is simple and streamlined *and* there&rsquo;s a rapid turnaround. <span><a href="https://l.facebook.com/l.php?u=https%3A%2F%2Fsubmit.ncbi.nlm.nih.gov%2Fsarscov2%2F%3Ffbclid%3DIwAR3p-OzZPe2yx4CZMoZxiWMF3kUQjXyVVduNQhBdehWmFTJ3cPBstsOLypI&amp;h=AT2d-umit7ciXRW-nrRYVL3gJSLKY4Hte8W8cXw8Wl94n6PGmoHmVqvvhgQj-mTo6A5lpMP9JDV_lRSq9RRLT5KeVVAAfcuRgJOeA6QhApIB2B9nFxUfDCD3sio4HYidpRwpmng&amp;__tn__=-UK-R&amp;c[0]=AT2zWGa1K5EvV4UcnB0b7HHvkBtX-wAyh7AF8_fZ9uI2y-02nOHQHT_Um3xgnto5KEZ26wRG0xNgUWTA1W-7HF0E25E23XtIL5XGOhloBXaDIcHw30AVjTCkQi7aFk4dN7aBCmVJeSbH37urtbM2kmMfyTCbdTvMU8FGlnX-DNVuCaZr4XfXnf_jvPNdxe9sBH84oXJ-uJz5kbqlHGAHDoqK" target="_blank">https://submit.ncbi.nlm.nih.gov/sarscov2/</a></span></div><div dir="auto">&nbsp;</div><div dir="auto"><span><span>Quickly and easily add your SARS-CoV-2 sequence data to the growing public archive with new, special features and support from NCBI. </span><a href="https://submit.ncbi.nlm.nih.gov/sarscov2/">new SARS-CoV-2 sequence submission landing page</a><span>&nbsp;will help you get started. GenBank submissions are accessioned and released in approximately 1-2 working days, and&nbsp;</span><a href="https://www.ncbi.nlm.nih.gov/sra" target="_blank">Sequence Read Archive</a><span>&nbsp;(SRA) submissions typically processed and released within hours. Submission is simple!</span></span></div><div><div dir="auto">&nbsp;</div><div dir="auto">More information is available on NCBI Insights. <span><a href="https://l.facebook.com/l.php?u=https%3A%2F%2Fncbiinsights.ncbi.nlm.nih.gov%2F2020%2F04%2F09%2Fsars-cov2-data-streamlined-submission-rapid-turnaround%2F%3Ffbclid%3DIwAR1OuLu3oDjz3VX4fDq5Jg316td9foTOUGNqnoN1eI2nFXTf4EBv28JiXD4&amp;h=AT0ah_epxwAc-nM6QiPBYvKSQ-kWmiPgHKO1w7SnxnnRiTI4etJJfNAWyzcR7snIdtxtcErAFRdHPBH2j0EY77gUPDdnBVnAsxnVbSgZnrrOPfnni331A37Xvytgnye0ArnUuWk&amp;__tn__=-UK-R&amp;c[0]=AT2zWGa1K5EvV4UcnB0b7HHvkBtX-wAyh7AF8_fZ9uI2y-02nOHQHT_Um3xgnto5KEZ26wRG0xNgUWTA1W-7HF0E25E23XtIL5XGOhloBXaDIcHw30AVjTCkQi7aFk4dN7aBCmVJeSbH37urtbM2kmMfyTCbdTvMU8FGlnX-DNVuCaZr4XfXnf_jvPNdxe9sBH84oXJ-uJz5kbqlHGAHDoqK" target="_blank">https://ncbiinsights.ncbi.nlm.nih.gov/2020/04/09/sars-cov2-data-streamlined-submission-rapid-turnaround/</a></span></div></div>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44852/what-is-data-science-%E2%80%94-a-bioinformatics-perspective</guid>
	<pubDate>Mon, 16 Jun 2025 01:44:34 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44852/what-is-data-science-%E2%80%94-a-bioinformatics-perspective</link>
	<title><![CDATA[What is Data Science? — A Bioinformatics Perspective]]></title>
	<description><![CDATA[<p>In today&rsquo;s era of big biology, we&rsquo;re generating more data than ever before&mdash;genomes, transcriptomes, proteomes, metabolomes, microbiomes&hellip; you name it. But raw biological data doesn&rsquo;t speak for itself. Making sense of it requires more than traditional biology. This is where data science steps in.</p><p><strong>So, What Is Data Science?</strong><br />At its core, data science is the interdisciplinary field that extracts knowledge and insights from data using programming, statistics, and domain expertise. In bioinformatics, data science enables us to turn gigabytes of sequence data into biological meaning.</p><p>Imagine trying to understand gene regulation in cancer by analyzing thousands of RNA-seq samples, or predicting antibiotic resistance from bacterial genomes&mdash;these challenges are not solvable through wet lab experiments alone. They require data-driven thinking.</p><p><strong>Data Science Meets Bioinformatics</strong><br />Bioinformatics is inherently a data science domain. From genomics to systems biology, every field in modern biology relies on data science techniques to:</p><p>Clean and process massive datasets</p><p>Discover patterns in high-dimensional data</p><p>Build predictive models (e.g., for disease classification)</p><p>Visualize complex biological networks and trends</p><p>Integrate diverse data types (e.g., transcriptomic + epigenomic data)</p><p><strong>The Bioinformatics Toolkit</strong><br />Here&rsquo;s what data science typically looks like in bioinformatics:</p><p>Task Data Science Role<br />Sequence alignment Efficient algorithms, indexing, parallel processing<br />Gene expression analysis Statistical modeling (e.g., DESeq2, limma)<br />Variant calling Data filtering, probabilistic models<br />Clustering of cells in single-cell data Unsupervised learning<br />Protein structure prediction Deep learning models (e.g., AlphaFold)<br />Metagenomics Data integration, classification, dimensionality reduction</p><p>Common tools include Python, R, Bioconductor, scikit-learn, Pandas, Seurat, and TensorFlow&mdash;often working together in reproducible workflows.</p><p><strong>It's Not Just About Coding</strong><br />A common misconception is that bioinformatics is just programming or scripting. But being a data scientist in bioinformatics also means:</p><p>Understanding experimental design</p><p>Asking biologically meaningful questions</p><p>Choosing the right statistical or machine learning models</p><p>Communicating findings effectively (e.g., plots, dashboards, papers)</p><p>In other words, data science in bioinformatics is where biology, statistics, and computer science converge.</p><p><strong>Why It Matters</strong><br />The real power of data science in bioinformatics is its ability to scale discovery.</p><p>Instead of studying one gene, we can study thousands.</p><p>Instead of analyzing one species, we can explore entire ecosystems.</p><p>Instead of waiting months for lab results, we can generate hypotheses in days.</p><p>From personalized medicine and cancer diagnostics to agricultural genomics and pandemic surveillance, data science is at the heart of the bioinformatics revolution.</p><p><strong>Final Thoughts</strong><br />If you&rsquo;re a biologist who&rsquo;s curious about code, or a data enthusiast fascinated by life sciences, bioinformatics is your playground&mdash;and data science is your toolkit.</p><p>In bioinformatics, data science isn&rsquo;t just useful. It&rsquo;s essential.</p><p>&nbsp;</p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/41578/samhar-covid19-hackathon</guid>
	<pubDate>Fri, 17 Apr 2020 06:47:10 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/41578/samhar-covid19-hackathon</link>
	<title><![CDATA[SAMHAR-COVID19 Hackathon]]></title>
	<description><![CDATA[<p>Centre for Development of Advanced Computing (C-DAC) under the aegis of the National Supercomputing Mission (NSM), a Ministry of Electronics &amp; Information Technology (MeitY) and Department of Science &amp; Technology (DST) initiative, in association with NVIDIA &amp; OpenACC, announces the&nbsp;SAMHAR-COVID19 Hackathon.</p><p>Pandemic outbreak such as Coronavirus outbreak can create huge challenges for the Government and Public Health Officials to gather information quickly and coordinate a response. In such a situation, Artificial Intelligence (AI) can play a huge role in predicting, minimizing and stalling its spread of the virus.</p><p>C-DAC has embarked on a program&nbsp;SAMHAR-COVID19&nbsp;(Supercomputing using&nbsp;AI,&nbsp;ML,&nbsp;Healthcare&nbsp;Analytics based&nbsp;Research for combating COVID19). This opportunity will provide researchers to find solutions for Identifying, Tracking and Forecasting outbreaks of COVID19 and Facilitating Drug Discovery as well.</p><div><div><div><ul>
<li><span>Participants can update submissions multiple times till the Registration End date.</span></li>
<li><span>Each entry can be submitted by a Team comprising of minimum 3 and maximum 5 members (Including the Team Lead).</span></li>
<li><span>Participants will have to share the complete work activities with C-DAC. And C-DAC will have right to use the submitted application/solution for SAMHAR-COVID19 programs.</span></li>
<li><span>The Award will be given to the&nbsp;<span>Selected/Winning Entry</span>&nbsp;irrespective of the number of members in the Team (members may choose to distribute the amount among themselves).</span></li>
<li><span>The decision of the&nbsp;<span>Eminent Jury</span>&nbsp;on the&nbsp;<span>I<span>3</span>&nbsp;Award</span>&nbsp;will be final and binding.</span></li>
<li><span>Award can be for the Team/Company/Institution, as submitted in the Application and cannot be changed later.</span></li>
<li><span>Submissions will be considered void if they are in whole or part ill-eligible, incomplete, damaged, altered, counterfeit, obtained through fraud or late submission.</span></li>
<li></li>
</ul></div></div></div><p>More at&nbsp;<a href="https://samhar-covid19hackathon.cdac.in/">https://samhar-covid19hackathon.cdac.in/</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41464/phytozome-v121-plant-science-community-hub-for-accessing-palnts-genomic-data</guid>
	<pubDate>Tue, 17 Mar 2020 07:30:17 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41464/phytozome-v121-plant-science-community-hub-for-accessing-palnts-genomic-data</link>
	<title><![CDATA[Phytozome  v12.1: plant science community hub for accessing palnts genomic data]]></title>
	<description><![CDATA[<p>Phytozome, the Plant Comparative Genomics portal of the Department of Energy's Joint Genome Institute, provides JGI users and the broader plant science community a hub for accessing, visualizing and analyzing JGI-sequenced plant genomes, as well as selected genomes and datasets that have been sequenced elsewhere. As of release v12.1.6, Phytozome hosts 93 assembled and annotated genomes, from 82 Viridiplantae species. More than half of these genomes have been sequenced, assembled and/or annotated with JGI Plant Science program resources. By integrating this large collection of plant genomes into a single resource and performing comprehensive and uniform annotation and analyses, Phytozome facilitates accurate and insightful comparative genomics studies.</p><p>Address of the bookmark: <a href="https://phytozome.jgi.doe.gov/pz/portal.html" rel="nofollow">https://phytozome.jgi.doe.gov/pz/portal.html</a></p>]]></description>
	<dc:creator>Surabhi Chaudhary</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44742/nasa-open-science-data-repository</guid>
	<pubDate>Wed, 18 Dec 2024 11:54:47 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44742/nasa-open-science-data-repository</link>
	<title><![CDATA[NASA Open Science Data Repository]]></title>
	<description><![CDATA[<p><span>The NASA Open Science Data Repository (OSDR) enables access to space-related data from experiments and missions that investigate biological and health responses of terrestrial life to spaceflight. The goal of OSDR is to enable multi-modal and multi-hierarchical fundamental space life science data be reused toward basic science, applied science, and operational outcomes for space exploration and knowledge discovery. These data include &lsquo;omics, phenotypic, physiological, behavioral, hardware, environmental telemetry; raw, processed; tabular, text, code, bioimaging, and video.</span></p>
<p><span>https://www.nasa.gov/reference/osdr-data-processing/</span></p><p>Address of the bookmark: <a href="https://www.nasa.gov/osdr/" rel="nofollow">https://www.nasa.gov/osdr/</a></p>]]></description>
	<dc:creator>Abhi</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/35418/karyoploter-plot-whole-genomes-with-arbitrary-data</guid>
	<pubDate>Fri, 02 Feb 2018 03:24:28 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/35418/karyoploter-plot-whole-genomes-with-arbitrary-data</link>
	<title><![CDATA[karyoploteR: plot whole genomes with arbitrary data]]></title>
	<description><![CDATA[<p><span><a href="http://bioconductor.org/packages/karyoploteR">karyoploteR</a></span><span>&nbsp;is an R package to create karyoplots, that is, representations of whole genomes with arbitrary data plotted on them. It is inspired by the R base graphics system and does not depend on other graphics packages. The aim of karyoploteR is to offer the user an easy way to plot data along the genome to get broad genome-wide view to facilitate the identification of genome wide relations and distributions.</span></p><p>Address of the bookmark: <a href="https://bernatgel.github.io/karyoploter_tutorial/" rel="nofollow">https://bernatgel.github.io/karyoploter_tutorial/</a></p>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34488/scripts-for-the-analysis-of-hgt-in-genome-sequence-data</guid>
	<pubDate>Wed, 29 Nov 2017 16:44:10 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34488/scripts-for-the-analysis-of-hgt-in-genome-sequence-data</link>
	<title><![CDATA[Scripts for the analysis of HGT in genome sequence data.]]></title>
	<description><![CDATA[<p><span>Scripts for the analysis of HGT in genome sequence data</span></p><p>Address of the bookmark: <a href="https://github.com/reubwn/hgt" rel="nofollow">https://github.com/reubwn/hgt</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36271/heap-a-highly-sensitive-and-accurate-snp-detection-tool-for-low-coverage-high-throughput-sequencing-data</guid>
	<pubDate>Thu, 19 Apr 2018 08:06:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36271/heap-a-highly-sensitive-and-accurate-snp-detection-tool-for-low-coverage-high-throughput-sequencing-data</link>
	<title><![CDATA[Heap: a highly sensitive and accurate SNP detection tool for low-coverage high-throughput sequencing data]]></title>
	<description><![CDATA[<p><span>Heap, that enables robustly sensitive and accurate calling of SNPs, particularly with a low coverage NGS data, which must be aligned to the reference genome sequences in advance. To reduce false positive SNPs, Heap determines genotypes and calls SNPs at each site except for sites at the both end of reads or containing a minor allele supported by only one read. Performance comparison with existing tools showed that Heap achieved the highest F-scores with low coverage (7X) restriction-site associated DNA sequencing reads of sorghum and rice individuals. This will facilitate cost-effective GWAS and GP studies in this NGS era. Code and documentation of Heap are freely available from&nbsp;</span><a href="https://github.com/meiji-bioinf/heap">https://github.com/meiji-bioinf/heap</a><span>&nbsp;and our web site (</span><a href="http://bioinf.mind.meiji.ac.jp/lab/en/tools.html">http://bioinf.mind.meiji.ac.jp/lab/en/tools.html</a><span>).</span></p><p>Address of the bookmark: <a href="https://github.com/meiji-bioinf/heap" rel="nofollow">https://github.com/meiji-bioinf/heap</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37259/epiviz-an-interactive-visualization-tool-for-functional-genomics-data</guid>
	<pubDate>Mon, 09 Jul 2018 05:27:39 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37259/epiviz-an-interactive-visualization-tool-for-functional-genomics-data</link>
	<title><![CDATA[Epiviz: an interactive visualization tool for functional genomics data.]]></title>
	<description><![CDATA[<p><span>Epiviz is an interactive visualization tool for functional genomics data. It supports genome navigation like other genome browsers, but allows multiple visualizations of data within genomic regions using scatterplots, heatmaps and other user-supplied visualizations. It also includes data from the&nbsp;</span><a href="http://barcode.luhs.org/" target="_blank">Gene Expression Barcode project</a><span>&nbsp;for transcriptome visualization. It has a flexible plugin framework so users can add</span><a href="http://d3js.org/" target="_blank">d3</a><span>&nbsp;visualizations. You can see a video tour&nbsp;</span><a href="http://youtu.be/099c4wUxozA" target="_blank">here</a><span>.</span></p>
<p><span>https://bioconductor.org/packages/release/bioc/html/epivizr.html</span></p>
<p><span>https://github.com/epiviz</span></p>
<p><span>https://github.com/epiviz/epiviz</span></p><p>Address of the bookmark: <a href="https://epiviz.github.io/" rel="nofollow">https://epiviz.github.io/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37835/variantbam-filtering-and-profiling-of-next-generational-sequencing-data-using-region-specific-rules</guid>
	<pubDate>Thu, 04 Oct 2018 16:30:44 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37835/variantbam-filtering-and-profiling-of-next-generational-sequencing-data-using-region-specific-rules</link>
	<title><![CDATA[VariantBam: Filtering and profiling of next-generational sequencing data using region-specific rules]]></title>
	<description><![CDATA[<p>VariantBam is a tool to extract/count specific sets of sequencing reads from next-generational sequencing files. To save money, disk space and I/O, one may not want to store an entire BAM on disk. In many cases, it would be more efficient to store only those read-pairs or reads who intersect some region around the variant locations. Alternatively, if your scientific question is focused on only one aspect of the data (e.g. breakpoints), many reads can be removed without losing the information relevant to the problem.</p>
<h5>&nbsp;</h5><p>Address of the bookmark: <a href="https://github.com/broadinstitute/VariantBam" rel="nofollow">https://github.com/broadinstitute/VariantBam</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>

</channel>
</rss>