<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/31382?offset=970</link>
	<atom:link href="https://bioinformaticsonline.com/related/31382?offset=970" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	
<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/43044/kanthida-lab</guid>
  <pubDate>Wed, 28 Apr 2021 02:27:22 -0500</pubDate>
  <link></link>
  <title><![CDATA[Kanthida Lab !]]></title>
  <description><![CDATA[
<p>Research Interest: </p>

<p>Bioinformatics </p>

<p>High-throughput and high-dimensional data analysis</p>

<p>Microbiome data analysis (Main focus)</p>

<p>Next-generation and third-generation sequencing data analysis for genomics</p>

<p>Gene expression data analysis</p>

<p>Machine learning for biological data</p>

<p>Biomarkers identification </p>

<p>Database and web-application for biological data</p>

<p>More at <br />https://sites.google.com/mail.kmutt.ac.th/kanthida-k/home?authuser=0</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/2699/translational-bioinformatics-transforming-300-billion-points-of-data</guid>
	<pubDate>Tue, 20 Aug 2013 19:03:47 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/2699/translational-bioinformatics-transforming-300-billion-points-of-data</link>
	<title><![CDATA[Translational Bioinformatics: Transforming 300 Billion Points of Data]]></title>
	<description><![CDATA[<iframe width="" height="" src="https://www.youtube-nocookie.com/embed/o4KNG7nd938" frameborder="0" allowfullscreen></iframe>Translational Bioinformatics: Transforming 300 Billion Points of Data into Diagnostics, Therapeutics, and New Insights into Disease      
      
Air date:  Wednesday, June 20, 2012, 3:00:00 PM
Time displayed is Eastern Time, Washington DC Local  
 
Description:  There is an urgent need to translate genome-era discoveries into clinical utility, but the difficulties in making bench-to-bedside translations haven't been well described. The nascent field of translational bioinformatics may help. Dr. Butte's lab at Stanford University builds and applies tools that convert more than 300 billion points of molecular, clinical, and epidemiological data (measured by researchers and clinicians over the past decade) into diagnostics, therapeutics, and new insights into disease. Dr. Butte, a bioinformatician and pediatric endocrinologist, will highlight his lab's work on using publicly available molecular measurements to find new uses for drugs, discovering new treatable mechanisms of disease in type 2 diabetes, and evaluating patients presenting with whole genomes sequenced. 

The NIH Wednesday Afternoon Lecture Series includes weekly scientific talks by some of the top researchers in the biomedical sciences worldwide. 

For more information, visit: 
The NIH Director's Wednesday Afternoon Lecture Series  
Author:  Atul Butte, M.D., Ph.D., Stanford University  
Runtime:  01:07:42  
Permanent link:  http://videocast.nih.gov/launch.asp?17321]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43323/biostarhandbook</guid>
	<pubDate>Fri, 27 Aug 2021 01:31:01 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43323/biostarhandbook</link>
	<title><![CDATA[biostarhandbook]]></title>
	<description><![CDATA[<p>Nice book collection for bioinformatician ... highly recommended.</p><p>Address of the bookmark: <a href="https://www.biostarhandbook.com/" rel="nofollow">https://www.biostarhandbook.com/</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/2741/bioinformatician-dreams</guid>
	<pubDate>Wed, 21 Aug 2013 10:50:45 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/2741/bioinformatician-dreams</link>
	<title><![CDATA[Bioinformatician Dreams]]></title>
	<description><![CDATA[<p>Bioinformatician life is interconnected, they always dream for a powerful server, little more space on server as they are generating lots of data per run, dream to publish results in good impact journals, meetings reminders :) and research analysis off course!!!&nbsp;</p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/2741" length="557537" type="image/png" />
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44516/16srna-database-download</guid>
	<pubDate>Wed, 24 Apr 2024 04:33:15 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44516/16srna-database-download</link>
	<title><![CDATA[16sRNA Database Download]]></title>
	<description><![CDATA[<p>Downloading 16S rRNA databases can be crucial for various bioinformatics analyses, especially in microbiome research. However, it's important to note that databases can vary based on your specific needs, such as the taxonomic coverage you require or the type of analysis you're performing. Here's a general guideline on how you can obtain 16S rRNA databases:</p><ol>
<li>
<p><span>NCBI (National Center for Biotechnology Information)</span>:</p>
<ul>
<li>NCBI provides various databases related to genetic information, including 16S rRNA sequences.</li>
<li>You can access the 16S ribosomal RNA sequences from NCBI's Nucleotide database (<a href="https://www.ncbi.nlm.nih.gov/nucleotide/" target="_new">https://www.ncbi.nlm.nih.gov/nucleotide/</a>).</li>
<li>Perform a search using keywords like "16S rRNA" or specific bacterial names to find relevant sequences.</li>
<li>You can download sequences individually or in batches using the provided tools.</li>
</ul>
</li>
<li>
<p><span>GreenGenes</span>:</p>
<ul>
<li>GreenGenes is a widely used 16S rRNA gene sequence database.</li>
<li>You can access it at <a target="_new">http://greengenes.secondgenome.com/</a>.</li>
<li>GreenGenes provides precompiled databases for various purposes, including classification, alignment, and phylogenetic analysis.</li>
</ul>
</li>
<li>
<p><span>SILVA</span>:</p>
<ul>
<li>SILVA (<a href="https://www.arb-silva.de/" target="_new">https://www.arb-silva.de/</a>) is another comprehensive database for ribosomal RNA (rRNA) sequences.</li>
<li>It covers not only 16S rRNA but also other ribosomal RNA sequences.</li>
<li>SILVA provides precompiled databases for various purposes, including taxonomic classification and alignment.</li>
</ul>
</li>
<li>
<p><span>Ribosomal Database Project (RDP)</span>:</p>
<ul>
<li>RDP (<a target="_new">http://rdp.cme.msu.edu/</a>) is a curated database that offers 16S rRNA sequences.</li>
<li>It provides tools for sequence analysis and classification.</li>
<li>You can download sequences and taxonomy information from their website.</li>
</ul>
</li>
<li>
<p><span>QIIME (Quantitative Insights Into Microbial Ecology)</span>:</p>
<ul>
<li>QIIME (<a href="https://qiime2.org/" target="_new">https://qiime2.org/</a>) is a widely used bioinformatics platform for microbiome analysis.</li>
<li>It provides tools for analyzing microbial communities, including processing 16S rRNA sequences.</li>
<li>QIIME often includes its own preprocessed 16S rRNA databases that can be used for analysis within the platform.</li>
</ul>
</li>
</ol><p>Before downloading any database, make sure to read the terms of use and citation requirements, as some databases may have specific usage policies. Additionally, consider the compatibility of the database with your analysis pipeline and software tools.</p><p>&nbsp;</p><p>NCBI 16s RNA database location&nbsp;ftp://ftp.ncbi.nih.gov/blast/db/16SMicrobial.tar.gz</p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/4072/bioinformatics</guid>
	<pubDate>Wed, 28 Aug 2013 19:16:33 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/4072/bioinformatics</link>
	<title><![CDATA[BIOINFORMATICS]]></title>
	<description><![CDATA[<iframe src="http://player.vimeo.com/video/52455340?byline=0" width="" height="" frameborder="0" webkitAllowFullScreen allowFullScreen></iframe>This is a promo video for the brand new cross-boarder branch of study - BIOINFORMATICS. It´s a co-operation between Johannes Kepler University in Linz (Austria) and University of South Bohemia in České Budějovice (Czech Republic).  Written, Edited and Directed by, DOP, VFX: Jan Míka  Sound by: Mirek Šmilauer  Narrator: Jack Bright  Produced by: FILMOFON (http://www.filmofon.cz)  Released: Nov 2012]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40251/mosdepth-fast-bamcram-depth-calculation-for-wgs-exome-or-targeted-sequencing</guid>
	<pubDate>Wed, 13 Nov 2019 22:20:19 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40251/mosdepth-fast-bamcram-depth-calculation-for-wgs-exome-or-targeted-sequencing</link>
	<title><![CDATA[mosdepth: fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing]]></title>
	<description><![CDATA[<p>mosdepth can output:</p>
<p>per-base depth about 2x as fast samtools depth--about 25 minutes of CPU time for a 30X genome.<br>mean per-window depth given a window size--as would be used for CNV calling.<br>the mean per-region given a BED file of regions.<br>a distribution of proportion of bases covered at or above a given threshold for each chromosome and genome-wide.<br>quantized output that merges adjacent bases as long as they fall in the same coverage bins e.g. (10-20)<br>threshold output to indicate how many bases in each region are covered at the given thresholds.<br>A summary of mean depths per chromosome and within specified regions per chromosome.</p><p>Address of the bookmark: <a href="https://github.com/brentp/mosdepth" rel="nofollow">https://github.com/brentp/mosdepth</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/3046/r-and-bioconductor-tutorial</guid>
	<pubDate>Fri, 23 Aug 2013 08:23:59 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/3046/r-and-bioconductor-tutorial</link>
	<title><![CDATA[R and Bioconductor Tutorial]]></title>
	<description><![CDATA[<p>This tutorial is intended to introduce users quickly to the basics of R, focusing on a few common tasks that &nbsp;biologists need to perform &nbsp;some basic analysis: &nbsp;load a table, plot some graphs, and perform some basic statistics. More extensive tutorials can be found on the project website and via bioconductor (not covered here).</p>
<p>You can add more tutorial links in comments if found new pages.</p><p>Address of the bookmark: <a href="http://manuals.bioinformatics.ucr.edu/home/R_BioCondManual" rel="nofollow">http://manuals.bioinformatics.ucr.edu/home/R_BioCondManual</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</guid>
	<pubDate>Tue, 23 Mar 2021 05:32:15 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</link>
	<title><![CDATA[Public Databases for Bioinformatics !]]></title>
	<description><![CDATA[<pre>https://www.nature.com/articles/s41467-020-17155-y<br><br>Server Infrastructure:

File Server:

dhara: Synology 3614 Storage Appliance
4 Core Xeon
108TB disk storage
10Gb ethernet to SCG3
Access atx: dhara:5000
Has btsync server (try it - its much better than dropbox)

Compute Servers:

nandi: Kundaje and Phi Server
24 intel cores
256GB RAM
500GB of SSD storage 
36TB RAID6 local storage
4 Intel Phi's (space for 4 more GPU's)


durga: Montgomery and sensitive data
24 intel cores
256GB RAM
500GB of SSD RAID0 storage 
60TB RAID6 local storage

mitra: Bassik and Web/DB Server
24 core
256GB RAM 
500GB of SSD RAID0 storage 
36TB RAID6 local storage

vayu: Kundaje GPU server
4 core
64GB RAM 
200GB of SSD storage 
8TB RAID10 local storage
4 Nvidia GTX 970 4GB GPUs

amold: Bickel and SGE server
32 AMD core
128GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

wotan: Bickel and SGE server
64 AMD core
256GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

Filesystem:

/users/$USER
default home directory
full backups nightly 
nfs mount to dhara
should store code, papers, and other highly processed data here

/mnt/data/
globally accessible data
should store common data here
e.g. genomes and indexes, annotations, ENCODE data  
if you dont want this to count towards your quote you must chown

/mnt/lab_data/$LAB/
lab accessible data
should store lab project data here 
e.g. ATAC-seq prediction data, enhancer prediction, motif calls

/srv/scratch/$USER
fast local storage
not backed up, but on raid and data will never be deleted
most analysis should be performed here

/srv/persistent/$USER
fast local storage
synced nightly, but not backed up
       ie if the hard drives fail or you delete something and notice 
       within 24 hours we can recover. Otherwise not. (vs home which is 
       properly backed up )  
intermediate analysis products that would be hard to recover should be stored here 
       e.g. stochastic analysis results that need to be kept so that paper 
       results can be reproduced

/srv/www/$LABNAME/
web accessible from mitra.stanford.edu
*NOT BACKED UP*

Some parallel programming patterns:

# gzip a bunch of files
parallel gzip -- *.FILESTOGZIP

# fork example in python:
(for more detailed examples look at 
 https://github.com/nboley/grit/ grit/lib/multiprocessing_utils.py)

import os
import time
import random

import multiprocessing

class ProcessSafeOPStream( object ):
    def __init__( self, writeable_obj ):
        self.writeable_obj = writeable_obj
        self.lock = multiprocessing.Lock()
        self.name = self.writeable_obj.name
        return
    
    def write( self, data ):
        self.lock.acquire()
        self.writeable_obj.write( data )
        self.writeable_obj.flush()
        self.lock.release()
        return
    
    def close( self ):
        self.writeable_obj.close()

def worker(queue, ofp):
    # Try without this
    random.seed()
    while True:
        i = queue.get()
        if i == 'FINISHED': return
        # simulate an expensive function
        x = random.random()
        time.sleep(x/10)
        print i, x
        ofp.write("%i\t%s\n" % (i, x))

NSIMS = 10000
NPROC = 25

# populate queue
todo = multiprocessing.Queue()
for i in xrange(NSIMS): todo.put(i)
for i in xrange(NPROC): todo.put('FINISHED')

ofp = ProcessSafeOPStream( open("output.txt", "w") )

pids = []
for i in xrange(NPROC):
    pid = os.fork()
    if pid == 0:
       worker(todo, ofp)
       os._exit(0)
    else:
       pids.append(pid)  

for pid in pids:
    os.waitpid(pid, 0)

ofp.close()

print "FINISHED"<br><br></pre>
<p>For use case 1 we obtained the following ENCODE and ROADMAP datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz">https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam">https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam">https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam</a>. Blacklisted regions were obtained from&nbsp;<a href="http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz">http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz</a>. The human genome version hg38 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz</a>.</p>
<p>For use case 2 we used the set of narrowPeak files summarized in&nbsp;<a href="https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt">https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt</a>&nbsp;(archived version v1.0.1). The human genome version hg19 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz</a></p>
<p>For use case 3 we used the ENCODE datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam">https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig">https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam">https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam</a>&nbsp;as we as the GENCODE annotation v29 from&nbsp;<a href="ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz">ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz</a>.</p><p>Address of the bookmark: <a href="http://mitra.stanford.edu/" rel="nofollow">http://mitra.stanford.edu/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/4043/what-is-bioinformatics</guid>
	<pubDate>Wed, 28 Aug 2013 06:53:05 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/4043/what-is-bioinformatics</link>
	<title><![CDATA[What is Bioinformatics?]]></title>
	<description><![CDATA[<iframe src="http://player.vimeo.com/video/71581534?byline=0" width="" height="" frameborder="0" webkitAllowFullScreen allowFullScreen></iframe>Illustration and Animation: Rachel Robinson Script: Tiffany Trent Voice-over: Kris Monger Sound: Glisten Carefully by Guennadi Malyshevski]]></description>
	
</item>

</channel>
</rss>