<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: All site bookmarks]]></title>
	<link>https://bioinformaticsonline.com/bookmarks/all?offset=220</link>
	<atom:link href="https://bioinformaticsonline.com/bookmarks/all?offset=220" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43042/bioinformatics-in-thailand</guid>
	<pubDate>Wed, 28 Apr 2021 02:04:56 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43042/bioinformatics-in-thailand</link>
	<title><![CDATA[Bioinformatics in Thailand !]]></title>
	<description><![CDATA[<p>Our international PhD and master programs are designed for students who desire focused training in the elements of biology, computer science, and information technology needed for a successful career in the exciting new discipline of Bioinformatics &amp; Systems Biology. Students in our program will receive comprehensive training in omics analysis, database design and management, software engineering and programming (including web-based development), simulation techniques and modeling, and data integration. Each student will apply their skills to a practical project, where they will design and implement a solution to a real-world problem under the guidance of an experienced mentor in industry or academia.</p>
<p><strong>https://bioinformatics.kmutt.ac.th/about.html</strong></p>
<p>Duangrudee Tanramluk (Ajarn Wi) uses computational biology and machine learning to tackle the key to drug design problems via MANORAA webserver.</p>
<p><strong>https://mb.mahidol.ac.th/en/bioinformatics/</strong></p>
<p><strong>https://graduate.mahidol.ac.th/inter/</strong></p>
<p>This&nbsp;international&nbsp;Doctorate programme is designed to further broaden students&rsquo; knowledge in Bioinformatics and Molecular Biology to their maximum capability.&nbsp;</p>
<p><strong>http://www.mbb.psu.ac.th/programmes/phd</strong></p>
<p>Ph.D. program in Bioinformatics and Computational Biology is a joint effort of the Faculty of Science and Faculty of Medicine, Chulalongkorn University. The program has study plans for both applicants who hold a bachelor&rsquo;s degree and applicants who hold a master&rsquo;s degree in any related fields of study.</p>
<p><strong>http://www.bioinfo.sc.chula.ac.th/ph-d-program-specialization/</strong></p>
<p>Additional detail&nbsp;</p>
<p><strong>https://www.biotec.or.th/en/index.php/research/research-units/genome-technology-research-unit</strong></p>
<p><strong>https://tbrcnetwork.org/labtbrc/index.php/bioinformatics-and-chemoinformatics/</strong></p>
<p><strong>https://genomicsthailand.com/Genomic/home</strong></p><p>Address of the bookmark: <a href="https://bioinformatics.kmutt.ac.th/" rel="nofollow">https://bioinformatics.kmutt.ac.th/</a></p>]]></description>
	<dc:creator>Shruti Paniwala</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43040/coronavir-computational-resources-on-novel-coronavirus-sars-cov-2-or-covid-19</guid>
	<pubDate>Tue, 27 Apr 2021 01:58:36 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43040/coronavir-computational-resources-on-novel-coronavirus-sars-cov-2-or-covid-19</link>
	<title><![CDATA[CoronaVIR: Computational Resources on Novel Coronavirus (SARS-CoV-2 or COVID-19)]]></title>
	<description><![CDATA[<div>
<p style="text-align: justify;">Aim of this web site is to facilitate the scientific community to fight against severe pandemic disease COVID-19 caused by SARS-CoV-2. Here, We have collected and organized information related to novel strain of coronavirus, i.e. SARS-CoV-2.and its resulting disease COVID-19 from the literature and other resources from the Internet. We are providing links to appropriate literature. Moreover, we are Bioinformatics Group, based on our knowledge and expertise, we are also proposing potential diagnostics primers, peptide and RNA based vaccine candidates and potential drug molecules. These are predicted candidates, need to be validated by experimental Researchers, who have appropriate infrastructure. It is an integrated multi-omics repository dedicated to current genomic, proteomic, diagnostic and therapeutic knowledge about coronaviruses particularly the recent strain, i.e. SARS-CoV-2 or 2019-nCoV. This web resource will be helpful for the researchers engaged in the development of therapies and drugs for the COVID-19. The information is collected from various available resources.<br><strong>Cite:&nbsp;</strong><a href="https://www.liebertpub.com/doi/10.1089/mab.2020.0035">Patiyal, Sumeet, et al. &ldquo;A Web-based Platform on COVID-19 to Maintain Predicted Diagnostic, Drug<br>and Vaccine Candidates.&rdquo; Monoclon Antib Immunodiagn Immunother. doi.org/10.1089/mab.2020.0035</a></p>
<div>
<p>&nbsp;</p>
</div>
</div><p>Address of the bookmark: <a href="https://webs.iiitd.edu.in/raghava/coronavir/" rel="nofollow">https://webs.iiitd.edu.in/raghava/coronavir/</a></p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43025/modular-efficient-and-constant-memory-single-cell-rna-seq-preprocessing</guid>
	<pubDate>Mon, 05 Apr 2021 11:19:43 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43025/modular-efficient-and-constant-memory-single-cell-rna-seq-preprocessing</link>
	<title><![CDATA[Modular, efficient and constant-memory single-cell RNA-seq preprocessing]]></title>
	<description><![CDATA[<p>With&nbsp;<strong>kallisto | bustools</strong>&nbsp;you can</p>
<ul>
<li>Generate a&nbsp;<em>cell x gene</em>&nbsp;or&nbsp;<em>cell x transcript equivalence class</em>&nbsp;count matrix</li>
<li>Perform RNA velocity and single-nuclei RNA-seq analsis</li>
<li>Quantify data from numerous technologies such as 10x, inDrops, and Dropseq.</li>
<li>Customize workflows for new technologies and protocols.</li>
<li>Process feature barcoding data such as CITE-seq, REAP-seq, MULTI-seq, Clicktags, and Perturb-seq.</li>
<li>Obtain QC reports from single-cell RNA-seq data</li>
</ul>
<p>The&nbsp;<strong>kallisto | bustools</strong>&nbsp;workflow is described in:</p>
<p>P&aacute;ll Melsted*, A. Sina Booeshaghi*, Lauren Liu, Fan Gao, Lambda Lu, Kyung Hoi (Joseph) Min, Eduardo da Veiga Beltrame, Kristj&aacute;n Eldj&aacute;rn Hj&ouml;rleifsson, Jase Gehring &amp; Lior Pachter&dagger;&nbsp;<a href="https://doi.org/10.1038/s41587-021-00870-2" target="_blank">Modular and efficient pre-processing of single-cell RNA-seq</a>, Nature Biotechnology (2021).</p>
<p>&nbsp;</p>
<p><span>Documentation and tutorials for the kallisto bustools workflow are available at&nbsp;</span><a href="http://pachterlab.github.io/kallistobustools">http://pachterlab.github.io/kallistobustools</a><span>.&nbsp;</span></p>
<p>https://www.nature.com/articles/s41587-021-00870-2</p><p>Address of the bookmark: <a href="https://pachterlab.github.io/kallistobustools/" rel="nofollow">https://pachterlab.github.io/kallistobustools/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43022/a-simple-tutorial-for-a-complex-complexheatmap</guid>
	<pubDate>Fri, 02 Apr 2021 06:18:32 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43022/a-simple-tutorial-for-a-complex-complexheatmap</link>
	<title><![CDATA[A simple tutorial for a complex ComplexHeatmap]]></title>
	<description><![CDATA[<p><em>ComplexHeatmap</em>&nbsp;(Gu, Eils, and Schlesner (2016)) is an R Programming Language (R Core Team (2020)) package that is currently listed in the&nbsp;<a href="https://bioconductor.org/">Bioconductor</a>&nbsp;package repository.</p>
<p><a href="https://github.com/kevinblighe/E-MTAB-6141#2-install-and-load-required-packages">install and load required packages</a></p>
<div>
<pre>  require(<span>RColorBrewer</span>)
  require(<span>ComplexHeatmap</span>)
  require(<span>circlize</span>)
  require(<span>digest</span>)
  require(<span>cluster</span>)</pre>
</div>
<p>If all load successfully, proceed to&nbsp;<span>Part 3</span>. Otherwise, go through the following code chunks in order to ensure that each package is installed and loaded properly.</p>
<p><em>BiocManager</em>&nbsp;(Morgan (2019))</p><p>Address of the bookmark: <a href="https://github.com/kevinblighe/E-MTAB-6141" rel="nofollow">https://github.com/kevinblighe/E-MTAB-6141</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43013/deg-50-a-database-of-essential-genes-in-both-prokaryotes-and-eukaryotes</guid>
	<pubDate>Tue, 30 Mar 2021 11:47:29 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43013/deg-50-a-database-of-essential-genes-in-both-prokaryotes-and-eukaryotes</link>
	<title><![CDATA[DEG 5.0: a database of essential genes in both prokaryotes and eukaryotes]]></title>
	<description><![CDATA[<p><span>Essential genes are those indispensable for the survival of an organism, and their functions are therefore considered a foundation of life. Determination of a minimal gene set needed to sustain a life form, a fundamental question in biology, plays a key role in the emerging field, synthetic biology. </span></p>
<p><span></span><span>DEG is freely available at the website&nbsp;</span><a href="http://tubic.tju.edu.cn/deg" target="_blank">http://tubic.tju.edu.cn/deg</a><span>&nbsp;or&nbsp;</span><a href="http://www.essentialgene.org/" target="_blank">http://www.essentialgene.org</a><span>.</span></p><p>Address of the bookmark: <a href="http://www.essentialgene.org/" rel="nofollow">http://www.essentialgene.org/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43011/deg-50-a-database-of-essential-genes-in-both-prokaryotes-and-eukaryotes</guid>
	<pubDate>Tue, 30 Mar 2021 11:47:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43011/deg-50-a-database-of-essential-genes-in-both-prokaryotes-and-eukaryotes</link>
	<title><![CDATA[DEG 5.0: a database of essential genes in both prokaryotes and eukaryotes]]></title>
	<description><![CDATA[<p><span>Essential genes are those indispensable for the survival of an organism, and their functions are therefore considered a foundation of life. Determination of a minimal gene set needed to sustain a life form, a fundamental question in biology, plays a key role in the emerging field, synthetic biology. </span></p>
<p><span></span><span>DEG is freely available at the website&nbsp;</span><a href="http://tubic.tju.edu.cn/deg" target="_blank">http://tubic.tju.edu.cn/deg</a><span>&nbsp;or&nbsp;</span><a href="http://www.essentialgene.org/" target="_blank">http://www.essentialgene.org</a><span>.</span></p><p>Address of the bookmark: <a href="http://www.essentialgene.org/" rel="nofollow">http://www.essentialgene.org/</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</guid>
	<pubDate>Tue, 23 Mar 2021 05:32:15 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</link>
	<title><![CDATA[Public Databases for Bioinformatics !]]></title>
	<description><![CDATA[<pre>https://www.nature.com/articles/s41467-020-17155-y<br><br>Server Infrastructure:

File Server:

dhara: Synology 3614 Storage Appliance
4 Core Xeon
108TB disk storage
10Gb ethernet to SCG3
Access atx: dhara:5000
Has btsync server (try it - its much better than dropbox)

Compute Servers:

nandi: Kundaje and Phi Server
24 intel cores
256GB RAM
500GB of SSD storage 
36TB RAID6 local storage
4 Intel Phi's (space for 4 more GPU's)


durga: Montgomery and sensitive data
24 intel cores
256GB RAM
500GB of SSD RAID0 storage 
60TB RAID6 local storage

mitra: Bassik and Web/DB Server
24 core
256GB RAM 
500GB of SSD RAID0 storage 
36TB RAID6 local storage

vayu: Kundaje GPU server
4 core
64GB RAM 
200GB of SSD storage 
8TB RAID10 local storage
4 Nvidia GTX 970 4GB GPUs

amold: Bickel and SGE server
32 AMD core
128GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

wotan: Bickel and SGE server
64 AMD core
256GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

Filesystem:

/users/$USER
default home directory
full backups nightly 
nfs mount to dhara
should store code, papers, and other highly processed data here

/mnt/data/
globally accessible data
should store common data here
e.g. genomes and indexes, annotations, ENCODE data  
if you dont want this to count towards your quote you must chown

/mnt/lab_data/$LAB/
lab accessible data
should store lab project data here 
e.g. ATAC-seq prediction data, enhancer prediction, motif calls

/srv/scratch/$USER
fast local storage
not backed up, but on raid and data will never be deleted
most analysis should be performed here

/srv/persistent/$USER
fast local storage
synced nightly, but not backed up
       ie if the hard drives fail or you delete something and notice 
       within 24 hours we can recover. Otherwise not. (vs home which is 
       properly backed up )  
intermediate analysis products that would be hard to recover should be stored here 
       e.g. stochastic analysis results that need to be kept so that paper 
       results can be reproduced

/srv/www/$LABNAME/
web accessible from mitra.stanford.edu
*NOT BACKED UP*

Some parallel programming patterns:

# gzip a bunch of files
parallel gzip -- *.FILESTOGZIP

# fork example in python:
(for more detailed examples look at 
 https://github.com/nboley/grit/ grit/lib/multiprocessing_utils.py)

import os
import time
import random

import multiprocessing

class ProcessSafeOPStream( object ):
    def __init__( self, writeable_obj ):
        self.writeable_obj = writeable_obj
        self.lock = multiprocessing.Lock()
        self.name = self.writeable_obj.name
        return
    
    def write( self, data ):
        self.lock.acquire()
        self.writeable_obj.write( data )
        self.writeable_obj.flush()
        self.lock.release()
        return
    
    def close( self ):
        self.writeable_obj.close()

def worker(queue, ofp):
    # Try without this
    random.seed()
    while True:
        i = queue.get()
        if i == 'FINISHED': return
        # simulate an expensive function
        x = random.random()
        time.sleep(x/10)
        print i, x
        ofp.write("%i\t%s\n" % (i, x))

NSIMS = 10000
NPROC = 25

# populate queue
todo = multiprocessing.Queue()
for i in xrange(NSIMS): todo.put(i)
for i in xrange(NPROC): todo.put('FINISHED')

ofp = ProcessSafeOPStream( open("output.txt", "w") )

pids = []
for i in xrange(NPROC):
    pid = os.fork()
    if pid == 0:
       worker(todo, ofp)
       os._exit(0)
    else:
       pids.append(pid)  

for pid in pids:
    os.waitpid(pid, 0)

ofp.close()

print "FINISHED"<br><br></pre>
<p>For use case 1 we obtained the following ENCODE and ROADMAP datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz">https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam">https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam">https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam</a>. Blacklisted regions were obtained from&nbsp;<a href="http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz">http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz</a>. The human genome version hg38 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz</a>.</p>
<p>For use case 2 we used the set of narrowPeak files summarized in&nbsp;<a href="https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt">https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt</a>&nbsp;(archived version v1.0.1). The human genome version hg19 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz</a></p>
<p>For use case 3 we used the ENCODE datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam">https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig">https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam">https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam</a>&nbsp;as we as the GENCODE annotation v29 from&nbsp;<a href="ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz">ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz</a>.</p><p>Address of the bookmark: <a href="http://mitra.stanford.edu/" rel="nofollow">http://mitra.stanford.edu/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42985/janggu-deep-learning-for-genomics</guid>
	<pubDate>Tue, 23 Mar 2021 05:14:43 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42985/janggu-deep-learning-for-genomics</link>
	<title><![CDATA[Janggu - Deep learning for Genomics]]></title>
	<description><![CDATA[<p><span>Janggu is a python package that facilitates deep learning in the context of genomics. The package is freely available under a GPL-3.0 license.</span></p>
<p><span>Detail tutorial at&nbsp;https://janggu.readthedocs.io/en/latest/</span></p>
<p><span>USE cases</span></p>
<p><span>https://github.com/wkopp/janggu_usecases</span></p><p>Address of the bookmark: <a href="https://github.com/BIMSBbioinfo/janggu" rel="nofollow">https://github.com/BIMSBbioinfo/janggu</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42965/nucl2vec-local-alignment-of-dna-sequences-using-distributed-vector-representation</guid>
	<pubDate>Tue, 16 Mar 2021 05:45:44 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42965/nucl2vec-local-alignment-of-dna-sequences-using-distributed-vector-representation</link>
	<title><![CDATA[Nucl2Vec: Local alignment of DNA sequences using Distributed Vector Representation]]></title>
	<description><![CDATA[<p><span>We demonstrate a novel approach for</span><span>local alignment of DNA reads with respect to reference genome.</span><span>For this process we have used Skip-gram model for creating</span><span>encoding(Nucl2Vec) and k-nearest neighbor for the alignment.</span><span>With our new approach we have reduced computation cost for</span><span>local alignment , while achieving accuracy comparable to existing</span><span>defacto standard BWA-MEM tool.</span> </p>
<p><em>https://prakharg24.github.io/papers/401851.full.pdf</em></p><p>Address of the bookmark: <a href="https://prakharg24.github.io/papers/401851.full.pdf" rel="nofollow">https://prakharg24.github.io/papers/401851.full.pdf</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42963/davi-deep-learning-based-tool-for-alignment-and-single-nucleotide-variant-identification</guid>
	<pubDate>Tue, 16 Mar 2021 05:41:33 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42963/davi-deep-learning-based-tool-for-alignment-and-single-nucleotide-variant-identification</link>
	<title><![CDATA[DAVI: Deep learning-based tool for alignment and single nucleotide variant identification]]></title>
	<description><![CDATA[<p>DAVI consists of models for both global and local alignment and for variant calling. We have evaluated the performance of DAVI against existing state-of-the-art tool sets and found that its accuracy and performance is comparable to existing tools used for bench-marking. We further demonstrate that while existing tools are based on data generated from a specific sequencing technology, the models proposed in DAVI are generic and can be used across different NGS technologies as well as across different species</p>
<p>https://iopscience.iop.org/article/10.1088/2632-2153/ab7e19/pdf</p><p>Address of the bookmark: <a href="https://github.com/gguptaiitd/NEAT" rel="nofollow">https://github.com/gguptaiitd/NEAT</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>