<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/30214?offset=1140</link>
	<atom:link href="https://bioinformaticsonline.com/related/30214?offset=1140" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	
<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/opportunity/view/43329/postdoc-position-at-kiel-university-germany</guid>
  <pubDate>Sat, 28 Aug 2021 01:16:55 -0500</pubDate>
  <link></link>
  <title><![CDATA[Postdoc position at Kiel University, Germany]]></title>
  <description><![CDATA[
<p>In the Genomic Microbiology Group of Prof. Tal Dagan at the Institute<br />of Microbiology at Kiel University, Germany, a</p>

<p>Postdoc position (m/w/d)</p>

<p>in the field of computational evolutionary microbiology is available<br />for an initially limited period of 36 months at the earliest possible<br />date. The weekly working time corresponds to 100% of full employment<br />(If the legal requirements under collective bargaining law are met, the<br />tariff grouping is carried out up to pay scale 13 TV-L. The obligation<br />to teach amounts to 4 hours.</p>

<p>The Genomic Microbiology Group research interests are focused on<br />microbial genome evolution with an emphasis on the study of lateral gene<br />transfer. In our research we use both computational and experimental<br />approaches (see www.uni-kiel.de/genomik). The position offers the<br />opportunity to develop an independent research profile within the group<br />research focus. The successful applicant is expected to be involved<br />in teaching of bioinformatics and molecular evolution, including the<br />development of teaching materials (lectures/exercises/short videos).</p>

<p>Your profile:<br />· Doctoral or PhD degree in Molecular Evolution, Bioinformatics or<br />related fields.<br />· Knowledge and experience in programming (e.g., Python) and<br />biostatistical analysis (e.g., with R or MatLab).<br />· Any of the following expertise is an advantage: the analysis of<br />genomic or transcriptomic data, phylogenetic reconstruction,<br />comparative genomics.<br />· Good oral and written communication skills in English.<br />· Ability to teach in German is an advantage (alternatively, an<br />indication to do so from the 2nd year on).<br />· Skills and motivation to communicate and interact with other<br />scientists.<br /> <br />The Christian-Albrechts-University sees itself as a modern and<br />cosmopolitan employer. We welcome your application regardless of your age,<br />gender, cultural and social background, religion, ideology, disability<br />or sexual identity. We promote equality of the sexes.</p>

<p>The Christian-Albrechts-University is committed to the employment of<br />people with disabilities. Preference will be given to applications from<br />severely handicapped persons and persons of equal standing, provided<br />they are suitable.</p>

<p>We expressly welcome applications from people with a migration background.</p>

<p>For enquiries regarding the position, teaching obligations and research<br />topic please contact Prof. Tal Dagan: tdagan@ifam.uni-kiel.de.</p>

<p>Applications should be submitted by email to Mrs. Haacks<br />(dhaacks@ifam.uni-kiel.de) as a single PDF and include: (1) a letter of<br />motivation (max 1 page, Arial 11, line spacing 1.15), (2) CV, (3) PhD<br />certificate. Please use 'GMG postdoc application - [your name]'<br />as a subject.</p>

<p>Please, refrain from sending us application photos.</p>

<p>Application deadline:  August 31 2021 or until the position is<br />filled. Interviews will take place during September/October 2021. The<br />planned starting date for the position is flexible (but in 2021).</p>
]]></description>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/2742/baumbach-lab</guid>
  <pubDate>Wed, 21 Aug 2013 10:56:35 -0500</pubDate>
  <link></link>
  <title><![CDATA[Baumbach Lab]]></title>
  <description><![CDATA[
<p>The Computational Biology research group was established in October 2012 at the Department of Mathematics and Computer Science (IMADA) at the University of Southern Denmark (SDU). It emerged from the Computational Systems Biology group, founded in March 2010 at the Max Planck Institute for Informatics (MPII) and the Cluster of Excellence for Multimodel Computing and Interaction (MMCI) at Saarland University, Saarbrücken, Germany.<br />​<br />The group is headed by Prof. Dr. Jan Baumbach and currently hosts nine PhD students and one postdoctoral fellow at both, IMADA/SDU and MMCI/MPII.</p>

<p>More at &gt;&gt; http://www.baumbachlab.net/</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44581/biokit-a-set-of-tools-dedicated-to-bioinformatics-data-visualisation</guid>
	<pubDate>Tue, 18 Jun 2024 02:04:39 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44581/biokit-a-set-of-tools-dedicated-to-bioinformatics-data-visualisation</link>
	<title><![CDATA[BioKit: a set of tools dedicated to bioinformatics, data visualisation]]></title>
	<description><![CDATA[<p><span>BioKit is a set of tools dedicated to bioinformatics, data visualisation (</span><a href="https://biokit.readthedocs.io/en/latest/references.html#module-biokit.viz" title="biokit.viz"><code><span>biokit.viz</span></code></a><span>), access to online biological data (e.g. UniProt, NCBI thanks to bioservices). It also contains more advanced tools related to data analysis (e.g.,&nbsp;</span><a href="https://biokit.readthedocs.io/en/latest/references.html#module-biokit.stats" title="biokit.stats"><code><span>biokit.stats</span></code></a><span>). Since R is quite common in bioinformatics, we also provide a convenient module to run R inside your Python scripts or shell (:mod:biokit.rtools module).</span></p><p>Address of the bookmark: <a href="https://biokit.readthedocs.io/en/latest/index.html" rel="nofollow">https://biokit.readthedocs.io/en/latest/index.html</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/2839/look-up-a-biological-numbers</guid>
	<pubDate>Fri, 23 Aug 2013 03:27:45 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/2839/look-up-a-biological-numbers</link>
	<title><![CDATA[Look up a biological numbers]]></title>
	<description><![CDATA[<p><strong>Did you ever need to look up a number</strong><span>&nbsp;like the volume of a cell or the cellular concentration of ATP, only to find yourself spending much more time than you wanted on the Internet or flipping through textbooks - all without much success?&nbsp;</span><br><br><span>Well, it didn&rsquo;t happen only to you. It is often surprising how difficult it can be to find concrete biological numbers, even for properties that have been measured numerous times. To help solve this for one and all, BioNumbers (</span><strong>the database of key numbers in molecular biology</strong><span>) was created. Along with the numbers, you'll find the relevant&nbsp;</span><strong>references to the original literature</strong><span>, useful comments, and related numbers.&nbsp;</span></p>
<p><span><span>To cite BioNumbers please refer to: Milo et al. Nucl. Acids Res. (2010) 38: D750-D753. When using a specific entry from the database it is highly recommended that you also specify the BioNumbers 6 digit ID, e.g. "BNID 100986, Milo et al 2010".&nbsp;</span></span></p><p>Address of the bookmark: <a href="http://bionumbers.hms.harvard.edu/" rel="nofollow">http://bionumbers.hms.harvard.edu/</a></p>]]></description>
	<dc:creator>Jitendra Narayan</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</guid>
	<pubDate>Tue, 23 Mar 2021 05:32:15 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</link>
	<title><![CDATA[Public Databases for Bioinformatics !]]></title>
	<description><![CDATA[<pre>https://www.nature.com/articles/s41467-020-17155-y<br><br>Server Infrastructure:

File Server:

dhara: Synology 3614 Storage Appliance
4 Core Xeon
108TB disk storage
10Gb ethernet to SCG3
Access atx: dhara:5000
Has btsync server (try it - its much better than dropbox)

Compute Servers:

nandi: Kundaje and Phi Server
24 intel cores
256GB RAM
500GB of SSD storage 
36TB RAID6 local storage
4 Intel Phi's (space for 4 more GPU's)


durga: Montgomery and sensitive data
24 intel cores
256GB RAM
500GB of SSD RAID0 storage 
60TB RAID6 local storage

mitra: Bassik and Web/DB Server
24 core
256GB RAM 
500GB of SSD RAID0 storage 
36TB RAID6 local storage

vayu: Kundaje GPU server
4 core
64GB RAM 
200GB of SSD storage 
8TB RAID10 local storage
4 Nvidia GTX 970 4GB GPUs

amold: Bickel and SGE server
32 AMD core
128GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

wotan: Bickel and SGE server
64 AMD core
256GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

Filesystem:

/users/$USER
default home directory
full backups nightly 
nfs mount to dhara
should store code, papers, and other highly processed data here

/mnt/data/
globally accessible data
should store common data here
e.g. genomes and indexes, annotations, ENCODE data  
if you dont want this to count towards your quote you must chown

/mnt/lab_data/$LAB/
lab accessible data
should store lab project data here 
e.g. ATAC-seq prediction data, enhancer prediction, motif calls

/srv/scratch/$USER
fast local storage
not backed up, but on raid and data will never be deleted
most analysis should be performed here

/srv/persistent/$USER
fast local storage
synced nightly, but not backed up
       ie if the hard drives fail or you delete something and notice 
       within 24 hours we can recover. Otherwise not. (vs home which is 
       properly backed up )  
intermediate analysis products that would be hard to recover should be stored here 
       e.g. stochastic analysis results that need to be kept so that paper 
       results can be reproduced

/srv/www/$LABNAME/
web accessible from mitra.stanford.edu
*NOT BACKED UP*

Some parallel programming patterns:

# gzip a bunch of files
parallel gzip -- *.FILESTOGZIP

# fork example in python:
(for more detailed examples look at 
 https://github.com/nboley/grit/ grit/lib/multiprocessing_utils.py)

import os
import time
import random

import multiprocessing

class ProcessSafeOPStream( object ):
    def __init__( self, writeable_obj ):
        self.writeable_obj = writeable_obj
        self.lock = multiprocessing.Lock()
        self.name = self.writeable_obj.name
        return
    
    def write( self, data ):
        self.lock.acquire()
        self.writeable_obj.write( data )
        self.writeable_obj.flush()
        self.lock.release()
        return
    
    def close( self ):
        self.writeable_obj.close()

def worker(queue, ofp):
    # Try without this
    random.seed()
    while True:
        i = queue.get()
        if i == 'FINISHED': return
        # simulate an expensive function
        x = random.random()
        time.sleep(x/10)
        print i, x
        ofp.write("%i\t%s\n" % (i, x))

NSIMS = 10000
NPROC = 25

# populate queue
todo = multiprocessing.Queue()
for i in xrange(NSIMS): todo.put(i)
for i in xrange(NPROC): todo.put('FINISHED')

ofp = ProcessSafeOPStream( open("output.txt", "w") )

pids = []
for i in xrange(NPROC):
    pid = os.fork()
    if pid == 0:
       worker(todo, ofp)
       os._exit(0)
    else:
       pids.append(pid)  

for pid in pids:
    os.waitpid(pid, 0)

ofp.close()

print "FINISHED"<br><br></pre>
<p>For use case 1 we obtained the following ENCODE and ROADMAP datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz">https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam">https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam">https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam</a>. Blacklisted regions were obtained from&nbsp;<a href="http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz">http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz</a>. The human genome version hg38 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz</a>.</p>
<p>For use case 2 we used the set of narrowPeak files summarized in&nbsp;<a href="https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt">https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt</a>&nbsp;(archived version v1.0.1). The human genome version hg19 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz</a></p>
<p>For use case 3 we used the ENCODE datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam">https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig">https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam">https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam</a>&nbsp;as we as the GENCODE annotation v29 from&nbsp;<a href="ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz">ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz</a>.</p><p>Address of the bookmark: <a href="http://mitra.stanford.edu/" rel="nofollow">http://mitra.stanford.edu/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/3952/ancestor-at-work</guid>
	<pubDate>Sun, 25 Aug 2013 19:45:28 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/3952/ancestor-at-work</link>
	<title><![CDATA[Ancestor at work !!!]]></title>
	<description><![CDATA[<p>When they will learn Bioinformatics :)</p>]]></description>
	<dc:creator>Jit</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/3952" length="10064" type="image/gif" />
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/4835/chang-lab</guid>
  <pubDate>Tue, 24 Sep 2013 17:25:49 -0500</pubDate>
  <link></link>
  <title><![CDATA[Chang lab]]></title>
  <description><![CDATA[
<p>The Chang lab is focused on how the activities of hundreds or even thousands of genes (gene parties) are coordinated to achieve biological meaning. We have pioneered methods to predict, dissect, and control large-scale gene regulatory programs; these methods have provided insights into human development, cancer, and aging. A particular interest is how cells know and remember their locations in the body, particularly with the help of long noncoding RNAs.</p>

<p>More at http://changlab.stanford.edu/index.html</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34567/jobtree-based-python-wrapper-to-run-the-genome-simulation-tool-suite-evolver</guid>
	<pubDate>Fri, 08 Dec 2017 16:26:32 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34567/jobtree-based-python-wrapper-to-run-the-genome-simulation-tool-suite-evolver</link>
	<title><![CDATA[jobTree based python wrapper to run the genome simulation tool suite Evolver]]></title>
	<description><![CDATA[<p><span>evolverSimControl</span><span>&nbsp;(</span><span>eSC</span><span>) can be used to simulate multi-chromosome genome evolution on an arbitrary phylogeny (</span><a href="http://evolution.genetics.washington.edu/phylip/newicktree.html">Newick format</a><span>). In addition to simply running evolver,&nbsp;</span><span>eSC</span><span>&nbsp;also automatically creates statistical summaries of the simulation as it runs including text and image files. Also included are convenience scripts to: check on a running simulation and see detailed status and logging information; extract fasta sequence files from the leaf nodes of a completed simulation; extract pairwise multiple alignment files (</span><a href="http://genome.ucsc.edu/FAQ/FAQformat.html#format5">.maf</a><span>) from leaf and branch nodes from a completed simulation and with the help of&nbsp;</span><a href="https://github.com/dentearl/mafTools/">mafJoin</a><span>, join them together into a single maf covering the entire simulation.</span></p><p>Address of the bookmark: <a href="https://github.com/dentearl/evolverSimControl" rel="nofollow">https://github.com/dentearl/evolverSimControl</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/videolist/watch/4155/genetics-for-fun-and-profit-andrew-hessel-at-tedxvilnius</guid>
	<pubDate>Sun, 01 Sep 2013 19:15:06 -0500</pubDate>
	<link>https://bioinformaticsonline.com/videolist/watch/4155/genetics-for-fun-and-profit-andrew-hessel-at-tedxvilnius</link>
	<title><![CDATA[Genetics for Fun and Profit: Andrew Hessel at TEDxVilnius]]></title>
	<description><![CDATA[<iframe width="" height="" src="https://www.youtube-nocookie.com/embed/A2h_JW7X_HE" frameborder="0" allowfullscreen></iframe>Andrew Hessel co-chairs the Bioinformatics and Biotechnology track at the Singularity University, an institution founded by futurist Ray Kurzweil and X Prize Foundation CEO Peter Diamandis, with sponsorship from world-leading organizations that include Google, Autodesk, and NASA. He is also the founder of the Pink Army Cooperative, a venture aiming to make open source personalized cancer therapies. His work has been featured in The New York Times, Futurist Magazine, H+, and Wired News.

In the spirit of ideas worth spreading, TEDx is a program of local, self-organized events that bring people together to share a TED-like experience. At a TEDx event, TEDTalks video and live speakers combine to spark deep discussion and connection in a small group. These local, self-organized events are branded TEDx, where x = independently organized TED event. The TED Conference provides general guidance for the TEDx program, but individual TEDx events are self-organized.* (*Subject to certain rules and regulations)]]></description>
	
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34867/magic-blast-a-tool-for-mapping-large-next-generation-rna-or-dna-sequencing-runs-against-a-whole-genome-or-transcriptome</guid>
	<pubDate>Tue, 26 Dec 2017 22:23:39 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34867/magic-blast-a-tool-for-mapping-large-next-generation-rna-or-dna-sequencing-runs-against-a-whole-genome-or-transcriptome</link>
	<title><![CDATA[Magic-BLAST: a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome.]]></title>
	<description><![CDATA[<p>Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. Each alignment optimizes a composite score, taking into account simultaneously the two reads of a pair, and in case of RNA-seq, locating the candidate introns and adding up the score of all exons. This is very different from other versions of BLAST, where each exon is scored as a separate hit and read-pairing is ignored.</p>
<p>Magic-BLAST incorporates within the NCBI BLAST code framework ideas developed in the NCBI Magic pipeline, in particular hit extensions by local walk and jump&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/26109056">(http://www.ncbi.nlm.nih.gov/pubmed/26109056)</a>, and recursive clipping of mismatches near the edges of the reads, which avoids accumulating artefactual mismatches near splice sites and is needed to distinguish short indels from substitutions near the edges.</p><p>Address of the bookmark: <a href="https://ncbi.github.io/magicblast/" rel="nofollow">https://ncbi.github.io/magicblast/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>