<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/38039?offset=380</link>
	<atom:link href="https://bioinformaticsonline.com/related/38039?offset=380" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34594/synima-synteny-imaging-tool</guid>
	<pubDate>Sun, 10 Dec 2017 17:03:48 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34594/synima-synteny-imaging-tool</link>
	<title><![CDATA[Synima: Synteny Imaging tool]]></title>
	<description><![CDATA[<p><span>Synteny Imaging tool (Synima) written in Perl, which uses the graphical features of R. Synima takes orthologues computed from reciprocal best BLAST hits or OrthoMCL, and DAGchainer, and outputs an overview of genome-wide synteny in PDF. Each of these programs are included with the Synima package, and a pipeline for their use. Synima has a range of graphical parameters including size, colours, order, and labels, which are specified in a config file generated by the first run of Synima &ndash; and can be subsequently edited. Synima runs quickly on a command line to generate informative and publication quality figures. Synima is open source and freely available from&nbsp;</span><span><a href="https://github.com/rhysf/Synima"><span>https://github.com/rhysf/Synima</span></a></span><span>&nbsp;under the MIT License.</span></p><p>Address of the bookmark: <a href="https://github.com/rhysf/Synima" rel="nofollow">https://github.com/rhysf/Synima</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34920/xmatchview-smith-waterman-alignment-visualization</guid>
	<pubDate>Thu, 28 Dec 2017 09:00:58 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34920/xmatchview-smith-waterman-alignment-visualization</link>
	<title><![CDATA[xmatchview: smith-waterman alignment visualization]]></title>
	<description><![CDATA[<p><span>xmatchview and xmatchview-conifer are imaging tools for comparing the synteny between DNA sequences. It allows users to align 2 DNA sequences in fasta format using cross_match and displays the alignment in a variety of image formats. xmatchview and xmatchview-conifer are written in python and run on linux and windows. They serve as visual tools for analyzing cross_match alignments. Cross_match (Green, P. (1994)&nbsp;</span><a href="http://www.phrap.org/">http://www.phrap.org</a><span>) uses an implementation of the Smith-Waterman algorithm for comparing DNA sequences that is sensitive.</span></p>
<p><span>http://www.bcgsc.ca/platform/bioinfo/software/xmatchview</span></p><p>Address of the bookmark: <a href="https://github.com/warrenlr/xmatchview" rel="nofollow">https://github.com/warrenlr/xmatchview</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42143/sibelia-a-comparative-genomics-tool</guid>
	<pubDate>Sat, 22 Aug 2020 02:49:00 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42143/sibelia-a-comparative-genomics-tool</link>
	<title><![CDATA[Sibelia: A comparative genomics tool]]></title>
	<description><![CDATA[<p><strong>Sibelia</strong>: A comparative genomics tool: It assists biologists in analysing the genomic variations that correlate with pathogens, or the genomic changes that help microorganisms adapt in different environments. Sibelia will also be helpful for the evolutionary and genome rearrangement studies for multiple strains of microorganisms.&nbsp;</p>
<p><strong>Sibelia</strong>&nbsp;is useful in finding: (1) shared regions, (2) regions that present in one group of genomes but not in others, (3) rearrangements that transform one genome to other genomes.</p>
<p>More at&nbsp;<a href="http://bioinf.spbau.ru/sibelia">http://bioinf.spbau.ru/sibelia</a></p>
<p>Sibelia docs&nbsp;<a href="http://gensoft.pasteur.fr/docs/Sibelia/3.0.7/SIBELIA.md">http://gensoft.pasteur.fr/docs/Sibelia/3.0.7/SIBELIA.md</a></p><p>Address of the bookmark: <a href="https://github.com/bioinf/Sibelia" rel="nofollow">https://github.com/bioinf/Sibelia</a></p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/44525/synorth-exploring-the-evolution-of-synteny-and-long-range-regulatory-interactions-in-vertebrate-genomes</guid>
	<pubDate>Mon, 06 May 2024 06:21:10 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/44525/synorth-exploring-the-evolution-of-synteny-and-long-range-regulatory-interactions-in-vertebrate-genomes</link>
	<title><![CDATA[Synorth: exploring the evolution of synteny and long-range regulatory interactions in vertebrate genomes]]></title>
	<description><![CDATA[<p><span>Genomic regulatory blocks are chromosomal regions spanned by long clusters of highly conserved noncoding elements devoted to long-range regulation of developmental genes, often immobilizing other, unrelated genes into long-lasting syntenic arrangements. Synorth&nbsp;</span><a href="http://synorth.genereg.net/" target="_blank">http://synorth.genereg.net/</a><span>&nbsp;is a web resource for exploring and categorizing the syntenic relationships in genomic regulatory blocks across multiple genomes, tracing their evolutionary fate after teleost whole genome duplication at the level of genomic regulatory block loci, individual genes, and their phylogenetic context.</span></p>
<p><span>More at&nbsp;https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2745767/</span></p><p>Address of the bookmark: <a href="http://synorth.genereg.net/" rel="nofollow">http://synorth.genereg.net/</a></p>]]></description>
	<dc:creator>LEGE</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/37572/gtdb-tk-a-toolkit-for-assigning-objective-taxonomic-classifications-to-bacterial-and-archaeal-genomes</guid>
	<pubDate>Wed, 22 Aug 2018 03:21:01 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/37572/gtdb-tk-a-toolkit-for-assigning-objective-taxonomic-classifications-to-bacterial-and-archaeal-genomes</link>
	<title><![CDATA[GTDB-Tk: A toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes.]]></title>
	<description><![CDATA[<p>GTDB-Tk is a software toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes. It is computationally efficient and designed to work with recent advances that allow hundreds or thousands of metagenome-assembled genomes (MAGs) to be obtained directly from environmental samples. It can also be applied to isolate and single-cell genomes. The GTDB-Tk is open source and released under the GNU General Public License (Version 3).</p>
<p>GTDB-Tk is&nbsp;<span>under active development and validation</span>. Please independently confirm the GTDB-Tk predictions by manually inspecting the tree and bringing any discrepencies to our attention. Notifications about GTDB-Tk releases will be available through the ACE Twitter account (<a href="https://twitter.com/ace_uq">https://twitter.com/ace_uq</a>).</p><p>Address of the bookmark: <a href="https://github.com/Ecogenomics/GTDBTk" rel="nofollow">https://github.com/Ecogenomics/GTDBTk</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/40463/%E2%80%98dockr%E2%80%99-the-r-container</guid>
	<pubDate>Mon, 23 Dec 2019 09:56:49 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/40463/%E2%80%98dockr%E2%80%99-the-r-container</link>
	<title><![CDATA[‘dockr’: the R container]]></title>
	<description><![CDATA[<p><code>dockr</code> 0.8.6 is now available on CRAN. <code>dockr</code> is a minimal toolkit to build a lightweight Docker container image for your R package, in which the package itself is available. The Docker image seeks to mirror your R session as close as possible with respect to R specific dependencies. Both dependencies on CRAN R packages as well as local non-CRAN R packages will be included in the Docker container image.</p>
<p>If you want to know, how Docker works, and why you should consider using Docker, please take a look at the <a href="https://www.docker.com/why-docker" target="_blank">Docker website</a>.</p><p>Address of the bookmark: <a href="https://www.docker.com/why-docker" rel="nofollow">https://www.docker.com/why-docker</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34049/libsvm-a-library-for-support-vector-machines</guid>
	<pubDate>Wed, 02 Aug 2017 06:49:05 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34049/libsvm-a-library-for-support-vector-machines</link>
	<title><![CDATA[LIBSVM -- A Library for Support Vector Machines]]></title>
	<description><![CDATA[<p><strong>LIBSVM&nbsp;</strong>is an integrated software for support vector classification, (C-SVC,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#nuandone">nu-SVC</a>), regression (epsilon-SVR,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#nuandone">nu-SVR</a>) and distribution estimation (<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#nuandone">one-class SVM</a>). It supports multi-class classification.</p>
<p>Since version 2.8, it implements an SMO-type algorithm proposed in this paper:<br>R.-E. Fan, P.-H. Chen, and C.-J. Lin.&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/papers/quadworkset.pdf">Working set selection using second order information for training SVM</a>. Journal of Machine Learning Research 6, 1889-1918, 2005. You can also find a pseudo code there. (<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/faq.html#f203">how to cite LIBSVM</a>)</p>
<p><span style="color: #ff0000;">Our goal is to help users from other fields to easily use SVM as a tool.&nbsp;</span><strong>LIBSVM&nbsp;</strong>provides a simple interface where users can easily link it with their own programs. Main features of&nbsp;<strong>LIBSVM</strong>&nbsp;include</p>
<ul>
<li>Different SVM formulations</li>
<li>Efficient multi-class classification</li>
<li>Cross validation for model selection</li>
<li>Probability estimates</li>
<li>Various kernels (including precomputed kernel matrix)</li>
<li>Weighted SVM for unbalanced data</li>
<li>Both C++ and&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#java">Java</a>&nbsp;sources</li>
<li><a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#GUI">GUI</a>&nbsp;demonstrating SVM classification and regression</li>
<li><a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#python">Python</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#R">R</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#matlab">MATLAB</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#perl">Perl</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#ruby">Ruby</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#weka">Weka</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#lisp">Common LISP</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#clisp">CLISP</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#haskell">Haskell</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#ocaml">OCaml</a>,&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#labview">LabVIEW</a>, and&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#PHP">PHP</a>&nbsp;interfaces.&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#csharp">C# .NET</a>&nbsp;code and&nbsp;<a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/#cuda">CUDA</a>&nbsp;extension is available.&nbsp;<br>It's also included in some data mining environments:&nbsp;<a href="http://rapid-i.com/">RapidMiner</a>,&nbsp;<a href="http://pcp.sourceforge.net/">PCP</a>, and&nbsp;<a href="http://lionoso.org/">LIONsolver</a>.</li>
<li>Automatic model selection which can generate contour of cross validation accuracy.</li>
<li></li>
</ul>
<p>https://www.csie.ntu.edu.tw/~cjlin/libsvm/</p><p>Address of the bookmark: <a href="https://www.csie.ntu.edu.tw/~cjlin/libsvm/" rel="nofollow">https://www.csie.ntu.edu.tw/~cjlin/libsvm/</a></p>]]></description>
	<dc:creator>Neel</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43902/interactivenn-a-web-based-tool-for-the-analysis-of-sets-through-venn-diagrams</guid>
	<pubDate>Wed, 29 Jun 2022 03:22:26 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43902/interactivenn-a-web-based-tool-for-the-analysis-of-sets-through-venn-diagrams</link>
	<title><![CDATA[InteractiVenn: a web-based tool for the analysis of sets through Venn diagrams]]></title>
	<description><![CDATA[<p><span>InteractiVenn, a more flexible tool for interacting with Venn diagrams including up to six sets. It offers a clean interface for Venn diagram construction and enables analysis of set unions while preserving the shape of the diagram. Set unions are useful to reveal differences and similarities among sets and may be guided in our tool by a tree or by a list of set unions. The tool also allows obtaining subsets&rsquo; elements, saving and loading sets for further analyses, and exporting the diagram in vector and image formats. InteractiVenn has been used to analyze two biological datasets, but it may serve set analysis in a broad range of domains.</span></p>
<p><span>More at&nbsp;https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-015-0611-3</span></p>
<p><span><img src="https://media.springernature.com/lw685/springer-static/image/art%3A10.1186%2Fs12859-015-0611-3/MediaObjects/12859_2015_611_Fig1_HTML.gif?as=webp" alt="image" style="border: 0px;"></span></p><p>Address of the bookmark: <a href="http://www.interactivenn.net/" rel="nofollow">http://www.interactivenn.net/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</guid>
	<pubDate>Tue, 23 Mar 2021 05:32:15 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</link>
	<title><![CDATA[Public Databases for Bioinformatics !]]></title>
	<description><![CDATA[<pre>https://www.nature.com/articles/s41467-020-17155-y<br><br>Server Infrastructure:

File Server:

dhara: Synology 3614 Storage Appliance
4 Core Xeon
108TB disk storage
10Gb ethernet to SCG3
Access atx: dhara:5000
Has btsync server (try it - its much better than dropbox)

Compute Servers:

nandi: Kundaje and Phi Server
24 intel cores
256GB RAM
500GB of SSD storage 
36TB RAID6 local storage
4 Intel Phi's (space for 4 more GPU's)


durga: Montgomery and sensitive data
24 intel cores
256GB RAM
500GB of SSD RAID0 storage 
60TB RAID6 local storage

mitra: Bassik and Web/DB Server
24 core
256GB RAM 
500GB of SSD RAID0 storage 
36TB RAID6 local storage

vayu: Kundaje GPU server
4 core
64GB RAM 
200GB of SSD storage 
8TB RAID10 local storage
4 Nvidia GTX 970 4GB GPUs

amold: Bickel and SGE server
32 AMD core
128GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

wotan: Bickel and SGE server
64 AMD core
256GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

Filesystem:

/users/$USER
default home directory
full backups nightly 
nfs mount to dhara
should store code, papers, and other highly processed data here

/mnt/data/
globally accessible data
should store common data here
e.g. genomes and indexes, annotations, ENCODE data  
if you dont want this to count towards your quote you must chown

/mnt/lab_data/$LAB/
lab accessible data
should store lab project data here 
e.g. ATAC-seq prediction data, enhancer prediction, motif calls

/srv/scratch/$USER
fast local storage
not backed up, but on raid and data will never be deleted
most analysis should be performed here

/srv/persistent/$USER
fast local storage
synced nightly, but not backed up
       ie if the hard drives fail or you delete something and notice 
       within 24 hours we can recover. Otherwise not. (vs home which is 
       properly backed up )  
intermediate analysis products that would be hard to recover should be stored here 
       e.g. stochastic analysis results that need to be kept so that paper 
       results can be reproduced

/srv/www/$LABNAME/
web accessible from mitra.stanford.edu
*NOT BACKED UP*

Some parallel programming patterns:

# gzip a bunch of files
parallel gzip -- *.FILESTOGZIP

# fork example in python:
(for more detailed examples look at 
 https://github.com/nboley/grit/ grit/lib/multiprocessing_utils.py)

import os
import time
import random

import multiprocessing

class ProcessSafeOPStream( object ):
    def __init__( self, writeable_obj ):
        self.writeable_obj = writeable_obj
        self.lock = multiprocessing.Lock()
        self.name = self.writeable_obj.name
        return
    
    def write( self, data ):
        self.lock.acquire()
        self.writeable_obj.write( data )
        self.writeable_obj.flush()
        self.lock.release()
        return
    
    def close( self ):
        self.writeable_obj.close()

def worker(queue, ofp):
    # Try without this
    random.seed()
    while True:
        i = queue.get()
        if i == 'FINISHED': return
        # simulate an expensive function
        x = random.random()
        time.sleep(x/10)
        print i, x
        ofp.write("%i\t%s\n" % (i, x))

NSIMS = 10000
NPROC = 25

# populate queue
todo = multiprocessing.Queue()
for i in xrange(NSIMS): todo.put(i)
for i in xrange(NPROC): todo.put('FINISHED')

ofp = ProcessSafeOPStream( open("output.txt", "w") )

pids = []
for i in xrange(NPROC):
    pid = os.fork()
    if pid == 0:
       worker(todo, ofp)
       os._exit(0)
    else:
       pids.append(pid)  

for pid in pids:
    os.waitpid(pid, 0)

ofp.close()

print "FINISHED"<br><br></pre>
<p>For use case 1 we obtained the following ENCODE and ROADMAP datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz">https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam">https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam">https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam</a>. Blacklisted regions were obtained from&nbsp;<a href="http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz">http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz</a>. The human genome version hg38 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz</a>.</p>
<p>For use case 2 we used the set of narrowPeak files summarized in&nbsp;<a href="https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt">https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt</a>&nbsp;(archived version v1.0.1). The human genome version hg19 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz</a></p>
<p>For use case 3 we used the ENCODE datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam">https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig">https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam">https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam</a>&nbsp;as we as the GENCODE annotation v29 from&nbsp;<a href="ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz">ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz</a>.</p><p>Address of the bookmark: <a href="http://mitra.stanford.edu/" rel="nofollow">http://mitra.stanford.edu/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34482/ribbon-visualizing-complex-genome-alignments-and-structural-variation</guid>
	<pubDate>Wed, 29 Nov 2017 07:40:22 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34482/ribbon-visualizing-complex-genome-alignments-and-structural-variation</link>
	<title><![CDATA[Ribbon: Visualizing complex genome alignments and structural variation:]]></title>
	<description><![CDATA[<p>Ribbon can be used for long reads, short reads, paired-end reads, and assembly/genome alignments. Instructions for each data format are available by clicking on "instructions" in each tab on the right.</p>
<p>Local installation:</p>
<p>You can install Ribbon locally from Github by following the instructions here:&nbsp;<a href="https://github.com/MariaNattestad/ribbon" target="_blank">https://github.com/MariaNattestad/Ribbon</a></p><p>Address of the bookmark: <a href="http://genomeribbon.com/" rel="nofollow">http://genomeribbon.com/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>