<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/35384?offset=390</link>
	<atom:link href="https://bioinformaticsonline.com/related/35384?offset=390" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/43896/list-of-comparative-genomics-resources</guid>
	<pubDate>Tue, 28 Jun 2022 04:08:06 -0500</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/43896/list-of-comparative-genomics-resources</link>
	<title><![CDATA[List of comparative genomics resources !]]></title>
	<description><![CDATA[<div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1096638041"><span>3D-GENOMICS -- A Database to Compare Structural and Functional Annotations of Proteins between Sequenced Genomes</span></a></div><p>Compare structural and functional annotations of proteins between sequenced genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1100640374"><span>ARED Organism -- expansion of ARED reveals AU-rich element cluster variations between human and mouse</span></a></div><p>View AREs in the human transcriptome and study the comparative genomics of AREs in model organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1234973128"><span>ATGC -- Alignable Tight Genomic Clusters Database</span></a></div><p>Find information about orthologous genes in prokaryotes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174596104"><span>AnimalQTLdb -- a livestock QTL database tool set for positional QTL information mining and beyond</span></a></div><p>Search for publicly available QTL data on livestocks and animal species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518150135"><span>BGDB -- Bovine Genome Database</span></a></div><p>Find information about bovine genomics data.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229012662"><span>COMPARE -- a multi-organism system for cross-species data comparison and transfer of information</span></a></div><p>A multi-organism web-based resource system designed to easily retrieve, correlate and interpret data across species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1218141952"><span>CONDOR -- COnserved Non-coDing Orthologous Regions</span></a></div><p>A database resource of developmentally associated conserved non-coding elements.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1099057221"><span>CORG -- A database for COmparative Regulatory Genomics</span></a></div><p>Delineate conserved non-coding blocks from upstream regions of putative orthologous gene pairs from man, mouse, rat, fugu, Mus musculus, Danio rerio, and zebrafish.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1203608896"><span>COXPRESdb -- a database of coexpressed gene networks in mammals</span></a></div><p>Find coexpressed gene lists and networks in human and mouse.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1097763045"><span>CVTree -- A Phylogenetic Tree Reconstruction Tool Based on Whole Genomes</span></a></div><p>Construct phylogenetic tree of microorganisms based on oligopeptide content of their complete proteomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232729680"><span>CleanEST -- the cleansed EST libraries database</span></a></div><p>A novel database server that classifies GenBank's dbEST (database of expressed gene sequences) libraries and removes contaminants.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1256926144"><span>CoCoa -- COefficient of COAncestry software</span></a></div><p>Find information about the ancestral relationship between genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1227549154"><span>CoGemiR -- a comparative genomics microRNA database</span></a></div><p>Provides an overview of the genomic organization of microRNAs and extent of conservation during evolution in different metazoan species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1117678221"><span>Comparative Genometrics (CG) -- a database dedicated to biometric comparisons of whole genomes</span></a></div><p>Conduct comparative biometric analysis of chromosomes of different organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1151007916"><span>DoTS -- Database Of Transcribed Sequences</span></a></div><p>Search for Indices of gene and transcripts in human and mouse.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174510065"><span>DroSpeGe -- rapid access database for new Drosophila species genomes</span></a></div><p>Search and compare 12 new and old Drosophila genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1098208414"><span>ECR Browser -- A Tool for Visualizing and Accessing Data from Comparisons of Multiple Vertebrate Genomes</span></a></div><p>Access to whole genome alignments of human, mouse, rat and fish sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209738459"><span>EPGD -- Eukaryotic Paralog Group Database</span></a></div><p>Find eukaryotic paralog/paralogon information.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232726869"><span>EVOG -- evolutionary visualizer for overlapping genes</span></a></div><p>Analyze the evolutionary process of overlapping genes when comparing different species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1227633714"><span>GNAT -- Inter-species gene mention normalization (ISGN)</span></a></div><p>The first publicly available system reported to handle inter-species gene mention normalization.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229438992"><span>GenColors -- annotation and comparative genomics of prokaryotes made easy</span></a></div><p>A web-based software/database system aimed at an improved and accelerated annotation of prokaryotic genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1151086258"><span>GeneNest gene indices</span></a></div><p>Visualize gene indices of human, mouse, Arabidopsis, Zebrafish, Drosophila and Sheep.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174489378"><span>GenomeTrafac -- a whole genome resource for the detection of transcription factor binding site clusters associated with conventional and microRNA encoding genes conserved between mouse and human gene orthologs</span></a></div><p>Use comparative genomics approach to characterize gene models and identify putative cis-regulatory regions of RefSeq Gene Orthologs.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518150753"><span>IKMC -- International Knockout Mouse Consortium web portal</span></a></div><p>Find information about mutated mouse genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209411604"><span>IMG/M -- Integrated Microbial Genomes/Metagenomes</span></a></div><p>A data management and analysis system for metagenomes</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1234976694"><span>ISED -- Influenza sequence and epitope database.</span></a></div><p>Search for influenza sequence, vaccine, and drug resistance information.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20140710115515"><span>LAMDHI: The Search for Animal Models Starts Here</span></a></div><p>LAMHDI, the initiative to Link Animal Models to Human DIsease, is designed to accelerate the research process by providing biomedical researchers with a simple, comprehensive Web-based resource to find the best animal models for their research.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1228843803"><span>MANTIS -- a phylogenetic framework for multi-species genome comparisons</span></a></div><p>The missing link between multi-species full genome comparisons and functional analysis.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1099578148"><span>MBGD -- Microbial genome database for comparative analysis</span></a></div><p>Conduct comparative analysis of completely sequenced microbial genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1221077729"><span>MEGA -- Molecular Evolutionary Genetics Analysis</span></a></div><p>A biologist-centric software for evolutionary analysis of DNA and protein sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174596756"><span>MamPol -- a database of nucleotide polymorphism in the Mammalia class</span></a></div><p>Conduct single nucleotide polymorphisms diversity measurements among homologous sequences from the Mammalia class.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1266437314"><span>MicrobesOnline -- Prokaryotic Genome Database</span></a></div><p>Find information about 1000s of microbial genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1208461006"><span>Narcisse -- a mirror view of conserved syntenies</span></a></div><p>A database dedicated to the study of genome conservation.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1219772764"><span>OMA -- the Orthologous MAtrix project</span></a></div><p>Explore orthologous relations across 352 complete genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209738741"><span>OPTIC -- orthologous and paralogous transcripts in clades</span></a></div><p>Browse complete genomes in several clades.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209573208"><span>OrthoDB -- the hierarchical catalog of eukaryotic orthologs</span></a></div><p>Find groups of orthologous genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1221231200"><span>OrthoMaM -- orthologous mammalian markers</span></a></div><p>A database of orthologous genomic markers for placental mammal phylogenetics.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1100009979"><span>PEDANT -- Protein Extraction, Description and ANalysis Tool</span></a></div><p>Conduct genome wide functional and structural analysis.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174489475"><span>PReMod -- a database of genome-wide mammalian cis-regulatory module predictions</span></a></div><p>Conduct genome-wide cis-regulatory module (CRM) predictions for both the human and the mouse genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1151083092"><span>PhenomicDB -- Comparison of phenotypes of orthologous genes in human and model organisms</span></a></div><p>Compare phenotypes of a given gene or gene set in different model organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1190899370"><span>Phylemon -- A suite of web tools for molecular evolution, phylogenetics and phylogenomics</span></a></div><p>Phylemon is a web server that integrates a selected suite of more than 20 different tools from the most popular stand-alone programs of phylogenetic and evolutionary analysis.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232555615"><span>PhyloPat -- the phylogenetic pattern database</span></a></div><p>Use this database to see where in the evolution some phylogenetic lineages were started, and over which species they were contained.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174510223"><span>Pristionchus.org -- a genome-centric database of the nematode satellite species Pristionchus pacificus</span></a></div><p>Search for genomic information on nematode satellite species Pristionchus pacificus.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1236367352"><span>ProtClustDB -- NCBI Protein Clusters Database</span></a></div><p>Find information about related protein sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209410278"><span>ProtozoaDB -- database of protozoan genomes</span></a></div><p>Database hosting genomics and post-genomics data from multiple protozoans.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1232554690"><span>Pseudofam -- the pseudogene families database</span></a></div><p>A database of pseudogene families based on the protein families from the Pfam database.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518151439"><span>RIDM - RIKEN Integrated Database of Mammals</span></a></div><p>Find genomic information about mammals.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1272562567"><span>RegPrecise -- Regulon Prediction Database</span></a></div><p>Find information about predicted regulons in prokaryotic transcription regulation.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1272477473"><span>SALAD -- Surveyed contained motif ALignment diagram and the Associating Dendrogram</span></a></div><p>Perform systematic comparison of proteome data among species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229010765"><span>SGN -- SOL Genomics Network</span></a></div><p>A comparative map viewer dedicated to the biology of the Solanaceae family.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1256669040"><span>ShotgunFunctionalizeR -- R-package for functional comparison of metagenomes</span></a></div><p>Analyze data from functional analysis on fragmented microbial genetic material.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1256238439"><span>SnoopCGH -- Comparative Genomic Hybridization software</span></a></div><p>Visualize and explore comparative genomic hybridization data sets.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1174489598"><span>SwissRegulon -- a database of genome-wide annotations of regulatory sites</span></a></div><p>Search for genome-wide annotations of regulatory sites in yeast and prokaryotes genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1229013521"><span>TaxonGap -- a visualization tool for intra- and inter-species variation among individual biomarkers</span></a></div><p>Compare and select individual biomarkers.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1106063477"><span>The Adaptive Evolution Database (TAED) -- a phylogeny based tool for comparative genomics</span></a></div><p>Search for information on adaptive evolution in gene families of higher plants and chordate.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1216742716"><span>The CGView Server -- a comparative genomics tool for circular genomes</span></a></div><p>Generate graphical maps of circular genomes that show sequence features, base composition plots, analysis results and sequence similarity plots.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1099663588"><span>The ERGO -- Genome analysis and discovery system</span></a></div><p>Conduct a comprehensive analysis of genes and genomes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1177611772"><span>The Macaque Genome: Interactive Poster and Teaching Resource</span></a></div><p>An interactive online poster presentation on the Macaque genome, including high-quality images, video clips, and Web resources</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1103816940"><span>The TIGR Gene Indices -- clustering and assembling EST and known genes and integration with eukaryotic genomes</span></a></div><p>Search for annotated genetic information of expressed sequence tags (ESTs) in different eukaryotic organisms.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1043767169"><span>UniGene</span></a></div><p>Find mapping and expression information for a unigene cluster (ESTs and full-length mRNA sequences organized into clusters that each represent a unique known or putative gene)</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1216738072"><span>Uprobe -- universal overgo hybridization-based probe retrieval and design</span></a></div><p>A public online resource for identifying or designing 'universal' overgo-hybridization probes from conserved sequences that can be used to efficiently screen one or more genomic libraries from a designated group of species.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1098205291"><span>VISTA -- Computational Tools for Comparative Genomics</span></a></div><p>Comprehensive suite of programs and databases for comparative analysis of genomic sequences.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL20110518144404"><span>cBARBEL -- Catfish Breeder and Researcher Bioinformatics Entry Location</span></a></div><p>Find information about ictalurid catfish.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1209738040"><span>eggNOG -- evolutionary genealogy of genes: Non-supervised Orthologous Groups</span></a></div><p>Discover orthologous groups of genes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1234370319"><span>metaTIGER -- a metabolic gene evolution resource</span></a></div><p>Find metabolic networks and phylogenomic information on a taxonomically diverse range of eukaryotes.</p></div><div><div><a href="https://www.hsls.pitt.edu/obrc/index.php?page=URL1138901833"><span>xBASE -- a collection of online databases for bacterial comparative genomics</span></a></div><p>Conduct bacterial comparative genomics.</p></div>]]></description>
	<dc:creator>Shruti Paniwala</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/33887/gview-a-java-application-for-viewing-and-examining-prokaryotic-genomes-in-a-circular-or-linear-context</guid>
	<pubDate>Fri, 14 Jul 2017 07:47:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/33887/gview-a-java-application-for-viewing-and-examining-prokaryotic-genomes-in-a-circular-or-linear-context</link>
	<title><![CDATA[GView: A Java application for viewing and examining prokaryotic genomes in a circular or linear context]]></title>
	<description><![CDATA[<p>GView is a Java application for viewing and examining prokaryotic genomes in a circular or linear context. It accepts standard sequence file formats and an optional style specification file to generate customizable, publication quality genome maps in bitmap and scalable vector graphics formats. GView features an interactive pan-and-zoom interface, a command-line interface for incorporation in genome analysis pipelines, and a public Application Programming Interface for incorporation in other Java applications.</p>
<p><strong>Availability:</strong>&nbsp;GView is a freely available application licensed under the GNU Public License. The application, source code, documentation, file specifications, tutorials and image galleries are available at&nbsp;<a href="http://gview.ca/" target="pmc_ext">http://gview.ca</a></p>
<p><strong>Contact:</strong>&nbsp;<a href="mailto:dev@null">ac.cg.cpsa-cahp@raalesmod.nav.yrag</a></p>
<p>https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995121/</p><p>Address of the bookmark: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995121/" rel="nofollow">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995121/</a></p>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</guid>
	<pubDate>Tue, 23 Mar 2021 05:32:15 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/42987/public-databases-for-bioinformatics</link>
	<title><![CDATA[Public Databases for Bioinformatics !]]></title>
	<description><![CDATA[<pre>https://www.nature.com/articles/s41467-020-17155-y<br><br>Server Infrastructure:

File Server:

dhara: Synology 3614 Storage Appliance
4 Core Xeon
108TB disk storage
10Gb ethernet to SCG3
Access atx: dhara:5000
Has btsync server (try it - its much better than dropbox)

Compute Servers:

nandi: Kundaje and Phi Server
24 intel cores
256GB RAM
500GB of SSD storage 
36TB RAID6 local storage
4 Intel Phi's (space for 4 more GPU's)


durga: Montgomery and sensitive data
24 intel cores
256GB RAM
500GB of SSD RAID0 storage 
60TB RAID6 local storage

mitra: Bassik and Web/DB Server
24 core
256GB RAM 
500GB of SSD RAID0 storage 
36TB RAID6 local storage

vayu: Kundaje GPU server
4 core
64GB RAM 
200GB of SSD storage 
8TB RAID10 local storage
4 Nvidia GTX 970 4GB GPUs

amold: Bickel and SGE server
32 AMD core
128GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

wotan: Bickel and SGE server
64 AMD core
256GB RAM 
200GB of SSD storage 
12TB RAID5 local storage

Filesystem:

/users/$USER
default home directory
full backups nightly 
nfs mount to dhara
should store code, papers, and other highly processed data here

/mnt/data/
globally accessible data
should store common data here
e.g. genomes and indexes, annotations, ENCODE data  
if you dont want this to count towards your quote you must chown

/mnt/lab_data/$LAB/
lab accessible data
should store lab project data here 
e.g. ATAC-seq prediction data, enhancer prediction, motif calls

/srv/scratch/$USER
fast local storage
not backed up, but on raid and data will never be deleted
most analysis should be performed here

/srv/persistent/$USER
fast local storage
synced nightly, but not backed up
       ie if the hard drives fail or you delete something and notice 
       within 24 hours we can recover. Otherwise not. (vs home which is 
       properly backed up )  
intermediate analysis products that would be hard to recover should be stored here 
       e.g. stochastic analysis results that need to be kept so that paper 
       results can be reproduced

/srv/www/$LABNAME/
web accessible from mitra.stanford.edu
*NOT BACKED UP*

Some parallel programming patterns:

# gzip a bunch of files
parallel gzip -- *.FILESTOGZIP

# fork example in python:
(for more detailed examples look at 
 https://github.com/nboley/grit/ grit/lib/multiprocessing_utils.py)

import os
import time
import random

import multiprocessing

class ProcessSafeOPStream( object ):
    def __init__( self, writeable_obj ):
        self.writeable_obj = writeable_obj
        self.lock = multiprocessing.Lock()
        self.name = self.writeable_obj.name
        return
    
    def write( self, data ):
        self.lock.acquire()
        self.writeable_obj.write( data )
        self.writeable_obj.flush()
        self.lock.release()
        return
    
    def close( self ):
        self.writeable_obj.close()

def worker(queue, ofp):
    # Try without this
    random.seed()
    while True:
        i = queue.get()
        if i == 'FINISHED': return
        # simulate an expensive function
        x = random.random()
        time.sleep(x/10)
        print i, x
        ofp.write("%i\t%s\n" % (i, x))

NSIMS = 10000
NPROC = 25

# populate queue
todo = multiprocessing.Queue()
for i in xrange(NSIMS): todo.put(i)
for i in xrange(NPROC): todo.put('FINISHED')

ofp = ProcessSafeOPStream( open("output.txt", "w") )

pids = []
for i in xrange(NPROC):
    pid = os.fork()
    if pid == 0:
       worker(todo, ofp)
       os._exit(0)
    else:
       pids.append(pid)  

for pid in pids:
    os.waitpid(pid, 0)

ofp.close()

print "FINISHED"<br><br></pre>
<p>For use case 1 we obtained the following ENCODE and ROADMAP datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz">https://www.encodeproject.org/files/ENCFF446WOD/@@download/ENCFF446WOD.bed.gz</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam">https://www.encodeproject.org/files/ENCFF546PJU/@@download/ENCFF546PJU.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam">https://www.encodeproject.org/files/ENCFF059BEU/@@download/ENCFF059BEU.bam</a>. Blacklisted regions were obtained from&nbsp;<a href="http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz">http://mitra.stanford.edu/kundaje/akundaje/release/blacklists/hg38-human/hg38.blacklist.bed.gz</a>. The human genome version hg38 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz</a>.</p>
<p>For use case 2 we used the set of narrowPeak files summarized in&nbsp;<a href="https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt">https://github.com/wkopp/janggu_usecases/tree/master/extra/urls.txt</a>&nbsp;(archived version v1.0.1). The human genome version hg19 was obtained from&nbsp;<a href="http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz">http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/hg19.fa.gz</a></p>
<p>For use case 3 we used the ENCODE datasets&nbsp;<a href="https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam">https://www.encodeproject.org/files/ENCFF591XCX/@@download/ENCFF591XCX.bam</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig">https://www.encodeproject.org/files/ENCFF736LHE/@@download/ENCFF736LHE.bigWig</a>,&nbsp;<a href="https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam">https://www.encodeproject.org/files/ENCFF177HHM/@@download/ENCFF177HHM.bam</a>&nbsp;as we as the GENCODE annotation v29 from&nbsp;<a href="ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz">ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_29/gencode.v29.annotation.gtf.gz</a>.</p><p>Address of the bookmark: <a href="http://mitra.stanford.edu/" rel="nofollow">http://mitra.stanford.edu/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34482/ribbon-visualizing-complex-genome-alignments-and-structural-variation</guid>
	<pubDate>Wed, 29 Nov 2017 07:40:22 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34482/ribbon-visualizing-complex-genome-alignments-and-structural-variation</link>
	<title><![CDATA[Ribbon: Visualizing complex genome alignments and structural variation:]]></title>
	<description><![CDATA[<p>Ribbon can be used for long reads, short reads, paired-end reads, and assembly/genome alignments. Instructions for each data format are available by clicking on "instructions" in each tab on the right.</p>
<p>Local installation:</p>
<p>You can install Ribbon locally from Github by following the instructions here:&nbsp;<a href="https://github.com/MariaNattestad/ribbon" target="_blank">https://github.com/MariaNattestad/Ribbon</a></p><p>Address of the bookmark: <a href="http://genomeribbon.com/" rel="nofollow">http://genomeribbon.com/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34571/mugsy-multiple-whole-genome-alignment-tool</guid>
	<pubDate>Fri, 08 Dec 2017 17:41:14 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34571/mugsy-multiple-whole-genome-alignment-tool</link>
	<title><![CDATA[Mugsy: multiple whole genome alignment tool]]></title>
	<description><![CDATA[<p><span>Mugsy is a multiple whole genome aligner. Mugsy uses Nucmer for pairwise alignment, a custom graph based segmentation procedure for identifying collinear regions, and the segment-based progressive multiple alignment strategy from Seqan::TCoffee. Mugsy accepts draft genomes in the form of multi-FASTA files and does not require a reference genome.</span></p>
<p>To cite Mugsy, use:</p>
<p>Angiuoli SV and Salzberg SL.&nbsp;<a href="http://bioinformatics.oxfordjournals.org/content/27/3/334">Mugsy: Fast multiple alignment of closely related whole genomes.</a><em>Bioinformatics</em>&nbsp;2011 27(3):334-4</p><p>Address of the bookmark: <a href="http://mugsy.sourceforge.net/" rel="nofollow">http://mugsy.sourceforge.net/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34867/magic-blast-a-tool-for-mapping-large-next-generation-rna-or-dna-sequencing-runs-against-a-whole-genome-or-transcriptome</guid>
	<pubDate>Tue, 26 Dec 2017 22:23:39 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34867/magic-blast-a-tool-for-mapping-large-next-generation-rna-or-dna-sequencing-runs-against-a-whole-genome-or-transcriptome</link>
	<title><![CDATA[Magic-BLAST: a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome.]]></title>
	<description><![CDATA[<p>Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. Each alignment optimizes a composite score, taking into account simultaneously the two reads of a pair, and in case of RNA-seq, locating the candidate introns and adding up the score of all exons. This is very different from other versions of BLAST, where each exon is scored as a separate hit and read-pairing is ignored.</p>
<p>Magic-BLAST incorporates within the NCBI BLAST code framework ideas developed in the NCBI Magic pipeline, in particular hit extensions by local walk and jump&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/26109056">(http://www.ncbi.nlm.nih.gov/pubmed/26109056)</a>, and recursive clipping of mismatches near the edges of the reads, which avoids accumulating artefactual mismatches near splice sites and is needed to distinguish short indels from substitutions near the edges.</p><p>Address of the bookmark: <a href="https://ncbi.github.io/magicblast/" rel="nofollow">https://ncbi.github.io/magicblast/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41158/carefully-opt-for-human-reference-genome</guid>
	<pubDate>Tue, 18 Feb 2020 07:43:32 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41158/carefully-opt-for-human-reference-genome</link>
	<title><![CDATA[Carefully opt for human reference genome]]></title>
	<description><![CDATA[<p><a href="http://lh3.github.io/2017/11/13/which-human-reference-genome-to-use" target="_blank">Heng Li posted several issues with the human reference genomes given in these resources</a> and suggests the following compressed FASTA file to be used as hg38/GRCh38 human reference genome.</p>
<p>if you map reads to GRCh38 or hg38, use the following:</p>
<div>
<div>
<pre><code>ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/405/GCA_000001405.15_GRCh38/seqs_for_alignment_pipelines.ucsc_ids/GCA_000001405.15_GRCh38_no_alt_analysis_set.fna.gz
</code></pre>
</div>
</div>
<p>There are several other versions of GRCh37/GRCh38. What&rsquo;s wrong with them? Here are a collection of potential issues:</p>
<p>More at http://lh3.github.io/2017/11/13/which-human-reference-genome-to-use</p><p>Address of the bookmark: <a href="http://lh3.github.io/2017/11/13/which-human-reference-genome-to-use" rel="nofollow">http://lh3.github.io/2017/11/13/which-human-reference-genome-to-use</a></p>]]></description>
	<dc:creator>biogeek</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36533/mecat-fast-mapping-error-correction-and-de-novo-assembly-for-single-molecule-sequencing-reads</guid>
	<pubDate>Fri, 11 May 2018 05:07:45 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36533/mecat-fast-mapping-error-correction-and-de-novo-assembly-for-single-molecule-sequencing-reads</link>
	<title><![CDATA[MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads]]></title>
	<description><![CDATA[<p>MECAT is an ultra-fast Mapping, Error Correction and de novo Assembly Tools for single molecula sequencing (SMRT) reads. MECAT employs novel alignment and error correction algorithms that are much more efficient than the state of art of aligners and error correction tools. MECAT can be used for effectively de novo assemblying large genomes. For example, on a 32-thread computer with 2.0 GHz CPU , MECAT takes 9.5 days to assemble a human genome based on 54x SMRT data, which is 40 times faster than the current&nbsp;<a href="http://cbcb.umd.edu/software/pbcr/mhap/">PBcR-Mhap pipeline</a>. MECAT performance were compared with&nbsp;<a href="http://cbcb.umd.edu/software/pbcr/mhap/">PBcR-Mhap pipeline</a>,&nbsp;<a href="https://github.com/PacificBiosciences/falcon">FALCON</a>&nbsp;and&nbsp;<a href="http://canu.readthedocs.io/en/latest/">Canu(v1.3)</a>&nbsp;in five real datasets. The quality of assembled contigs produced by MECAT is the same or better than that of the&nbsp;<a href="http://cbcb.umd.edu/software/pbcr/mhap/">PBcR-Mhap pipeline</a>&nbsp;and&nbsp;<a href="https://github.com/PacificBiosciences/falcon">FALCON</a>.&nbsp;</p>
<p>https://www.nature.com/articles/nmeth.4432</p><p>Address of the bookmark: <a href="https://github.com/xiaochuanle/MECAT" rel="nofollow">https://github.com/xiaochuanle/MECAT</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36918/p-rna-scaffolder-a-fast-and-accurate-genome-scaffolder-using-paired-end-rna-sequencing-reads</guid>
	<pubDate>Tue, 12 Jun 2018 08:14:41 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36918/p-rna-scaffolder-a-fast-and-accurate-genome-scaffolder-using-paired-end-rna-sequencing-reads</link>
	<title><![CDATA[P_RNA_scaffolder: a fast and accurate genome scaffolder using paired-end RNA-sequencing reads]]></title>
	<description><![CDATA[P_RNA_scaffolder, a fast and accurate tool using paired-end RNA-sequencing reads to scaffold genomes. This tool aims to improve the completeness of both protein-coding and non-coding genes. After this tool was applied to scaffolding human contigs, the structures of both protein-coding genes and circular RNAs were almost completely recovered and equivalent to those in a complete genome, especially for long proteins and long circular RNAs.<p>Address of the bookmark: <a href="http://www.fishbrowser.org/software/P_RNA_scaffolder/" rel="nofollow">http://www.fishbrowser.org/software/P_RNA_scaffolder/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/file/view/36952/getoptspl-file</guid>
	<pubDate>Fri, 15 Jun 2018 04:43:03 -0500</pubDate>
	<link>https://bioinformaticsonline.com/file/view/36952/getoptspl-file</link>
	<title><![CDATA[getopts.pl file]]></title>
	<description><![CDATA[
<p>SSPACE_longread complain for getopts.pl file. </p>

<p>To resolve this, download and have in SSPACED-Longreads folder. </p>

<p>Cheers :)</p>
]]></description>
	<dc:creator>Jit</dc:creator>
	<enclosure url="https://bioinformaticsonline.com/file/download/36952" length="942" type="text/plain" />
</item>

</channel>
</rss>