<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: Related items]]></title>
	<link>https://bioinformaticsonline.com/related/37257?offset=210</link>
	<atom:link href="https://bioinformaticsonline.com/related/37257?offset=210" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/30236/pyscaf</guid>
	<pubDate>Mon, 19 Dec 2016 14:20:33 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/30236/pyscaf</link>
	<title><![CDATA[pyScaf]]></title>
	<description><![CDATA[<p>pyScaf orders contigs from genome assemblies utilising several types of information:</p>
<ul>
<li>paired-end (PE) and/or mate-pair libraries (<a href="https://github.com/lpryszcz/pyScaf#ngs-based-scaffolding">NGS-based mode</a>)</li>
<li>long reads (<a href="https://github.com/lpryszcz/pyScaf#scaffolding-based-on-long-reads">NGS-based mode</a>)</li>
<li>synteny to the genome of some related species (<a href="https://github.com/lpryszcz/pyScaf#reference-based-scaffolding">reference-based mode</a>)</li>
</ul>
<p>Scaffolding&nbsp;</p>
<p>In reference-based mode, pyScaf uses synteny to the genome of closely related species in order to order contigs and estimate distances between adjacent contigs.</p>
<p>Contigs are aligned globally (end-to-end) onto reference chromosomes, ignoring:</p>
<ul>
<li>matches not satisfying cut-offs (<code>--identity</code>&nbsp;and&nbsp;<code>--overlap</code>)</li>
<li>suboptimal matches (only best match of each query to reference is kept)</li>
<li>and removing overlapping matches on reference.</li>
</ul>
<p>In preliminary tests, pyScaf performed superbly on simulated heterozygous genomes based on&nbsp;<em>C. parapsilosis</em>&nbsp;(13 Mb; CANPA) and&nbsp;<em>A. thaliana</em>&nbsp;(119 Mb; ARATH) chromosomes, reconstructing correctly all chromosomes always for CANPA and nearly always for ARATH (<a href="https://www.dropbox.com/sh/bb7lwggo40xrwtc/AAAZ7pByVQQQ-WhUXZVeJaZVa/pyScaf?dl=0">Figures in dropbox</a>,&nbsp;<a href="https://docs.google.com/spreadsheets/d/1InBExy-qKDLj-upd8tlPItVSKc4mLepZjZxB31ii9OY/edit#gid=2036953672">CANPA table</a>,&nbsp;<a href="https://docs.google.com/spreadsheets/d/1InBExy-qKDLj-upd8tlPItVSKc4mLepZjZxB31ii9OY/edit#gid=1920757821">ARATH table</a>).<br>Runs took ~0.5 min for CANPA on&nbsp;<code>4 CPUs</code>&nbsp;and ~2 min for ARATH on&nbsp;<code>16 CPUs</code>.</p>
<p><span>Important remarks:</span></p>
<ul>
<li>Reduce your assembly before (fasta2homozygous.py) as any redundancy will likely break the synteny.</li>
<li>pyScaf works better with contigs than scaffolds, as scaffolds are often affected by mis-assemblies (no&nbsp;<em>de novo assembler</em>&nbsp;/ scaffolder is perfect...), which breaks synteny.</li>
<li>pyScaf works very well if divergence between reference genome and assembled contigs is below 20% at nucleotide level.</li>
<li>pyScaf deals with large rearrangements ie. deletions, insertion, inversions, translocations.&nbsp;<span>Note however, this is experimental implementation!</span></li>
<li>Consider closing gaps after scaffolding.</li>
</ul><p>Address of the bookmark: <a href="https://github.com/lpryszcz/pyScaf" rel="nofollow">https://github.com/lpryszcz/pyScaf</a></p>]]></description>
	<dc:creator>Bulbul</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/30625/pandaseq</guid>
	<pubDate>Mon, 23 Jan 2017 04:54:32 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/30625/pandaseq</link>
	<title><![CDATA[PANDASEQ]]></title>
	<description><![CDATA[<p>PANDASEQ assembles paired-end Illumina reads into sequences, trying to correct for errors and uncalled bases. The assembler reads two files in FASTQ format with quality information. If amplification primers were used (e.g., to isolate a variable region of the 16S gene, or the constant regions around zinc finger binding residues), they can be removed from the sequence during assembly. The final sequence will correct any uncalled bases in the overlapping region using the complementary strand. When mismatches occur in the overlapping region, the base with the better quality score is chosen.<br>The algorithm is as follows:<br><br>1.Find the positions where the forward and reverse primers match best above the threshold and discard the ends of the sequence, including the primer.<br>2.Pick and overlap to maximise the probability of the forward and reverse reads having come from a single piece of DNA.<br>3.Identify the masking of the end of the read with the quality score B or # as done by CASAVA and adjust the probabilities in this region.<br>4.Construct an assembled sequence between the primers and calculate the quality.<br>5.Check for various constraints, including quality, length, uncalled bases, and user-supplied modules.</p>
<p>http://neufeldserver.uwaterloo.ca/~apmasell/pandaseq_man1.html</p><p>Address of the bookmark: <a href="http://neufeldserver.uwaterloo.ca/~apmasell/pandaseq_man1.html" rel="nofollow">http://neufeldserver.uwaterloo.ca/~apmasell/pandaseq_man1.html</a></p>]]></description>
	<dc:creator>Shruti Paniwala</dc:creator>
</item>

<item>
  <guid isPermaLink='true'>https://bioinformaticsonline.com/researchlabs/view/32713/salzberg-lab</guid>
  <pubDate>Mon, 15 May 2017 05:14:01 -0500</pubDate>
  <link></link>
  <title><![CDATA[Salzberg lab]]></title>
  <description><![CDATA[
<p>We are a computational biology lab that develops novel methods for analysis of DNA and RNA sequences. Our research includes software for aligning and assembling RNA-seq data, whole-genome assembly, and microbiome analysis. We work closely with biomedical scientists to apply these methods to current problems arising in a broad spectrum of biological and medical research areas. We’re also part of the Center for Computational Biology, a group of 20+ faculty members and their labs at Johns Hopkins working on computational, statistical, and mathematical methods that can turn massive genomic data sets into biologically and clinically useful information.</p>

<p>https://salzberg-lab.org/</p>
]]></description>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/36594/fragscaff-genome-assembly-with-contiguity-preserving-transposition</guid>
	<pubDate>Mon, 14 May 2018 04:28:14 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/36594/fragscaff-genome-assembly-with-contiguity-preserving-transposition</link>
	<title><![CDATA[fragScaff: Genome Assembly with Contiguity Preserving Transposition]]></title>
	<description><![CDATA[<p>Contiguity preserving transposition and sequencing (CPT-seq) is an entirely in vitro means of generating libraries comprised of 9216 indexed pools, each of which contains thousands of sparsely sequenced long fragments ranging from 5 kilobases to &gt;1 megabase. This software, fragScaff, leverages coincidences between the content of different pools as a source of contiguity information for scaffolding de novo genome assemblies. FragScaff is complementary to Lachesis, providing midrange contiguity to support robust, accurate chromosome-scale de novo genome assemblies without the need for laborious in vivo cloning steps.</p>
<p>Further information about fragScaff, including source code, is available at:<a href="https://sourceforge.net/projects/fragscaff/files/">https://sourceforge.net/projects/fragscaff/files</a>.</p>
<p>Manuscript describing fragScaff was published as: Adey A, Kitzman JO, Burton JN, Daza R, Kumar A, Christiansen L, Ronaghi M, Amini S, L Gunderson K, Steemers FJ, Shendure J#.&nbsp;<em>In vitro, long-range sequence information for de novo genome assembly via transposase contiguity.</em>&nbsp;Genome Research 2014 Dec;24(12):2041-9. doi:&nbsp;<a href="http://dx.doi.org/10.1101/gr.178319.114">10.1101/gr.178319.114</a>. PubMed PMID:&nbsp;<a href="http://www.ncbi.nlm.nih.gov/pubmed/25327137">25327137</a>.</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="https://sourceforge.net/projects/fragscaff/files/" rel="nofollow">https://sourceforge.net/projects/fragscaff/files/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/news/view/37905/phased-human-genome-assembly</guid>
	<pubDate>Mon, 08 Oct 2018 09:10:54 -0500</pubDate>
	<link>https://bioinformaticsonline.com/news/view/37905/phased-human-genome-assembly</link>
	<title><![CDATA[Phased Human Genome Assembly !]]></title>
	<description><![CDATA[<p>The new publicly available assembly (PacBio&nbsp;<a href="https://www.globenewswire.com/Tracker?data=IM2cKfZgtHafORdb9VSstujBjyW-aIzFILCtXNAkcY_yqVmxdjvG01R_FZQC7zLxs-alqquXwsW6MG98G9-g-ym8Nue2pmUZMtkIg3FIat2mYbJ-z2Ra367GlinbO13x" target="_blank" title=""><span style="text-decoration: underline;">HG00733</span></a>) has the fewest gaps of any human genome assembly, with more than half of the genome contained in gapless sequence at least 27 Mb long. The primary contig assembly is 2.89 Gb long and consists of 865 contigs that were assembled with PacBio data generated with the company&rsquo;s Sequel<span>&reg;</span>&nbsp;System. Using the&nbsp;<a href="https://www.globenewswire.com/Tracker?data=jOa6mE1Y5r8VbU1CaCgx1A0HsoVzJ7waxOiDKgvmKL6cwJq_eH4nWrGj2vLkNpxHl1-5CH4htDB4113PXT8WU60hvHQ-KKpvAwQwveEGvz3N4d0q7QHSa_X97LW8_9xEiYqfsc4d24ca-IpVYZsf7Ue-XL7fSIIZw_EHK-F96t1aaQNRcD-z1PP5qvlZbVwX" target="_blank" title=""><span style="text-decoration: underline;">FALCON-Unzip assembler</span></a>, maternal and paternal haplotypes were resolved over more than 80% of the genome. Maternal and paternal haplotype blocks were then further phased using Hi-C technology and the&nbsp;<a href="https://www.globenewswire.com/Tracker?data=jOa6mE1Y5r8VbU1CaCgx1IrQmRcKvNQm83FLTqQE6OGzutM-fEggnm4Z-nsniK0D_YmDKS_UKWE0NHtHbgvbL973Y2-9NhrWhYKizXQ4lpiTvlqPf1UZdjqVs7BDjISgDnovv8foYw8es8jQzAg5Xfq1CH36NOnWQgA_X04XSvyEEEj0q801Im6cV5M5K4eL15vb_ZgUayccOvDY_fc6lxxPAAAyA4h16-zUN44Y81KdujciCrJrv5xynMIXEjRsaIKCf6eCX_Q1j_uZlN5TD0MVr6HulTYG8lGgyL0x-eQ=" target="_blank" title=""><span style="text-decoration: underline;">FALCON-Phase method</span></a>developed in collaboration with Phase Genomics. The genome was then&nbsp;<em>de novo</em>&nbsp;scaffolded using Phase Genomics&rsquo;&nbsp;<a href="https://www.globenewswire.com/Tracker?data=4wcqEWHJpCHRJARQkC0oVkYT9htT14iVebujxcW1nMpAjmigHGQ46ObCGetRfyaZm1ADIHaV1-30B9izTAhjJ-efhFlxorUxs08kdV-9AAzQyuHJ9S7wxnRRnyegsTZd" target="_blank" title=""><span style="text-decoration: underline;">Proximo Hi-C platform</span></a>, resulting in the first chromosome-scale diploid assembly of a single individual accomplished with only two technologies. More specific details about the assembly are included on the PacBio blog.</p><p>The data are available using NCBI accession IDs: BioProject: (<a href="https://www.globenewswire.com/Tracker?data=YZtCuhY2wu5H0yIso9jtUufPXbwyHh1QOZ1jBggGpK5NtXaU_JGC9X39F3uHZ96uVmu6hW5OB2Qq805hUEW2OhSNCm630yFiEF6_nsAwYB0=" target="_blank" title=""><span style="text-decoration: underline;">PRJNA483067</span></a>), assembly: [<a href="https://www.globenewswire.com/Tracker?data=CEXZ7E56JOsRgfH4Wq3r5LVbv4QH_UIekV9idYBys9l8K7pFft824jmYWNzJqK7lQ9fMbaAtbURpm8gM7zqUbpPUrydFwrkJGGtG-NBHctjyjddiFY-p06xZPm2mHXE2" target="_blank" title=""><span style="text-decoration: underline;">RBJD00000000</span></a>] and sequence data (<a href="https://www.globenewswire.com/Tracker?data=pELP2RpqTqTRaPF9yN1N7GZYlQmTxpY0aW-B8xaNw6iyD-Lylw7X3UzMDK3YS4AIYgLtD13em2XsbzOwKhXuNbI4Ks6-LSyXl1_yVdFoB0U=" target="_blank" title=""><span style="text-decoration: underline;">SRP155659</span></a>).</p><p><span>Additional Resources</span></p><ul>
<li><a href="http://globenewswire.com/Tracker?data=zXpdadphSgIAIEWeq46yRPm5-TU0H7wTkL48ue4I9GsaHd5mJyMb9PgXgAsElREkLOCOdWdJ8uW9DHB-LyQ7xhzbd97Qis6CuAlqD0ubGgY%3D" target="_blank" title=""><span style="text-decoration: underline;">Interactive map</span></a>&nbsp;showcasing global initiatives underway to generate reference-quality human genome assemblies for diverse populations</li>
<li><a href="http://globenewswire.com/Tracker?data=EQ8NIaaa8k1Nw1MPRJYIHYrqgsDy92kU8W0siJdGQhq5IJ0dcb890PFFm-C1SrAlFf0xkxUVRxZefFK5ebhoIzmS-6OjR1G9sTxOkCOwRHCAZWmHL-e7uGSuZYcw1VsDp8AeDWO0RwcepMMB6hAoR6BBCJDiJVVZtdFlWBn2uxs%3D" target="_blank" title=""><span style="text-decoration: underline;">BioReport Podcast</span></a>&nbsp;on the value of ethnic-specific reference genomes</li>
<li><em>Nature Reviews Genetics</em>&nbsp;paper from NHGRI:&nbsp;<a href="http://globenewswire.com/Tracker?data=dffu-wPD_JX1_KVeCA6VFy-kP1tlAUbn7d85saXD59dnnJfT2BE3N_Rbm6kT4BvifA_XEs49ioa75cy4HyFi90RA_LRa2QFF6Y4mr-dcoMucljZw0K4JNDZuwWkWPE51cVC2Lqq3E3C1aZ8un6Bq3i-OO_NiVH0hh23hUw4wC84%3D" target="_blank" title=""><span style="text-decoration: underline;">Prioritizing&nbsp;diversity&nbsp;in human genomics research</span></a></li>
<li>Article in&nbsp;<em>The Journal of Precision Medicine</em>: &ldquo;<a href="http://globenewswire.com/Tracker?data=yokLqO2TCBLCdj6uZl-GYbqcGMWBerBYjSPrLMumNrWF2p5XlXq9yl5p-1b5xx3Ckfn5ZjQWkdhxLttbiNae5gccUCP-9RWPUqvTu9MuU9zgJ1c8e14lAladCuEOiVZ2oVRiqssPtLu9hgQWw4ad5EUxZemevsHE4BHC6IiFmMZ6DS6ApwZu-IonFgCFBIcjWOpitQthDASosfaqkMi9LsKgLU9F0WGVJDDOzHXpddhjfCUdEEJ7xC1p8uh9TSiCZgZV6XPlUJSe8n0C_9TtOw%3D%3D" target="_blank" title=""><span style="text-decoration: underline;">Minority Report &ndash; Ethnic Diversity and the Real Promise for Precision Medicine</span></a>&rdquo;</li>
<li>Article&nbsp;in&nbsp;<em>Bio-IT World</em>: &ldquo;<a href="http://globenewswire.com/Tracker?data=rLp1pKetctTPitNEnRjOVDZ3Cvw3FUdL6_ybXncvhjR4ksOrX3y6HUK8WtLlKHT7XZzq_woUjZ-uw20YNvsP0GZAmy5lVqETt27oBLi02wFtTH_6ubELIHtBu8vfVyKnqKp-YhosFG5K7y0RUtzmNjOAlCYPAeVXabn2a2AiSePxUXA_tSy_g79hjYm63x9dPN9oFQGYedOsyHD_ls8DKw%3D%3D" target="_blank" title=""><span style="text-decoration: underline;">Genomic Data Standards Are a Necessity</span></a>&rdquo;</li>
<li>NHGRI Project Award:&nbsp;<a href="http://globenewswire.com/Tracker?data=FbqTEeRffJ88lFryYX6MiOefXvIXFdZDAyW4nrFoYNHaJyMEYIcb7I4BIcEQmxzsKOjrlf9F8irfRJeJLOqG8KFsl-kvkhakUkg3BfYdKGnpLzKYyWbUFR0aKMeEXirHBi7oDLEUSDO45qxANwxyee-pqZXfzAIwF1Wcuaf7EIzNqRqmBUJ3TyNyI05lwAo9gDKmApMnJo5VxPj5P_6rY8lisuv1PNSAh_kJPOuhVBk%3D" target="_blank" title=""><span style="text-decoration: underline;">High Quality Human and Non-Human Primate Genome Assemblies</span></a></li>
</ul><p>More details are available on the PacBio website:</p><ul>
<li>Blog post:&nbsp;<a href="http://globenewswire.com/Tracker?data=ycj-ujgsKzVyljNa11buVmIS5tk9B733VsFZEw77nBXo-IkBvcoG16dN9vuTiY3nm2G5dJZS5Iva3w_znrEtJVDuU8cVlFpozY2ibinKwrMGxkXZVSqW8_uD8fbySRjM5Q_cjuPU22ARFSSLCc9vHJx9WHnb9Rza-qPbuWgewa0rWWStq2fQY5mLpeaQf5fcDJnyQkvDAMI3fauXdzyThg%3D%3D" target="_blank" title=""><span style="text-decoration: underline;">Data Release: Highest-Quality, Most Contiguous Individual Human Genome Assembly to Date</span></a></li>
<li>Blog post:&nbsp;<a href="http://globenewswire.com/Tracker?data=GlZZ9nyp5mDSjJPPfhVD1-dZ_W2l8s0eAUox3TQs949zyGjzO7dx9xodyvyqerdqPC-G3ZhdPEs9xNhJwflrwgHPYQL3kTofprKHBBq3O4gn9E75YUBweJw9b6tTE89sMLUQzF-vRNNDjero3mibm_uG-fSHoYBTm2ZlyEmwzZ5E9tXVd5_RjG0Xnej2E0scA0SncEItAF6Q7vdOydTV_Yr9yYT2TmKY5jtyAt6ZrNGn3McqfV9mMRkR-8dYJLqrQln9JiEkWTwUae6Blj56HyjyXKl6Dfa_CyNuy4r-EWU%3D" target="_blank" title=""><span style="text-decoration: underline;">For Reference-Grade Human Genome Assemblies, SMRT Sequencing Yields Optimal Results</span></a></li>
<li>Webinar: &nbsp;<a href="http://globenewswire.com/Tracker?data=xlnfDwMNLGZZvtexJYsUgMe-DV8HNrYx2QqjwIjfj40dToVtqrBi-gvhknHZmIe8GV_3WU3_9LIlP6GzG3ZoajnDIpwECzdMV5Vyy8Ast4Y2AiHJckf7rBhZVEU4_mV4JB0k3I9XjN2jHK8Cp5uBxyIWWqPdI6qBBdCYYhYLXUTkKpaZEV98oCfC5ET2Q7OSwUM7NieKa75yzMHwaPEYwg%3D%3D" target="_blank" title=""><span style="text-decoration: underline;">Assembling High-Quality Human Reference Genomes for Global Populations</span></a></li>
<li>FALCON-Phase&nbsp;<a href="http://globenewswire.com/Tracker?data=4Z9LDdRq3w2zYFQXEFGmz6u-Vrbfh96syfzrQMKhegLRo2PUvk7s3Xz_y1o--NuTLoCQMrHsqOEBUHIL1IPeOmhyf6Eqwdp8dv8xYo9gSVI%3D" target="_blank" title=""><span style="text-decoration: underline;">press release</span></a>&nbsp;and article&nbsp;<a href="http://globenewswire.com/Tracker?data=4Z9LDdRq3w2zYFQXEFGmz9Ts_IJqHWWrKd33x_ldJEU9mSKXpcVTTi9ioY0kVqrbrXHeCKDf4TdPnAoPJaGBK3YeZtYp-nXZacgyPESZ1XboSUZEJ9rIhDyW7bTLL5HN" target="_blank" title=""><span style="text-decoration: underline;">preprint</span></a></li>
<li>PacBio research focus webpage about&nbsp;<a href="http://globenewswire.com/Tracker?data=E-zzUkw4N01KR4muPun47qg4HX8ToDvLS4sX953hLM2wRyQZ2upkLR4WidyXTFDRLWQORpqxnkbD-CNzsOJyIfH8mJPbrLwRf04J4yjuNdem-Fulc8QIT3OCi4wx5LpqgC2ymLE0rYX5UOpbFPBgvA%3D%3D" target="_blank" title=""><span style="text-decoration: underline;">Human Population Genetics</span></a></li>
</ul><p>&nbsp;Ref:&nbsp;https://stockguru.com/2018/10/08/pacific-biosciences-releases-highest-quality-most-contiguous-individual-human-genome-assembly-to-date/</p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38063/referee-genome-assembly-quality-scores</guid>
	<pubDate>Sun, 04 Nov 2018 16:44:30 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38063/referee-genome-assembly-quality-scores</link>
	<title><![CDATA[Referee: Genome assembly quality scores]]></title>
	<description><![CDATA[<p>Modern genome sequencing technologies provide a succint measure of quality at each position in every read, however all of this information is lost in the assembly process. Referee summarizes the quality information from the reads that map to a site in an assembled genome to calculate a quality score for each position in the genome assembly.</p>
<p>We accomplish this by first calculating genotype likelihoods for every site. For a given site in a diploid genome, there are 10 possible genotypes (AA, AC, AG, AT, CC, CG, CT, GG, GT, TT). Referee takes as input the genotype likelihoods calculated for all 10 genotypes given the called reference base at each position.</p>
<h3>Referee is a program to calculate a quality score for every position in a genome assembly. This allows for easy filtering of low quality sites for any downstream analysis.</h3>
<p>https://github.com/gwct/referee</p><p>Address of the bookmark: <a href="https://gwct.github.io/referee/#" rel="nofollow">https://gwct.github.io/referee/#</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/38441/genome-sequence-based-sub-species-delineation</guid>
	<pubDate>Wed, 12 Dec 2018 08:31:14 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/38441/genome-sequence-based-sub-species-delineation</link>
	<title><![CDATA[Genome sequence-based (sub-)species delineation.]]></title>
	<description><![CDATA[<p>The GGDC web service reports digital DDH for a universal and accurate delineation of prokaryotic (sub-)species without inheriting the pitfalls of classic DDH, and also calculates differences in genomic G+C content.</p>
<p>http://ggdc.dsmz.de/ggdc_background.php#</p>
<p><small>Genome-to-Genome Distance Calculator 2.1</small></p>
<p>http://ggdc.dsmz.de/ggdc.php</p>
<p>&nbsp;</p><p>Address of the bookmark: <a href="http://ggdc.dsmz.de/" rel="nofollow">http://ggdc.dsmz.de/</a></p>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/41604/synteny-and-rearrangement-identifier-syri</guid>
	<pubDate>Tue, 05 May 2020 10:37:10 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/41604/synteny-and-rearrangement-identifier-syri</link>
	<title><![CDATA[Synteny and Rearrangement Identifier (SyRI)]]></title>
	<description><![CDATA[<p>SyRI is a comprehensive tool for predicting genomic differences between related genomes using whole-genome assemblies (WGA). The assemblies are aligned using whole-genome alignment tools, and these alignments are then used as input to SyRI. SyRI identifies syntenic path (longest set of co-linear regions), structural rearrangements (inversions, translocations, and duplications), local variations (SNPs, indels, CNVs etc) within syntenic and structural rearrangements, and un-aligned regions.</p><p>Address of the bookmark: <a href="https://schneebergerlab.github.io/syri/" rel="nofollow">https://schneebergerlab.github.io/syri/</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/43725/comparative-genomics-workshops</guid>
	<pubDate>Tue, 25 Jan 2022 20:39:58 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/43725/comparative-genomics-workshops</link>
	<title><![CDATA[Comparative Genomics Workshops !]]></title>
	<description><![CDATA[<p><span>This meeting's objective was to obtain a big picture look at the current state of the field of comparative&nbsp;genomics with a focus on commonalities across genomic investigations into humans, model organisms&nbsp;(both traditional and non-traditional), agricultural species, wildlife species and microbes.</span></p>
<p>https://www.genome.gov/event-calendar/perspectives-in-comparative-genomics-and-evolution</p><p>Address of the bookmark: <a href="https://www.genome.gov/event-calendar/perspectives-in-comparative-genomics-and-evolution" rel="nofollow">https://www.genome.gov/event-calendar/perspectives-in-comparative-genomics-and-evolution</a></p>]]></description>
	<dc:creator>Rahul Nayak</dc:creator>
</item>
<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/blog/view/44770/nvidia-and-arc-institute-unveil-evo-2-a-breakthrough-ai-for-dna-design</guid>
	<pubDate>Fri, 21 Feb 2025 10:39:47 -0600</pubDate>
	<link>https://bioinformaticsonline.com/blog/view/44770/nvidia-and-arc-institute-unveil-evo-2-a-breakthrough-ai-for-dna-design</link>
	<title><![CDATA[NVIDIA and Arc Institute Unveil Evo 2: A Breakthrough AI for DNA Design]]></title>
	<description><![CDATA[<p>NVIDIA and the Arc Institute have introduced <strong style="font-size: 12.8px;">Evo 2</strong>, a groundbreaking AI model designed to <strong style="font-size: 12.8px;">understand, predict, and generate DNA sequences</strong>. This marks a major advancement in computational biology, offering scientists an unprecedented tool to decode the genetic blueprint of life and even design entirely new biological systems.</p><h3><strong>The Power of Evo 2: AI Meets DNA</strong></h3><p>Evo 2 is <strong>the largest AI model for biology ever created</strong>, trained on an astonishing <strong>9.3 trillion DNA "letters"</strong> (nucleotides) carefully selected from genomes spanning the entire tree of life. This massive dataset ensures that Evo 2 can recognize patterns and relationships in genetic sequences at an unparalleled scale.</p><p>For the first time, scientists can <strong>design DNA with AI</strong>, moving beyond simple sequence analysis to active DNA generation. Evo 2 enables researchers to <strong>predict, modify, and even create entire genetic sequences</strong>, opening new possibilities in medicine, agriculture, and synthetic biology.</p><h3><strong>Decoding the Dark Genome</strong></h3><p>One of the biggest challenges in genetics is understanding the <strong>non-coding regions</strong> of DNA&mdash;vast stretches of the genome that do not code for proteins but play crucial roles in regulating gene expression. These regions control when and how genes are activated, influencing everything from development to disease.</p><p>Evo 2 is designed to <strong>decode these non-coding elements</strong>, helping researchers uncover their functions and use this knowledge to develop gene-based therapies, synthetic life forms, and precision agriculture solutions.</p><h3><strong>From Reading DNA to Writing It</strong></h3><p>To put Evo 2&rsquo;s impact into perspective:</p><ul>
<li><strong>Previous AI models could "read" DNA</strong> like a book, analyzing genetic sequences and identifying patterns.</li>
<li><strong>Evo 2 can "write" entirely new DNA</strong>, designing functional genes, chromosomes, and even full genomes from scratch.</li>
</ul><p>This means scientists can now <strong>engineer biological systems with AI</strong>, designing new proteins, metabolic pathways, and genetic circuits to address real-world challenges.</p><h3><strong>A Step Toward Generative Biology</strong></h3><p>The Arc Institute describes Evo 2 as a major step toward <strong>"generative biology"</strong>&mdash;a revolutionary approach where AI is used to create <strong>novel biological structures</strong> rather than just analyzing existing ones. This could lead to breakthroughs such as:</p><ul>
<li><strong>New medicines</strong>: AI-generated enzymes and proteins tailored for targeted therapies.</li>
<li><strong>Disease-resistant crops</strong>: Genetically optimized plants for higher yield and climate resilience.</li>
<li><strong>Synthetic organisms</strong>: Custom-designed microbes for bioremediation, biofuel production, and industrial applications.</li>
</ul><h3><strong>An Open-Source Revolution</strong></h3><p>Unlike many proprietary AI models, <strong>Evo 2 is open source</strong>, making its capabilities accessible to researchers worldwide. This democratization of AI-driven biology means that scientists from different disciplines can <strong>collaborate, experiment, and innovate</strong>, accelerating discoveries in genetic engineering and synthetic biology.</p><p>With Evo 2, the boundaries of what&rsquo;s possible in <strong>DNA design, genetic engineering, and biological innovation</strong> are being redrawn. The future of life sciences is no longer just about understanding life&rsquo;s code&mdash;it&rsquo;s about writing it.</p>]]></description>
	<dc:creator>BioStar</dc:creator>
</item>

</channel>
</rss>