<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: MashMap: a fast and approximate software for mapping long reads (PacBio/ONT) or assembly to reference genome(s)]]></title>
	<link>https://bioinformaticsonline.com/bookmarks/view/34618/mashmap-a-fast-and-approximate-software-for-mapping-long-reads-pacbioont-or-assembly-to-reference-genomes?</link>
	<atom:link href="https://bioinformaticsonline.com/bookmarks/view/34618/mashmap-a-fast-and-approximate-software-for-mapping-long-reads-pacbioont-or-assembly-to-reference-genomes?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/34618/mashmap-a-fast-and-approximate-software-for-mapping-long-reads-pacbioont-or-assembly-to-reference-genomes</guid>
	<pubDate>Tue, 12 Dec 2017 17:23:31 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34618/mashmap-a-fast-and-approximate-software-for-mapping-long-reads-pacbioont-or-assembly-to-reference-genomes</link>
	<title><![CDATA[MashMap: a fast and approximate software for mapping long reads (PacBio/ONT) or assembly to reference genome(s)]]></title>
	<description><![CDATA[<p><span>MashMap is a fast and approximate software for mapping long reads (PacBio/ONT) or assembly to reference genome(s). It maps a query sequence against a reference region if and only if its estimated alignment identity is above a specified threshold. It does not compute the alignments explicitly, but rather estimates a&nbsp;</span><em>k</em><span>-mer based&nbsp;</span><a href="https://en.wikipedia.org/wiki/Jaccard_index">Jaccard similarity</a><span>&nbsp;using a combination of&nbsp;</span><a href="http://www.cs.princeton.edu/courses/archive/spr05/cos598E/bib/p76-schleimer.pdf">Winnowing</a><span>&nbsp;and&nbsp;</span><a href="https://en.wikipedia.org/wiki/MinHash">MinHash</a><span>. This is then converted to an estimate of sequence identity using the&nbsp;</span><a href="http://mash.readthedocs.org/">Mash</a><span>&nbsp;distance. An appropriate&nbsp;</span><em>k</em><span>-mer sampling rate is automatically determined given minimum local alignment length and identity thresholds. The efficiency of the algorithm improves as both of these thresholds are increased.</span></p><p>Address of the bookmark: <a href="https://github.com/marbl/MashMap" rel="nofollow">https://github.com/marbl/MashMap</a></p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>
<item>
	<guid isPermaLink='true'>https://bioinformaticsonline.com/bookmarks/view/34618/mashmap-a-fast-and-approximate-software-for-mapping-long-reads-pacbioont-or-assembly-to-reference-genomes#item-annotation-3432</guid>
	<pubDate>Fri, 13 Jul 2018 17:21:42 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/34618/mashmap-a-fast-and-approximate-software-for-mapping-long-reads-pacbioont-or-assembly-to-reference-genomes#item-annotation-3432</link>
	<title><![CDATA[Comment by Jit]]></title>
	<description><![CDATA[<p>➜ mashmap-Linux64-v2.0 git:(master) ✗ ./mashmap -h<br>-----------------<br>Mashmap is an approximate long read or contig mapper based on Jaccard<br>similarity<br>-----------------<br>Example usage:<br>$ mashmap -r ref.fa -q seq.fq [OPTIONS]<br>$ mashmap --rl reference_files_list.txt -q seq.fq [OPTIONS]</p>
<p>Available options<br>-----------------<br>-h, --help<br> Print this help page</p>
<p>-r &lt;value&gt;, --ref &lt;value&gt;<br> an input reference file (fasta/fastq)[.gz]</p>
<p>--refList &lt;value&gt;, --rl &lt;value&gt;<br> a file containing list of reference files, one per line</p>
<p>-q &lt;value&gt;, --query &lt;value&gt;<br> an input query file (fasta/fastq)[.gz]</p>
<p>--ql &lt;value&gt;, --queryList &lt;value&gt;<br> a file containing list of query files, one per line</p>
<p>-s &lt;value&gt;, --segLength &lt;value&gt;<br> mapping segment length [default : 5,000]<br> sequences shorter than segment length will be ignored</p>
<p>--noSplit<br> disable splitting of input sequences during mapping [enabled by default]</p>
<p>--perc_identity &lt;value&gt;, --pi &lt;value&gt;<br> threshold for identity [default : 85]</p>
<p>-t &lt;value&gt;, --threads &lt;value&gt;<br> count of threads for parallel execution [default : 1]</p>
<p>-o &lt;value&gt;, --output &lt;value&gt;<br> output file name [default : mashmap.out]</p>
<p>-k &lt;value&gt;, --kmer &lt;value&gt;<br> kmer size &lt;= 16 [default : 16]</p>
<p>-f &lt;value&gt;, --filter_mode &lt;value&gt;<br> filter modes in mashmap: 'map', 'one-to-one' or 'none' [default: map]<br> 'map' computes best mappings for each query sequence<br> 'one-to-one' computes best mappings for query as well as reference sequence<br> 'none' disables filtering</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>