<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: GraphMap - A highly sensitive and accurate mapper for long, error-prone reads]]></title>
	<link>https://bioinformaticsonline.com/bookmarks/view/23167/graphmap-a-highly-sensitive-and-accurate-mapper-for-long-error-prone-reads?</link>
	<atom:link href="https://bioinformaticsonline.com/bookmarks/view/23167/graphmap-a-highly-sensitive-and-accurate-mapper-for-long-error-prone-reads?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/bookmarks/view/23167/graphmap-a-highly-sensitive-and-accurate-mapper-for-long-error-prone-reads</guid>
	<pubDate>Mon, 06 Jul 2015 08:46:53 -0500</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/23167/graphmap-a-highly-sensitive-and-accurate-mapper-for-long-error-prone-reads</link>
	<title><![CDATA[GraphMap - A highly sensitive and accurate mapper for long, error-prone reads]]></title>
	<description><![CDATA[<p>GraphMap is a novel mapper targeted at aligning long, error-prone third-generation sequencing data.<br>It is&nbsp;<strong>designed to handle Oxford Nanopore MinION 1d and 2d reads</strong>&nbsp;with very high sensitivity and accuracy, and also presents a significant improvement over the state-of-the-art for PacBio read mappers.</p>
<p>GraphMap was also designed for ease-of-use: the&nbsp;<strong>default parameters</strong>&nbsp;can handle a wide range of read lengths and error profiles, including:&nbsp;<em>Illumina</em>,&nbsp;<em>PacBio</em>&nbsp;and&nbsp;<em>Oxford Nanopore</em>.<br>This is an especially important feature for technologies where the error rates and error profiles can vary widely across, or even within, sequencing runs.</p>
<p><a href="http://biorxiv.org/content/early/2015/06/10/020719">http://biorxiv.org/content/early/2015/06/10/020719</a></p><p>Address of the bookmark: <a href="https://github.com/isovic/graphmap" rel="nofollow">https://github.com/isovic/graphmap</a></p>]]></description>
	<dc:creator>Rahul Agarwal</dc:creator>
</item>
<item>
	<guid isPermaLink='true'>https://bioinformaticsonline.com/bookmarks/view/23167/graphmap-a-highly-sensitive-and-accurate-mapper-for-long-error-prone-reads#item-annotation-3227</guid>
	<pubDate>Thu, 25 Jan 2018 05:30:04 -0600</pubDate>
	<link>https://bioinformaticsonline.com/bookmarks/view/23167/graphmap-a-highly-sensitive-and-accurate-mapper-for-long-error-prone-reads#item-annotation-3227</link>
	<title><![CDATA[Comment by Jit]]></title>
	<description><![CDATA[<p>urbe@urbo214b[genome] graphmap align -h []<br>GraphMap - A very accurate and sensitive long-read, high error-rate sequence mapper<br>GraphMap Version: v0.5.2<br>Build date: Jun 6 2017 at 15:45:57</p>
<p>GraphMap (c) by Ivan Sovic, Mile Sikic and Niranjan Nagarajan<br>GraphMap is licensed under The MIT License.</p>
<p>Affiliations: Ivan Sovic (1, 3), Mile Sikic (2), Niranjan Nagarajan (3)<br> (1) Ruder Boskovic Institute, Zagreb, Croatia<br> (2) University of Zagreb, Faculty of Electrical Engineering and Computing<br> (3) Genome Institute of Singapore, A*STAR, Singapore</p>
<p><br>Usage:<br> graphmap [options] -r &lt;reference_file&gt; -d &lt;reads_file&gt; -o &lt;output_sam_path&gt;</p>
<p><br>Usage:<br> graphmap [options]</p>
<p>Options<br> Input/Output options:<br> -r, --ref STR Path to the reference sequence (fastq or fasta).<br> -i, --index STR Path to the index of the reference sequence. If not specified, index is generated in<br> the same folder as the reference file, with .gmidx extension. For non-parsimonious<br> mode, secondary index .gmidxsec is also generated.<br> -d, --reads STR Path to the reads file.<br> -o, --out STR Path to the output file that will be generated.<br> --gtf STR Path to a General Transfer Format file. If specified, a transcriptome will be built<br> from the reference sequence and used for mapping. Output SAM alignments will be in<br> genome space (not transcriptome).<br> -K, --in-fmt STR Format in which to input reads. Options are:<br> auto - Determines the format automatically from file extension.<br> fastq - Loads FASTQ or FASTA files.<br> fasta - Loads FASTQ or FASTA files.<br> gfa - Graphical Fragment Assembly format.<br> sam - Sequence Alignment/Mapping format. [auto]<br> -L, --out-fmt STR Format in which to output results. Options are:<br> sam - Standard SAM output (in normal and '-w overlap' modes).<br> m5 - BLASR M5 format. [sam]<br> -I, --index-only - Build only the index from the given reference and exit. If not specified, index will<br> automatically be built if it does not exist, or loaded from file otherwise. [false]<br> --rebuild-index - Always rebuild index even if it already exists in given path. [false]<br> --auto-rebuild-index - Rebuild index only if an existing index is of an older version or corrupt. [false]<br> -u, --ordered - SAM alignments will be output after the processing has finished, in the order of<br> input reads. [false]<br> -B, --batch-mb INT Reads will be loaded in batches of the size specified in megabytes. Value &lt;= 0 loads<br> the entire file. [1024]</p>
<p>General-purpose pre-set options:<br> -x, --preset STR Pre-set parameters to increase sensitivity for different sequencing technologies.<br> Valid options are:<br> illumina - Equivalent to: '-a gotoh -w normal -M 5 -X 4 -G 8 -E 6'<br> overlap - Equivalent to: '-a anchor -w normal --overlapper --evalue 1e0<br> --ambiguity 0.50 --secondary'<br> sensitive - Equivalent to: '--freq-percentile 1.0 --minimizer-window 1'</p>
<p>Alignment options:<br> -a, --alg STR Specifies which algorithm should be used for alignment. Options are:<br> sg - Myers' bit-vector approach. Semiglobal. Edit dist. alignment.<br> sggotoh - Gotoh alignment with affine gaps. Semiglobal.<br> anchor - anchored alignment with end-to-end extension.<br> Uses Myers' global alignment to align between anchors.<br> anchorgotoh - anchored alignment with Gotoh.<br> Uses Gotoh global alignment to align between anchors. [anchor]<br> -w, --approach STR Additional alignment approaches. Changes the way alignment algorithm is applied.<br> Options are:<br> normal - Normal alignment of reads to the reference.<br> (Currently no other options are provided. This is a placeholder for future features,<br> such as cDNA mapping) [normal]<br> --overlapper - Perform overlapping instead of mapping. Skips self-hits if reads and reference files<br> contain same sequences, and outputs lenient secondary alignments. [false]<br> --no-self-hits - Similar to overlapper, but skips mapping of sequences with same headers. Same<br> sequences can be located on different paths, and their overlap still skipped. [false]<br> -M, --match INT Match score for the DP alignment. Ignored for Myers alignment. [5]<br> -X, --mismatch INT Mismatch penalty for the DP alignment. Ignored for Myers alignment. [4]<br> -G, --gapopen INT Gap open penalty for the DP alignment. Ignored for Myers alignment. [8]<br> -E, --gapext INT Gap extend penalty for the DP alignment. Ignored for Myers alignment. [6]<br> -z, --evalue FLT Threshold for E-value. If E-value &gt; FLT, read will be called unmapped. If FLT &lt; 0.0,<br> thredhold not applied. [1e0]<br> -c, --mapq INT Threshold for mapping quality. If mapq &lt; INT, read will be called unmapped. [1]<br> --extcigar - Use the extended CIGAR format for output alignments. [false]<br> --no-end2end - Disables extending of the alignments to the ends of the read. Works only for<br> anchored modes. [false]<br> --max-error FLT If an alignment has error rate (X+I+D) larger than this, it won't be taken into<br> account. If &gt;= 1.0, this filter is disabled. [1.0]<br> --max-indel-error FLT If an alignment has indel error rate (I+D) larger than this, it won't be taken into<br> account. If &gt;= 1.0, this filter is disabled. [1.0]</p>
<p>Algorithmic options:<br> -k INT Graph construction kmer size. [6]<br> -l INT Number of edges per vertex. [9]<br> -A, --minbases INT Minimum number of match bases in an anchor. [12]<br> -e, --error-rate FLT Approximate error rate of the input read sequences. [0.45]<br> -g, --max-regions INT If the final number of regions exceeds this amount, the read will be called<br> unmapped. If 0, value will be dynamically determined. If &lt; 0, no limit is set. [0]<br> -q, --reg-reduce INT Attempt to heuristically reduce the number of regions if it exceeds this amount.<br> Value &lt;= 0 disables reduction but only if param -g is not 0. If -g is 0, the value of<br> this parameter is set to 1/5 of maximum number of regions. [0]<br> -C, --circular - Reference sequence is a circular genome. [false]<br> -F, --ambiguity FLT All mapping positions within the given fraction of the top score will be counted for<br> ambiguity (mapping quality). Value of 0.0 counts only identical mappings. [0.02]<br> -Z, --secondary - If specified, all (secondary) alignments within (-F FLT) will be output to a file.<br> Otherwise, only one alignment will be output. [false]<br> -P, --double-index - If false, only one gapped spaced index will be used in region selection. If true,<br> two such indexes (with different shapes) will be used (2x memory-hungry but more<br> powerful for very high error rates). [false]<br> --min-bin-perc FLT Consider only bins with counts above FLT * max_bin, where max_bin is the count of<br> the top scoring bin. [0.75]<br> --bin-step FLT After a chunk of bins with values above FLT * max_bin is processed, check if there<br> is one extremely dominant region, and stop the search. [0.25]<br> --min-read-len INT If a read is shorter than this, it will be marked as unmapped. This value can be<br> lowered if the reads are known to be accurate. [80]<br> --minimizer-window INT Length of the window to select a minimizer from. If equal to 1, minimizers will be<br> turned off. [5]<br> --freq-percentile FLT Filer the (1.0 - value) percent of most frequent seeds in the lookup process. [0.99]<br> --fly-index - Index will be constructed on the fly, without storing it to disk. If it already<br> exists on disk, it will be loaded unless --rebuild-index is specified. [false]</p>
<p>Other options:<br> -t, --threads INT Number of threads to use. If '-1', number of threads will be equal to min(24, num_cores/2). [-1]<br> -v, --verbose INT Verbose level. If equal to 0 nothing except strict output will be placed on stdout. [5]<br> -s, --start INT Ordinal number of the read from which to start processing data. [0]<br> -n, --numreads INT Number of reads to process per batch. Value of '-1' processes all reads. [-1]<br> -h, --help - View this help. [false]</p>
<p>Debug options:<br> -y, --debug-read INT ID of the read to give the detailed verbose output. [-1]<br> -Y, --debug-qname STR QNAME of the read to give the detailed verbose output. Has precedence over -y. Use<br> quotes to specify.<br> -b, --verbose-sam INT Helpful debug comments can be placed in SAM output lines (at the end). Comments can<br> be turned off by setting this parameter to 0. Different values increase/decrease<br> verbosity level.<br> 0 - verbose off<br> 1 - server mode, command line will be omitted to obfuscate paths.<br> 2 - umm this one was skipped by accident. The same as 0.<br> &gt;=3 - detailed verbose is added for each alignment, including timing measurements and<br> other.<br> 4 - qnames and rnames will not be trimmed to the first space.<br> 5 - QVs will be omitted (if available). [0]</p>]]></description>
	<dc:creator>Jit</dc:creator>
</item>

</channel>
</rss>