<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:atom="http://www.w3.org/2005/Atom" >
<channel>
	<title><![CDATA[BOL: How to remove duplicates reads Ids ?]]></title>
	<link>https://bioinformaticsonline.com/answers/view/32425/how-to-remove-duplicates-reads-ids?</link>
	<atom:link href="https://bioinformaticsonline.com/answers/view/32425/how-to-remove-duplicates-reads-ids?" rel="self" type="application/rss+xml" />
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">https://bioinformaticsonline.com/answers/view/32425/how-to-remove-duplicates-reads-ids</guid>
	<pubDate>Wed, 03 May 2017 08:41:11 -0500</pubDate>
	<link>https://bioinformaticsonline.com/answers/view/32425/how-to-remove-duplicates-reads-ids</link>
	<title><![CDATA[How to remove duplicates reads Ids ?]]></title>
	<description><![CDATA[<div><div>I mapped reads with</div></div><div><div><pre><code>bwa mem -M -t 40 allCombinedFinalSet.fa Seq.R1.fastq Seq.R2.fastq &gt; aln.sam
</code></pre><p>Extracted the mapped reads</p><pre><code>samtools view -f 0x2 -b aln.bam &gt; output.bam
</code></pre><p>Extracted the fastq</p><pre><code>bamToFastq -i output.bam -fq R1.fq -fq2 R2.fq 

grep @HISEQ578:1035:HJ2KCBCXX:1:1104:14672:39678/1 R1.fq             []
@HISEQ578:1035:HJ2KCBCXX:1:1104:14672:39678/1
@HISEQ578:1035:HJ2KCBCXX:1:1104:14672:39678/1
@HISEQ578:1035:HJ2KCBCXX:1:1104:14672:39678/1
</code></pre><p>I notice it has duplicated ....</p><p>I think this because read was mapped twice (i.e. BWAmem).</p><p>I tried fastuniq but it does not remove the duplicated reads.</p><p>Can you please help me to remove duplicated reads from fastq files.</p></div></div>]]></description>
	<dc:creator>Abhimanyu Singh</dc:creator>
</item>

</channel>
</rss>