Thanks for developing geneSCF, I tried to install it but got the following error
bio@bio214b[geneSCF-master-source-v1.1-p2] ./prepare_database []
prepare_database USAGE:
./prepare_database...
A tetra-nucleotide is a fragment of DNA sequence with 4 bases (e.g. AGTC or TTGG). Pride et al. (2003) showed that the frequency of tetra-nucleotides in bacterial genomes contain useful, albeit weak, phylogenetic signals. Even though...
Jit commented on an answer to a question 2811 days ago
You can follow following steps to get rid of duplicates:
a. Extract all the reads Ids for indivisual pair and make it uniq.
b. Use uniq Ids to extract the original reads from fastq files (Seq.R1.fastq/Seq.R2.fastq in your case).
Jit commented on an answer to a question 2811 days ago
You can follow following steps to get rid of duplicates:
a. Extract all the reads Ids for indivisual pair and make it uniq.
b. Use uniq Ids to extract the original reads from fastq files (Seq.R1.fastq/Seq.R2.fastq in your case).
You can follow following steps to get rid of duplicates:
a. Extract all the reads Ids for indivisual pair and make it uniq.
b. Use uniq Ids to extract the original reads from fastq files (Seq.R1.fastq/Seq.R2.fastq in your case).
FASTQ format is a text-based format for storing both a biological sequence (usually nucleotide sequence) and its corresponding quality scores. Both the sequence letter and quality score are each encoded with a...
Ok !
fastuniq accept the name of fastq as a listfile. Create a listfile.txt and write both of your PE file name there, and call it with -i listfile.txt