BOL: All Site Activity

All Site Activity

- Jit@jit.aber
Jit bookmarked Tetra-Nucleotide Analysis 3329 days ago

A tetra-nucleotide is a fragment of DNA sequence with 4 bases (e.g. AGTC or TTGG). Pride et al. (2003) showed that the frequency of tetra-nucleotides in bacterial genomes contain useful, albeit weak, phylogenetic signals. Even though...

https://chunlab.wordpress.com/tetra-nucleotide-analysis/
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh answered the question How to remove duplicates reads Ids ? 3329 days ago

Thanks everyone, it is done :) bio@bio214b[bio] fastq-stats out.R1.fq []reads 39376969len 251len mean 251.0000len stdev 0.0000len min 251phred 33window-size 2000000cycle-max 35dups 1609486%dup 4.0874unique-dup seq 23609min dup count 2dup seq 1...
- Sanjay@sanjaydeshpande
Sanjay is now a friend with Jitendra Narayan 3329 days ago
Sanjay@sanjaydeshpande
Jitendra Narayan@admin
- Jitendra Narayan@admin
Jitendra Narayan is now a friend with Sanjay 3329 days ago
Jitendra Narayan@admin
Sanjay@sanjaydeshpande
- Neel@neelam
Neel answered the question How to remove duplicates reads Ids ? 3330 days ago

I recomment reformat.sh dedupe.sh from BBmap suits (https://sourceforge.net/projects/bbmap/)
- Jit@jit.aber
Jit commented on an answer to a question 3330 days ago

You can follow following steps to get rid of duplicates: a. Extract all the reads Ids for indivisual pair and make it uniq. b. Use uniq Ids to extract the original reads from fastq files (Seq.R1.fastq/Seq.R2.fastq in your case).
- Jit@jit.aber
Jit commented on an answer to a question 3330 days ago

You can follow following steps to get rid of duplicates: a. Extract all the reads Ids for indivisual pair and make it uniq. b. Use uniq Ids to extract the original reads from fastq files (Seq.R1.fastq/Seq.R2.fastq in your case).
- Jit@jit.aber
Jit answered the question How to remove duplicates reads Ids ? 3330 days ago

You can follow following steps to get rid of duplicates: a. Extract all the reads Ids for indivisual pair and make it uniq. b. Use uniq Ids to extract the original reads from fastq files (Seq.R1.fastq/Seq.R2.fastq in your case).
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh asked How to remove duplicates reads Ids ? 3330 days ago

I mapped reads with bwa mem -M -t 40 allCombinedFinalSet.fa Seq.R1.fastq Seq.R2.fastq > aln.sam Extracted the mapped reads samtools view -f 0x2 -b aln.bam > output.bam Extracted the fastq bamToFastq -i output.bam -fq R1.fq -fq2...
- Jit@jit.aber
Jit bookmarked Fastq format 3330 days ago

FASTQ format is a text-based format for storing both a biological sequence (usually nucleotide sequence) and its corresponding quality scores. Both the sequence letter and quality score are each encoded with a...

https://en.wikipedia.org/wiki/FASTQ_format
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh posted to the wire 3330 days ago

Show before(B) and after(A) the matching region $ grep -B 3 -A 2 foo README.txt #Linux #Tricks #View #Grep
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh is now a friend with Jit 3330 days ago
Abhimanyu Singh@abhimanyu
Jit@jit.aber
- Jit@jit.aber
Jit is now a friend with Abhimanyu Singh 3330 days ago
Jit@jit.aber
Abhimanyu Singh@abhimanyu
- Jit@jit.aber
Jit voted on the poll How long have you been a bioinformatics scientist for? 3330 days ago
- Jit@jit.aber
Jit posted to the wire 3331 days ago

Perl OneLiner http://userweb.eng.gla.ac.uk/umer.ijaz/bioinformatics/oneliners.html #Perl #Oneliner #NGS #Script
- Jit@jit.aber
Jit posted to the wire 3331 days ago

NGS onliner http://userweb.eng.gla.ac.uk/umer.ijaz/bioinformatics/subsetFASTAFASTAQ.html #OneLiner #NGS
- Jit@jit.aber
Jit answered the question How to run fastuniq? 3331 days ago

Ok ! fastuniq accept the name of fastq as a listfile. Create a listfile.txt and write both of your PE file name there, and call it with -i listfile.txt
- Bulbul@bulbul
Bulbul answered the question How to run fastuniq? 3331 days ago

Thanks but it does not accept both fastq files
- Abhimanyu Singh@abhimanyu
Abhimanyu Singh bookmarked Mapping NGS 3331 days ago

NGS data are just a bunch of sequences, you have no idea which region in the genome each sequences comes from, which gene it represents...To know that you have to align the sequences to the reference sequence. The reference sequence is in most cases...

http://wiki.bits.vib.be/index.php/Mapping_of_NGS_data
- Jit@jit.aber
Jit answered the question How to run fastuniq? 3331 days ago

Did you followed fastuniq help $ fastuniq-i : The input file list of paired FSATQ sequence files [FILE IN]Maximum 1000 pairs This parameter is used to specify a list of paired sequence files inFASTQ format as input, in which two adjacent files...

BOL

Our Sponsors

All Site Activity