from a Broad Institute site:
"N50 is a statistical measure of average length of a set of sequences. It is used widely in genomics, especially in reference to contig or supercontig lengths within a draft assembly.
Given a set of sequences of...
Trinity, developed at the Broad Institute and the Hebrew University of Jerusalem, represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Trinity combines three independent software modules:...
GA4GH Data Working Group
Led by David Haussler (UCSC) and Richard Durbin (Sanger Institute), the Data Working Group (DWG) of the Global Alliance brings together the leading Genome Institutes and Centers with IT industry leaders to create global...
To see the read group information for a BAM file, use the following command.
samtools view -H sample.bam | grep '@RG'
Option Description of AddOrReplaceReadGroups
INPUT (String) Input file (BAM or SAM or a GA4GH url). Required.OUTPUT (File)...
i have downloaded sra file and converted into paired end FastQ having following headers:
@HWUSI-EAS754_0001:4:1:5605:1034#GCCAT
The head and tail of file are as follow
Head:
==> ERR042057_1.fastq...