BOL: Question: Is this right to say genome size with their contigs file size?

Question: Question: Is this right to say genome size with their contigs file size?

Marysia
2888 days ago

Question: Is this right to say genome size with their contigs file size?

Genome
Size
Estimate

I usually report and estimate the genome size by looking into file size of the contigs. Is this right way?

Answers

Well, that is wrong way to estimate the genome size. There are extra characters in the file that do not represent nucleotides in the genome. Such as fasta header, newline char etc.

You can estimate it by counting fasta size of all sequences and sum them up.

Jit 2888 days ago

I agree with Jit, but you also need to remember ... when working with large Fasta files, it’s common to discuss things in terms of megabytes or gigabytes, not millions or billions of bytes. This is another complication, since 1 megabyte = 1024 kilobytes = 1024 * 1024 bytes, whereas 1 megabase = 1000 kilobases = 1000 * 1000 base pairs.

Abhimanyu Singh 2887 days ago

BOL

Marysia

Our Sponsors

Question: Question: Is this right to say genome size with their contigs file size?