X BOL wishing you a very and Happy New year

Alternative content

Our Sponsors



Download BioinformaticsOnline(BOL) Apps in your chrome browser.




Question: Question: Is this right to say genome size with their contigs file size?

Marysia
2803 days ago

Question: Is this right to say genome size with their contigs file size?

I usually report and estimate the genome size by looking into file size of the contigs. Is this right way? 

Answers
1

Well, that is wrong way to estimate the genome size. There are extra characters in the file that do not represent nucleotides in the genome. Such as fasta header, newline char etc. 

You can estimate it by counting fasta size of all sequences and sum them up. 

1

I agree with Jit, but you also need to remember ... when working with large Fasta files, it’s common to discuss things in terms of megabytes or gigabytes, not millions or billions of bytes. This is another complication, since 1 megabyte = 1024 kilobytes = 1024 * 1024 bytes, whereas 1 megabase = 1000 kilobases = 1000 * 1000 base pairs.