The enormous size of the loblolly pine genome having 22 billion base pairs compared to only 3 billion in the human genome. In other words, it is seven times larger than a human&...
...iption, very fast. Available as part of the open source Carrot2 framework
k-means: base line clustering algorithm, produces bag-of-words style cluster descriptions. A...