Heaps’ Law

\(M = kT^b\)

It is used to estimate vocabulary size from a collection’s size.

For instance if we had:

Hello Jason,

Hello World!

We have a vocabulary size of 2 (if our vocabulary excludes names):

Hello, World