What are the bottlenecks for index construction?
Have to parse and build posting entries one document at a time.
Sort postings by term, followed by doc.
Too slow because we are sorting 100M records.
We cannot store any term, until we parse the last document.