What are the bottlenecks for index construction?

Have to parse and build posting entries one document at a time.

Sort postings by term, followed by doc.

Too slow because we are sorting 100M records.

We cannot store any term, until we parse the last document.