Why is Jaccard coefficient insufficient for ranking?
It cannot represent frequencies:
Document A: Caesar Caesar
Document B: Caesar
Document A has higher frequency than Document B.
It cannot represent document frequency:
Query: The emperor
Document A: emperor
Document B: the
A > B since the is common.