How can we reduce the time space complexity of our index?
Use stop word / stop list, remove “the”, “a”, “and”, etc..
Use case folding
Lemmatizer
Stemming useful for spanish, german, finnish 30% gains for finnish
Use stop word / stop list, remove “the”, “a”, “and”, etc..
Use case folding
Lemmatizer
Stemming useful for spanish, german, finnish 30% gains for finnish