Activity timeline
December 12, 2008
Matt added LaTeX Graphics Companion, The to his library.
Matt added Foundations of Statistical Natural Language Processing to his library.
October 27, 2008
I love that lemmatization is presented as a far superior technique to savage tools such as stemming though lemmatization doesn't actually appear to help any over stemming or doing nothing at all.
October 22, 2008
Early on it's a little difficult to determine the differences between core concepts and implementation details. For example if I were to create a simple inverted index in Python, I would use sets not lists and sets would do all of the containment work faster than a list-based algorithm of my own design would.
















