Textbook Chapter 8

In Chapter 8, there is a very good summary of research work on stemming. The main point is that a great deal of space can be saved, which makes it worthwhile. Some studies have found stemming to improve retrieval effectiveness, or to behave about the same as truncation. In addition, a good implementation of the Porter stemming algorithm is given, along with code. Note, however, that studying the code is not necessary - only the major phases of the algorithm and key data structures should be studied.


fox@cs.vt.edu
Thu Oct 27 02:57:58 EDT 1994