Study Questions

You should be able to answer each of the following questions.

  1. Why should we seriously consider stopword removal as a part of lexical analysis, instead of as a later step using hashing?
  2. What are the main stages of the lexical analysis and stopword removal routine, that make up the pre-processing and the lookup phases?
  3. What in general happens during each of the various phases of stemming using Porter's algorithm?
  4. What is the empirical evidence regarding the effect of stemming on space, recall, and precision - as compared to other related schemes?


fox@cs.vt.edu
Thu Oct 27 02:57:58 EDT 1994