You should be able to define/explain the following terms and
phrases. You may wish to write in the definitions by annotating
this document in Mosaic, or by working with KMS, to facilitate
ease of reference.
- Boolean IR system
- clustering
- controlled language indexing
- digital tree
- distance function
- document (surrogate)
- E measure
- exhaustivity
- faceted classification
- filtering
- finite (state) automata
- flat file
- hashing
- indexing
- indexing language
- inverse document frequency (idf)
- inverted file
- natural language text-search system
- Patricia and PAT trees
- precision
- query
- ranking operation
- recall
- regular expression
- relevance judgments
- search tree
- signature file
- similarity measure
- specificity
- stemming (suffix stripping)
- stop list
- term broadening
- term discrimination model
- term frequency (tf)
- term narrowing
- term weighting
- thesaurus
- trie
- truncation
- volatility