Introduction
- queries
- documents, document surrogates
- operations
Domain Analysis
- conceptual model: (ext) Boolean, vector, prob., string search
- file structures (inverted, PAT tree)
- query operations (parse, Boolean, feedback)
- term operations (stem, stoplist, weight)
- document operations (index, cluster, rank, display
- hardware (magnetic/optical disk, parallel)
IR vs. Other Systems
- Types: AI, DBMS
- Data Objects: logical statement, relation
- Primary Operation: inference, deterministic retrieval
- Size: small, VLDB
- IR: document, prob. retrieval, VL
Evaluation
- collections of documents and queries
- relevance judgments
- recall, precision
- E measure