System Summary and Timing Organization Name: Department of Computing Science, University of Glasgow List of Run ID's: GLAIR1 Construction of Indices, Knowledge Bases, and other Data Structures Methods Used to build Data Structures - Length (in words) of the stopword list: 320 - Controlled Vocabulary? : No - Stemming Algorithm: Porter - Morphological Analysis: None - Term Weighting: tf*idf - Phrase Discovery? : - Tokenizer? : Simple word boundary tokeniser Statistics on Data Structures built from TREC Text - Inverted index - Run ID : GLAIR1 - Total Storage (in MB): ~100 - Total Computer Time to Build (in hours): 30 - Automatic Process? (If not, number of manual hours): Yes - Use of Term Positions? : No - Only Single Terms Used? : Yes - Clusters - N-grams, Suffix arrays, Signature Files - Knowledge Bases - Use of Manual Labor - Special Routing Structures - Other Data Structures built from TREC text Query construction Automatically Built Queries (Ad-Hoc) - Topic Fields Used: description - Average Computer Time to Build Query (in cpu seconds): very short time - Method used in Query Construction - Tokenizer? Simple word boundary tokeniser: - Expansion of Queries using Previously-Constructed Data Structure? : Searching Search Times - Run ID : GLAIR1 - Computer Time to Search (Average per Query, in CPU seconds): <1 minute? Machine Searching Methods - Probabilistic Model? : Yes Factors in Ranking - Term Frequency? : Yes - Inverse Document Frequency? : Yes - Document Length? : Yes Machine Information - Machine Type for TREC Experiment: SPARC 10 - Was the Machine Dedicated or Shared: Shared - Amount of Hard Disk Storage (in MB): 9Gb - Amount of RAM (in MB): 32Mb - Clock Rate of CPU (in MHz): Don't know System Comparisons - Amount of "Software Engineering" which went into the Development of the System: 2 furlongs per fortnight