| 
        Text REtrieval Conference (TREC)  | 
| Organization Name: University of Waterloo (MultiText) | Run ID: uwmt6v1 | 
| Section 1.0 System Summary and Timing | 
|---|
| Section 1.1 System Information | 
| Hardware Model Used for TREC Experiment:Cyrix P200+ System Use:DEDICATED Total Amount of Hard Disk Storage:48 Gb Total Amount of RAM:256 MB Clock Rate of CPU:150 MHz | 
| Section 1.2 System Comparisons | 
| Amount of developmental "Software Engineering":NONE List of features that are not present in the system, but would have been beneficial to have: List of features that are present in the system, and impacted its performance, but are not detailed within this form: | 
| Section 2.0 Construction of Indices, Knowledge Bases, and Other Data Structures | 
|---|
| Length of the stopword list:0 words Type of Stemming:NONE Controlled Vocabulary:NO Term weighting:NO 
 Phrase discovery:NO 
 Type of Spelling Correction:NONE Manually-Indexed Terms:NO Proper Noun Identification:NO Syntactic Parsing:NO Tokenizer:YES Word Sense Disambiguation:NO Other technique:NO Additional comments: | 
| Section 3.0 Statistics on Data Structures Built from TREC Text | 
|---|
| Section 3.1 First Data Structure | 
| Structure Type:INVERTED INDEX Type of other data structure used: Brief description of method using other data structure: Total storage used:30.8 Gb Total computer time to build:4.48 hours Automatic process:YES Manual hours required:hours Type of manual labor:NONE Term positions used:YES Only single terms used:YES Concepts (vs. single terms) represented:NO 
 Type of representation: Auxilary files used:NO 
 Additional comments: | 
| Section 3.2 Second Data Structure | 
| Structure Type:OTHER DATA STRUCTURE Type of other data structure used:text database Brief description of method using other data structure:store document identifiers Total storage used:0.1 Gb Total computer time to build:n/a hours Automatic process:YES Manual hours required:hours Type of manual labor:NONE Term positions used:YES Only single terms used:YES Concepts (vs. single terms) represented:NO 
 Type of representation: Auxilary files used:NO 
 Additional comments:Build time for section 3.1 derived from building both data structures in parallel. | 
| Section 3.3 Third Data Structure | 
| Structure Type:NONE Type of other data structure used: Brief description of method using other data structure: Total storage used:Gb Total computer time to build:hours Automatic process: Manual hours required:hours Type of manual labor:NONE Term positions used: Only single terms used: Concepts (vs. single terms) represented: 
 Type of representation: Auxilary files used: 
 Additional comments: | 
| Section 4.0 Data Built from Sources Other than the Input Text | 
|---|
| File type:NONE Domain type:DOMAIN INDEPENDENT Total Storage:Gb Number of Concepts Represented:concepts Type of representation:NONE Automatic or Manual: 
 Type of Manual Labor used:NONE Additional comments: | 
| File is:NONE Total Storage:Gb Number of Concepts Represented:concepts Type of representation:NONE Additional comments: | 
| Section 5.0 Computer Searching | 
|---|
| Average computer time to search (per query): 1 CPU seconds | 
| Times broken down by component(s): | 
| Section 5.1 Searching Methods | 
| Vector space model:NO Probabilistic model:NO Cluster searching:NO N-gram matching:NO Boolean matching:YES Fuzzy logic:NO Free text scanning:NO Neural networks:NO Conceptual graphic matching:NO Other:NO Additional comments:Wall clock average of 1.35s per query. | 
| Section 5.2 Factors in Ranking | 
| Term frequency:NO Inverse document frequency:NO Other term weights:NO Semantic closeness:NO Position in document:YES Syntactic clues:NO Proximity of terms:YES Information theoretic weights:NO Document length:NO Percentage of query terms which match:NO N-gram frequency:NO Word specificity:NO Word sense frequency:NO Cluster distance:NO Other:YES Additional comments:GCL queries with tiered cover density ranking. | 
| Disclaimer: Contents of this online document are not necessarily the official views of, nor endorsed by the U.S. Government, the Department of Commerce, or NIST. |