System Summary and Timing Organization Name: Rutgers Interactive Track List of Run ID's: ruint Construction of Indices, Knowledge Bases, and other Data Structures Methods Used to build Data Structures - Length (in words) of the stopword list: 419 - Controlled Vocabulary?: NO - Stemming Algorithm: STANDARD INQUERY 2.1P3 - Morphological Analysis: NO - Term Weighting: STANDARD INQUERY 2.1P3 - Phrase Discovery?: NO - Syntactic Parsing?: NO - Word Sense Disambiguation?: NO - Heuristic Associations (including short definition)?: NO - Spelling Checking (with manual correction)?: NO - Spelling Correction?: NO - Proper Noun Identification Algorithm?: NO - Tokenizer?: NO - Manually-Indexed Terms?: NO Statistics on Data Structures built from TREC Text - Inverted index - Run ID: ruint - Total Storage (in MB): 559.683 - Total Computer Time to Build (in hours): 3 - Automatic Process? (If not, number of manual hours): YES - Use of Term Positions?: YES - Only Single Terms Used?: YES - Clusters - N-grams, Suffix arrays, Signature Files - Knowledge Bases - Use of Manual Labor - Special Routing Structures - Other Data Structures built from TREC text Query construction Interactive Queries - Initial Query Built Automatically or Manually: MANUALLY - Type of Person doing Interaction - Domain Expert: NO - System Expert: NO - Average Time to do Complete Interaction - Clock Time from Initial Construction of Query to Completion of Final Query (in minutes): 19.11 - Average Number of Iterations: 7.9 - Average Number of Documents Examined per Iteration: 2.77, WHERE EXAMINED MEANS FULL TEXT OF DOCUMENT VIEWED - Minimum Number of Iterations: 2 - Maximum Number of Iterations: 21 - What Determines the End of an Iteration: INVOKE "RUN QUERY" - Methods used in Interaction - Automatic Term Reweighting from Relevant Documents?: YES - Automatic Query Expansion from Relevant Documents?: - Only Top X Terms Added (what is X): X= 2N+3, WHERE N=NUMBER OF RELEVANT DOCUMENTS - User Selected Terms Added: YES - Manual Methods - Using Individual Judgment (No Set Algorithm)?: YES Searching Machine Searching Methods - Probabilistic Model?: YES, INQUERY 2.1P3 Factors in Ranking - Term Frequency?: YES - Inverse Document Frequency?: YES Machine Information - Machine Type for TREC Experiment: SUN SPARCSTATION 5 - Was the Machine Dedicated or Shared: DEDICATED - Amount of Hard Disk Storage (in MB): 10000 - Amount of RAM (in MB): 64 - Clock Rate of CPU (in MHz): 110 System Comparisons - Amount of "Software Engineering" which went into the Development of the System: USED INQUERY 2.1P3 - Given appropriate resources - Features the System is Missing that would be beneficial: NEGATIVE RELEVANCE FEEDBACK