Filtering Track
- set evaluation
- scaled utility
T9U = 2R+ - N+
then scaled between [-100, 2*num-rels]
- F with ?=.5
- set precision, recall also reported
- routing runs produced ranked list so evaluated using mean average precision
- but note some topics have > 1000 relevant docs