Unknown Story Boundary Condition
- Retrieval using continuous speech stream
- systems process entire broadcasts for ASR and retrieval with no provided segmentation
- systems output a single time marker for each relevant excerpt to indicate topical passages
- this task does NOT attempt to determine topic boundaries
- time-based scoring:
- map to a story ID (“dummy” ID for retrieved non-stories and duplicates)
- score as usual using TREC_EVAL
- penalizes for duplicate retrieved stories
- story-based scoring somewhat artificial but expedient