Definition Questions
Represented about 25% of test set
What is an atom? What is epilepsy?
What are invertebrates?
Are also heavily represented in filtered logs
Are hard for systems to answer & assessors to judge
- NIST to control mix in future test sets?
- while real, there are better ways of finding definitions than looking in large corpus