Judging Answer Strings
Formed pools of answer strings
- 169-207 strings per pool; mean of 191.6
- 28-93 docs per pool; mean of 55.3
- very little overlap in strings across runs
Assessors given 2-hour training session
- reviewed purpose of task
- given guidelines for how to judge
- judged 4 training questions with concocted answer sets