QA List Evaluation
Each list judged as a unit
instances marked right/unsupported/wrong
subset of right& unsupported instances marked distinct
Accuracy used as evaluation metric
# distinct instances
target # of instances
Previous slide
Next slide
Back to first slide
View graphic version