TREC 2002 Video Track Runs (cont.)
ID Priority Affiliation
CMU.r1 1 Carnegie Mellon Univ.
CLIPS-LIT-GEOD 1 CLIPS_IMAG Lab
DCUFE2002 1 Dublin City Univ.
Eurecom1 1 Eurecom
Fudan_FE_Sys1 1 Fudan University
IBM-1 1 IBM Research
MSRA 1 Microsoft Research Asia
TZI_univ_bremen 1 Univ. of Bremen
UMD1 1 Univ. of Maryland
UnivO_MT1 1 Univ. of Oulu
If the feature is true for some frame (sequence) within the shot, then
it is true for the shot.
Outdoors: Segment contains a recognizably outdoor
location, i.e., one outside of buildings. Should exclude all scenes that
are indoors or are close-ups of objects (even if the objects are outdoor).
Indoors: Segment contains a recognizably indoor location,
i.e., inside a building. Should exclude all scenes that are outdoors or
are close-ups of objects (even if the objects are indoor).
Face: Segment contains at least one human face with
the nose, mouth, and both eyes visible. Pictures of a face meeting the
above conditions count.
People: Segment contains a group of two more humans,
each of which is at least partially visible and is recognizable as a human.
Cityscape: Segment contains a recognizably city/urban/suburban
Landscape: Segment contains a predominantly natural
inland setting, i.e., one with little or no evidence of development by
humans. For example, scenes consisting mostly of plowed/planted fields,
pastures, orchards would be excluded. Some buildings, if small features
on the overall landscape, should be OK. Scenes with bodies of water that
are clearly inland may be included.
Text Overlay: Segment contains superimposed text large
enough to be read.
Speech: A human voice uttering words is recognizable
as such in this segment
Instrumental Sound: Sound produced by one or more
musical instruments is recognizable as such in this segment. Included are
Monologue: Segment contains an event in which a single
person is at least partially visible and speaks for a long time without
interruption by another speaker. Pauses are ok if short.