TY - GEN
T1 - Not everybody's special
T2 - 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2013
AU - Sadovnik, Amir
AU - Gallagher, Andrew
AU - Chen, Tsuhan
PY - 2013
Y1 - 2013
N2 - Referring expression generation is widely considered a basic building block of any natural language generation system. Generating these phrases, which can point out a single object from a group of objects, has been studied extensively in that community. However, to build systems which can discuss images in an intelligent way, it is necessary to consider additional factors unique to the visual domain. In this paper we consider the use of neighbors as anchors to create a referring expression for a person in a group image. We describe a target person using the people around him, when we cannot find a reliable set of attributes to describe the target himself. We first present a method for including neighbors in a referring expression, and discuss several ways of presenting this data to a user. We show through experiments that using descriptions with neighbors can significantly improve the probability of conveying the correct information to a user.
AB - Referring expression generation is widely considered a basic building block of any natural language generation system. Generating these phrases, which can point out a single object from a group of objects, has been studied extensively in that community. However, to build systems which can discuss images in an intelligent way, it is necessary to consider additional factors unique to the visual domain. In this paper we consider the use of neighbors as anchors to create a referring expression for a person in a group image. We describe a target person using the people around him, when we cannot find a reliable set of attributes to describe the target himself. We first present a method for including neighbors in a referring expression, and discuss several ways of presenting this data to a user. We show through experiments that using descriptions with neighbors can significantly improve the probability of conveying the correct information to a user.
KW - Attributes
KW - Image Description
KW - Referring Expression
UR - http://www.scopus.com/inward/record.url?scp=84884950205&partnerID=8YFLogxK
U2 - 10.1109/CVPRW.2013.47
DO - 10.1109/CVPRW.2013.47
M3 - Conference contribution
AN - SCOPUS:84884950205
SN - 9780769549903
T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
SP - 269
EP - 276
BT - Proceedings - 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2013
Y2 - 23 June 2013 through 28 June 2013
ER -