TY - GEN
T1 - Multi-criterion active learning in conditional random fields
AU - Symons, Christopher T.
AU - Samatova, Nagiza F.
AU - Krishnamurthy, Ramya
AU - Park, Byung H.
AU - Umar, Tarik
AU - Buttler, David
AU - Critchlow, Terence
AU - Hysom, David
PY - 2006
Y1 - 2006
N2 - Conditional Random Fields (CRFs), which are popular supervised learning models for many Natural Language Processing (NLP) tasks, typically require a large collection of labeled data for training. In practice, however, manual annotation of text documents is quite costly. Furthermore, even large labeled training sets can have arbitrarily limited performance peaks if they are not chosen with care. This paper considers the use of multi-criterion active learning for identification of a small but sufficient set of text samples for training CRFs. Our empirical results demonstrate that our method is capable of reducing the manual annotation costs, while also limiting the retraining costs that are often associated with active learning. In addition, we show that the generalization performance of CRFs can be enhanced through judicious selection of training examples.
AB - Conditional Random Fields (CRFs), which are popular supervised learning models for many Natural Language Processing (NLP) tasks, typically require a large collection of labeled data for training. In practice, however, manual annotation of text documents is quite costly. Furthermore, even large labeled training sets can have arbitrarily limited performance peaks if they are not chosen with care. This paper considers the use of multi-criterion active learning for identification of a small but sufficient set of text samples for training CRFs. Our empirical results demonstrate that our method is capable of reducing the manual annotation costs, while also limiting the retraining costs that are often associated with active learning. In addition, we show that the generalization performance of CRFs can be enhanced through judicious selection of training examples.
UR - http://www.scopus.com/inward/record.url?scp=38949157155&partnerID=8YFLogxK
U2 - 10.1109/ICTAI.2006.90
DO - 10.1109/ICTAI.2006.90
M3 - Conference contribution
AN - SCOPUS:38949157155
SN - 0769527280
SN - 9780769527284
T3 - Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI
SP - 323
EP - 331
BT - Procedings - 18th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2006
T2 - 18th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2006
Y2 - 13 October 2006 through 15 October 2006
ER -