A study on lexical chain identification and word sense disambiguation

Ştefan Daniel Dumitrescu, Ana Gǎinaru, Ştefan Trǎuşan-Matu

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

The present paper investigates the issues of lexical chains and word sense disambiguation and the strong connection between them. We propose a system that extracts words from unstructured text and provides sets of lexical chains and also words and their disambiguation based on WordNet's synsets. We test three unsupervised algorithms, each with three similarity measures based on the concept of Information Content. To evaluate the system we compare the results against manually annotated files containing disambiguated words.

Original languageEnglish
Pages (from-to)197-212
Number of pages16
JournalUPB Scientific Bulletin, Series C: Electrical Engineering
Volume73
Issue number4
StatePublished - 2011
Externally publishedYes

Keywords

  • Clustering algorithms
  • Lexical chains
  • Semantic distance
  • Word sense disambiguation

Fingerprint

Dive into the research topics of 'A study on lexical chain identification and word sense disambiguation'. Together they form a unique fingerprint.

Cite this