Visual Context Learning with Big Data Analytics

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Understanding contextual information composed of both text and images is very useful for multimedia information processing. However, capturing such contexts is not trivial, especially while dealing with real datasets. Existing solutions such as using ontologies (e.g., WordNet) are mainly interested in individual terms, but they do not support identifying a group of terms that describe a specific context available at runtime. Within our knowledge, there are very limited solutions regarding the integration of contextual information from both images and text. Furthermore, existing solutions are not scalable due to the computationally intensive tasks and are prone to data sparsity. In this paper, we propose a semantic framework, called VisContextthat is based on a contextual model combined with images and text. The VisContext framework is based on the scalable pipeline that is composed of the primary components as follows: (i)Natural Language Processing (NLP), (ii) Feature extraction usingTerm Frequency-Inverse Document Frequency (TF-IDF), (iii)Feature association using unsupervised learning algorithms including K-Means clustering (KM) and Expectation-Maximization(EM) algorithms, iv) Validation of visual context models using supervised learning algorithms (Naïve Bayes, Decision Trees, Random Forests). The proposed VisContext framework has been implemented with the Spark MLlib and CoreNLP. We have evaluated the effectiveness of the framework in visual understanding with three large datasets (IAPR, Flick3k, SBU) containing more than one million images and their annotations. The results are reported in the discovery of the contextual association of terms and images, image context visualization, and image classification based on contexts.

Original languageEnglish
Title of host publicationProceedings - 16th IEEE International Conference on Data Mining Workshops, ICDMW 2016
EditorsCarlotta Domeniconi, Francesco Gullo, Francesco Bonchi, Francesco Bonchi, Josep Domingo-Ferrer, Ricardo Baeza-Yates, Ricardo Baeza-Yates, Ricardo Baeza-Yates, Zhi-Hua Zhou, Xindong Wu
PublisherIEEE Computer Society
Pages600-607
Number of pages8
ISBN (Electronic)9781509054725
DOIs
StatePublished - Jul 2 2016
Event16th IEEE International Conference on Data Mining Workshops, ICDMW 2016 - Barcelona, Spain
Duration: Dec 12 2016Dec 15 2016

Publication series

NameIEEE International Conference on Data Mining Workshops, ICDMW
Volume0
ISSN (Print)2375-9232
ISSN (Electronic)2375-9259

Conference

Conference16th IEEE International Conference on Data Mining Workshops, ICDMW 2016
Country/TerritorySpain
CityBarcelona
Period12/12/1612/15/16

Keywords

  • Big Data Analytics
  • Expectation Maximization
  • K-Means
  • TF-IDF
  • Visual Vontext

Fingerprint

Dive into the research topics of 'Visual Context Learning with Big Data Analytics'. Together they form a unique fingerprint.

Cite this