Facility detection and popularity assessment from text classification of social media and crowdsourced data

Kevin A. Sparks, Roger G. Li, Gautam S. Thakur, Robert N. Stewart, Marie L. Urban

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Advances in technology have continually progressed our understanding of where people are, how they use the environment around them, and why they are at their current location. Having a better knowledge of when various locations become popular through space and time could have large impacts on research fields like urban dynamics and energy consumption. In this paper, we discuss the ability to identify and locate various facility types (e.g. restaurant, airport, stadiums) using social media, and assess methods in determining when these facilities become popular over time. We use standard natural language processing tools and machine learning classifiers to interpret geotagged Twitter text and determine if a user is seemingly at a location of interest when the tweet was sent. On average our classifiers are approximately 85% accurate varying across multiple facility types, with a peak precision of 98%. By using these standard methods to classify unstructured text, geotagged social media data can be an extremely useful tool to better understanding the composition of places and how and when people use them.

Original languageEnglish
Title of host publicationProceedings of the 10th Workshop on Geographic Information Retrieval, GIR 2016
EditorsRoss S. Purves, Christopher B. Jones
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450345880
DOIs
StatePublished - Oct 31 2016
Event10th Workshop on Geographic Information Retrieval, GIR 2016 - San Francisco Bay Area, United States
Duration: Oct 31 2016 → …

Publication series

NameProceedings of the 10th Workshop on Geographic Information Retrieval, GIR 2016

Conference

Conference10th Workshop on Geographic Information Retrieval, GIR 2016
Country/TerritoryUnited States
CitySan Francisco Bay Area
Period10/31/16 → …

Funding

This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05- 00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes.

FundersFunder number
United States Government
U.S. Department of Energy

    Keywords

    • Crowd-sourced data
    • Geographic information systems
    • Machine learning
    • Occupancy analysis
    • Participatory sensing
    • Social media
    • Text classification

    Fingerprint

    Dive into the research topics of 'Facility detection and popularity assessment from text classification of social media and crowdsourced data'. Together they form a unique fingerprint.

    Cite this