Abstract
Advances in technology have continually progressed our understanding of where people are, how they use the environment around them, and why they are at their current location. Having a better knowledge of when various locations become popular through space and time could have large impacts on research fields like urban dynamics and energy consumption. In this paper, we discuss the ability to identify and locate various facility types (e.g. restaurant, airport, stadiums) using social media, and assess methods in determining when these facilities become popular over time. We use standard natural language processing tools and machine learning classifiers to interpret geotagged Twitter text and determine if a user is seemingly at a location of interest when the tweet was sent. On average our classifiers are approximately 85% accurate varying across multiple facility types, with a peak precision of 98%. By using these standard methods to classify unstructured text, geotagged social media data can be an extremely useful tool to better understanding the composition of places and how and when people use them.
Original language | English |
---|---|
Title of host publication | Proceedings of the 10th Workshop on Geographic Information Retrieval, GIR 2016 |
Editors | Ross S. Purves, Christopher B. Jones |
Publisher | Association for Computing Machinery, Inc |
ISBN (Electronic) | 9781450345880 |
DOIs | |
State | Published - Oct 31 2016 |
Event | 10th Workshop on Geographic Information Retrieval, GIR 2016 - San Francisco Bay Area, United States Duration: Oct 31 2016 → … |
Publication series
Name | Proceedings of the 10th Workshop on Geographic Information Retrieval, GIR 2016 |
---|
Conference
Conference | 10th Workshop on Geographic Information Retrieval, GIR 2016 |
---|---|
Country/Territory | United States |
City | San Francisco Bay Area |
Period | 10/31/16 → … |
Funding
This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05- 00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes.
Keywords
- Crowd-sourced data
- Geographic information systems
- Machine learning
- Occupancy analysis
- Participatory sensing
- Social media
- Text classification