Abstract
Social media, including Twitter, has become an important source for disaster response. Yet most studies focus on a very limited amount of geotagged data (approximately 1% of all tweets) while discarding a rich body of data that contains location expressions in text. Location information is crucial to understanding the impact of disasters, including where damage has occurred and where the people who need help are situated. In this paper, we propose a novel two-stage machine learning- and deep learning-based framework for power outage detection from Twitter. First, we apply a probabilistic classification model using bag-of-ngrams features to find true power outage tweets. Second, we implement a new deep learning method–bidirectional long short-term memory networks–to extract outage locations from text. Results show a promising classification accuracy (86%) in identifying true power outage tweets, and approximately 20 times more usable tweets can be located compared with simply relying on geotagged tweets. The method of identifying location names used in this paper does not require language- or domain-specific external resources such as gazetteers or handcrafted features, so it can be extended to other situational awareness analyzes and new applications.
Original language | English |
---|---|
Pages (from-to) | 1285-1299 |
Number of pages | 15 |
Journal | International Journal of Digital Earth |
Volume | 12 |
Issue number | 11 |
DOIs | |
State | Published - Nov 2 2019 |
Funding
The authors would like to acknowledge the financial support received from Oak Ridge National Laboratory (ORNL)'s Liane Russell Distinguished Early Career Fellowship and grant no. TG0100000. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the US Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). Also, we would like to thank to Laurie Varma from ORNL for her diligent technical editing and proofreading of the manuscript.
Funders | Funder number |
---|---|
DOE Public Access Plan | |
United States Government | |
U.S. Department of Energy | |
Oak Ridge National Laboratory | DE-AC05-00OR22725, TG0100000 |
Keywords
- Power outage mapping
- deep learning
- location detection
- named entity recognition
- natural language processing
- social media mining