Abstract
During natural emergencies (e.g., hurricanes, tornadoes, storms), individuals can choose to avoid or leave areas of risk. Yet, often people choose to stay or travel to danger areas. Some may underestimate the danger; others may want to protect their property or families. Widespread social media use by these individuals can help us understand their motives and quantify their likelihood to engage in risky travel or decisions to stay. Social media data in such situations is not unlike sensor data; by tracking where individuals go and what they tweet about we can discover both temporal and spatial trends in human emotion and behavior during weather events. In this paper, we describe our extensible, distributed, real-time data collection and analysis pipeline that combines public streaming data from the National Weather Service and Twitter for subsequent exploration and analysis, including risk behavior modeling. Our pipeline leverages the open-source Apache Storm framework and the ELK (Elasticsearch, Logstash, Kibana) stack to process, filter, augment and index this streaming data for subsequent efficient retrieval. This work, which can be expanded to other social media (Facebook, Flickr, Instagram) is pathbreaking in several respects; first, it represents a novel integration of weather and social media data; second, our pipeline can be easily adapted to other analyzes by adding or removing processing components; and finally, this work represents the first (to our knowledge) quantification of human risk behavior using social media data in the form of average vectors and individual risk behavior indicators.
| Original language | English |
|---|---|
| Title of host publication | Practice and Experience in Advanced Research Computing 2018 |
| Subtitle of host publication | Seamless Creativity, PEARC 2018 |
| Publisher | Association for Computing Machinery |
| ISBN (Print) | 9781450364461 |
| DOIs | |
| State | Published - Jul 22 2018 |
| Externally published | Yes |
| Event | 2018 Practice and Experience in Advanced Research Computing Conference: Seamless Creativity, PEARC 2018 - Pittsburgh, United States Duration: Jul 22 2017 → Jul 26 2017 |
Publication series
| Name | ACM International Conference Proceeding Series |
|---|
Conference
| Conference | 2018 Practice and Experience in Advanced Research Computing Conference: Seamless Creativity, PEARC 2018 |
|---|---|
| Country/Territory | United States |
| City | Pittsburgh |
| Period | 07/22/17 → 07/26/17 |
Funding
We would like to thank the VACCINE (Visual Analytics for Command, Control, and Interoperability Environments Center) center for providing the Hurricane Sandy tweets dataset. The early conceptual work for the RBQ calculation was developed with a grant from the VACCINE center; we thank the director, David Ebert, and his graduate students for their help with this work.
Keywords
- Data analytics
- Distributed data pipeline
- Social media
Fingerprint
Dive into the research topics of 'Social media modeling of human behavior in natural emergencies'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver