TY - GEN
T1 - STExNMF
T2 - 17th IEEE International Conference on Data Mining, ICDM 2017
AU - Shin, Dear Sungbok
AU - Choi, Minsuk
AU - Choi, Jinho
AU - Langevin, Scott
AU - Bethune, Christopher
AU - Horne, Philippe
AU - Kronenfeld, Nathan
AU - Kannan, Ramakrishnan
AU - Drake, Barry
AU - Park, Haesun
AU - Choo, Jaegul
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/12/15
Y1 - 2017/12/15
N2 - Understanding newly emerging events or topics associated with a particular region of a given day can provide deep insight on the critical events occurring in highly evolving metropolitan cities. We propose herein a novel topic modeling approach on text documents with spatio-temporal information (e.g., when and where a document was published) such as location-based social media data to discover prevalent topics or newly emerging events with respect to an area and a time point. We consider a map view composed of regular grids or tiles with each showing topic keywords from documents of the corresponding region. To this end, we present a tilebased spatio-temporally exclusive topic modeling approach called STExNMF, based on a novel nonnegative matrix factorization (NMF) technique. STExNMF mainly works based on the two following stages: (1) first running a standard NMF of each tile to obtain general topics of the tile and (2) running a spatiotemporally exclusive NMF on a weighted residual matrix. These topics likely reveal information on newly emerging events or topics of interest within a region. We demonstrate the advantages of our approach using the geo-tagged Twitter data of New York City. We also provide quantitative comparisons in terms of the topic quality, spatio-temporal exclusiveness, topic variation, and qualitative evaluations of our method using several usage scenarios. In addition, we present a fast topic modeling technique of our model by leveraging parallel computing.
AB - Understanding newly emerging events or topics associated with a particular region of a given day can provide deep insight on the critical events occurring in highly evolving metropolitan cities. We propose herein a novel topic modeling approach on text documents with spatio-temporal information (e.g., when and where a document was published) such as location-based social media data to discover prevalent topics or newly emerging events with respect to an area and a time point. We consider a map view composed of regular grids or tiles with each showing topic keywords from documents of the corresponding region. To this end, we present a tilebased spatio-temporally exclusive topic modeling approach called STExNMF, based on a novel nonnegative matrix factorization (NMF) technique. STExNMF mainly works based on the two following stages: (1) first running a standard NMF of each tile to obtain general topics of the tile and (2) running a spatiotemporally exclusive NMF on a weighted residual matrix. These topics likely reveal information on newly emerging events or topics of interest within a region. We demonstrate the advantages of our approach using the geo-tagged Twitter data of New York City. We also provide quantitative comparisons in terms of the topic quality, spatio-temporal exclusiveness, topic variation, and qualitative evaluations of our method using several usage scenarios. In addition, we present a fast topic modeling technique of our model by leveraging parallel computing.
KW - Anomaly detection
KW - Event detection
KW - Matrix factorization
KW - Social network analysis
KW - Topic modeling
UR - http://www.scopus.com/inward/record.url?scp=85044007831&partnerID=8YFLogxK
U2 - 10.1109/ICDM.2017.53
DO - 10.1109/ICDM.2017.53
M3 - Conference contribution
AN - SCOPUS:85044007831
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 435
EP - 444
BT - Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017
A2 - Karypis, George
A2 - Alu, Srinivas
A2 - Raghavan, Vijay
A2 - Wu, Xindong
A2 - Miele, Lucio
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 November 2017 through 21 November 2017
ER -