Abstract
Given the sheer volume of scientific data archived within the data-intensive projects at the US Department of Energy's Oak Ridge National Laboratory, finding precisely what data we are looking for may not be a trivial task; conversely, we may also miss a more prominent data product. To address such issues, we propose improving the data discovery system and using data analytics methods to comprehend what specific users might be interested in based on their physiological state, search patterns, and past data usage history. This work's primary goal is to prune the complexity, increase the visibility of popular data products, and direct users toward the data that best meet their needs. The proposed algorithm constructs a user profile based on the user's explicit or implicit interactions with the system, such as items they are currently looking at on-site and the key metadata mappings related to the data set. The pattern is then used to build a training data set, which will help find relevant data to recommend to the user.
Original language | English |
---|---|
Title of host publication | Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020 |
Editors | Xintao Wu, Chris Jermaine, Li Xiong, Xiaohua Tony Hu, Olivera Kotevska, Siyuan Lu, Weijia Xu, Srinivas Aluru, Chengxiang Zhai, Eyhab Al-Masri, Zhiyuan Chen, Jeff Saltz |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 5658-5661 |
Number of pages | 4 |
ISBN (Electronic) | 9781728162515 |
DOIs | |
State | Published - Dec 10 2020 |
Event | 8th IEEE International Conference on Big Data, Big Data 2020 - Virtual, Atlanta, United States Duration: Dec 10 2020 → Dec 13 2020 |
Publication series
Name | Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020 |
---|
Conference
Conference | 8th IEEE International Conference on Big Data, Big Data 2020 |
---|---|
Country/Territory | United States |
City | Virtual, Atlanta |
Period | 12/10/20 → 12/13/20 |
Bibliographical note
Publisher Copyright:© 2020 IEEE.
Keywords
- clustering
- collaborative filtering
- content-based filtering
- data discovery
- data recommended system