Abstract
Scientific datasets are continuously growing with the amount of raw data being collected worldwide. This amount of data poses the biggest challenge to web search engines on how to retrieve them efficiently. This paper discusses how major scientific data centers are using popular open-source search platforms such as Solr [1] to retrieve structured data stored in data sources such as relational database management systems using its import handler mechanisms [2]. Additionally, we will also focus on how we can configure Solr to serve advanced full-text, faceted search capabilities, along with its key features, which simplify representing and delivering better performance to the scientific search interfaces.
Original language | English |
---|---|
Title of host publication | Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020 |
Editors | Xintao Wu, Chris Jermaine, Li Xiong, Xiaohua Tony Hu, Olivera Kotevska, Siyuan Lu, Weijia Xu, Srinivas Aluru, Chengxiang Zhai, Eyhab Al-Masri, Zhiyuan Chen, Jeff Saltz |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 5685-5687 |
Number of pages | 3 |
ISBN (Electronic) | 9781728162515 |
DOIs | |
State | Published - Dec 10 2020 |
Event | 8th IEEE International Conference on Big Data, Big Data 2020 - Virtual, Atlanta, United States Duration: Dec 10 2020 → Dec 13 2020 |
Publication series
Name | Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020 |
---|
Conference
Conference | 8th IEEE International Conference on Big Data, Big Data 2020 |
---|---|
Country/Territory | United States |
City | Virtual, Atlanta |
Period | 12/10/20 → 12/13/20 |
Funding
ACKNOWLEDGEMENT Oak Ridge National Laboratory is managed by the UT-Battelle, LLC, for the U.S. Department of Energy under contract DEAC05-00OR22725.
Keywords
- Apache Lucene
- Apache Solr
- Data Import Handler
- data indexing
- faceting search