Abstract
Remote sensing lidar point cloud dataset embeds inherent 3D topological, topographical and complex geometrical information which possess immense potential in applications involving machine-understandable 3D perception. The lidar point clouds are unstructured, unlike images, and hence are challenging to process. In our work, we are exploring the possibility of deep learning-based question-answering on the lidar 3D point clouds. We are proposing a deep CNN-RNN parallel architecture to learn lidar point cloud features and word embedding from the questions and fuse them to form a feature mapping for generating answers. We have restricted our experiments for the urban domain and present preliminary results of binary question-answering (yes/no) using the urban lidar point clouds based on the perplexity, edit distance, evaluation loss, and sequence accuracy as the performance metrics. Our proposed hypothesis of lidar question-answering is the first attempt, to the best of our knowledge, and we envisage that our novel work could be a foundation in using lidar point clouds for enhanced 3D perception in an urban environment. We envisage that our proposed lidar question-answering could be extended for machine comprehension-based applications such as rendering lidar scene descriptions and content-based 3D scene retrieval.
Original language | English |
---|---|
Title of host publication | IGARSS 2021 - 2021 IEEE International Geoscience and Remote Sensing Symposium, Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 6936-6939 |
Number of pages | 4 |
ISBN (Electronic) | 9781665403696 |
DOIs | |
State | Published - 2021 |
Externally published | Yes |
Event | 2021 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2021 - Brussels, Belgium Duration: Jul 12 2021 → Jul 16 2021 |
Publication series
Name | International Geoscience and Remote Sensing Symposium (IGARSS) |
---|---|
Volume | 2021-July |
Conference
Conference | 2021 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2021 |
---|---|
Country/Territory | Belgium |
City | Brussels |
Period | 07/12/21 → 07/16/21 |
Funding
The authors express their gratitude towards the PHIL-Lidar Program of the Philippines for publishing the open lidar data. The authors thank the Google Cloud Team for providing the Google Cloud Platform's GPU-enabled computing facility for implementing the architectures under the Google Cloud Platform Research Credits Program.
Keywords
- 3D urban perception
- Deep learning
- Lidar question-answering
- Towards scene retrieval