Abstract
In this paper, we propose IRTR-DETR, an Interactive and Real-Time Rotated DEtection TRansformer that extends IRT-DETR to predict rotated bounding boxes. IRTR-DETR maintains the Human-In-The-Loop (HIL) workflow of IRTDETR but introduces rotation-aware heads for improved detection of objects with arbitrary orientations. Similarly to IRTDETR, IRTR-DETR can be trained with a small labeled sample set in an interactive setting, but we show that it can also be pretrained on related but not identical data-such as a building damage dataset-before being applied to tasks like identifying buildings under construction. We demonstrate the efficacy of our approach on the publicly available Tiny-DOTA and xBD dataset, as well as two study-cases on proprietary datasets of greenhouses and houses under construction ('waffle homes'). Detecting greenhouses is highly relevant in the context of damage assessment, while 'waffle homes' aid understanding typical floorplans and building codes in different areas, both thereby supporting population modeling, emergency response, and policy planning. Our method outperforms the state of the art in interactive rotated object detection on the Tiny-DOTA dataset by 5.7 percent, and improves upon the non interactive RTDETR by 7.85 to 19.39 percent (depending on the number of provided samples) while maintaining its real-time efficiency.
Original language | English |
---|---|
Title of host publication | Proceedings - 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1129-1137 |
Number of pages | 9 |
ISBN (Electronic) | 9798331536626 |
DOIs | |
State | Published - 2025 |
Event | 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025 - Tucson, United States Duration: Feb 28 2025 → Mar 4 2025 |
Publication series
Name | Proceedings - 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025 |
---|
Conference
Conference | 2025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2025 |
---|---|
Country/Territory | United States |
City | Tucson |
Period | 02/28/25 → 03/4/25 |
Funding
This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doepublic-access-plan). This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC05-00OR22725.
Keywords
- applications
- interactive
- object detection
- remote sensing