TY - GEN
T1 - Information Extraction for Semantic Enrichment of BIM for Bridges
AU - Li, Hang
AU - Zhang, Jiansong
N1 - Publisher Copyright:
© 2024 ASCE.
PY - 2024
Y1 - 2024
N2 - Current bridge projects mainly rely on PDF plans as the official deliverables and documents to be stored, communicated, and transferred among different stakeholders. With the Industry Foundation Classes' (IFC) Building Information Modeling (BIM) standard adopted by the American Association of State Highway and Transportation Officials (AASHTO) as the national standard for modeling bridge and road infrastructure projects, upgrading the documentation of bridge projects to 3D BIM in compliance with the national standard has become an urgent need. In this research, the state-of-the-art PDF2BIM algorithms were leveraged to semi-automatically create 3D geometric models of bridges based on PDF drawings. To enrich the 3D geometric model with semantic information of bridges' components (e.g., bridge name, structure type, concrete strength), information extraction algorithms based on optical character recognition (OCR) and natural language processing (NLP) were developed to extract data from the bridge plans automatically. It significantly increases the efficiency and productivity of information extraction and enrichment of IFC-based BIM instance models for bridges by leveraging the rich information that already resides in the PDF plans. The results show that it achieved 97.6% accuracy in the information extraction task and reduced the overall time consumption on processing bridge data by 96.3% compared to the manual approach.
AB - Current bridge projects mainly rely on PDF plans as the official deliverables and documents to be stored, communicated, and transferred among different stakeholders. With the Industry Foundation Classes' (IFC) Building Information Modeling (BIM) standard adopted by the American Association of State Highway and Transportation Officials (AASHTO) as the national standard for modeling bridge and road infrastructure projects, upgrading the documentation of bridge projects to 3D BIM in compliance with the national standard has become an urgent need. In this research, the state-of-the-art PDF2BIM algorithms were leveraged to semi-automatically create 3D geometric models of bridges based on PDF drawings. To enrich the 3D geometric model with semantic information of bridges' components (e.g., bridge name, structure type, concrete strength), information extraction algorithms based on optical character recognition (OCR) and natural language processing (NLP) were developed to extract data from the bridge plans automatically. It significantly increases the efficiency and productivity of information extraction and enrichment of IFC-based BIM instance models for bridges by leveraging the rich information that already resides in the PDF plans. The results show that it achieved 97.6% accuracy in the information extraction task and reduced the overall time consumption on processing bridge data by 96.3% compared to the manual approach.
UR - https://www.scopus.com/pages/publications/85188666330
U2 - 10.1061/9780784485262.064
DO - 10.1061/9780784485262.064
M3 - Conference contribution
AN - SCOPUS:85188666330
T3 - Construction Research Congress 2024, CRC 2024
SP - 629
EP - 638
BT - Advanced Technologies, Automation, and Computer Applications in Construction
A2 - Shane, Jennifer S.
A2 - Madson, Katherine M.
A2 - Mo, Yunjeong
A2 - Poleacovschi, Cristina
A2 - Sturgill, Roy E.
PB - American Society of Civil Engineers (ASCE)
T2 - Construction Research Congress 2024, CRC 2024
Y2 - 20 March 2024 through 23 March 2024
ER -