Reading Industrial Inspection Sheets by Inferring Visual Relations

Rohit Rahul, Arindam Chowdhury, Animesh, Samarth Mittal, Lovekesh Vig

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The traditional mode of recording faults in heavy factory equipment has been via handmarked inspection sheets, wherein a machine engineer manually marks the faulty machine regions on a paper outline of the machine. Over the years, millions of such inspection sheets have been recorded and the data within these sheets has remained inaccessible. However, with industries going digital and waking up to the potential value of fault data for machine health monitoring, there is an increased impetus towards digitization of these handmarked inspection records. To target this digitization, we propose a novel visual pipeline combining state of the art deep learning models, with domain knowledge and low level vision techniques, followed by inference of visual relationships. Our framework is robust to the presence of both static and non-static background in the document, variability in the machine template diagrams, unstructured shape of graphical objects to be identified and variability in the strokes of handwritten text. The proposed pipeline incorporates a capsule and spatial transformer network based classifier for accurate text reading, and a customized CTPN [15] network for text detection in addition to hybrid techniques for arrow detection and dialogue cloud removal. We have tested our approach on a real world dataset of 50 inspection sheets for large containers and boilers. The results are visually appealing and the pipeline achieved an accuracy of 87.1% for text detection and 94.6% for text reading.

Original languageEnglish
Title of host publicationComputer Vision – ACCV 2018 Workshops - 14th Asian Conference on Computer Vision, 2018, Revised Selected Papers
EditorsGustavo Carneiro, Shaodi You
PublisherSpringer Verlag
Pages159-173
Number of pages15
ISBN (Print)9783030210731
DOIs
StatePublished - 2019
Externally publishedYes
Event14th Asian Conference on Computer Vision, ACCV 2018 - Perth, Australia
Duration: Dec 2 2018Dec 6 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11367 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th Asian Conference on Computer Vision, ACCV 2018
Country/TerritoryAustralia
CityPerth
Period12/2/1812/6/18

Fingerprint

Dive into the research topics of 'Reading Industrial Inspection Sheets by Inferring Visual Relations'. Together they form a unique fingerprint.

Cite this