Removal and Recovery of the Human Invisible Region

Qian Zhang, Qiyao Liang, Hong Liang, Ying Yang

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

The occlusion problem is one of the fundamental problems of computer vision, especially in the case of non‐rigid objects with variable shapes and complex backgrounds, such as humans. With the rise of computer vision in recent years, the problem of occlusion has also become increasingly visible in branches such as human pose estimation, where the object of study is a human being. In this paper, we propose a two‐stage framework that solves the human de‐occlusion problem. The first stage is the amodal completion stage, where a new network structure is designed based on the hourglass network, and a large amount of prior information is obtained from the training set to constrain the model to predict in the correct direction. The second phase is the content recovery phase, where visible guided attention (VGA) is added to the U‐Net with a symmetric U‐ shaped network structure to derive relationships between visible and invisible regions and to capture information between contexts across scales. As a whole, the first stage is the encoding stage, and the second stage is the decoding stage, and the network structure of each stage also consists of encoding and decoding, which is symmetrical overall and locally. To evaluate the proposed approach, we provided a dataset, the human occlusion dataset, which has occluded objects from drilling scenes and synthetic images that are close to reality. Experiments show that the method has high performance in terms of quality and diversity compared to existing methods. It is able to remove occlusions in complex scenes and can be extended to human pose estimation.

Original languageEnglish
Article number531
JournalSymmetry
Volume14
Issue number3
DOIs
StatePublished - Mar 2022
Externally publishedYes

Funding

Funding: This research was funded by The Science Foundation of Shandong Province, grant number ZR2020MF005.

FundersFunder number
The Science Foundation of Shandong ProvinceZR2020MF005

    Keywords

    • Amodal completion
    • Content recovery
    • Human de‐occlusion
    • Human occlusion dataset

    Fingerprint

    Dive into the research topics of 'Removal and Recovery of the Human Invisible Region'. Together they form a unique fingerprint.

    Cite this