Image Provenance Analysis at Scale

Daniel Moreira, Aparna Bharati, Joel Brogan, Allan Pinto, Michael Parowski, Kevin W. Bowyer, Patrick J. Flynn, Anderson Rocha, Walter J. Scheirer

Research output: Contribution to journalArticlepeer-review

46 Scopus citations

Abstract

Prior art has shown it is possible to estimate, through image processing and computer vision techniques, the types and parameters of transformations that have been applied to the content of individual images to obtain new images. Given a large corpus of images and a query image, an interesting further step is to retrieve the set of original images whose content is present in the query image, as well as the detailed sequences of transformations that yield the query image, given the original images. This is a problem that recently has received the name of image provenance analysis. In these times of public media manipulation (e.g., fake news and meme sharing), obtaining the history of image transformations is relevant for fact checking and authorship verification, among many other applications. This paper presents an end-to-end processing pipeline for image provenance analysis which works at real-world scale. It employs a cutting-edge image filtering solution that is custom-tailored for the problem at hand, as well as novel techniques for obtaining the provenance graph that expresses how the images, as nodes, are ancestrally connected. A comprehensive set of experiments for each stage of the pipeline is provided, comparing the proposed solution with the state-of-the-art results, employing previously published data sets. In addition, this paper introduces a new data set of real-world provenance cases from the social media site Reddit, along with baseline results.

Original languageEnglish
Article number8438504
Pages (from-to)6109-6123
Number of pages15
JournalIEEE Transactions on Image Processing
Volume27
Issue number12
DOIs
StatePublished - Dec 2018
Externally publishedYes

Funding

Manuscript received December 28, 2017; revised June 14, 2018; accepted July 9, 2018. Date of publication August 16, 2018; date of current version September 17, 2018. This work was supported in part by DARPA and the Air Force Research Laboratory under Grant FA8750-16-2-0173, in part by FAPESP through the DéjàVu Project under Grant 2017/12646-3, in part by CAPES through the DeepEyes Grant, and in part by CNPq under Grant 304472/2015-8. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Tolga Tasdizen. (Corresponding author: Walter J. Scheirer.) D. Moreira, A. Bharati, J. Brogan, M. Parowski, K. W. Bowyer, P. J. Flynn, and W. J. Scheirer are with the Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556 USA (e-mail: [email protected]) A. Pinto and A. Rocha are with the Institute of Computing, University of Campinas, Campinas 13083-970, Brazil.

FundersFunder number
Defense Advanced Research Projects Agency
Air Force Research LaboratoryFA8750-16-2-0173
Fundação de Amparo à Pesquisa do Estado de São Paulo2017/12646-3
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
Conselho Nacional de Desenvolvimento Científico e Tecnológico304472/2015-8

    Keywords

    • Digital image forensics
    • digital humanities
    • graphs
    • image phylogeny
    • image provenance
    • image retrieval

    Fingerprint

    Dive into the research topics of 'Image Provenance Analysis at Scale'. Together they form a unique fingerprint.

    Cite this