Abstract
Prior art has shown it is possible to estimate, through image processing and computer vision techniques, the types and parameters of transformations that have been applied to the content of individual images to obtain new images. Given a large corpus of images and a query image, an interesting further step is to retrieve the set of original images whose content is present in the query image, as well as the detailed sequences of transformations that yield the query image, given the original images. This is a problem that recently has received the name of image provenance analysis. In these times of public media manipulation (e.g., fake news and meme sharing), obtaining the history of image transformations is relevant for fact checking and authorship verification, among many other applications. This paper presents an end-to-end processing pipeline for image provenance analysis which works at real-world scale. It employs a cutting-edge image filtering solution that is custom-tailored for the problem at hand, as well as novel techniques for obtaining the provenance graph that expresses how the images, as nodes, are ancestrally connected. A comprehensive set of experiments for each stage of the pipeline is provided, comparing the proposed solution with the state-of-the-art results, employing previously published data sets. In addition, this paper introduces a new data set of real-world provenance cases from the social media site Reddit, along with baseline results.
Original language | English |
---|---|
Article number | 8438504 |
Pages (from-to) | 6109-6123 |
Number of pages | 15 |
Journal | IEEE Transactions on Image Processing |
Volume | 27 |
Issue number | 12 |
DOIs | |
State | Published - Dec 2018 |
Externally published | Yes |
Funding
Manuscript received December 28, 2017; revised June 14, 2018; accepted July 9, 2018. Date of publication August 16, 2018; date of current version September 17, 2018. This work was supported in part by DARPA and the Air Force Research Laboratory under Grant FA8750-16-2-0173, in part by FAPESP through the DéjàVu Project under Grant 2017/12646-3, in part by CAPES through the DeepEyes Grant, and in part by CNPq under Grant 304472/2015-8. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Tolga Tasdizen. (Corresponding author: Walter J. Scheirer.) D. Moreira, A. Bharati, J. Brogan, M. Parowski, K. W. Bowyer, P. J. Flynn, and W. J. Scheirer are with the Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556 USA (e-mail: [email protected]) A. Pinto and A. Rocha are with the Institute of Computing, University of Campinas, Campinas 13083-970, Brazil.
Funders | Funder number |
---|---|
Defense Advanced Research Projects Agency | |
Air Force Research Laboratory | FA8750-16-2-0173 |
Fundação de Amparo à Pesquisa do Estado de São Paulo | 2017/12646-3 |
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior | |
Conselho Nacional de Desenvolvimento Científico e Tecnológico | 304472/2015-8 |
Keywords
- Digital image forensics
- digital humanities
- graphs
- image phylogeny
- image provenance
- image retrieval