Evaluation of Missing Data Imputation Methods for an Enhanced Distributed PV Generation Prediction

Aditya Sundararajan, Arif I. Sarwat

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

14 Scopus citations

Abstract

To effectively predict generation of distributed photovoltaic (PV) systems, three parameters are critical: irradiance, ambient temperature, and module temperature. However, their completeness cannot be guaranteed because of issues in data acquisition. Many methods in literature address missingness, but their applicability varies with missingness mechanism. Exploration of methods to impute missing data in PV systems is lacking. This paper conducts statistical analyses to understand missingness mechanism in data of a real grid-tied 1.4MW PV system at Miami, and compares the imputation performance of different methods: random imputation, multiple imputation using expectation-maximization, kNN, and random forests, using error metrics and size effect measures. Imputed values are used in a multilayer perceptron to predict and compare PV generation with observed values. Results show that values imputed using kNN and random forests have the least differences in proportions and help utilities make more accurate prediction of generation for distribution planning.

Original languageEnglish
Title of host publicationProceedings of the Future Technologies Conference, FTC 2019 Volume 1
EditorsKohei Arai, Rahul Bhatia, Supriya Kapoor
PublisherSpringer
Pages590-609
Number of pages20
ISBN (Print)9783030325190
DOIs
StatePublished - 2020
Externally publishedYes
Event4th Future Technologies Conference, FTC 2019 - San Francisco, United States
Duration: Oct 24 2019Oct 25 2019

Publication series

NameAdvances in Intelligent Systems and Computing
Volume1069
ISSN (Print)2194-5357
ISSN (Electronic)2194-5365

Conference

Conference4th Future Technologies Conference, FTC 2019
Country/TerritoryUnited States
CitySan Francisco
Period10/24/1910/25/19

Funding

Acknowledgments. The work published is a result of the research sponsored by the National Science Foundation (NSF) CNS division under the award 1553494.

FundersFunder number
National Science Foundation1553494

    Keywords

    • Data processing
    • Distributed PV
    • Imputation methods
    • Missing data
    • PV Generation Prediction

    Fingerprint

    Dive into the research topics of 'Evaluation of Missing Data Imputation Methods for an Enhanced Distributed PV Generation Prediction'. Together they form a unique fingerprint.

    Cite this