Abstract
To effectively predict generation of distributed photovoltaic (PV) systems, three parameters are critical: irradiance, ambient temperature, and module temperature. However, their completeness cannot be guaranteed because of issues in data acquisition. Many methods in literature address missingness, but their applicability varies with missingness mechanism. Exploration of methods to impute missing data in PV systems is lacking. This paper conducts statistical analyses to understand missingness mechanism in data of a real grid-tied 1.4MW PV system at Miami, and compares the imputation performance of different methods: random imputation, multiple imputation using expectation-maximization, kNN, and random forests, using error metrics and size effect measures. Imputed values are used in a multilayer perceptron to predict and compare PV generation with observed values. Results show that values imputed using kNN and random forests have the least differences in proportions and help utilities make more accurate prediction of generation for distribution planning.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Future Technologies Conference, FTC 2019 Volume 1 |
| Editors | Kohei Arai, Rahul Bhatia, Supriya Kapoor |
| Publisher | Springer |
| Pages | 590-609 |
| Number of pages | 20 |
| ISBN (Print) | 9783030325190 |
| DOIs | |
| State | Published - 2020 |
| Externally published | Yes |
| Event | 4th Future Technologies Conference, FTC 2019 - San Francisco, United States Duration: Oct 24 2019 → Oct 25 2019 |
Publication series
| Name | Advances in Intelligent Systems and Computing |
|---|---|
| Volume | 1069 |
| ISSN (Print) | 2194-5357 |
| ISSN (Electronic) | 2194-5365 |
Conference
| Conference | 4th Future Technologies Conference, FTC 2019 |
|---|---|
| Country/Territory | United States |
| City | San Francisco |
| Period | 10/24/19 → 10/25/19 |
Funding
Acknowledgments. The work published is a result of the research sponsored by the National Science Foundation (NSF) CNS division under the award 1553494.
Keywords
- Data processing
- Distributed PV
- Imputation methods
- Missing data
- PV Generation Prediction