TY - JOUR
T1 - Effects of error covariance structure on estimation of model averaging weights and predictive performance
AU - Lu, Dan
AU - Ye, Ming
AU - Meyer, Philip D.
AU - Curtis, Gary P.
AU - Shi, Xiaoqing
AU - Niu, Xu Feng
AU - Yabusaki, Steve B.
PY - 2013/9
Y1 - 2013/9
N2 - When conducting model averaging for assessing groundwater conceptual model uncertainty, the averaging weights are often evaluated using model selection criteria such as AIC, AICc, BIC, and KIC (Akaike Information Criterion, Corrected Akaike Information Criterion, Bayesian Information Criterion, and Kashyap Information Criterion, respectively). However, this method often leads to an unrealistic situation in which the best model receives overwhelmingly large averaging weight (close to 100%), which cannot be justified by available data and knowledge. It was found in this study that this problem was caused by using the covariance matrix, CÉ, of measurement errors for estimating the negative log likelihood function common to all the model selection criteria. This problem can be resolved by using the covariance matrix, Cek, of total errors (including model errors and measurement errors) to account for the correlation between the total errors. An iterative two-stage method was developed in the context of maximum likelihood inverse modeling to iteratively infer the unknown Cek from the residuals during model calibration. The inferred Cek was then used in the evaluation of model selection criteria and model averaging weights. While this method was limited to serial data using time series techniques in this study, it can be extended to spatial data using geostatistical techniques. The method was first evaluated in a synthetic study and then applied to an experimental study, in which alternative surface complexation models were developed to simulate column experiments of uranium reactive transport. It was found that the total errors of the alternative models were temporally correlated due to the model errors. The iterative two-stage method using Cek resolved the problem that the best model receives 100% model averaging weight, and the resulting model averaging weights were supported by the calibration results and physical understanding of the alternative models. Using Cek obtained from the iterative two-stage method also improved predictive performance of the individual models and model averaging in both synthetic and experimental studies. Key Points Diagonal weight matrix of measurement errors gives wrong model averaging weights Model errors of simulating breakthrough data are correlated in time Full weigh matrix of correlated total errors gives right model averaging weights
AB - When conducting model averaging for assessing groundwater conceptual model uncertainty, the averaging weights are often evaluated using model selection criteria such as AIC, AICc, BIC, and KIC (Akaike Information Criterion, Corrected Akaike Information Criterion, Bayesian Information Criterion, and Kashyap Information Criterion, respectively). However, this method often leads to an unrealistic situation in which the best model receives overwhelmingly large averaging weight (close to 100%), which cannot be justified by available data and knowledge. It was found in this study that this problem was caused by using the covariance matrix, CÉ, of measurement errors for estimating the negative log likelihood function common to all the model selection criteria. This problem can be resolved by using the covariance matrix, Cek, of total errors (including model errors and measurement errors) to account for the correlation between the total errors. An iterative two-stage method was developed in the context of maximum likelihood inverse modeling to iteratively infer the unknown Cek from the residuals during model calibration. The inferred Cek was then used in the evaluation of model selection criteria and model averaging weights. While this method was limited to serial data using time series techniques in this study, it can be extended to spatial data using geostatistical techniques. The method was first evaluated in a synthetic study and then applied to an experimental study, in which alternative surface complexation models were developed to simulate column experiments of uranium reactive transport. It was found that the total errors of the alternative models were temporally correlated due to the model errors. The iterative two-stage method using Cek resolved the problem that the best model receives 100% model averaging weight, and the resulting model averaging weights were supported by the calibration results and physical understanding of the alternative models. Using Cek obtained from the iterative two-stage method also improved predictive performance of the individual models and model averaging in both synthetic and experimental studies. Key Points Diagonal weight matrix of measurement errors gives wrong model averaging weights Model errors of simulating breakthrough data are correlated in time Full weigh matrix of correlated total errors gives right model averaging weights
KW - logscore
KW - measurement error
KW - model structure error
KW - serial correlation
KW - surface complexation model
KW - time series analysis
UR - http://www.scopus.com/inward/record.url?scp=84884493601&partnerID=8YFLogxK
U2 - 10.1002/wrcr.20441
DO - 10.1002/wrcr.20441
M3 - Article
AN - SCOPUS:84884493601
SN - 0043-1397
VL - 49
SP - 6029
EP - 6047
JO - Water Resources Research
JF - Water Resources Research
IS - 9
ER -