TY - JOUR
T1 - Dependence-preserving approach to synthesizing household characteristics
AU - Kao, Shih Chieh
AU - Kim, Hoe
AU - Liu, Cheng
AU - Cui, Xiaohui
AU - Bhaduri, Budhendra
PY - 2012/12/1
Y1 - 2012/12/1
N2 - One effective approach to study day-to-day traveler behavior is through the activity-based traffic demand model, in which all travelers are treated as individual agents and interact under a computation-intensive framework. Nevertheless, because of high survey costs, low response rate, and privacy concerns, detailed household and personal characteristics are usually unavailable. Various population synthesizers were therefore proposed to reconstruct a methodologically rigorous estimate of household characteristics from different surveys. For instance, the iterative proportional fitting (IPF) algorithm is used to synthesize the full population from the public use microdata sample (PUMS) and Census Summary File 3 (SF3) in the popular activity-based traffic demand model, TRANSIMS. However, some fundamental limitations of IPF (e.g., zero cells in the contingency table as a result of small sample size) have drawn sufficient attention and resulted in the development of enhanced IPF algorithms and other strategies. This paper proposes a copula-based method to synthesize household characteristics that preserves marginal distributions and dependence structure between variables. The proposed method is tested for the state of Iowa, and the results are compared with the IPF approach of TRANSIMS. The synthesized households resulted in the same local SF3 statistics at each block group. But having similar intervariable correlations as described in the PUMS suggest the applicability of the copula-based approach. Because marginal distributions and dependence structure can be faithfully preserved, the proposed method could be a suitable alternative to synthesize realistic agent characteristics for further activity-based traffic demand modeling.
AB - One effective approach to study day-to-day traveler behavior is through the activity-based traffic demand model, in which all travelers are treated as individual agents and interact under a computation-intensive framework. Nevertheless, because of high survey costs, low response rate, and privacy concerns, detailed household and personal characteristics are usually unavailable. Various population synthesizers were therefore proposed to reconstruct a methodologically rigorous estimate of household characteristics from different surveys. For instance, the iterative proportional fitting (IPF) algorithm is used to synthesize the full population from the public use microdata sample (PUMS) and Census Summary File 3 (SF3) in the popular activity-based traffic demand model, TRANSIMS. However, some fundamental limitations of IPF (e.g., zero cells in the contingency table as a result of small sample size) have drawn sufficient attention and resulted in the development of enhanced IPF algorithms and other strategies. This paper proposes a copula-based method to synthesize household characteristics that preserves marginal distributions and dependence structure between variables. The proposed method is tested for the state of Iowa, and the results are compared with the IPF approach of TRANSIMS. The synthesized households resulted in the same local SF3 statistics at each block group. But having similar intervariable correlations as described in the PUMS suggest the applicability of the copula-based approach. Because marginal distributions and dependence structure can be faithfully preserved, the proposed method could be a suitable alternative to synthesize realistic agent characteristics for further activity-based traffic demand modeling.
UR - http://www.scopus.com/inward/record.url?scp=84872708172&partnerID=8YFLogxK
U2 - 10.3141/2302-21
DO - 10.3141/2302-21
M3 - Article
AN - SCOPUS:84872708172
SN - 0361-1981
SP - 192
EP - 200
JO - Transportation Research Record
JF - Transportation Research Record
IS - 2302
ER -