TY - GEN
T1 - Construction of synthetic populations with key attributes
T2 - 2nd IEEE International Conference on Social Computing, SocialCom 2010, 2nd IEEE International Conference on Privacy, Security, Risk and Trust, PASSAT 2010
AU - Fernandez, Steven J.
AU - Rose, Amy N.
AU - Bright, Edward A.
AU - Beaver, Justin M.
AU - Symons, Christopher T.
AU - Omitaomu, Olufemi A.
AU - Jiao, Cathy
PY - 2010
Y1 - 2010
N2 - In this paper, we describe our concept for overcoming the data barriers of building credible synthetic populations to assist the transformation between social theories and mathematical models. We specifically developed a 31-million-agent model of Afghanistan's population to demonstrate the ability to computationally control and analytically manipulate a system with the large number of agents (i.e., 108) necessary to model regions at the individual level using the LandScan Global population database. Afghanistan was selected for this case study because gathering data for Afghanistan was thought to be especially challenging. The LandScan Global population database is used by a majority of key U.S. and foreign agencies as their database system for worldwide geospatial distribution of populations. Assigning attributes to disaggregated population was achieved by fusing appropriate indicator databases using two forms of aggregation techniques - geographical and categorical. A new approach of matching attributes to theoretical constructs was illustrated. The other data sources used include data on military and peacekeeper forces' loyalties, readiness, and deployment collected through a combination of UN and classified force projections; economic data collected at the national level and disaggregated using data fusion techniques; data on social attitudes, beliefs, and social cleavages through anthropological studies, worldwide polling, and classified sources; and data on infrastructure and information systems and networks.
AB - In this paper, we describe our concept for overcoming the data barriers of building credible synthetic populations to assist the transformation between social theories and mathematical models. We specifically developed a 31-million-agent model of Afghanistan's population to demonstrate the ability to computationally control and analytically manipulate a system with the large number of agents (i.e., 108) necessary to model regions at the individual level using the LandScan Global population database. Afghanistan was selected for this case study because gathering data for Afghanistan was thought to be especially challenging. The LandScan Global population database is used by a majority of key U.S. and foreign agencies as their database system for worldwide geospatial distribution of populations. Assigning attributes to disaggregated population was achieved by fusing appropriate indicator databases using two forms of aggregation techniques - geographical and categorical. A new approach of matching attributes to theoretical constructs was illustrated. The other data sources used include data on military and peacekeeper forces' loyalties, readiness, and deployment collected through a combination of UN and classified force projections; economic data collected at the national level and disaggregated using data fusion techniques; data on social attitudes, beliefs, and social cleavages through anthropological studies, worldwide polling, and classified sources; and data on infrastructure and information systems and networks.
KW - Agent-based models
KW - Data fusion
KW - High-resolution data
KW - Population database
KW - Social modeling
UR - http://www.scopus.com/inward/record.url?scp=78649289278&partnerID=8YFLogxK
U2 - 10.1109/SocialCom.2010.109
DO - 10.1109/SocialCom.2010.109
M3 - Conference contribution
AN - SCOPUS:78649289278
SN - 9780769542119
T3 - Proceedings - SocialCom 2010: 2nd IEEE International Conference on Social Computing, PASSAT 2010: 2nd IEEE International Conference on Privacy, Security, Risk and Trust
SP - 701
EP - 706
BT - Proceedings - SocialCom 2010
Y2 - 20 August 2010 through 22 August 2010
ER -