GenSynthPop: generating a spatially explicit synthetic population of individuals and households from aggregated data

Jan de Mooij*, Tabea Sonnenschein, Marco Pellegrino, Mehdi Dastani, Dick Ettema, Brian Logan, Judith A. Verstegen

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

2 Downloads (Pure)

Abstract

Synthetic populations are representations of actual individuals living in a specific area. They play an increasingly important role in studying and modeling individuals and are often used to build agent-based social simulations. Traditional approaches for synthesizing populations use a detailed sample of the population (which may not be available) or combine data into a single joint distribution, and draw individuals or households from these. The latter group of existing sample-free methods fail to integrate (1) the best available data on spatial granular distributions, (2) multi-variable joint distributions, and (3) household level distributions. In this paper, we propose a sample-free approach where synthetic individuals and households directly represent the estimated joint distribution to which attributes are iteratively added, conditioned on previous attributes such that the relative frequencies within each joint group of attributes are maintained and fit granular spatial marginal distributions. In this paper we present our method and test it for the Zuid-West district of The Hague, the Netherlands, showing that spatial, multi-variable and household distributions are accurately reflected in the resulting synthetic population.

Original languageEnglish
Article number48
Pages (from-to)1-28
Number of pages28
JournalAutonomous Agents and Multi-Agent Systems
Volume38
Issue number2
DOIs
Publication statusPublished - 3 Oct 2024

Keywords

  • Data disaggregation
  • Iterative proportional fitting
  • Sample-free data synthesis
  • Spatial heterogeneity
  • Synthetic households
  • Synthetic population
  • Synthetic reconstruction

Fingerprint

Dive into the research topics of 'GenSynthPop: generating a spatially explicit synthetic population of individuals and households from aggregated data'. Together they form a unique fingerprint.

Cite this