Estimating Classification Errors under Edit Restrictions in Composite Survey-Register Data Using Multiple Imputation Latent Class Modelling (MILC)

Laura Boeschoten, Daniel Oberski, Ton De Waal

Research output: Contribution to journalArticleAcademicpeer-review

12 Citations (Scopus)

Abstract

Both registers and surveys can contain classification errors. These errors can be estimated by making use of a composite data set. We propose a new method based on latent class modelling to estimate the number of classification errors across several sources while taking into account impossible combinations with scores on other variables. Furthermore, the latent class model, by multiply imputing a new variable, enhances the quality of statistics based on the composite data set. The performance of this method is investigated by a simulation study, which shows that whether or not the method can be applied depends on the entropy R2 of the latent class model and the type of analysis a researcher is planning to do. Finally, the method is applied to public data from Statistics Netherlands.

Original languageEnglish
Pages (from-to)921-962
Number of pages42
JournalJournal of Official Statistics
Volume33
Issue number4
DOIs
Publication statusPublished - 1 Dec 2017
Externally publishedYes

Fingerprint

Dive into the research topics of 'Estimating Classification Errors under Edit Restrictions in Composite Survey-Register Data Using Multiple Imputation Latent Class Modelling (MILC)'. Together they form a unique fingerprint.

Cite this