New data sources and inference methods for statistics
This paper reviews methodological issues if non-probability data are used to compile official statistics.
Two different research lines can be identified to use non-probabilty data sources in the production of official statistics. The first approach is to combine big data sources with sample data in a model-based inference approach. This implies that big-data sources are used as covariates in models used for small area estimation and time series models to improve precision and timeliness of sample statistics. The second approach is to use big data sources as a primary data source for the compilations of official statistics. This requires adjustments for selection bias.