Abstract
Data-driven segmentation is an important tool for analyzing patterns of associations in social survey data; however, it remains a challenge to compare the quality of segmentations obtained by different methods. We present a statistical framework for quantifying the quality of segmentations of human values, by evaluating their ability to predict held-out data. By comparing clusterings of human values survey data from the forth round of European Social Study (ESS-4), we show that demographic markers such as age or country predict better than random, yet are outperformed by data-driven segmentation methods. We show that a Bayesian version of Latent Class Analysis (LCA) outperforms the standard maximum likelihood LCA in predictive performance and is more robust for different number of clusters.
Original language | English |
---|---|
Journal | Journal of Mathematical Sociology |
Volume | 46 |
Issue number | 1 |
Pages (from-to) | 28-55 |
Number of pages | 28 |
ISSN | 0022-250X |
DOIs | |
Publication status | Published - 2022 |
Bibliographical note
Published online: 17. September 2020.Keywords
- Human value segmentation
- Bayesian latent class analysis
- Predictive evaluation