A cluster differences unfolding method for large datasets of preference ratings on an interval scale: Minimizing the mean squared centred residuals

Rodrigo Macías; J Fernando Vera; Willem J Heiser

doi:10.1111/bmsp.12332

A cluster differences unfolding method for large datasets of preference ratings on an interval scale: Minimizing the mean squared centred residuals

Br J Math Stat Psychol. 2024 May;77(2):356-374. doi: 10.1111/bmsp.12332. Epub 2024 Jan 11.

Authors

Rodrigo Macías¹, J Fernando Vera², Willem J Heiser³

Affiliations

¹ Centro de Investigación en Matemáticas, Unidad Monterrey, Monterrey, México.
² University of Granada, Granada, Spain.
³ Leiden University, Leiden, The Netherlands.

PMID: 38213088
DOI: 10.1111/bmsp.12332

Abstract

Clustering and spatial representation methods are often used in combination, to analyse preference ratings when a large number of individuals and/or object is involved. When analysed under an unfolding model, row-conditional linear transformations are usually most appropriate when the goal is to determine clusters of individuals with similar preferences. However, a significant problem with transformations that include both slope and intercept is the occurrence of degenerate solutions. In this paper, we propose a least squares unfolding method that performs clustering of individuals while simultaneously estimating the location of cluster centres and object locations in low-dimensional space. The method is based on minimising the mean squared centred residuals of the preference ratings with respect to the distances between cluster centres and object locations. At the same time, the distances are row-conditionally transformed with optimally estimated slope parameters. It is computationally efficient for large datasets, and does not suffer from the appearance of degenerate solutions. The performance of the method is analysed in an extensive Monte Carlo experiment. It is illustrated for a real data set and the results are compared with those obtained using a two-step clustering and unfolding procedure.

Keywords: cluster analysis; correlation; interval level data; least squares; unfolding model; vector model.

A cluster differences unfolding method for large datasets of preference ratings on an interval scale: Minimizing the mean squared centred residuals

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding