A flexible Bayesian nonparametric model for preference-based clustering: toward a Netflix-like collaborative filtering algorithm to improve healthcare decisions (original) (raw)

Accurate prediction of future claims is a fundamentally important problem in insurance. The Bayesian approach is natural in this context, as it provides a complete predictive distribution for future claims. The classical credibility theory provides a simple approximation to the mean of that predictive distribution as a point predictor, but this approach ignores other features of the predictive distribution, such as spread, that would be useful for decision making. In this article, we propose a Dirichlet process mixture of log-normals model and discuss the theoretical properties and computation of the corresponding predictive distribution. Numerical examples demonstrate the benefit of our model compared to some existing insurance loss models, and an R code implementation of the proposed method is also provided.