Statistical Modelling 10 (2010), 133158
Latent regression analysis
Thaddeus Tarpey
Department of Mathematics and Statistics
Wright State University
Dayton, Ohio
U.S.A.
eMail: thaddeus.tarpey@wright.edu
Eva Petkova
Department of Child and Adolescent Psychiatry
New York University
New York
and
Nathan S. Kline Institute for Psychiatric Research
New York
U.S.A.
Abstract:
Finite mixture models have come to play a very prominent role in modelling
data. The finite mixture model is predicated on the assumption that distinct
latent groups exist in the population. The finite mixture model therefore is
based on a categorical latent variable that distinguishes the different
groups. Often in practice, distinct sub-populations do not actually exist.
For example, disease severity (e.g., depression) may vary continuously and
therefore, a distinction of diseased and non-diseased may not be based on
the existence of distinct sub-populations. Thus, what is needed is a
generalization of the finite mixture’s discrete latent predictor to a
continuous latent predictor. We cast the finite mixture model as a regression
model with a latent Bernoulli predictor. A latent regression model is proposed
by replacing the discrete Bernoulli predictor by a continuous latent predictor
with a beta distribution. Motivation for the latent regression model arises
from applications where distinct latent classes do not exist, but instead
individuals vary according to a continuous latent variable. The shapes of
the beta density are very flexible and can approximate the discrete Bernoulli
distribution. Examples and a simulation are provided to illustrate the latent
regression model. In particular, the latent regression model is used to model
placebo effect among drug-treated subjects in a depression study.
Keywords:
beta distribution; EM algorithm; finite and infinite mixtures;
placebo effect; quasi-Newton algorithms; skew-normal distribution
Downloads:
Example data and R-code in
zipped archive
back