Statistical Modelling 11 (2011), 489505
Hierarchical mixture models for biclustering in microarray data
F Martella
Dipertimenti di Scienze Statistiche,
Facoltà di Ingegneria dell' Informazione, Informaticae Statistica,
Sapienza Università di Roma
P.le Also Moro, 5
I00185 Rome
Italy
eMail: francesca.martella@uniroma.it
M Alfò and M Vichi
Dipartimento di Scienze Statistiche,
Facoltà di Ingegneria dell’ Informazione, Informaticae Statistica,
Sapienza Università di Roma
Rome
Italy
Abstract:
In the last few years, model-based clustering techniques have become widely used in the context of microarray data analysis. In this empirical context, a potential purpose for statistical approaches is the identification of clusters of genes that are co-expressed under subsets of experimental conditions. We discuss a hierarchical mixture model to combine advantages of allowing for dependence within gene clusters and for simultaneous clustering of genes and experimental conditions. Thanks to the adopted hierarchical structure, we may distinguish gene clusters from mixture components, where the latter may represent intra-cluster gene-specific extra-Gaussian departures. To cluster experimental conditions, instead, we suggest a suitable parameterization of component-specific means by using a binary row stochastic matrix representing condition membership. The performance of the proposed approach is discussed on both simulated and real datasets.
Keywords:
Hierarchical mixture model; biclustering; microarray data
Downloads:
Example data and Matlab code in
zipped archive
back