Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Simultaneous clustering with mixtures of factor analysers

This work details the method of Simultaneous Model-based Clustering. It also presents an extension to this method by reformulating it as a model with a mixture of factor analysers. This allows for the technique, known as Simultaneous Model-Based Clustering with a Mixture of Factor Analysers, to be a...

Full description

Saved in:
Bibliographic Details
Main Author: O'Donnell, Warwick
Other Authors: Lesosky, Maia
Format: Thesis
Language:English
Published: Department of Medicine 2015
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This work details the method of Simultaneous Model-based Clustering. It also presents an extension to this method by reformulating it as a model with a mixture of factor analysers. This allows for the technique, known as Simultaneous Model-Based Clustering with a Mixture of Factor Analysers, to be able to cluster high dimensional gene-expression data. A new table of allowable and non-allowable models is formulated, along with a parameter estimation scheme for one such allowable model. Several numerical procedures are tested and various datasets, both real and generated, are clustered. The results of clustering the Iris data find a 3 component VEV model to have the lowest misclassification rate with comparable BIC values to the best scoring model. The clustering of Genetic data was less successful, where the 2-component model could successfully uncover the healthy tissue, but partitioned the cancerous tissue in half.