Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Statistical analysis of grouped data

Thesis (PhD (Mathematical Statistics))--University of Pretoria, 2007.

Saved in:
Bibliographic Details
Other Authors: Smit, Chris F.
Format: Thesis
Published: University of Pretoria 2013
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613520102686720
access_status_str Open Access
author2 Smit, Chris F.
author_browse Smit, Chris F.
author_facet Smit, Chris F.
collection Thesis
dc_rights_str_mv ©2007, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
description Thesis (PhD (Mathematical Statistics))--University of Pretoria, 2007.
format Thesis
id oai:repository.up.ac.za:2263/25968
institution University of Pretoria (South Africa)
last_indexed 2026-06-10T12:37:27.084Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from UPSpace — University of Pretoria Institutional Repository
publishDate 2013
publishDateRange 2013
publishDateSort 2013
publisher University of Pretoria
publisherStr University of Pretoria
record_format dspace
source_str UPSpace — University of Pretoria Institutional Repository
spelling oai:repository.up.ac.za:2263/25968 Statistical analysis of grouped data Smit, Chris F. gretel.crafford@up.ac.za Crafford, Gretel Grouped data set UCTD Thesis (PhD (Mathematical Statistics))--University of Pretoria, 2007. The maximum likelihood (ML) estimation procedure of Matthews and Crowther (1995: A maximum likelihood estimation procedure when modelling in terms of constraints. South African Statistical Journal, 29, 29-51) is utilized to fit a continuous distribution to a grouped data set. This grouped data set may be a single frequency distribution or various frequency distributions that arise from a cross classification of several factors in a multifactor design. It will also be shown how to fit a bivariate normal distribution to a two-way contingency table where the two underlying continuous variables are jointly normally distributed. This thesis is organized in three different parts, each playing a vital role in the explanation of analysing grouped data with the ML estimation of Matthews and Crowther. In Part I the ML estimation procedure of Matthews and Crowther is formulated. This procedure plays an integral role and is implemented in all three parts of the thesis. In Part I the exponential distribution is fitted to a grouped data set to explain the technique. Two different formulations of the constraints are employed in the ML estimation procedure and provide identical results. The justification of the method is further motivated by a simulation study. Similar to the exponential distribution, the estimation of the normal distribution is also explained in detail. Part I is summarized in Chapter 5 where a general method is outlined to fit continuous distributions to a grouped data set. Distributions such as the Weibull, the log-logistic and the Pareto distributions can be fitted very effectively by formulating the vector of constraints in terms of a linear model. In Part II it is explained how to model a grouped response variable in a multifactor design. This multifactor design arise from a cross classification of the various factors or independent variables to be analysed. The cross classification of the factors results in a total of T cells, each containing a frequency distribution. Distribution fitting is done simultaneously to each of the T cells of the multifactor design. Distribution fitting is also done under the additional constraints that the parameters of the underlying continuous distributions satisfy a certain structure or design. The effect of the factors on the grouped response variable may be evaluated from this fitted design. Applications of a single-factor and a two-factor model are considered to demonstrate the versatility of the technique. A two-way contingency table where the two variables have an underlying bivariate normal distribution is considered in Part III. The estimation of the bivariate normal distribution reveals the complete underlying continuous structure between the two variables. The ML estimate of the correlation coefficient ρ is used to great effect to describe the relationship between the two variables. Apart from an application a simulation study is also provided to support the method proposed. Statistics unrestricted 2013-09-07T01:44:26Z 2008-07-10 2013-09-07T01:44:26Z 2007-04-13 2007 2008-07-01 Thesis Crafford, G 2007, Statistical analysis of grouped data, PhD thesis, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/25968 > D377/gm http://hdl.handle.net/2263/25968 http://upetd.up.ac.za/thesis/available/etd-07012008-072755/ ©2007, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. application/pdf application/pdf application/pdf application/pdf application/pdf application/pdf University of Pretoria
spellingShingle Grouped data set
UCTD
Statistical analysis of grouped data
title Statistical analysis of grouped data
title_full Statistical analysis of grouped data
title_fullStr Statistical analysis of grouped data
title_full_unstemmed Statistical analysis of grouped data
title_short Statistical analysis of grouped data
title_sort statistical analysis of grouped data
topic Grouped data set
UCTD
url http://hdl.handle.net/2263/25968
http://upetd.up.ac.za/thesis/available/etd-07012008-072755/