2006, Number 5
<< Back Next >>
salud publica mex 2006; 48 (5)
Regression models for variables expressed as a continuous proportion
Salinas-Rodríguez A, Pérez-Núñez R, Ávila-Burgos L
Language: Spanish
References: 21
Page: 395-404
PDF size: 144.14 Kb.
ABSTRACT
Objective. To describe some of the statistical alternatives available for studying continuous proportions and to compare them in order to show their advantages and disadvantages by means of their application in a practical example of the Public Health field.
Materials and Methods. From the National Reproductive Health Survey performed in 2003, the proportion of individual coverage in the family planning program –proposed in one study carried out in the National Institute of Public Health in Cuernavaca, Morelos, Mexico (2005)– was modeled using the Normal, Gamma, Beta and quasi-likelihood regression models. The Akaike Information Criterion (AIC) proposed by McQuarrie and Tsai was used to define the best model. Then, using a simulation (Monte Carlo/Markov Chains approach) a variable with a Beta distribution was generated to evaluate the behavior of the 4 models while varying the sample size from 100 to 18 000 observations.
Results. Results showed that the best statistical option for the analysis of continuous proportions was the Beta regression model, since its assumptions are easily accomplished and because it had the lowest AIC value. Simulation evidenced that while the sample size increases the Gamma, and even more so the quasi-likelihood, models come significantly close to the Beta regression model.
Conclusions. The use of parametric Beta regression is highly recommended to model continuous proportions and the normal model should be avoided. If the sample size is large enough, the use of quasi-likelihood model represents a good alternative.
REFERENCES
Johnson NL, Kotz S, Balakrishnan N. Continuous univariate distributions, vol. 2. New York: Wiley, 1995.
Hviid M, Villadsen B. Beta distributed market shares in a spatial model with an application to the market for audit services. Review of Industrial Organization 1995;10:737-747.
Bury K. Statistical distributions in engineering. New York: Cambridge University Press, 1999.
Godfrey L. Misspecification tests in econometrics: the Lagrange multiplier principle and other approaches. New York: Cambridge University Press, 1988.
Kieschnick R, McCullough BD. Regression analysis of variates observed on (0, 1): percentages, proportions and fractions. Stat Model 2003;3(3):193-213.
Cox C. Nonlinear quasi-likelihood models: applications to continuous proportions. Comput StatDat Anal 1996;21:449-461.
Ferrari S, Cribari-Nieto F. Beta regression for modelling rates and proportions. J Appl Stat 2004;31(7):799-815.
Atkinson A. Plots, transformations and regression: an introduction to graphical methods of diagnostic regression analysis. New York: Oxford University Press, 1985.
Aitchison J. The statistical analysis of compositional data. New York: Chapman & Hall, 1986.
Hardin J, Hilbe J. Generalized linear models and extensions. Texas: Stata Press, 2001
McCullagh P, Nelder JA. Generalized linear models. New York: Chapman & Hall, 1989.
Papke LE, Wooldridge JM. Econometric methods for fractional response variables with an application to 401(k) plan participation rates. J Appl Econom 1996;11(6):619-632.
McCulloch CH, Searle SR. Generalized, linear, and mixed models. New York: Wiley, 2001.
Barclay MJ, Smith CW. The determinants of corporate leverage and dividend policies. Journal of Applied Corporate Finance 1995;7:4-19.
Congdon P. Bayesian statistical modelling. Chichester, UK: John Wiley & Sons, 2001.
Programa Nacional de Población 2001-2006. Informe de ejecución 2003-2004 del Programa Nacional de Población 2001-2006:321-365.
Lindsey JK. Applying generalized linear models. New York: Springer-Verlag, 1997.
McQuarrie A, Tsai C. Regression and time series model selection. New Jersey: World Scientific Publishing Company, 1998.
Gilks WR, Richardson S, Spiegelhalter DJ eds. Markov chain Monte Carlo in practice. London, UK: Chapman and Hall, 1996.
Jorgensen B. The theory of dispersion models. New York: Chapman & Hall, 1997.
McDonald JB, Xu YJ. A generalization of the beta distribution with applications. J Econom 1995;66:133-52.