Impact of Non-Weighting in the Analysis of Data Obtained from Complex Samples
Keywords:
DMF Index, Statistics, Health SurveysAbstract
Objective: To compare the estimates obtained, considering or not the weighting data. Material and Methods: Secondary data from the Oral Health Survey of the State of São Paulo (SBSP2015) was used for calculation of mean estimates, standard errors of the mean and confidence intervals (CI) for the DMFT index and components (decayed, lost and filled), in the age group of 35-44 years. Multiple logistic regression models were estimated, considering or not the weighting from the sampling plan (p<0.05). Results: It was observed that the estimates of the DMFT index and the carious component did not vary much when the design was considered or not (1.1% and 2.0%, respectively). However, the data referring to the lost and filled component showed greater differences between the values of the means. The averages fluctuated up and down by up to 6.7% for weighted versus unweighted analyses. The standard error was underestimated in the unweighted analysis and the confidence interval showed variations. Differences between the regression models obtained by the weighted and unweighted analysis of the data were detected. Conclusion: Although weighted and unweighted models presented differences of less than 10% in estimates of the mean, confidence intervals, as well as statistical inferences, were different. Thus, weighting should be applied in the population base data analysis collected by sampling with complex designs.
References
Bloch KV, Luiz RR, Werneck GL. Amostragem. In: Medronho RA, Bloch KV, Luiz RR, Werneck GL. Epidemiologia. 2. ed. Rio de Janeiro: Atheneu; 2009. [In Portuguese].
Pereira GR. Uma revisão sobre o uso analítico de dados provenientes de amostras complexas. Dissertação (Mestrado) – Esalq/USP, Piracicaba; 2016. [In Portuguese]. https://doi.org/10.11606/D.11.2016.tde-28112016-144856
Hulley SB, Newman TB, Cummings SR. Escolhendo Sujeitos do Estudo: Especificação, Amostragem e Recrutamento. In: Hulley SB, Cummings SR, Browner WS, Grady DG, Newman TB. Delineando a Pesquisa Clínica. 4. ed. Porto Alegre: Artmed; 2015. [In Portuguese].
Cunningham SD, Huguet N. Weighting and Complex Samples Design Adjustments in Longitudinal Studies. New York, NY: Routledge Taylor & Francis Group; 2012.
Bolfarine H, Bussab WO. Elementos de Amostragem. São Paulo: Edgar Blücher; 2005. [In Portuguese].
Luiz RR, Magnanini MMF. The logic of sample size determination in epidemiological research. Cad Saúde Colet 2000; 8(2):9-28.
Szwarcwald CL, Damacena GN. Complex sampling design in population surveys: planning and effects on statistical data analysis. Rev Bras Epidemiol 2008; 11(Suppl 1):38-45. https://doi.org/10.1590/S1415-790X2008000500004
Kreuter F, Valliant R. A survey on survey statistics: what in done and can be done in Stata. Stata J 2007; 7(1):1-21. https://doi.org/10.1177/1536867X0700700101
Frohlich N, Carriere KC, Potvin L, Black C. Assessing socioeconomic effects on different sized populations: to weight or not to weight? J Epidemiol Community Health 2001; 55(12):913-20. https://doi.org/10.1136/jech.55.12.913
Lumley T. Analysis of complex survey samples. J Stat Softw 2004, 9(1):1-19. https://doi.org/10.18637/jss.v009.i08
Pereira AC, Vieira V, Frias AC. Pesquisa Estadual de Saúde Bucal - Relatório Final. Águas de São Pedro: Livronovo; 2016. 122p. [In Portuguese].
Korn EL, Graubard BI. Epidemiologic studies utilizing surveys: accounting for the sampling design. Am J Public Health 1991; 81(9):1166-73. https://doi.org/10.2105/AJPH.81.9.1166
Sakshaug JW, West BT. Important considerations when analyzing health survey data collected using a complex sample design. Am J Public Health 2014; 104(1):15-6. https://doi.org/10.2105/AJPH.2013.301515
Cordeiro R. Effect of design in cluster sampling to estimate the distribution of occupations among workers. Rev Saúde Pública 2001; 35(1):10-5. https://doi.org/10.1590/S0034-89102001000100002
Cochran WG. Técnicas de Amostragem. Rio de Janeiro: Agência Norte-Americana para o Desenvolvimento Internacional e Editora Fundo de Cultura; 1965. [In Portuguese].
Perez MC, Utra IB, León AA, Roche RG, Sagué KA, de la Rosa MC, et al. Estimate methods used with complex sampling designs: their application in the Cuban 2001 health survey. Rev Panam Salud Publica 2004; 15(3):176-84. https://doi.org/10.1590/S1020-49892004000300006
Du Mouchel WH, Duncan GJ. Using sample survey weights in multiple regression analysis of stratified samples. J Am Stat Assoc 1983; 78(383):535-43. https://doi.org/10.2307/2288115
Ciol MA, Hoffman JM, Dudgeon BJ, Shumway-Cook A, Yorkston KM, Chan L. Understanding the use of weights in the analysis of data from multistage surveys. Arch Phys Med Rehabil 2006; 87(2):299-303. https://doi.org/10.1016/j.apmr.2005.09.021
Kalton G. Introduction to Survey Sampling. Beverly Hills: Sage Publications; 1983. https://doi.org/10.4135/9781412984683
Cox BG, Cohen SB. Methodological Issues for Health Care Surveys. New York: Marcel Dekker; 1985. https://doi.org/10.1002/bimj.4710290103
Sousa MH, Silva NN. Estimates from a complex survey. Rev Saúde Pública 2003; 37(5):662-70. https://doi.org/10.1590/s0034-89102003000500018
Greenland S. Invited Commentary: Variable selection versus shrinkage in the control of multiple confounders. Am J Epidemiol 2008; 167(5):523-9; discussion 530-1. https://doi.org/10.1093/aje/kwm355
Cochran WG. Sampling Techiniques. 3rd ed. New York: John Wiley & Sons; 1977.
Queiroz RCS. Portela MC. Vasconcelos MTLV. Brazilian Oral Health Survey (SB Brazil 2003): data do not allow for population estimates, but correction is possible. Cad Saúde Pública 2009; 25(1):47-58. https://doi.org/10.1590/S0102-311X2009000100005
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Pesquisa Brasileira em Odontopediatria e Clínica Integrada
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.