Impact of Non-Weighting in the Analysis of Data Obtained from Complex Samples


  • Denise de Fátima Barros Cavalcante
  • Antonio Carlos Pereira
  • Elza Cristina Farias de Araújo
  • Yuri Wanderley Cavalcanti
  • Marcelo de Castro Meneghim
  • Gláucia Maria Bovi Ambrosano


DMF Index, Statistics, Health Surveys


Objective: To compare the estimates obtained, considering or not the weighting data. Material and Methods: Secondary data from the Oral Health Survey of the State of São Paulo (SBSP2015) was used for calculation of mean estimates, standard errors of the mean and confidence intervals (CI) for the DMFT index and components (decayed, lost and filled), in the age group of 35-44 years. Multiple logistic regression models were estimated, considering or not the weighting from the sampling plan (p<0.05). Results: It was observed that the estimates of the DMFT index and the carious component did not vary much when the design was considered or not (1.1% and 2.0%, respectively). However, the data referring to the lost and filled component showed greater differences between the values of the means. The averages fluctuated up and down by up to 6.7% for weighted versus unweighted analyses. The standard error was underestimated in the unweighted analysis and the confidence interval showed variations. Differences between the regression models obtained by the weighted and unweighted analysis of the data were detected. Conclusion: Although weighted and unweighted models presented differences of less than 10% in estimates of the mean, confidence intervals, as well as statistical inferences, were different.  Thus, weighting should be applied in the population base data analysis collected by sampling with complex designs.


Bloch KV, Luiz RR, Werneck GL. Amostragem. In: Medronho RA, Bloch KV, Luiz RR, Werneck GL. Epidemiologia. 2. ed. Rio de Janeiro: Atheneu; 2009. [In Portuguese].

Pereira GR. Uma revisão sobre o uso analítico de dados provenientes de amostras complexas. Dissertação (Mestrado) – Esalq/USP, Piracicaba; 2016. [In Portuguese].

Hulley SB, Newman TB, Cummings SR. Escolhendo Sujeitos do Estudo: Especificação, Amostragem e Recrutamento. In: Hulley SB, Cummings SR, Browner WS, Grady DG, Newman TB. Delineando a Pesquisa Clínica. 4. ed. Porto Alegre: Artmed; 2015. [In Portuguese].

Cunningham SD, Huguet N. Weighting and Complex Samples Design Adjustments in Longitudinal Studies. New York, NY: Routledge Taylor & Francis Group; 2012.

Bolfarine H, Bussab WO. Elementos de Amostragem. São Paulo: Edgar Blücher; 2005. [In Portuguese].

Luiz RR, Magnanini MMF. The logic of sample size determination in epidemiological research. Cad Saúde Colet 2000; 8(2):9-28.

Szwarcwald CL, Damacena GN. Complex sampling design in population surveys: planning and effects on statistical data analysis. Rev Bras Epidemiol 2008; 11(Suppl 1):38-45.

Kreuter F, Valliant R. A survey on survey statistics: what in done and can be done in Stata. Stata J 2007; 7(1):1-21.

Frohlich N, Carriere KC, Potvin L, Black C. Assessing socioeconomic effects on different sized populations: to weight or not to weight? J Epidemiol Community Health 2001; 55(12):913-20.

Lumley T. Analysis of complex survey samples. J Stat Softw 2004, 9(1):1-19.

Pereira AC, Vieira V, Frias AC. Pesquisa Estadual de Saúde Bucal - Relatório Final. Águas de São Pedro: Livronovo; 2016. 122p. [In Portuguese].

Korn EL, Graubard BI. Epidemiologic studies utilizing surveys: accounting for the sampling design. Am J Public Health 1991; 81(9):1166-73.

Sakshaug JW, West BT. Important considerations when analyzing health survey data collected using a complex sample design. Am J Public Health 2014; 104(1):15-6.

Cordeiro R. Effect of design in cluster sampling to estimate the distribution of occupations among workers. Rev Saúde Pública 2001; 35(1):10-5.

Cochran WG. Técnicas de Amostragem. Rio de Janeiro: Agência Norte-Americana para o Desenvolvimento Internacional e Editora Fundo de Cultura; 1965. [In Portuguese].

Perez MC, Utra IB, León AA, Roche RG, Sagué KA, de la Rosa MC, et al. Estimate methods used with complex sampling designs: their application in the Cuban 2001 health survey. Rev Panam Salud Publica 2004; 15(3):176-84.

Du Mouchel WH, Duncan GJ. Using sample survey weights in multiple regression analysis of stratified samples. J Am Stat Assoc 1983; 78(383):535-43.

Ciol MA, Hoffman JM, Dudgeon BJ, Shumway-Cook A, Yorkston KM, Chan L. Understanding the use of weights in the analysis of data from multistage surveys. Arch Phys Med Rehabil 2006; 87(2):299-303.

Kalton G. Introduction to Survey Sampling. Beverly Hills: Sage Publications; 1983.

Cox BG, Cohen SB. Methodological Issues for Health Care Surveys. New York: Marcel Dekker; 1985.

Sousa MH, Silva NN. Estimates from a complex survey. Rev Saúde Pública 2003; 37(5):662-70.

Greenland S. Invited Commentary: Variable selection versus shrinkage in the control of multiple confounders. Am J Epidemiol 2008; 167(5):523-9; discussion 530-1.

Cochran WG. Sampling Techiniques. 3rd ed. New York: John Wiley & Sons; 1977.

Queiroz RCS. Portela MC. Vasconcelos MTLV. Brazilian Oral Health Survey (SB Brazil 2003): data do not allow for population estimates, but correction is possible. Cad Saúde Pública 2009; 25(1):47-58.




How to Cite

Cavalcante, D. de F. B. ., Pereira, A. C. ., de Araújo, E. C. F. ., Cavalcanti, Y. W. ., Meneghim, M. de C. ., & Ambrosano, G. M. B. . (2021). Impact of Non-Weighting in the Analysis of Data Obtained from Complex Samples. Pesquisa Brasileira Em Odontopediatria E Clínica Integrada, 21, e0188. Retrieved from



Original Articles