| Sign In to gain access to subscriptions and/or personal tools. |
Bayesian Analysis of Clustered Interval-censored Data
1 Dental Public Health, Faculty of Dentistry, and Correspondence: * corresponding author, mcmwong{at}hkucc.hku.hk
The recording of multiple interval-censored failure times is common in dental research. Modeling multilevel data has been a difficult task. This paper aims to use the Bayesian approach to analyze a set of multilevel clustered interval-censored data from a clinical study to investigate the effectiveness of silver diamine fluoride and sodium fluoride varnish in arresting active dentin caries in Chinese pre-school children. The time to arrest dentin caries on a surface was measured. A three-level random-effects Weibull regression model was used. Analysis was performed with WinBUGS. Results revealed a strong positive correlation (0.596) among the caries lesions arrest times on different surfaces from the same child. The software WinBUGS made the above complicated estimation simple. In conclusion, the annual application of silver diamine fluoride on caries lesions, and caries removal before the application, were found to shorten the arrest time.
Key Words: Bayesian approach biostatistics multilevel modeling WinBUGS survival data
Survival analysis encompasses a variety of statistical techniques for analyzing failure time data. When independent exact failure times are recorded with right-censored failure times (e.g., unobserved failure times due to subject drop-outs), various parametric, semi-parametric, and non-parametric methods are available in standard statistical software packages to estimate the survival curves and to investigate the effects of the covariates on survival (Allison, 1995; Venables and Ripley, 1999; Fleming and Lin, 2002; SPSS Inc., 2002). However, in practice, subjects are usually not monitored continuously but are examined periodically at pre-scheduled time points, e.g., every 6 mos. When a failure is observed, the event actually occurred between the current and the previous examination times (interval-censored data). Few methods are available to analyze interval-censored failure time data (Lindsey and Ryan, 1998). A common approach to handling interval-censored data is to assign a particular value to the failure time (e.g., midpoint of the time interval) and then proceed as if the data are being collected on a continuous scale. However, this can lead to biased and misleading results (Prentice and Gloeckler, 1978). Recording multiple failure times from the same subject is a common practice in dental research. It is obvious that data from the same subject are not independent. Thus, when one is analyzing clustered failure time data, it is important to estimate the intra-cluster association. Multilevel modeling (Gilthorpe et al., 2000b; Leyland and Goldstein, 2001) or hierarchical linear modeling (Bryk and Raudenbush, 1992) is a class of statistical techniques developed to take into account the intra-cluster dependence in the analysis of clustered data. Analysis of clustered multilevel interval-censored data using the frequentist approach for parameter estimation requires tailor-made computer programs. It would be desirable for dental researchers if the clustered multilevel interval-censored data could be analyzed by some software.
Bayesian analysis by the Monte Carlo Markov Chain (MCMC) has been a popular tool for analyzing complex data recently, and it has made its way into the medical and dental arena due to advances in computational and modeling techniques. Basically, Bayesian analysis generates conclusions based on the synthesis of new information from a study (the observed data) and previous knowledge or external evidence from independent sources (priors). By specifying a probability model for the observed data, D, given a set of unknown parameters, This paper aims to use the Bayesian approach to analyze a set of multilevel clustered interval-censored data from a clinical study to investigate the effectiveness of silver diamine fluoride and sodium fluoride varnish in arresting active dentin caries in Chinese pre-school children.
Dataset The data were from a prospective controlled clinical trial investigating the effectiveness of silver diamine fluoride (SDF) and sodium fluoride varnish (NaF) in arresting active dentin caries in Chinese pre-school children (Lo et al., 2001; Chu et al., 2002). Approval from the Ethics Committee of the Faculty of Dentistry, University of Hong Kong, was obtained prior to the implementation of the study. Children with written parental consent attending eight kindergartens participated in the study. At the baseline, one trained dentist examined the kindergarten childrens upper incisors and canines. After the examination, children with dentin caries in at least 1 of their primary anterior teeth were sequentially allocated to one of five groups. For children in the first group, soft dentin in the caries lesions was removed by means of hand instruments. The cavities were then painted with a 38% SDF solution every 12 mos. Children in the second group had SDF applied to the caries lesions every 12 mos without prior removal of the carious tissue. For children in the third group, soft dentin in the caries lesions was removed, and then a 5% NaF varnish was applied to the caries lesions every 3 mos. Children in the fourth group had NaF applied every 3 mos without prior removal of caries. Water was painted onto the carious teeth in the last group of children. Follow-up examinations were carried out every 6 mos after baseline by the same examiner, who did not know the subjects group assignments. Caries was diagnosed at cavitation level and explored with a sharp sickle-shaped probe at the center of the cavity. A tooth surface could be recorded as sound, caries-active, caries-arrested, filled, or missing.
Statistical Analysis
Since the arrest times were not totally independent, 2 additive random effects were included in the model, to account for the clustering effects of the carious tooth surfaces in the same childs mouth, and of children attending the same kindergarten, namely, Bj (j = 1, 2, ... , 367) and Ck (k = 1, 2, ... , 8), respectively. The random effects Bj and Ck were assumed to follow the N(0,
and
respectively, where
and X are the observed covariates, such as group allocation. In this study, all the covariates X are coded as 1 or 0, indicating the presence or absence of a certain characteristic or treatment. Typically, a positive regression coefficient β corresponds to a higher risk of the failure being observed among those with the associated characteristic, relative to those without. Alternatively, it is natural to report a more intuitive measure, namely, the relative risks [RR = exp(β), RR > 1 indicates a higher risk of failure]. In this study, a positive β or RR > 1 corresponds to a higher chance of arrest of active dentin caries and thus expects a shorter arrest time.
The shape parameter r characterizes the shape of the distribution (r > 1 for increasing failure rate; r < 1 for decreasing failure rate; and r = 1 for constant failure rate). With the above model, the intra-cluster correlation between the logarithmic arrest times from the same child and from children attending the same kindergarten can be estimated by [ With the arrest time Tijk being interval-censored in the interval (t1, t2), conditioned on the random effects Bj and Ck, the contribution to the likelihood can be expressed as
Unconditioning the random effects is very often an intractable task in the interval-censored set-up, particularly in multilevel modeling (with more than one random effect). Hence, the Bayesian approach with MCMC algorithms was adopted, and the analysis was carried out with the software WinBUGS, version 1.3, in which Gibbs sampler was used for the generation of samples (Spiegelhalter et al., 1999). A three-level model was considered, with tooth surfaces as level 1, children as level 2, and kindergartens as level 3. In the estimation of the parameters, the first 5000 simulations were treated as burn-ins and discarded, while the estimation was based on the next 10,000 simulations. Non-informative priors were adopted in this analysis, since we did not want to impose any prior beliefs on the effects of the treatments. A graphed presentation of the model used in the analysis and the model statements used for the programming are shown in the Appendix for technical reference (readers could skip this without loss of continuity).
A total of 375 children, 209 boys (56%) and 166 girls (44%), with a mean age of 4.1 yrs (SD = 0.9) was included in the study. The mean dmfs of the children was 4.7, and the mean number of active-caries surfaces was 4.0 (Table 1
In the analysis, 1483 surfaces with dentin caries from 367 children were included. Results from 10,000 simulations, generated from the posterior distributions of the parameter estimates (Table 2 2school = 0.025; 95% credible interval = (0.001, 0.151), Table 2 2child = 2.394; 95% credible interval = (1.822, 3.066), Table 2
When we compared the results obtained from the analysis performed at the tooth-surface level using a Bayesian approach in analyzing clustered interval-censored data in this paper with the analysis performed at the subject level, reported previously (Lo et al., 2001; Chu et al., 2002), we found, in both analyses, that SDF solution applied annually to active caries lesions was more effective in arresting caries than was NaF applied every 3 mos. However, with the analysis performed at the tooth-surface level, it was also found that having the soft caries removed could shorten the arrest time. Since the correlation among the arrest times of caries lesions in tooth surfaces from the same child was found to be very strong, any analysis ignoring this correlation would yield biased or invalid results. When survival analysis is performed at the tooth-surface level, it is possible to estimate the median time for a caries-active tooth surface to become arrested. This provides more information on the effectiveness of the agents. Recently, in dental research, several approaches have been proposed for handling clustered survival data with exact failure times (Chuang et al., 2002a,b; Gilthorpe et al., 2002), or for handling clustered interval-censored data (Härkänen et al., 2000, 2002; Hannigan et al., 2001; Bogaerts et al., 2002). Both the frequentist and the Bayesian approaches have been used, different models (frailty vs. marginal) have been suggested, and different software packages (SAS, S-plus) have been recommended. To our knowledge, this is the first study in dental research to use the software package WinBUGS for analyzing multilevel (clustered) interval-censored data, and to report the correlations among the failure times. Multilevel modeling in terms of multivariate frailty can also be applied if the data structure is much more complicated—for instance, multi-stage clustering or nested design in a randomized controlled trial. The Bayesian approach rests on an essentially subjective interpretation of the observed data in the light of external evidence, judgment, and past experiences (i.e., the informative priors) and then to derive the conclusion in a manner that fits naturally with the clinical decision-making process (Spiegelhalter et al., 1994). It is well-known that turning informally expressed opinion into a mathematical prior distribution is perhaps the most difficult aspect of Bayesian analysis and therefore should be introduced with caution (Spiegelhalter, 2001). In situations where informative priors are unavailable, or to provide a kind of objective Bayesian analysis free from subjectivity, non-informative priors can be adopted, as in this study. Bayesian inference has several advantages over the frequentist approaches, particularly in the flexibility of model-building for complex data. Moreover, for many models, frequentist inference can be obtained as a special case of Bayesian inference with the use of non-informative priors (Ibrahim et al., 2001). The Bayesian approach enables us to make exact inference based on the posterior distribution for any sample size, whereas the frequentist approach relies heavily on the large sample approximation, and there is always the issue of whether the sample size is large enough for the approximation to be valid (Ibrahim et al., 2001). There is a danger that the additional complexity of Bayesian methods could lead to improper data analysis if it is not used correctly. In addition, software for implementation of Bayesian methods is still limited in user-friendliness (Spiegelhalter et al., 2004). Bayesian inference Using Gibbs Sampling (BUGS or WinBUGS) is a piece of freely available computer software for the Bayesian analysis of complex statistical models using Markov chain Monte Carlo (MCMC) methods (Spiegelhalter et al., 1999). It is reasonably easy to use and comes with a wide range of examples (Spiegelhalter et al., 1996a,b). However, much technical statistical knowledge is required for it to be used correctly. With the abovementioned advantages and the availability of the software WinBUGS, analysis of clustered multilevel interval-censored data is made possible and simple. In conclusion, the annual application of silver diamine fluoride to caries lesions, and caries removal before the application, were found to have shortened the arrest time.
The work described in this paper was supported by a grant from the Research Grants Council of the Hong Kong SAR, China (Project No. HKU 7026/00E).
A supplemental appendix to this article is published electronically only at http://www.dentalresearch.org. Received for publication November 13, 2003. Revision received April 28, 2005. Accepted for publication May 31, 2005.
Journal of Dental Research, Vol. 84, No. 9,
817-821 (2005) This article has been cited by other articles:
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(unknown quantities that are of interest), and assuming that
(
2child) and N(0,
ijk(t) functions 





