| Sign In to gain access to subscriptions and/or personal tools. |
Applying Modern Survival Analysis Methods to Longitudinal Dental Caries Studies
1 Rolf Nevanlinna Institute, PO Box 4, FIN-00014 University of Helsinki, Finland; and Correspondence: *corresponding author, Tommi.Harkanen{at}RNI.helsinki.fi
Before the 1960s, tooth-specific caries risk was reported to be highest at 2 to 4 years after eruption. We studied the tooth-specific caries risk in three contemporary age cohorts in Finland. All together, 4072 boys and girls were followed annually from age 6 to age 18+ years in three age cohorts born in the 1960s, 1970s, and 1980s. We used a survival model and Bayesian inferential methods in the statistical analyses to establish the secular changes during this period. The analysis was based on the caries risk in individual teeth as a function of tooth age instead of summary measures, such as DMFS values. Our first finding was a marked overall decrease of caries. Moreover, analyses of the 1960 and 1970 cohorts revealed that the risk in molar teeth was highest immediately after eruption; in the youngest cohort, however, the risks of individual teeth were so low that no such dependencies on tooth age could be established.
Key Words: dental caries Bayesian inference intensity model measurement model survival analysis
Well-known indices for dental health, such as dmf(s), are summary measures of the entire dentition and therefore not particularly well-suited for describing the caries process at the level of an individual tooth or surface. This is because such indices do not distinguish between different subjects, different teeth, variations in the numbers of teeth at risk, or dental age. The classic reference for dental health determination based on dental age is the study by carlos and gittelsohn (1965) in which life table methods were used to estimate the risk of tooth failure due to caries separately for each tooth and as a function of tooth age. They reported that the caries risk was highest about two years after eruption in the second permanent molars, and one or two years later in all the other teeth. There has been a significant improvement in dental health in several economically developed countries. Both the level and the pattern of caries attack have changed: Caries in canines and mandibular incisors has virtually disappeared in young adults (Suni et al., 1998), and there is no longer any post-eruptive lag period for molar caries (Larmas et al., 1995). In this study, the teeth of Finnish boys and girls in three contemporary birth cohorts were compared by means of a Bayesian intensity model. (A more general discussion of the application of Bayesian ideas to dental research is given in Gilthorpe et al., 2000.) Our model is somewhat simpler than that described in Härkänen et al. (2000), corresponding to our focus on secular trends rather than on individuals in a heterogeneous population. We addressed the following main hypotheses: (i) There is an overall decrease of dental caries from the older age cohort, born in the 1960s, to the younger cohorts, born in the 1970s and 1980s; (ii) in molar teeth, the highest risk of failure occurs immediately after eruption in the 1960 and 1970 cohorts; and (iii) in the 1980 cohort, the risk is so low that individual variation between different teeth can no longer be established.
The present analysis is based on several data sets collected in the course of normal dental care in several municipal health centers in Finland, with annual examinations as described by Larmas et al. (1995; Appendix, www.dentalresearch.org). The use of dental records for scientific analyses was approved by the Ethical Committee of the Medical Faculty of the University of Oulu. We considered three age cohorts of boys and girls (n = 4072) born in the 1960s, 1970s, and 1980s and followed annually from age 6 to age 18+ years (see Table
The statistical model and estimation have been described by Härkänen et al. (2000). Here, only a short description without mathematical details is provided. The distribution of failure times is expressed in terms of positive-valued intensity functions (t), where t denotes subject age in years. Then, (t) dt is (approximately) the probability that a tooth which has not failed by age t will fail during the age interval [t, t+dt), where dt is a small positive constant. After the failure, the intensity function has the value of zero, which is compatible with the fact that a previously failed tooth is no longer intact and therefore cannot fail again "for the first time". The intensity is also zero before tooth emergence, because teeth cannot fail prior to their eruption. High intensity function values reflect a high risk of failure.
Let index i correspond to a subject, j to a tooth, k
b} = 1 for a<t b, and 1{a<t b} = 0 otherwise, ensuring that the intensity function is zero before eruption and again after failure. A similar model is assumed for tooth eruption times:
i models the between-subject variation: In some subjects, teeth erupt later (earlier), and for those subjects, i gets larger (smaller) values. Although a weak secular trend in the median age of eruption of some teeth was seen between the cohorts, it was assumed that the eruption processes were similar in the three birth cohorts, but possibly different between the genders (Virtanen et al., 1994). The value of the survival function at time t represents the probability that a newly erupted tooth will survive for at least t years after eruption, that is, bij>aij+t (Andersen et al., 1993). The survival function depends on the cumulative risk experienced, being a function of the integrated hazard rate, and it is therefore not sensitive to local changes in the risk level. Consequently, we use both hazard rates and survival functions in reporting our empirical results. In Bayesian inference, the posterior distribution of model parameters can be viewed as an expression of the uncertainty regarding their true values, given the prior information and the observed evidence contained in the data (Gelman et al., 1995). Due to the large amount of data in this study, the choice of the prior distributions of the parameters h and f is likely to have a negligible effect on the results. The software for the estimation is available at http://www.rni.helsinki.fi/~tth/bite.html. The eruption and failure times were interval-censored in the data: In each case, only the first examination time at which a newly erupted tooth or a new failure was found was recorded. Since the models (1) and (2) are expressed in terms of exact eruption and failure times, the original interval-censored data were augmented (Tanner and Wong, 1987) by the sampling of such times from the Poisson likelihoods defined by (1) and (2). The data augmentation as well as the estimation of the functions h and f in (1) and (2) were done iteratively by the generation of a large sample of the missing eruption and failure times and parameter values from the targeted posterior distribution of these variables, given the data. All posterior probabilities and expectations can then be approximated by suitable averages computed from that sample. For example, the posterior expectation of the unknown survival function corresponds to the mean value of that function, given the statistical model and the evidence contained in the data. For simplicity, we have here chosen to ignore in our statistical model the frailty parameters, which were used by Härkänen et al. (2000) for modeling the within-subject correlation of the failure times. In the test runs, this had only a small effect on the width of the credibility intervals.
Our results show that caries has decreased dramatically during the study period, in view of the tooth survival probabilities in the three cohorts. The difference of survival functions between the 1960 and 1970 cohorts was somewhat larger than that between the 1970 and 1980 cohorts (Fig. 1
The molar teeth also had much larger failure risks in terms of posterior expectations of the hazard rates in the 1960 cohort (Fig. 3
The classic reference for longitudinal dental caries studies in adolescents has been Carlos and Gittelsohn (1965). Their report was based on a clinical trial conducted in two towns (n = 6400) in the state of New York, USA. Their survival estimates were found to be similar to our estimates (Fig. 1
It is not clear how many of the differences in the observed patterns of caries attack between the 1960 and 1970 cohorts compared with the 1980 cohort are real changes in caries, and to what extent they should be attributed to new practices in disease prevention and treatment. The emphasis placed on preventive treatment, such as school-based fluoride rinses, the use of fissure sealants, increased use of fluoridated tooth paste, etc., in Finland had reduced the caries risk in the 1970 and especially in the 1980 cohorts. It is also probable that, as caries has diminished, the decisions to restore have been postponed (Espelid et al., 2001). The peculiar difference in the failure risk in some molar teeth (Figs. 1, 2
When the tooth-specific hazard rates were considered as functions of tooth age, some striking differences were found between our results (Fig. 3
The differences raise some questions which are difficult to answer, since we do not have access to the data of Carlos and Gittelsohn (1965). Nevertheless, some of their results can be questioned: First, the follow-up time was divided into four-month computational intervals. The risk of caries was then estimated by dividing the number of failures by the subjects' (estimated) total time at risk during that interval. In these computations, interval-censored (unobserved) occurrence times were replaced by the mid-points of the dental examination intervals, which were approximately one year. This may have surprising consequences in the life table estimates, particularly if the computational intervals are much shorter than the examination intervals. If both tooth eruption and caries attack were registered as having occurred during the same examination interval, then the unknown true value of the tooth lifetime was approximated by a value equal to half the length of that interval. As a consequence, the first computational interval can contain a failure time of a tooth only if the corresponding examination interval was less than eight months, and thus, short lifetimes are recorded (approximately) correctly in the life table estimation only if they are associated with short examination intervals. It seems that such intervals were quite rare in the study by Carlos and Gittelsohn (1965). This alone can explain why the short lifetimes were systematically associated with low risk estimates, which also explains partly the "rapid rise in susceptibility" shown by the curves in their Figs. 1 and 2 Second, Table 1 (in Carlos and Gittelsohn, 1965) indicates that only 2104 children out of approximately 7400 were included in the life table analysis. All permanent teeth that had erupted before the first examination were excluded. It is also unclear why some children had only one or two examinations. It is questionable whether appropriate life table techniques exist for dealing with such situations. The strength of our method is that it does not require intervals of equal length, and that it can also utilize data on children who were examined only a few times. From a clinical point of view, it is important to notice that our results show that the highest risk of caries attack in the molar teeth occurred soon after eruption in the 1960 and 1970 cohorts, not 2 to 4 years after eruption. Therefore, the most efficient caries-preventive measures for molar teeth were needed during the years immediately following tooth eruption. Similar results can be seen in all the other analyses of carious attack that have been conducted on Finnish children and adolescents after the 1960s (Larmas et al., 1995; Virtanen and Larmas, 1995; Virtanen et al., 1997; Suni et al., 1998). This high-risk period seems to occur at least when the caries prevalence is as high as it was in the USA before 1960 (Carlos and Gittelsohn, 1965) and in Finland in the 1960s. Although Bayesian intensity models are not the only method for analyzing survival data (cf. Hujoel et al., 1998; Hannigan et al., 2001), they constitute a unified framework for analyzing left-, interval-, or right-censored multivariate data. Frequentist methods could also be used, but non-parametric analysis of interval-censored data is not equally straightforward in that case. From a methodological perspective, the present study supports the proposition that Bayesian modeling and data analysis will become familiar features in the dental literature in the future (Gilthorpe et al., 2000). The drawback of this methodology is that the computational burden in estimation is considerably heavier than when traditional frequentist methods are applied.
This study is the Finnish part of the EUDENT project (Biomed contract BMH4-CT96-1541) of the European Union. In addition, T. Härkänen received funding from the ComBi graduate school of the University of Helsinki. Received for publication December 29, 2000. Revision received November 26, 2001. Accepted for publication November 30, 2001.
Journal of Dental Research, Vol. 81, No. 2,
144-148 (2002) This article has been cited by other articles:
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(t), where t denotes subject age in years. Then,
{1960, 1970, 1980} to a birth cohort, and l 
b} = 1 for a<t
i models the between-subject variation: In some subjects, teeth erupt later (earlier), and for those subjects, 



