Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to sign up for SAGE Journal Email Alerts today!

Sign In to gain access to subscriptions and/or personal tools.
Journal of Dental Research
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Blicher, B.
Right arrow Articles by Eke, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Blicher, B.
Right arrow Articles by Eke, P.
Right arrowPubmed/NCBI databases
Medline Plus Health Information
*Dental Health
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

CRITICAL REVIEWS IN ORAL BIOLOGY & MEDICINE

Validation of Self-reported Periodontal Disease: A Systematic Review

B. Blicher1, K. Joshipura1,*,2 and P. Eke3

1 Department of Oral Health Policy and Epidemiology, Harvard School of Dental Medicine, 188 Longwood Avenue, Boston, MA 02115, USA;
2 Department of Epidemiology, Harvard School of Public Health, Boston, MA, USA; and 3 Division of Oral Health, Centers for Disease Control and Prevention, Atlanta, GA, USA;

Correspondence: * corresponding author, kjoshipura{at}hsdm.harvard.edu


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Self-report is an efficient and accepted means of assessing many population characteristics, risk factors, and diseases, but has rarely been used for periodontal disease (chronic periodontitis). The availability of valid self-reported measures of periodontal disease would facilitate epidemiologic studies on a much larger scale, allow for integration of new studies of periodontal disease within large ongoing studies, and facilitate lower-cost population surveillance of periodontitis. Several studies have been conducted to validate self-reported measures for periodontal disease, but results have been inconsistent. In this report, we conducted a systematic review of the validation studies. We reviewed the 16 studies that assessed the validity of self-reported periodontal and gingivitis measures against clinical gold standards. Seven of the studies included self-reported measures specific to gingivitis, four included measures only for periodontitis, and five included both gingivitis and periodontal measures. Three of the studies used a self-assessment method where they provided the patient with a detailed manual for performing a self-exam. The remaining 13 studies asked participants to self-report symptoms, presence of periodontal disease itself, or their recollection of a dental health professional diagnosing them or providing treatment for periodontal disease. The review indicates that some measures showed promise, but results varied across populations and self-reported measures. One example of a good measure is, "Has any dentist/hygienist told you that you have deep pockets?", which had a sensitivity of 55%, a specificity of 90%, positive predictive value of 77%, and negative predictive value of 75% against clinical pocket depth. Higher validity could be potentially obtained by the use of combinations of several self-reported questions and other predictors of periodontal disease.

Key Words: Systematic review • self-report • validity • periodontal disease • gingivitis.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Self-report is an efficient and accepted means of assessing many diseases, such as cancer, cardiovascular disease (Newell et al., 1999), and juvenile rheumatoid arthritis (Wright et al., 1994), as well as risk factors for disease, such as diet (Willett, 1990; Rimm et al., 1992), physical activity (Wolf et al., 1994), high blood pressure (Tormo et al., 2000), and general health (Sheridan et al., 1998). In the United States, The Behavioral Risk Factor Surveillance System (BRFSS), a self-report survey system established in 1984 by the Centers for Disease Control and Prevention (CDC), is used extensively at the state and local levels to survey and track trends in diseases such as heart disease, cancer, stroke, and diabetes, and risk factors such as obesity, and has been used in recent years to monitor trends in dental visits, dental cleanings, and tooth loss (Battelle Memorial Institute, 1999). Self-report is used for overall oral health in other studies as well. For example, the Geriatric Oral Health Assessment Index (GOHAI) has been validated for use in populations diverse in ethnicity and age (Atchison and Dolan, 1990; Atchison et al., 1998; Tubert-Jeannin et al., 2003). Nonetheless, self-report has rarely been used for periodontal disease (chronic periodontitis). Investigators have questioned whether self-report can be used for this purpose. Studies evaluating the validity of self-reported measures for periodontal disease and gingivitis have reported inconsistent results.

The development, implementation, and evaluation of public health interventions for periodontal disease will require that the diseases be monitored at several levels of the population. Current measures of periodontal disease are extremely resource-intensive and cannot be used in several state-based surveillance systems. The existence and use of valid, low-cost, and low-resource self-reported measures of periodontal disease would be beneficial in a variety of ways. It would facilitate epidemiological studies of periodontal disease on a much larger scale than is feasible with the present clinical measures, since much larger study populations could be reached by surveys rather than by clinical examination. Additionally, questions regarding periodontal disease could easily be added to ongoing studies to evaluate associations with other diseases and conditions. The use of self-report would allow for an easier and low-cost method of obtaining data for research and would support the creation of oral health programs (Siegal et al., 1988; Kallio, 1996). Self-assessment can additionally serve as a motivational tool for good oral hygiene (Kallio, 1996). Finally, self-reported measures would allow for surveillance of the periodontal condition of populations over time, in national, state, or regional surveillance programs.

To date, no comprehensive review of the field has been published. In this report, we have reviewed all of the studies validating self-report of periodontal or gingival diseases. We did not necessarily expect to find a clear "yes" or "no" answer as to whether self-reported periodontal measures were valid. Our objective was not only to summarize the validity of different self-reported measures in different populations, but also to identify methods and measures which show promise for use and/or further development, testing, and refinement.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
We sought to identify all studies that evaluated the validity of self-report of periodontal and gingival diseases using clinical measures as the standard.

Literature searches were performed via the Ovid Web Gateway (2000) Internet interface for MEDLINE. The search strategy (Table 1Go) was developed by the incorporation of dental vocabulary identified from the MEDLINE Medical Subject Heading index, as well as key words. MEDLINE was searched from 1966 to June 14, 2004, by several groupings of terms, each group combined by the Boolean term "OR". Group 1 consisted of terms describing periodontal disease and gingivitis (periodontal disease, periodontitis, periodontal, gingival disease, gingivitis, and gingival), Group 2 contained terms capturing self-report (self-report, self-reported, self-assessment, and questionnaire), and Group 3 contained terms related to validation (validity, validation, comparison, and compared). Combining all three groups with "AND" resulted in 207 studies. We scanned the titles and abstracts of these studies manually to identify the studies that actually validated self-report of periodontal and/or gingival disease. Studies that used self-report but did not validate these measures, or that validated only overall or composite oral health measures, were discarded. Our prior knowledge of the literature in the field indicated to us that some studies were not included among the 10 studies identified by this search. Thus, we decided to broaden the search by combining Groups 1 and 2, resulting in a total of 749 studies. From these studies, we found 15 studies that fit our criteria. Bibliographies of these 15 studies were scanned for additional articles, which led us to one additional study, giving a total of 16 studies for review (Table 2Go).


View this table:
[in this window]
[in a new window]

 
Table 1. MEDLINE Search Strategy
 

View this table:
[in this window]
[in a new window]

 
Table 2. Characteristics of Included Studies
 
We reviewed each of these 16 studies, and extracted data from each study in the following fields: population characteristics and sampling criteria, method of self-report (self-assessment, questionnaire, interview), self-reported questions, clinical gold standards, and results of the validation study. Information regarding the patients’ self-reported signs, symptoms, perceptions, or knowledge of gingivitis or periodontal disease or treatment was included, whereas measures regarding perceived treatment needs or family history were discarded. A single abstracter performed the first abstraction, with consultation from a second author, as is accepted in the literature (Horvitz-Lennon et al., 2001). The second author subsequently verified all the results abstracted. Where there were discrepancies, they were discussed and resolved.

We synthesized the information regarding the studies and the results into three tables. Self-reported questions are grouped according to topic, and measures are described according to the specific wording of the questions to the extent provided by the authors, along with the clinical gold standards that were used for validating the self-reported questions. Results presented are generally as reported by the authors, and include p-values, percent agreement, correlation coefficients, regression coefficients, sensitivity and specificity, predictive values, and simple descriptive measures. We have calculated additional statistics based on the data provided in the manuscript when needed and possible, as noted. We were unable to perform summary analyses, such as ROC curves, of the studies under review, due to the inconsistency of statistical measures reported.

We considered a measure to have good validity when the sum of either sensitivity plus specificity or positive plus negative predictive values was 120% or above. This value was arbitrarily chosen; however, it represents the levels of the statistics that are accepted as good validity. Changing the threshold of the gold standard definition would increase specificity at the cost of sensitivity, or vice versa. In this context of validation of measures that could be used for etiologic studies, surveys, or surveillance, it is hard to know the relative importance of sensitivity and specificity. Hence, it is important to look at the combination of sensitivity plus specificity or predictive values.

The 16 studies gave us a large array of results, and we have presented only the most pertinent results from each study (Tables 3Go, 4Go). Many studies validated each self-report question with more than one clinical measure. Although we present multiple clinical measures for single self-reported measures where appropriate, when multiple similar clinical measures yielded similar results, we display only one of these clinical measures. For example, questions about periodontal status and periodontal surgery were compared with three measures of radiographic bone loss: "above median of average bone loss", "above median % of sites with score ≥ 2 mm", and "above median % of sites with score ≥ 3 mm" (Joshipura et al., 2002). The first two measures are more similar and gave comparable predictive values. Thus, we present the results using only the second and third clinical measures for each self-reported measure (Table 3Go, questions 5 and 20).


View this table:
[in this window]
[in a new window]

 
Table 3. Results from Validation of Self-reported Periodontal Disease (AQ)
 

View this table:
[in this window]
[in a new window]

 
Table 4. Results from Validation of Self-reported Gingivitis
 

    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
The 16 studies are briefly summarized in Table 2Go, which describes the sample size used in the validation study, the population, the sampling methods used, and the method for self-report for each study. The sample size ranged from 63 to 1333 participants. The first row shows a study, by Glavind and Attström (1979), consisting of a self-assessment of periodontal status by 108 individuals, aged 19–75 years, who were recruited from the Royal Dental College in Århus, Denmark, among patients seeking dental treatment for periodontal disease, caries, or prosthetics, and for other reasons, such as referral from their dentist.

The studies varied by population characteristics. Six of the studies were conducted among school-age children (Nakashima et al., 1988, 1989; Schwarz, 1989; Kallio et al., 1994; Kallio, 1996; Taani and Alhaija, 2003), while ten were conducted among adults. Of the 16 reports, only three, all by the same group, were conducted in the United States (Joshipura et al., 1996, 2002; Pitiphat et al., 2002). Two of these reports consisted of populations of health professionals—one a cohort of dentists (Joshipura et al., 1996), and the other a cohort of non-dentist health professionals (Joshipura et al., 2002). The other US report (Pitiphat et al., 2002) performed separate validations among two different populations—one a group of veterans, and another, consecutive patients at a dental school clinic. Thus, there were actually 17 separate populations evaluated in the 16 publications.

Of the 16 reports, three used a specified self-assessment method (Glavind and Attström, 1979; Kallio et al., 1990; Kallio, 1996). In these studies, participants were given written manuals detailing the procedures they should use for self-assessment and were asked to report their findings on the forms provided.

The remaining 13 publications assessed symptoms or awareness of disease conditions by means of a questionnaire for self-report. The questionnaires were either administered in writing at the time of the patient visit (Kallio et al., 1994; Gilbert and Nuttall, 1999; Taguchi et al., 1999; Pitiphat et al., 2002; Taani and Alhaija, 2003), distributed by mail (Tervonen and Knuuttila, 1988; Schwarz, 1989; Joshipura et al., 1996, 2002; Unell et al., 1997; Buhlin et al., 2002), or given as an interview, conducted in person (Nakashima et al., 1988, 1989) or by telephone (Pitiphat et al., 2002).

The results of the validation studies are presented in Table 3Go for periodontal disease and Table 4Go for gingivitis. Eight studies validated self-reported periodontal disease, and 13 studies validated self-reported gingivitis and are included in both Tables. Five of these studies validated both self-reported periodontal disease and gingivitis (Glavind and Attström, 1979; Nakashima et al., 1988; Tervonen and Knuuttila, 1988; Unell et al., 1997; Gilbert and Nuttall, 1999; Buhlin et al., 2002).

Results showing good validity are indicated by gray highlighting in Tables 3Go and 4Go. An outline box indicates that the statistics given by the authors were adequate but the validity was not satisfactory, and the absence of highlighting or an outline box indicates that inadequate statistics were given for validity to be determined, and thus we were unable to evaluate these results.

As seen in Table 3Go, self-reported measures for periodontal disease were grouped into several categories: disease awareness/perception as defined by the study participant (periodontal disease or periodontal disease with bone loss), knowledge of professional diagnosis of periodontal disease, severity of periodontal disease, symptoms of periodontal disease (tooth mobility, recession), and treatment (periodontal treatment, periodontal surgery).

Question 1 in Table 3Go is from the study by Tervonen and Knuutilla (1988). Patients were asked, "Do you have gum disease?" Presence of gum disease was reported by 20% of the subjects, and clinical disease assessed by CPITN was found in 38%. No other statistics were reported; thus, the measure is not marked by highlighting or an outline box and does not give sufficient data for us to determine validity.

Question 4 in Table 3Go is from the study by Joshipura et al.(1996). Patients were asked, "Have you had periodontal disease with bone loss?" Validation by the clinical measure, " ≥ 2 sites with radiographic bone loss > 2 mm or complete loss of crestal lamina dura", gave a positive predictive value of 76% and a negative predictive value of 74%. That is, 76% of those who self-reported disease had two or more sites with bone loss greater than 2 mm, and 74% of those who self-reported no disease had fewer than 2 sites with bone loss greater than 2 mm. This measure is marked by gray highlighting, indicating good validity.

Often, questions were validated by more than one clinical gold standard. Question 2 in Table 3Go is from the study by Gilbert and Nuttall (1999). Self-reported gum disease was validated by two different clinical measures. Validation against clinical pocket depth greater than 4 mm showed sensitivity of 32% and specificity of 93%. That is, 32% of those who had any pockets greater than 4 mm self-reported gum disease, and 93% of those who had no pockets greater than 4 mm reported no gum disease. Overall, these results show good validity of the measure. Validation against mobility, defined as horizontal mobility greater than 0.2 mm, showed sensitivity of 26% and specificity of 91%.

As seen in Table 4Go, self-reported measures for gingivitis may be divided into categories similar to those for periodontal disease. The studies looked at self-reported presence of gingivitis, knowledge of professional diagnosis of gingivitis, and symptoms of gingivitis (bleeding from gums, bleeding on toothbrushing, bleeding on toothpicking, inflammation of gums). Classification of the clinical measure as periodontal disease or gingivitis was done based on the self-reported measure; therefore, several of the clinical gold standards used for the validation are the same as those used for the self-reported measures of periodontal disease.

Question 1 in Table 4Go is from the study by Schwarz (1989). Participants were asked, "Do you have gum problems?" Participants who self-reported "no gum problems" showed a gingival bleeding index (GBI%) of 6.1%, those who self-reported "gum problems now and then" showed a GBI% = 10.1%, and those participants who self-reported "gum problems often" showed a GBI% of 24.5%. No other statistics were reported; thus, the study is not highlighted or outlined and does not give sufficient data for us to determine validity.

Two measures for gingivitis showed good validity, as indicated by gray highlighting. Both measures were from the category "Bleeding from Gums". One measure was from the study by Gilbert and Nuttall (1999). The measure, "Gums have bled recently" (question 9 in Table 4Go), was validated by clinical bleeding (> 40% sites bled), and resulted in sensitivity of 35% and specificity of 88%. The second self-report measure with good validity was from the study by Buhlin et al.(2002). "Do your gums usually bleed?" (question 10 in Table 4Go) was validated with clinical presence of bleeding on probing (BOP), and resulted in sensitivity of 42%, specificity of 76%, positive predictive value of 53%, and negative predictive value of 67%.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
As indicated by gray highlighting in Tables 3Go and 4Go, some studies appear to show good validity. Since the universally accepted measures of validity [sensitivity, specificity, or predictive values) (Bossuyt et al., 2003)] were needed for the classification to be made, those studies that reported inadequate statistics could not be classified. If appropriate statistics were available, classification of studies was based on our arbitrarily pre-defined criteria (sum of sensitivity plus specificity or positive plus negative predictive values greater than 120%), since there are no universal criteria or universal definitions of valid measures.

Sixteen (80%) of the 20 self-reported measures for periodontal disease in Table 3Go provided results in the appropriate format for validity to be determined (sensitivity, specificity, or predictive values). Thirteen (81%) of these 16 self-reported measures were valid. Five (38%) of the 13 valid measures were valid against more than one clinical measure. For the 24 self-reported measures of gingivitis in Table 4Go, eight (33%) presented results in the appropriate format for validity to be determined, and only two (25%) of these eight self-reported measures were valid.

The self-report questions repeated in two or more studies often showed conflicting results. For example, self-reported bleeding from gums was reported in six studies, and appropriate statistics were given for three of these. Of these, two studies (Gilbert and Nuttall, 1999; Buhlin et al., 2002) found that measure to be valid, and one did not. The two studies that showed good validity used different clinical measures, although both were measures of gingival bleeding, and had slight differences in wording across the self-reported measures ("gums have bled sometimes" and "gums have bled recently"). However, there does not seem to be any obvious factor explaining the discrepancy between studies that did and those that did not find self-reported measures to be valid, though many factors are likely to have an effect.

Critics often question whether self-report is a valid measure at all. Self-report is considered a suitable measure in routine use for many different conditions and diseases. For example, the Behavioral Risk Factor Surveillance System (BRFSS) assessed self-reported diabetes. Studies have compared self-reported estimates of diabetes from BRFSS, with fasting serum glucose levels and medical records as gold standards (Bowlin et al., 1993; Martin et al., 2000; Nelson et al., 2001). The sensitivity values ranged from 67% to 80%, and specificity ranged from 98% to 99%, suggesting that persons without diabetes provided valid answers. Based on data for 1995, estimates for diabetes prevalence were 4.7% in BRFSS and compared well with a prevalence of 4.5% in the National Health Interview Survey, which used clinical information (Battelle Memorial Institute, 1999). Thus, self-reported diabetes appears to be valid in this context.

Other measures used in the BRFSS show validity similar to, or even weaker than, the measures for self-reported periodontal disease and gingivitis. Validation studies of self-reported hypercholesterolemia compared with clinical measurements resulted in sensitivity of 43% and specificity of 86% (sum = 129%). Comparing self-reported blood pressure with medical records yielded a sensitivity of 99% and specificity of 23% (sum = 122%). In contrast, the best measure for self-reported periodontal disease was by Buhlin et al.(2002), asking, "Has any dentist/hygienist told you that you have deep pockets?" (question 7, Table 3Go). Validation against pocket depth (number of pockets ≥ 4 mm) showed sensitivity of 55% and specificity of 90% (sum = 145%). Hence, several of the measures in our review give validation results similar to, or even better than, those given by other, more accepted, self-report measures outside of oral health.

To convey the extent of the impact of misclassification, one would need to compare the true prevalence, as defined by the clinical gold standards, with the observed prevalence from the self-reported measures. For example, in a hypothetical population of US adults, and a single self-report question with positive predictive value of 76% and negative predictive value of 74%, if the true prevalence is 35%, then the prevalence in the self-reports would be 19% (Barron, 1977; Flegal et al., 1986; Joshipura, 1995). Thus, there would be an underestimation of periodontal disease by 46%. Hence, if we use self-reports for estimating prevalence, we would need to use the formulae for correcting the prevalence estimates from self-report to arrive at the true prevalence.

Validity is likely to vary across the types of questions asked. Severe measures of periodontal disease, such as mobility, may be easier for the patients to notice in themselves. For example, self-reported "Think teeth loose or wobbly" [question 12 in Table 3Go (Gilbert and Nuttall, 1999)] shows good validity by clinical mobility. Mobility is related to severe periodontal disease and should be easy to notice, and we would expect that it is a valid measure.

Three measures in our review evaluated the validity of self-reported mobility. One measure, self-assessed highest recorded tooth mobility score compared with professionally determined mobility (Glavind and Attström, 1979), showed good validity, based on our calculations from the authors’ reported statistics, with sensitivity of 92% and specificity of 53% (SN + SP = 145%). The second measure, "think teeth loose or wobbly", compared with clinical mobility (Gilbert and Nuttall, 1999) showed good overall validity, based on our criteria of sensitivity + specificity > 120%. Although the sensitivity is only about 30% compared with the clinical gold standards, the > 90% specificity indicates that this can in fact be a reasonable measure, especially if combined with another measure that shows very high sensitivity. Hence, mobility does seem to be a relatively good measure. The third measure included mobility but combined it with bleeding, "self-assessed number of pockets exhibiting bleeding or mobility compared to (sic) clinically determined pocket depth" (Glavind and Attström, 1979), and did not report appropriate statistics for us to determine validity.

Not only is the idea behind the question important, but also the specific wording used plays a role in making a patient understand what is being asked, and affects his or her ability to answer the question. For example, asking a patient, "Do you have periodontal disease?" is different from asking him or her, "Do you have gum problems?" or even, "Do you have gum disease?". Depending on the patient’s background, different terms may have very different meanings. Wording may also help the patient to answer a question. Self-reported "Told by dentist/hygienist have gum disease" may trigger the memory of being in the dentist’s office, enhancing the patient’s ability to answer the self-report accurately. Accordingly, all three measures about professional diagnosis of periodontal disease show good validity (questions 7, 8, and 9 in Table 3Go). However, people who do not have access to dental care would feel uneasy answering the questions asking them to recall if a dentist had diagnosed them with periodontal disease. Additional details in the self-report question offer the patient a greater opportunity to recognize the disease in him- or herself, perhaps based on recognition of some key phrase that a health professional had previously discussed with them. Self-report measures asking about "gum disease" (questions 1, 2, and 3 in Table 3Go) show much less validity than those asking about periodontal disease with bone loss (questions 4, 5, and 6 in Table 3Go).

The manner in which the self-reported measure is asked or reported by the patient may also play a role in determining its validity. Having a patient follow a self-assessment protocol requires more discipline and skill, and these measures might be biased by the types of patients who can adhere to the protocol. In the reports compiled, self-assessment measures were used to determine toothpick- or toothbrush-induced bleeding (Glavind and Attström, 1979; Kallio et al., 1990; Kallio, 1996) or tooth mobility (Glavind and Attström, 1979). However, we cannot infer anything regarding the validity of self-assessment from this report: Two studies (Glavind and Attström, 1979; Kallio et al., 1990) did not report statistics appropriately for us to determine validity of their studies, and the other study (Kallio, 1996) did not show good validity. Written questionnaires might be better measures for some populations, although education level certainly determines the groups for which this would work best. It is likely that patient motivation and concentration may be lower when completing a mailed questionnaire vs. one given in person at the time of a visit to the dentist’s office. Six of the seven studies using a written questionnaire at the time of patient visit, and that reported the appropriate statistics, were valid. All four studies using a mailed questionnaire and reporting the appropriate statistics showed good validity. Telephone vs. live interviewing methods could also affect the efficacy of the self-reported measure. Adequate statistics were not given for us to compare the validity of telephone vs. personal interviewing tactics.

The population studied is likely to play a role in the validity. Population characteristics such as disease status, socio-economic status (SES), and dental care utilization are all likely to affect the validity of self-report. Subjects who face more disease might be more aware of their periodontal status. SES certainly plays a role in the utilization of dental care, and, as stated earlier, it may be difficult for a person who does not visit the dentist to answer questions regarding professional diagnosis of periodontal disease. Additionally, awareness and perceptions of periodontal condition may differ among different SES levels and different levels of dental care utilization. Several of the measures reviewed showed low sensitivity, and this may be due to low dental care access and utilization. Patients who do not visit the dentist are less likely to be aware of their periodontal condition. Presumably, the sensitivity for these measures would be higher in populations with higher levels of dental care utilization.

Dental care utilization affects a population’s ability to self-report its oral condition. For example, there is recent evidence that dental care is available and used widely in the US (Vargas et al., 2003). The 2002 National Health Interview Survey by the CDC found that 87% of adults aged 18 years or older had contacted a dentist or other dental health professional within the preceding five years (Lethbridge-Cejku et al., 2004). The 2002 BRFSS found that 71% of adults in a population over 18 (weighted to reflect the US population) had had a dental cleaning, and 70% had visited the dentist in the preceding 12 months. Only 28% had not visited the dentist or had a dental cleaning in the previous 12 months. High access to care may make an American population more adept at reporting its periodontal condition. Seven of the 12 self-reported measures for periodontal disease showing good validity were among US populations, although three of these measures were among health professionals, which might account for their better validity.

As seen in Tables 3Go and 4Go, the clinical measure used for validation also affects the validity of the self-reported measure. Many of the clinical measures give similar results. Clinically determined pocketing and mobility showed similar good validity for question 8 in Table 3Go, "Told by dentist/hygienist have gum disease" (Gilbert and Nuttall, 1999). However, several of the self-reported measures were compared against more than one clinical measure with different outcomes; thus, we can see that the clinical measure used plays a role. For example, clinically determined pocketing and mobility give different results in the validation of question 15 in Table 3Go, self-reported "Thinks teeth have moved position" (Gilbert and Nuttall, 1999). Validation with a clinical measure of mobility (any teeth with horizontal mobility > 0.2 mm) results in good validity, whereas validation with clinical pocketing (any pockets > 4 mm) does not show good validity. Similarly, different measures of radiographic bone loss, "above median % of sites with radiographic bone loss > 20%" and "> 4 teeth with radiographic bone loss > 40%", give different results for question 9 in Table 3Go, self-reported "Have you ever been told by a dentist that you have periodontal/gum disease with bone loss?" (Pitiphat et al., 2002). The second measure is valid, whereas the first measure is not.

Additionally, besides the variations that occur in validating the self-reported measures with different clinical measures, we must question which clinical measures are most appropriate. Clinical measures should be similar in the context of the self-reported measures which they validate. For example, clinical measures specific to periodontal disease, such as attachment loss or bone loss, should be used to validate questions that are specific to periodontal disease. Several studies reported validation results for clinical measures that did not match the self-reported measure in symptoms or severity of disease, and this could explain the lack of good validity in these cases. For example, question 9 in Table 4Go, regarding bleeding from gums, a sign of gingivitis, was compared with clinical bleeding, a clinical measure of gingivitis, but also with pocket depth and mobility, two measures that represent periodontal disease. Accordingly, we included only the bleeding measure in the Tables. As may be expected, good validity was seen for clinically measured bleeding against self-report, whereas comparisons with clinical pocket depth or mobility did not show good validity.

Determining which clinical measures are most appropriate for validation is a difficult task, and even more so when one considers what exactly makes a clinical measure valid. Clinical gold standards themselves lack inherent standardization; thus, the definition of periodontal disease varies according to which measure was used for diagnosis. Clinical measurements are difficult to standardize, due to variations in the exact site of placement of the probe, probing force, angulation, patient discomfort, degree of inflammation, and bleeding (Pihlstrom, 1992), and, when radiographs are used, in the angulation and technique. Additionally, there is no universally accepted threshold of periodontal disease, and comparisons with different thresholds of attachment loss, bone loss, or pocket depth will give different levels of validity.

Complicating matters more, the clinical measures could themselves be considered surrogate endpoints (Prentice, 1989). For example, obesity, hypertension, and hypercholesterolemia are surrogate endpoints for the true outcomes of cardiovascular morbidity and mortality (Psaty et al., 1999). Gingival bleeding, pocket depth, mobility, and radiographic bone loss are measures that we have available to tell us a patient’s risk for developing the true endpoints of periodontal disease. One can consider the true endpoints for periodontal disease as the health outcomes that result, such as tooth loss (Hujoel et al., 1997) leading to loss of function, pain, or loss of aesthetics. However, such endpoints cannot easily be measured in a standardized way. Loss of function and pain must be reported by the patients themselves; thus, self-report could be considered better than clinical measures if one considers function and patients’ perceptions as the gold standards of periodontal disease.

Further work must be done before the validity of self-report for periodontal disease and gingivitis can be determined. Investigators conducting future research in this area should keep in mind the results of previous studies as guidance. The studies above do not cover the breadth of possible questions that could be asked. Studies must be done in which several variations of the same question are compared, or combinations of questions are examined. The studies we have tabulated all had adequate sample size, with 60 or more participants, and were generally cross-sectional. However, we are limited by the few populations examined in the 16 studies, and future work should examine a greater diversity of populations. Only seven of the 16 studies were conducted on random samples; hence, investigators should keep in mind the greater generalizability the study may have with random sampling, when feasible. Overall, the specificity of measures was high, while sensitivity was low. The sensitivity might be improved by the addition of questions in parallel, rather than in series. If a patient incorrectly responds "no" to the original question, further questions are unable to detect the presence of a periodontal condition. Allowing each question to be asked separately, although seemingly redundant, will allow each question its fair chance at detecting the condition.

Several of the manuscripts that we reviewed did not contain all of the details that we wished to extract to make a true systematic comparison of the studies. There are several components of validation studies that would have allowed us to better analyze and compare the studies reviewed. A comprehensive format, such as that outlined in the Standards for Reporting of Diagnostic Accuracy STARD Checklist for Reporting of Diagnostic Accuracy Studies (Bossuyt et al., 2003), would ensure that population characteristics, self-reported and clinical measures, and diagnostic accuracy or validity are complete. A major limitation to the work in this review was the lack of uniformity and appropriateness in statistical analysis, and this prevented us from conducting a quantitative assessment of the validity in the field. The reporting of statistics in a standard and uniform manner is important for comparison of measures and allows for easier prediction of which measures might work best. We were unable to perform any formal quality assessments, such as those outlined by Antczak et al. (1986a,b), both due to the lack of uniform reporting among studies and because the methods are generally for the assessment of randomized control trials.

Based on the literature synthesized above, self-report shows good potential for the assessment of periodontal disease. Thirteen self-reported measures of periodontal disease showed good validity compared with clinical gold standards. Results were less supportive of the validity of self-report for gingivitis, since only two measures of gingivitis showed validity. Several measures for periodontal disease were useful and valid in the populations examined, but the results so far have not consistently proved any single measure’s superiority and qualification for use alone in a general population. Using several self-reported measures in combination may prove to be a good alternative. The best measures we found were, "Have you had periodontal disease with bone loss?" (question 4 in Table 3Go), "Do you have periodontal disease with bone loss?" (question 6 in Table 3Go), and "Has any dentist/hygienist told you that you have deep pockets?" (question 7 in Table 3Go). A few such measures, capturing different dimensions, could be combined with demographic factors and major risk factors for periodontal disease, such as smoking. The development and use of valid self-reported measures for periodontal disease and gingivitis would allow for larger-scale epidemiologic studies, population surveillance, and the integration of questions about periodontal disease and gingivitis into existing studies to evaluate associations with other diseases and conditions.


    ACKNOWLEDGMENTS
 
The authors acknowledge Dr. Jeff Hyman for conducting an initial literature search, and Dr. Shuku Fujimaki for translating the Japanese articles. Brooke Blicher was supported by NIH Training Grant DEO7151. The project was supported by the Centers for Disease Control and Prevention, Division of Oral Health.

Received for publication September 13, 2004. Accepted for publication June 10, 2005.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  • Antczak AA, Tang J, Chalmers TC (1986a). Quality assessment of randomized control trials in dental research. I. Methods. J Periodontal Res 21:305–314.[CrossRef][Medline] [Order article via Infotrieve]
  • Antczak AA, Tang J, Chalmers TC (1986b). Quality assessment of randomized control trials in dental research. II. Results: periodontal research. J Periodontal Res 21:315–321.[CrossRef][Medline] [Order article via Infotrieve]
  • Atchison KA, Der-Martirosian C, Gift HC (1998). Components of self-reported oral health and general health in racial and ethnic groups. J Public Health Dent 58:301–308.[Medline] [Order article via Infotrieve]
  • Atchison KA, Dolan TA (1990). Development of the Geriatric Oral Health Assessment Index. J Dent Educ 54:680–687.[Abstract]
  • Barron BA (1977). The effects of misclassification on the estimation of relative risk. Biometrics 33:414–418.[CrossRef][Medline] [Order article via Infotrieve]
  • Battelle Memorial Institute (1999). Evaluation of the Behavioral Risk Factor Surveillance System (BRFSS) as a source of national estimates for selected health risk behaviors: final report. Baltimore, MD.
  • Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziov PP, Irwig LM, et al. (2003). Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. Ann Intern Med 138:40–44.[Abstract/Free Full Text]
  • Bowlin SJ, Morrill BD, Natziger AN, Jenkins PL, Lewis C, Pearson TA (1993). Validity of cardiovascular disease risk factors assessed by telephone survey: the Behavioral Risk Factor Survey. J Clin Epidemiol 46:561–571.[CrossRef][Medline] [Order article via Infotrieve]
  • Buhlin K, Gustafsson A, Andersson K, Hakansson J, Klinge B (2002). Validity and limitations of self-reported periodontal health. Community Dent Oral Epidemiol 30:431–437.[CrossRef][Medline] [Order article via Infotrieve]
  • Flegal KM, Brownie C, Haas JD (1986). The effects of exposure misclassification on estimates of relative risk. Am J Epidemiol 123:736–751.[Abstract/Free Full Text]
  • Gilbert AD, Nuttall NM (1999). Self-reporting of periodontal health status. Br Dent J 186:241–244.[Medline] [Order article via Infotrieve]
  • Glavind L, Attström R (1979). Periodontal self-examination. A motivational tool in periodontics. J Clin Periodontol 6:238–251.[Medline] [Order article via Infotrieve]
  • Horvitz-Lennon M, Normand SL, Gaccione P, Frank RG (2001). Partial versus full hospitalization for adults in psychiatric distress: a systematic review of the published literature (1957–1997). Am J Psychiatry 158:676–685.[Abstract/Free Full Text]
  • Hujoel PP, Leroux BG, DeRouen TA, Powell LV, Kiyak HA (1997). Evaluating the validity of probing attachment loss as a surrogate for tooth mortality in a clinical trial on the elderly. J Dent Res 76:858–866.
  • Joshipura KJ (1995). Oral health, nutrition, and coronary heart disease. In: Epidemiology and dental medicine. Boston, MA: Harvard, p. 89.
  • Joshipura KJ, Douglass CW, Garcia RI, Valachovic R, Willett WC (1996). Validity of a self-reported periodontal disease measure. J Public Health Dent 56:205–212.[Medline] [Order article via Infotrieve]
  • Joshipura KJ, Pitiphat W, Douglass CW (2002). Validation of self-reported periodontal measures among health professionals. J Public Health Dent 62:115–121.[Medline] [Order article via Infotrieve]
  • Kallio P (1996). Self-assessed bleeding in monitoring gingival health among adolescents. Community Dent Oral Epidemiol 24:128–132.[Medline] [Order article via Infotrieve]
  • Kallio P, Ainamo J, Dusadeepan A (1990). Self-assessment of gingival bleeding. Int Dent J 40:231–236.[Medline] [Order article via Infotrieve]
  • Kallio P, Nordblad A, Croucher R, Ainamo J (1994). Self-reported gingivitis and bleeding gums among adolescents in Helsinki. Community Dent Oral Epidemiol 22(5 Pt 1):277–282.[Medline] [Order article via Infotrieve]
  • Lethbridge-Cejku M, Schiller JS, Bernadel L (2004). Summary health statistics for US adults: National Health Interview Survey, 2002. National Center for Health Statistics. Vital Health Stat 10(222):1–151.
  • Martin LM, Leff M, Calonge N, Garrett C, Nelson DE (2000). Validation of self-reported chronic conditions and health services in a managed care population. Am J Prev Med 18:215–218.[CrossRef][Medline] [Order article via Infotrieve]
  • Nakashima K, Maeda S, Shimoyama M, Karami K, Shimojima T, Watanabe Y, et al. (1988). [Epidemiological research of periodontal disease from questionnaire and pocket examination for junior and senior high school students in Kawagoe]. Nippon Shishubyo Gakkai Kaishi 30:935–946 (in Japanese).[Medline] [Order article via Infotrieve]
  • Nakashima K, Kurihara C, Kawanaga T, Kurihashi Y, Ohsawa K, Onodera O, et al. (1989). [Research into actual conditions and preventive care in periodontal disease. Relationship between questionnaire results and periodontal disease in youth]. Nippon Shishubyo Gakkai Kaishi 31:1220–1241 (in Japanese).[Medline] [Order article via Infotrieve]
  • Nelson DE, Holtzman D, Bolen J, Stanwyck CA, Mack KA (2001). Reliability and validity of measures from the Behavioral Risk Factor Surveillance System (BRFSS). Soz Praventivmed 46(Suppl 1):S3–S42.[CrossRef][Medline] [Order article via Infotrieve]
  • Newell SA, Girgis A, Sanson-Fisher RW, Savolainen NJ (1999). The accuracy of self-reported health behaviors and risk factors relating to cancer and cardiovascular disease in the general population: a critical review. Am J Prev Med 17:211–229.[CrossRef][Medline] [Order article via Infotrieve]
  • Ovid Web Gateway (2000). New York, NY: Ovid Technologies, Inc.
  • Pihlstrom BL (1992). Measurement of attachment level in clinical trials: probing methods. J Periodontol 63(12 Suppl):1072–1077.[Medline] [Order article via Infotrieve]
  • Pitiphat W, Garcia RI, Douglass CW, Joshipura KJ (2002). Validation of self-reported oral health measures. J Public Health Dent 62:122–128.[Medline] [Order article via Infotrieve]
  • Prentice RL (1989). Surrogate endpoints in clinical trials: definition and operational criteria. Stat Med 8:431–440.[Medline] [Order article via Infotrieve]
  • Psaty BM, Weiss NS, Furberg CD, Koepsell TD, Siscovick DS, Rosendaal FR, et al. (1999). Surrogate end points, health outcomes, and the drug-approval process for the treatment of risk factors for cardiovascular disease. J Am Med Assoc 282:786–790.[Free Full Text]
  • Rimm EB, Giovannucci EL, Stampfer MJ, Colditz GA, Litin LB, Willett WC (1992). Reproducibility and validity of an expanded self-administered semiquantitative food frequency questionnaire among male health professionals. Am J Epidemiol 135:1114–1126; discussion 1127–1136.[Abstract/Free Full Text]
  • Salonen LW, Frithiof L, Wouters FR, Helldén LB (1991). Marginal alveolar bone height in an adult Swedish population. A radiographic cross-sectional epidemiologic study. J Clin Periodontol 18:223–232.[Medline] [Order article via Infotrieve]
  • Schwarz E (1989). Dental caries, visible plaque, and gingival bleeding in young adult Danes in alternative dental programs. Acta Odontol Scand 47:149–157.[Medline] [Order article via Infotrieve]
  • Sheridan CL, Mulhern M, Martin D (1998). Validation of a self-report measure of somatic health. Psychol Rep 82:679–687.[Medline] [Order article via Infotrieve]
  • Siegal MD, Martin B, Kuthy RA (1988). Usefulness of a local oral health survey in program development. J Public Health Dent 48:121–124.[Medline] [Order article via Infotrieve]
  • Taani DQ, Alhaija ES (2003). Self-assessed bleeding as an indicator of gingival health among 12–14-year-old children. J Oral Rehabil 30:78–81.[Medline] [Order article via Infotrieve]
  • Taguchi A, Suei Y, Ohtsuka M, Otani K, Tanimoto K, Hollender LG (1999). Relationship between bone mineral density and tooth loss in elderly Japanese women. Dentomaxillofac Radiol 28:219–223.[Abstract]
  • Tervonen T, Knuuttila M (1988). Awareness of dental disorders and discrepancy between "objective" and "subjective" dental treatment needs. Community Dent Oral Epidemiol 16:345–348.[Medline] [Order article via Infotrieve]
  • Tormo MJ, Navarro C, Chirlaque MD, Barber X (2000). Validation of self diagnosis of high blood pressure in a sample of the Spanish EPIC cohort: overall agreement and predictive values. EPIC Group of Spain. J Epidemiol Community Health 54:221–226.[Abstract/Free Full Text]
  • Tubert-Jeannin S, Riordan PJ, Morel-Papernot A, Porcheray S, Saby-Collet S (2003). Validation of an oral health quality of life index (GOHAI) in France. Community Dent Oral Epidemiol 31:275–284.[Medline] [Order article via Infotrieve]
  • Unell L, Söderfeldt B, Halling A, Paulander J, Birkhed D (1997). Oral disease, impairment, and illness: congruence between clinical and questionnaire findings. Acta Odontol Scand 55:127–132.[Medline] [Order article via Infotrieve]
  • Vargas CM, Dye BA, Hayes K (2003). Oral health care utilization by US rural residents, National Health Interview Survey 1999. J Public Health Dentist 63:150–157.
  • Willett W (1990). Nutritional epidemiology. New York: Oxford University Press.
  • Wolf AM, Hunter DJ, Coblitz GA, Manson JE, Stampfer MJ, Corsano KA, et al. (1994). Reproducibility and validity of a self-administered physical activity questionnaire. Int J Epidemiol 23:991–999.[Abstract/Free Full Text]
  • Wright FV, Law M, Crombie V, Goldsmith CH, Dent P (1994). Development of a self-report functional status index for juvenile rheumatoid arthritis. J Rheumatol 21:536–544.[Medline] [Order article via Infotrieve]

Journal of Dental Research, Vol. 84, No. 10, 881-890 (2005)
DOI: 10.1177/154405910508401003


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Blicher, B.
Right arrow Articles by Eke, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Blicher, B.
Right arrow Articles by Eke, P.
Right arrowPubmed/NCBI databases
Medline Plus Health Information
*Dental Health
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?