You are seeing this message because your Web browser does not support basic Web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.


ABOUT ARCHIVES
Advanced Search

Welcome   | My Account | E-mail Alerts | Access Rights | Sign In


  Vol. 137 No. 8, August 2001 TABLE OF CONTENTS
  Archives
  •  Online Features
  Observation
 This Article
 •Abstract
 •PDF
 •Send to a friend
 • Save in My Folder
 •Save to citation manager
 •Permissions
 Citing Articles
 •Citation map
 •Citing articles on HighWire
 •Citing articles on ISI (14)
 •Contact me when this article is cited
 Related Content
 •Related article
 •Similar articles in this journal
 Topic Collections
 •Statistics and Research Methods
 •Photosensitivity Disorders
 •Alert me on articles by topic

Reliability of Counting Actinic Keratoses Before and After Brief Consensus Discussion

The VA Topical Tretinoin Chemoprevention (VATTC) Trial

Martin A. Weinstock, MD, PhD; Stephen F. Bingham, PhD; Gary W. Cole, MD; David Eilers, MD; Mark F. Naylor, MD; James Kalivas, MD; J. Richard Taylor, MD; Hayes B. Gladstone, MD; Daniel J. Piacquadio, MD; John J. DiGiovanna, MD

Arch Dermatol. 2001;137:1055-1058.

ABSTRACT

Objective  To assess the reliability of counts of actinic keratoses (AKs) and the effect of a brief joint discussion of discrepancies on that reliability.

Design and Intervention  Seven dermatologists independently counted AKs on the face and ears before and after a brief joint discussion of discrepancies.

Setting and Patients  A volunteer sample of 9 patients from the ongoing VA (Department of Veterans Affairs) Topical Tretinoin Chemoprevention (VATTC) Trial. All participating individuals are veterans and have had 2 or more keratinocyte carcinomas (basal or squamous cell carcinoma) in the 5 years before enrollment in the study.

Main Outcome Measure  Standard deviation of estimates of the Poisson regression parameter for the dermatologists.

Results  Substantial variation was found among the dermatologists in their AK counts. The SD of the parameter estimates for the dermatologists decreased from 0.45 to 0.24 after the brief joint discussion, a 47% decrease (P = .076). The variation attributable to the dermatologists also decreased substantially ({chi}26 decrease, 94 to 12).

Conclusions  Actinic keratoses are common, and there is a continuous spectrum of lesions that ranges from sun-damaged skin to squamous cell carcinoma in situ. Clinical distinguishing features may be difficult to delineate precisely. Counts of AK are commonly performed, but appear to be unreliable, even when performed by experienced dermatologists. Joint discussion of discrepancies may enhance the reliability of these counts, although substantial variation remains. Research that relied on these counts must be reevaluated in light of the marked variation among expert observers. Future studies should consider measures to assess and enhance reliability.



INTRODUCTION
 Jump to Section
 •Top
 •Introduction
 •Subjects and methods
 •Results
 •Comment
 •Author information
 •References

ACTINIC KERATOSES (AKs) are the dysplastic nevi of the keratinocyte world. They are direct precursors of squamous cell carcinoma of the skin (some have called them incipient squamous cell carcinomas), and are important indicators of risk for all 3 common types of sun-related skin cancers and markers of UV exposure in sun-sensitive skin. Because they are very common, and commonly treated, they are responsible for substantial use of health care resources. Like dysplastic nevomelanocytic nevi, they excite controversy. The regulations governing payment for their treatment by Medicare in the United States have been particularly controversial, because AKs are much more commonly treated on diagnosis than are dysplastic nevi.

The AK count is also a secondary end point of the ongoing VA (Department of Veterans Affairs) Topical Tretinoin Chemoprevention (VATTC) Trial. Larger-than-expected variation among clinical sites of this trial in baseline AK counts provided the initial impetus for this investigation.

Large-scale investigations involving AK, including clinical trials and epidemiologic investigations, typically use as their primary measure the number of AKs as counted by a dermatologist or other suitably trained individual.1-5 Despite the use of this end point, the reliability and validity of these counts have been subject to very limited study. The present investigation seeks to evaluate the reliability of these counts in the context of the high-risk population enrolled in the VATTC Trial.


SUBJECTS AND METHODS
 Jump to Section
 •Top
 •Introduction
 •Subjects and methods
 •Results
 •Comment
 •Author information
 •References

The context of this investigation was the VATTC Trial, a randomized, multicenter, double-blind vehicle-controlled trial of topical 0.1% tretinoin cream applied twice daily to the face and ears. The primary outcome of this trial is time to onset of new keratinocyte carcinoma (basal or squamous cell) on the face or ears. All participants in this trial have received care at 1 of the 6 participating sites (all Department of Veterans Affairs medical centers). All have had 2 or more keratinocyte carcinomas on the face and ears in the 5 years before enrollment in the study, and met the other requirements for the trial. Among the secondary outcomes of the VATTC Trial is AK count on the face and ears. For the purpose of this trial, AK was defined as a keratotic lesion with at least some erythema that did not fit an alternative diagnosis such as seborrheic keratosis or psoriasis. The present investigation was prompted by the observation that there were marked differences among the 6 clinical sites in the mean AK count at the baseline examination. These differences suggested discrepancies among the sites in AK recognition.

To address this issue, a meeting was held to conduct the present investigation. It included investigators from the VATTC Trial (all dermatologists) and a convenience sample of trial participants (the patients). All patients were receiving 0.1% tretinoin cream or a vehicle control; at that point in the study, the typical frequency of use was once daily. On the morning of the meeting, each dermatologist independently counted and recorded the number of AKs on the face and ears of each patient in the morning group. The dermatologists were asked to use the trial AK definition, and no further instruction was given. To facilitate counting and recording, the area of the face and ears was subdivided into 12 anatomic regions. These 12 regions were combined for all analyses. Once these data were collected, the dermatologists convened as a group around a single patient and specifically identified and discussed the lesions that were or were not counted as AK. This discussion was conducted for each of the remaining patients in the morning session. The group of dermatologists reconvened after lunch to independently count the number of AKs on the face and ears of a second group of patients. None of the afternoon group of patients had participated in the morning session.

Seven dermatologists counted AKs (counters). These dermatologists included the primary investigators (G.W.C., D.E., N.F.N., J.K., and J.R.T.) at 5 of the 6 clinical sites of the VATTC Trial. Another was the collaborating investigator (H.B.G.) at the sixth site, and another was the study chairman (M.A.W.). The collaborating investigator was in fellowship training; the other 6 counters were board-certified dermatologists with a mean of 20 years of experience since certification (range, 7-30 years). All except the study chairman were actively involved in counting AKs for the VATTC Trial.

We used Stata version 6.06 to obtain maximum likelihood Poisson regression model estimates for each evaluation session. The model included patient- and counter-effect parameters. We used {chi}2 goodness of fit tests to determine if the data fit the Poisson regression model. The difference between likelihood ratio {chi}2 statistics was used to determine if there were significant counter effects. The SDs for the morning and afternoon counter parameters were compared using the F test. The VATTC Trial in which all of these participants were enrolled is a double-blind trial; that "blind" was not broken for these analyses.

Written informed consent was obtained from all patients participating in this study.


RESULTS
 Jump to Section
 •Top
 •Introduction
 •Subjects and methods
 •Results
 •Comment
 •Author information
 •References

Five patients were recruited for the morning session, and 4 for the afternoon session. All were participants in the ongoing VATTC Trial at the Miami, Fla, center, and were receiving topical 0.1% tretinoin or vehicle cream. Their mean age was 68 years (range, 56-81 years); they had a mean of 7 basal cell carcinomas (median, 8; range, 0-13) and 2.4 squamous cell carcinomas of the skin (median, 1; range, 0-9). The mean of all counts of AKs on the face and ears of these participants was 12.7. Table 1 displays measures of central tendency, variability, and skewness for the AK counts for each of the 9 participants.


View this table:
[in this window]
[in a new window]
Measures of Central Tendency, Variability, and Skewness of Actinic Keratosis Counts


The morning (prediscussion of differences) and afternoon (postdiscussion) sessions were fitted to separate Poisson regression models. The morning session data did not fit the Poisson model (P = .041), but removal of a single outlying count resulted in an acceptable goodness of fit (P = .5). Poisson parameters were estimated for each of the counters. The SD of these parameter estimates was 0.45 (0.42 when the outlier was included). Variation among participants accounted for most of the overall variability in the model, but variation among counters was substantial and statistically significant ({chi}26 = 94; P<.001). The data from the afternoon session did fit the Poisson model (P = .17). The SD of the Poisson parameters was 0.24, a 47% decrease from the morning session (43% if the outlier was not excluded from the morning data). This 47% decrease was of borderline statistical significance (F6,6 = 3.51; P = .076). The variation attributable to the counters in the afternoon Poisson model ({chi}26 = 12; P = .06) was much smaller than that in the morning model. Table 1 illustrates that although the variability in counts in the afternoon session was reduced in magnitude, it nevertheless was substantial. The Poisson parameters of the dermatologist still in fellowship training was not the highest or the lowest in the morning or the afternoon session. We also performed analysis restricted to the 5 primary investigators, and the results were similar.


COMMENT
 Jump to Section
 •Top
 •Introduction
 •Subjects and methods
 •Results
 •Comment
 •Author information
 •References

We have documented a substantial variation in results of AK counting by experienced dermatologists in our high-risk study population. We have also documented a substantial improvement in the consistency of AK counts after group discussion of discrepancies.

The present investigation is limited in several respects. The numbers of patients and dermatologists were small, so statistical power was modest. All of the patients were participants in the VATTC Trial, so the results may not be generalizable to the broader population, even if they apply to VATTC Trial participants. Generalization to other populations is limited by 3 important considerations. First, all of the patients were receiving the study medication or vehicle control, which might have altered the appearance of their AKs. Second, VATTC Trial participants are selected because of a diagnosis of 2 or more keratinocyte carcinomas during the 5 years before enrollment in the study. Hence, they had severe actinic damage and were at high risk for subsequent keratinocytic neoplasia. Finally, all patients were receiving care at the Miami Veterans Affairs Medical Center, so they may be different in relevant respects from populations that differ geographically, or by sex, age, or socioeconomic status. Generalizability from the study dermatologists to all practicing dermatologists also cannot be guaranteed.

Some have assumed that in experienced hands, dysplastic nevi are difficult to diagnose, but AKs are not. This assumption has not been demonstrated and may be false. Even among experts, both diagnoses may be difficult to define precisely. The increased scrutiny devoted to the limitations of accuracy of many clinical judgments in recent years is useful for properly understanding dermatologic diagnosis as well.

Prior investigations of the validity of clinical AK diagnoses have been limited. Predictive value of a positive clinical diagnosis of AK has generally been found to be reasonably good. One study confirmed the clinical diagnosis of 34 (94%) of 36 clinically typical AKs.7 A second study included a randomly chosen subset of a population-based clinical trial involving sunscreen use, and confirmed 39 (81%) of the 48 clinical diagnoses of AK.2 Other population-based studies have reported 80% and 91% histological confirmation of randomly chosen clinically diagnosed AKs.8-9 In all of these studies, the gold standard against which the clinical diagnosis was compared was a histological diagnosis, and the validity of the histological diagnosis has been questioned.10-12

Studies of the clinical diagnoses of skin cancers, however, have turned up substantial numbers of misdiagnosed AKs in some but not all contexts. In a population-based study, 13% of suspected skin cancers were histologically diagnosed as AK (12% of 514 presumed basal cell carcinomas and 39% of 44 presumed squamous cell carcinomas)13; in another, 17% were so diagnosed9; but in clinical practice setting, only 2% of suspected basal cell carcinomas were histologically confirmed as AK.14

Reliability of AK diagnoses was assessed in a general population sample. Photographs were taken of suspected AKs, and a panel of 3 dermatologists agreed with the examiner's diagnosis of AK in 86%, although the details are not entirely clear, including the extent of agreement among the 3 dermatologists.15 Other studies of classification of patients into categories of single AK, more than 1 AK, or no AKs in screening or clinic settings have observed interobserver {kappa} statistics ranging from 0.17 to 0.52 for a single AK and 0.48 to 0.78 for multiple AKs.16-18

Clinical trials and epidemiological studies involving AKs have typically used simple counts of clinically diagnosed lesions without histological confirmation (except occasionally on a small subset) or evaluation of reliability of the assessment.1-5 In light of our results, training and subsequent assessment of reliability would seem to be useful in future studies.

Counting of AKs is an imprecise business, even when performed by experts. Part of the difficulty is that there is a continuous spectrum of lesions that ranges from sun-damaged skin to squamous cell carcinoma in situ. Defining the difference between AK and sun-damaged skin on the one end, or the difference between AK and squamous cell carcinoma in situ on the other, is akin to defining the difference between green and blue: easy to recognize in the pure cases but less reliable in intermediate areas of the spectrum. Fortunately, observers working together can achieve a degree of reliability. We must be careful to interpret the existing literature involving AK with this in mind, and encourage assessments of AK in published studies to be accompanied by an estimate of the reliability of those assessments.


AUTHOR INFORMATION
 Jump to Section
 •Top
 •Introduction
 •Subjects and methods
 •Results
 •Comment
 •Author information
 •References

Accepted for publication April 11, 2001.

This project was supported by grant 402 from the Cooperative Studies Program of the Department of Veterans Affairs Office of Research and Development, Washington, DC.

Presented at the Fifth Annual Meeting of the International Dermatoepidemiology Association, Chicago, Ill, May 11, 2000.

We thank Margarita Givens, MD, Kimberly Marcolivio, MEd, and Gail Kirk, MS, for their assistance with this research.

Corresponding author and reprints: Martin A. Weinstock, MD, PhD, Dermatoepidemiology Unit, Veterans Affairs Medical Center–111D, 830 Chalkstone Ave, Providence, RI 02908-4799 (e-mail: maw{at}brown.edu).

From the Dermatoepidemiology Unit, Veterans Affairs (VA) Medical Center, Providence (Dr Weinstock), and Department of Dermatology, Rhode Island Hospital and Brown University, Providence (Drs Weinstock and DiGiovanna); the VA Cooperative Studies Program Coordinating Center, Perry Point, Md (Dr Bingham); Department of Dermatology, VA Medical Center Long Beach, Long Beach, Calif (Dr Cole); Department of Dermatology, University of California–Irvine (Dr Cole); Division of Dermatology, Hines VA Medical Center, Hines, Ill (Dr Eilers); Division of Dermatology, Loyola University, Chicago, Ill (Dr Eilers); Department of Dermatology, VA Medical Center Oklahoma City and University of Oklahoma, Oklahoma City (Dr Naylor); Department of Dermatology, Carl T. Hayden VA Medical Center and University of Arizona, Phoenix (Dr Kalivas); Department of Dermatology, University of Miami, and Dermatology Service, Miami VA Medical Center, Miami, Fla (Dr Taylor); Department of Dermatology, VA Medical Center West Los Angeles, Los Angeles, Calif (Dr Gladstone); and Department of Dermatology, University of California–San Diego (Dr Piacquadio). Dr Gladstone is now affiliated with the Division of Dermatologic Surgery, Stanford University, Stanford, Calif.


REFERENCES
 Jump to Section
 •Top
 •Introduction
 •Subjects and methods
 •Results
 •Comment
 •Author information
 •References

1. Naylor MF, Boyd A, Smith DW, Cameron GS, Hubbard D, Neldner KH. High sun protection factor sunscreens in the suppression of actinic neoplasia. Arch Dermatol. 1995;131:170-175. ABSTRACT
2. Thompson SC, Jolley D, Marks R. Reduction of solar keratoses by regular sunscreen use. N Engl J Med. 1993;329:1147-1151. FREE FULL TEXT
3. Black HS, Herd JA, Goldberg LH, et al. Effect of a low-fat diet on the incidence of actinic keratosis. N Engl J Med. 1994;330:1272-1275. FREE FULL TEXT
4. Frost C, Williams G, Green A. High incidence and regression rates of solar keratoses in a Queensland community. J Invest Dermatol. 2000;115:273-277. FULL TEXT | ISI | PUBMED
5. Holman CDJ, Armstrong BK, Evans PR, et al. Relationship of solar keratosis and history of skin cancer to objective measures of actinic skin damage. Br J Dermatol. 1984;110:129-138. FULL TEXT | ISI | PUBMED
6. Stata Statistical Software: Release 6.0 [computer program]. College Station, Tex: Stata Corp; 1999.
7. Ponsford MW, Goodman G, Marks R. The prevalence and accuracy of diagnosis of non-melanotic skin cancer in Victoria. Australas J Dermatol. 1983;24:79-82. PUBMED
8. Frost CA, Green AC, Williams GM. The prevalence and determinants of solar keratoses at a subtropical latitude. Br J Dermatol. 1998;139:1033-1039. FULL TEXT | ISI | PUBMED
9. Frost CA, Green AC. Epidemiology of solar keratoses. Br J Dermatol. 1994;131:455-464. ISI | PUBMED
10. Carag HR, Prieto VG, Yballe LS, Shea CR. Utility of step sections: demonstration of additional pathological findings in biopsy samples initially diagnosed as actinic keratosis. Arch Dermatol. 2000;136:471-475. FREE FULL TEXT
11. Jones RE. Questions to the editorial board and other authorities. Am J Dermatopathol. 1984;6:301-306.
12. Heaphy MR, Ackerman AB. The nature of solar keratosis: a critical review in historical perspective. J Am Acad Dermatol. 2000;43:138-150. FULL TEXT | ISI | PUBMED
13. Kricker A, English DR, Randell PL, et al. Skin cancer in Geraldton, Western Australia: a survey of incidence and prevalence. Med J Aust. 1990;152:399-407. ISI | PUBMED
14. Lewis FM, Start RD, Gawkrodger DJ. Diagnostic accuracy for basal cell papillomas. Br J Dermatol. 1993;129:94-95. FULL TEXT | ISI | PUBMED
15. Harvey I, Frankel S, Marks R, Shalom D, Nolan-Farrell M. Non-melanoma skin cancer and solar keratoses, I: methods and descriptive results of the South Wales Skin Cancer Study. Br J Cancer. 1996;74:1302-1307. ISI | PUBMED
16. Whited JD, Hall RP, Simel DL, Hormer RD. Primary care clinicians' performance for detecting actinic keratoses and skin cancer. Arch Intern Med. 1997;157:985-990. ABSTRACT
17. Whited JD, Horner RD, Hall RP, Simel DL. The influence of history on interobserver agreement for diagnosing actinic keratoses and malignant skin lesions. J Am Acad Dermatol. 1995;33:603-607. FULL TEXT | ISI | PUBMED
18. Leffell DJ, Chen Y-T, Berwick M, Bolognia JL. Interobserver agreement in a community skin cancer screening setting. J Am Acad Dermatol. 1993;28:1003-1005. ISI | PUBMED

RELATED ARTICLE

Archives of Dermatology Reader's Choice: Continuing Medical Education
Arch Dermatol. 2001;137(8):1122-1123.
FULL TEXT  


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES

Clinical Recognition of Actinic Keratoses in a High-Risk Population: How Good Are We?
Venna et al.
Arch Dermatol 2005;141:507-509.
ABSTRACT | FULL TEXT  





HOME | CURRENT ISSUE | PAST ISSUES | TOPIC COLLECTIONS | CME | SUBMIT | SUBSCRIBE | HELP
CONDITIONS OF USE | PRIVACY POLICY | CONTACT US | SITE MAP
 
© 2001 American Medical Association. All Rights Reserved.