Two Views of Developmental Testing

(1)

Letters to the Editor

Statements appearing here are those of the writers and do not represent the official position of the American Academy

of Pediatrics, Inc. or its Committees. Comments on any topic, including the contents ofPediatrics,are invited from all

members of the profession: those accepted for publication will not be subject to major editorial revision but generally must be no more than 400 words in length. The editors reserve the right to publish replies and may solicit responses from authors and others.

• • •

Letters should be submitted in duplicate in double-spaced typing on plain white paper with name and address of sender(s) on the letter. Send them to Jerold F. Lucey, MD, Editor, Pediatrics Editorial Office, University of Vermont College of Medicine, 89 Beaumont Ave, Given Building, Room D201, Burlington, VT 05405-0068.

Two Views of Developmental Testing

To the Editor.—

I am deeply perturbed by Dr Frankenburg’s commentary.1_His

assertions are erroneous and troubling:

1. Developmental delays cannot be quantified.False! Every state,

un-der the Individuals With Disabilities Education Act, has care-fully generated definitions of developmental delays (eg, 40% delay in a single developmental area or 25% delays in 2 or more areas). Children who meet such criteria are eligible for special education services, preschool intervention programs, and an array of other assistance. Clearly, development can not only be quantified but quantified in a way that is meaningful and productive.

2. The Denver II is a developmental growth chart and not a screening

tool.If this were the case, why does the measure provide scores

such as “abnormal,” or “questionable.” Why does the Denver II remain the tool most frequently mentioned by pediatricians when asked how they detect developmental delays? More im-portantly, why does the Denver II manual and its authors not actively discourage use of the tool in screening? Finally, if the Denver II is a growth chart, why would we want to deploy it if such a process did not lead to detection of potential problems and to a proven intervention?

3. Psychometric support for the Denver II and the Prescreening

Devel-opmental Questionnaire (PDQ) is adequate.Also untrue. The

mea-sures were standardized only in Colorado and not throughout the United States. Thus, we do not know whether the test works well in diverse settings: New York City, Miami, or Bayou Goula, Louisiana. Even growth charts for height and weight were standardized on a sample that was geographically di-verse. Even more disturbing, the Denver II is not validated. The authors should have administered the test alongside a battery of criterion measures tapping the various developmental do-mains and then calibrated cutoff scores in relation to diagnostic results. Failure to do this is not only psychometrically un-sound2_{but also explains the Denver II’s limited sensitivity or}

specificity.3_{The PDQ II, a subset of Denver II items, suffers}

from similar psychometric and scoring weaknesses. A reviewer at the Buros Institute of Mental Measurement noted that “. . . the absence of reliability, validity, and comprehensive normative data, suggests the PDQ be used most cautiously, primarily as an

experi-mental or exploratory scale in research.”4

Problems with the accuracy of the Denver II were noted⬎10 years ago, and the authors were encouraged to remove the test from the market to pursue the necessary standardization and validation.2_{The tremendous influence of the Denver II on the field}

of pediatrics, nationally and internationally, places heavy respon-sibility on its authors to market a quality instrument. School failure, worsening delays, unnecessarily expensive and intensive special education services all await a child with special needs who is unidentified because of an invalid screening test. Alternatively, poorly constructed measures waste precious diagnostic resources and professional time and may cause parents needless worry and expense.

Yet the responsibility of constructing a quality measure is not onerous. The principles of test construction are well-defined by a consortium of societies devoted to psychological and educational research.3_{Many other tests have been subjected to psychometric}

studies and, unlike the Denver II, found worthy of clinical appli-cation. Table 1 (see page 1182) illustrates the differences between the Denver II and several competing screening and surveillance measures. Providers are encouraged to select tools that enjoy nationally representative standardization, validation, and high levels of accuracy (ie, sensitivity in detecting delays/disabilities and specificity in detecting typical development⬎70%).5

Frances Page Glascoe, PhD East Berlin, PA 17316

REFERENCES

1. Frankenburg WK. Developmental surveillance and screening of infants and young children.Pediatrics.2002;109:144 –145

2. American Educational Research Association, American Psychological Association, National Council on Measurement in Education.Standards for Educational and Psychological Testing.5th ed. Washington, DC: Amer-ican Educational Research Association; 1999

3. Glascoe FP, Byrne KE, Chang B, Strickland B, Ashford L, Johnson K. Accuracy of the Denver II in developmental screening.Pediatrics.1992; 89(Suppl):1221–1225

4. Michael WB. Review of the Revised Denver Prescreening Developmen-tal Questionnaire. In: Conoley JC, Impara JC, eds.The Twelfth Mental Measurements Yearbook. Lincoln, NE: Buros Institute of Mental Measurements; 1995. (See Test Number 327.)

5. Squires J, Nickel RE, Eisert D. Early detection of developmental problems: strategies for monitoring young children in the practice set-ting.J Dev Behav Pediatr.1996;17:420 – 427

In Reply.—

Dr Glascoe’s response to my commentary1_{makes a variety of}

surprising assertions and quotations showing that she misread the commentary in some cases and is not up-to-date in others. In addition, she raises so many issues that it is difficult to respond to all of them here.

1. She attributes to me the statement “developmental delay can-not be quantified.” That was can-not in my commentary. What was said was, “There is no single universally agreed upon definition of ‘developmental delay.’ ” Clinicians speaking of growth or developmental delays may have very different interpretations of the word “delay.” Although an education law may state specific percentages of delay that qualify for services, the di-versity of procedures used to determine these percentages pre-clude a universal reliable definition even among educators. 2. She implies that a developmental growth chart cannot be used

as a screening tool. Growth charts, questionnaires, and a variety of other procedures have been used as screening tools for decades. “Screening” as used in my commentary refers to the

PEDIATRICS Vol. 109 No. 6 June 2002 1181 at Viet Nam:AAP Sponsored on August 30, 2020

www.aappublications.org/news

(2)

TABLE 1. Comparison of Screening and Surveillance Tools Test Description Standardization

Validation (N

Given Diagnostic Measures) Accuracy Across Age range: Sensitivity/ Specificity Age (Range) Administration Time in Minutes Measures Using Information from Parents Ages and Stages Questionnaire ASQ (formerly Infant Monitoring System) (1994). Paul H. Brookes Publishers, PO Box 10624, Baltimore, MD 21285 (1-800-638-3775). ($130) http://www.pbrookes.com/ Clear drawings and simple directions help parents indicate children ’s skills. Separate copyable forms of 25 to 35 items for each age range (tied to well-child visit schedule). Can be used in mass mail-outs for child-find programs. In English, Spanish, French, Korean Multi-state 538 children 70% –90%, except at the 4-month level/76% –91% 0– 65 Parents ’ Evaluations of Developmental Status (PEDS). (1997) Ellsworth & Vandermeer Press, Ltd, PO Box 68164, Nashville, TN 37206. Phone: 615-226-4460; fax: 615-227-0411 http://www.pedstest.com ($38.99) Ten questions eliciting parents ’ concerns in English, Spanish and Vietnamese. Written at the 5th grade level. Determines when to refer, provide a second screen, provide patient education, or monitor development, behavior, and academic progress. Has a longitudinal tracking form Multi-state 771 74% –80%/ 70% –80% 0– 92 Prescreening Developmental Questionnaire (PDQ-II) Denver Developmental Materials Inc, PO Box 6919, Denver CO 80206-0919 Approximately 22 questions to parents at each of four age ranges. Identifies a subset of children who should be administered the complete Denver II Colorado 0 Not assessed 0– 61 0– 15 Measures Requiring Elicitation of Skills from Children Brigance Screens. Curriculum Associates, Inc. (1985) 153 Rangeway Road, N. Billerica, MA 01862 (1-800-225-0248 ($248.55). http://www.curriculumassociates.com/ Nine separate forms, one for each 12 month age range. Taps speech-language, motor, readiness and general knowledge at younger ages and also reading and math at older ages. In the 0-to 2-year age range, can be administered by parent report. At all ages, also detects advanced/gifted development Multi-state 485 70% –82%/ 70% –82% 0– 91 0– 15 Bayley Infant Neurodevelopmental Screen (BINS). San Antonio, TX. The Psychological Corporation, 1995. 555 Academic Court, San Antonio, TX 78204 (1-800-228-0752) ($195) http://www.psychcorp.com Uses 10 –13 directly elicited items per 3-to 6-month age range assess neurologic processes (reflexes, and tone); neurodevelopmental skills (movement, and symmetry) and developmental accomplishments (object permanence, imitation, and language) Multi-state 303 75% –86%/ 75% –86% 0– 21 0– 15 Denver II Denver Developmental Materials Inc, PO Box 6919, Denver, CO 80206-0919 One hundred twenty-five items tapping the various developmental domains. Produces 4 scores: abnormal, questionable, untestable, and normal. Questionable scores require repeat testing with the Denver II in 3 months and if again questionable (likely attributable to the Denver II ’s reliability) prompts a referral for diagnostic testing Colorado 0 Not assessed* 0– 61 5– 20 * A 1992 study found sensitivity of 80% if “questionable scores ” were included with abnormal scores but specificity of 46%. Alternatively, if “questionable scores ” were included with normal scores, sensitivity was 46% and specificity was 80% — in neither case a parsimonious scoring solution.

(3)

application of quick, simple procedures to examine a presump-tively normal population to ferret out those individuals who are likely to be harboring the problem in question. Screening procedures merely suggest the individual who should be fur-ther evaluated. They do not make a diagnosis. My commentary and our previous writings reflect this approach. The scoring she referred to in her text and her chart fail to reflect changes made 10 years ago.2

3. She questions whether the Denver II “works well in diverse settings.” My commentary stated, “When clinically significant developmental differences attributable to sex, ethnicity, mater-nal education or place of residence occurred, the differences were given in the Denver II Technical Manual.” This manual also documents that there is no clinically significant difference on the previously mentioned variables between the demo-graphics of the standardization population and the US 1980 census.

4. Her comments regarding the test’s validity, sensitivity, and specificity again suggest a lack of understanding of my com-mentary’s discussion of the Denver II’s validity. Because growth charts do not have sensitivity and specificity data, are they also not valid?

5. Her discussion reveals that she doesn’t know that the Denver II interpretation was revised as early as 1992, and its similarity to a growth curve was made at the time it first appeared. 6. She is correct in stating that PDQ II reliability data have not

been presented. The PDQ II is a simple, quick questionnaire to be used as a prescreen to identify children who should be further evaluated with the Denver II. It was developed using the norms of the Denver II standardization. This is similar to the way in which the PDQ and RPDQ were used in relation to the DDST by busy pediatricians.

7. Were it only as simple as Dr Glascoe suggests in Table 1 of her letter, each of the screening procedures, for which she gives sensitivity and specificity measures, would have equal sensi-tivities and specificities for every aspect of development. Per-haps, it is with the recognition, that no one developmental screening test can identify all developmental aberrations, that the AAP Committee on Children With Disabilities suggests the use of additional screens when the clinician suspects a problem such as a behavior disorder.

8. One must question the relevance of Table 1 in her letter which includes her test, the Parents Evaluation of Developmental Status (PEDS), as an example of an adequate screen. A recent scholarly review of the PEDS states that the test “suffers from some important psychometric limitations” (standardization sample, reliability, and validity data).3 _{In fact, the reviewer}

says: “At a minimum those youngsters identified by significant concerns on the PEDS should be referred for a broad spectrum, validated screening test like the Battelle Developmental Inven-tory Screening Test or the Denver II.”3

The PDQ II followed, if necessary, by the Denver II is designed to serve the busy clinician with a brief overview of a child’s development in personal-social, fine motor-adaptive, language, and gross motor areas. As in the case of a growth curve, it is designed to serve as a quick screen of development and for developmental surveillance. In the end, the clinician has to decide for him/herself which test and which deviation are most relevant. There is no substitute for clinical judgment.

William K. Frankenburg, MD, MSPH University of Colorado School of Medicine Denver, CO

REFERENCES

1. Frankenburg WK. Developmental surveillance and screening of infants and young children.Pediatrics.2002;109:144 –145

2. Frankenburg WK, Dodds J, Archer P, et al. The Denver II Training Manual.Denver, CO: Denver Developmental Materials, Inc; 1992 3. Bischoff L. Review of the Parents’ Evaluation of Developmental Status.

In: Plake BS, Impara JC, eds.Fourteenth Mental Measurements Yearbook. Lincoln, NE: Buros Institute of Mental Measurements; 2001:897–900

2-Dose Hepatitis Vaccination Versus 3-Dose

Regimen

To the Editor.—

The recent paper by Levaux et al1_{that compares a 2-dose}

hepatitis vaccination regimen against a 3-dose regimen makes some quantitative errors or assumptions that bias results in favor of the 2-dose hepatitis B regimen. First, the authors assume vac-cine seroprotection rates for completion of the 2- and 3-dose regimens are 99.2% and 98.3%, respectively. The sole study2_on

which these rates are based indicates this difference has aPvalue of .62. In fact, had one fewer person in the 2-dose arm serocon-verted, the observed seroprotection rates would be both mathe-matically and statistically equivalent. It would have been more prudent to use the weighted average of the 2 seroprotection rates for both regimens when building economic models. Second, the authors presume that the probability of receiving the second dose among those receiving the first dose is equivalent in both the 2-dose and 3-dose vaccination regimens. This assumption seems implausible. The second dose of the 3-dose regimen is scheduled 1 month after the first, whereas the second in the 2-dose regimen is scheduled 4 to 6 months after the first. Surely the probability of receiving the second dose, among those who took the first, must be higher for the 3-dose series.

The fundamental problem with building a complex economic model that involves a number of assumptions is that it is difficult to access the effect individual assumptions have on the model. It is unclear how a revised analysis would change study results. It behooves model builders to include not just the assumptions that support their own beliefs, but also to consider those held by others that may not provide the same support.

David F. Cruess, PhD

Department of Preventive Medicine and Biometrics Uniformed Services University of the Health Sciences Bethesda, MD 20814-4799

REFERENCES

1. Levaux HP, Schonfeld WH, Pellissier JM, Cassidy WM, Sheriff SK, Fitzsimon C. Economic evaluation of a 2-dose hepatitis B vaccination regimen for adolescents.Pediatrics.2001;108:317–325

2. Cassidy WM, Watson B, Ioli VA, Williams K, Bird S, West DJ. A randomized trial of alternative two- and three-dose hepatitis B vacci-nation regimens in adolescents: antibody responses, safety, and immu-nologic memory.Pediatrics.2001;107:626 – 631

To the Editor.—

The study by Levaux et al1_{of adolescent hepatitis B}

immuni-zation strategies has numerous limitations. First, their model as-cribes greater seroprotection at regimen completion for the 2-dose regimen, despite a lack of evidence that any such difference exists. Second, the authors assume adolescents are equally likely to re-ceive a second dose of the 2-dose regimen as a second dose of the 3-dose regimen. This is unlikely because the second dose of the 3-dose regimen is due only 4 weeks after the first, whereas the second of the 2-dose regimen is not due until 4 to 6 months after the first. Third, the assumed lifetime risk of 5% among persons age 11 to 15 years with no previous hepatitis B vaccination or hepatitis B exposure is overstated. Although they provide no reference for this estimate, all their hepatitis B clinical and economic effects derive from one previous cost-effectiveness analysis of the 3-dose regimen.2_{Importantly, that paper states a 4.16% risk for members}

of a 1991 cohort who are at least 6 years old. Furthermore, the number of reported hepatitis B infections among persons at least 15 years old has decreased from 16 980 in 1991 (L. Finelli, Centers for Disease Control and Prevention, Division of Viral Hepatitis, Hepatitis Surveillance Branch, personal communication, August 17, 2001) to 7414 in 1998,3 _{or 56%. Using these data, a more}

reasonable estimate of lifetime hepatitis B risk for this population is 1.83%, or 63% less than that used by the authors.

Assuming equivalent seroprotection at regimen completion and a more accurate hepatitis B risk estimate, the data used by Levaux et al results in the 3-dose regimen, not the 2-dose regimen, obtaining greater cost-effectiveness in each of the three analyses.

LETTERS TO THE EDITOR 1183

at Viet Nam:AAP Sponsored on August 30, 2020

(4)

DOI: 10.1542/peds.109.6.1181

2002;109;1181

Pediatrics

Frances Page Glascoe

Two Views of Developmental Testing

Services

Updated Information &

http://pediatrics.aappublications.org/content/109/6/1181

including high resolution figures, can be found at:

References

http://pediatrics.aappublications.org/content/109/6/1181#BIBL

This article cites 4 articles, 2 of which you can access for free at:

Subspecialty Collections

l_issues_sub

http://www.aappublications.org/cgi/collection/development:behaviora

Developmental/Behavioral Pediatrics

following collection(s):

This article, along with others on similar topics, appears in the

Permissions & Licensing

http://www.aappublications.org/site/misc/Permissions.xhtml

in its entirety can be found online at:

Information about reproducing this article in parts (figures, tables) or

Reprints

http://www.aappublications.org/site/misc/reprints.xhtml

Information about ordering reprints can be found online:

(5)

Two Views of Developmental Testing

Letters to the Editor

Two Views of Developmental Testing

2-Dose Hepatitis Vaccination Versus 3-Dose

Regimen

DOI: 10.1542/peds.109.6.1181

2002;109;1181

Pediatrics

Frances Page Glascoe

Two Views of Developmental Testing

Services

Updated Information &

http://pediatrics.aappublications.org/content/109/6/1181

including high resolution figures, can be found at:

References

http://pediatrics.aappublications.org/content/109/6/1181#BIBL

This article cites 4 articles, 2 of which you can access for free at:

Subspecialty Collections

l_issues_sub

http://www.aappublications.org/cgi/collection/development:behaviora

Developmental/Behavioral Pediatrics

following collection(s):

This article, along with others on similar topics, appears in the

Permissions & Licensing

http://www.aappublications.org/site/misc/Permissions.xhtml

in its entirety can be found online at:

Information about reproducing this article in parts (figures, tables) or

Reprints

http://www.aappublications.org/site/misc/reprints.xhtml

Information about ordering reprints can be found online:

DOI: 10.1542/peds.109.6.1181

2002;109;1181

Pediatrics

Frances Page Glascoe

Two Views of Developmental Testing

http://pediatrics.aappublications.org/content/109/6/1181

located on the World Wide Web at:

The online version of this article, along with updated information and services, is

by the American Academy of Pediatrics. All rights reserved. Print ISSN: 1073-0397.

the American Academy of Pediatrics, 345 Park Avenue, Itasca, Illinois, 60143. Copyright © 2002

has been published continuously since 1948. Pediatrics is owned, published, and trademarked by

Pediatrics is the official journal of the American Academy of Pediatrics. A monthly publication, it