September 2005 — Industry Perspective

Print this article | Email this article

Click here to receive your FREE subscription to T.H.E. Journal

Test Bias or Real Differences?

A test publisher’s perspective on closing the achievement gap.

Even with the years of effort dedicatedMargaret A. Jorgensen to equalizing education, the achievement gap in the US persists. The question now facing test developers is whether the gaps are the result of true differences in achievement or of bias in the tests. Test publishers have developed methods of detecting and eliminating bias in their assessment products; however, despite all the attention and resources directed at closing the achievement gap, a disparity in performance still remains. But an “item design” would enable assessments not only to measure a student’s academic achievement, but also to provide insight and detailed information about what learning objectives a student has not yet reached. By both serving as a thermometer and offering educational prescriptions, standardized assessments can be a tool for narrowing the achievement gap.

With professional and ethical responsibilities to remove barriers of bias in their assessments, test publishers extensively rely on industry standards to produce highquality assessment instruments. For a published test to be fair and unbiased, it must measure a student’s achievement without being affected by extraneous factors such as student gender, culture, ethnicity, geography, or soci'economic status. Deeply underpinning this discussion is that the clear purpose of educational assessments is to measure the differences in student achievement. There is no legitimate reason to build and administer a test that confirms sameness. Only by understanding the differences between students can we customize instruction for each student.

Given best practices in the industry for eliminating bias, what can test publishers do to contribute and facilitate a deeper understanding of the root causes of these differences? Addressing this requires a shift in the traditional function and design of standardized assessment systems and individual items. What if items were constructed differently so that they would reveal where learning breaks down for individual students?

Multiple-choice items have a long history of helping educators and policymakers understand what students know and can do. Basically, items are created to sort students into two categories: those who know the content and those who do not. They are not written to provide systematic insights into where students are in their thinking; valuable information for teachers whose job it is to help move students who do not answer the item correctly to an understanding of the standard.

Harcourt (harcourtassessment.com) has developed a new item type that helps teachers understand not only which students know the content, but also where the students who do not know the content are in their understanding. Given a typical multiple-choice item, we have the opportunity to sort students into four levels of achievement. By building assessments that provide increasingly precise information about why students choose the incorrect answer, test publishers make a direct contribution to the improvement of classroom instruction. Teachers can know where learning has broken down and where instruction should be targeted.