Hoboken Curriculum Project: Design Flaw Suspected In Texas Standardized Tests

Thursday, September 6, 2012

Design Flaw Suspected In Texas Standardized Tests - Morgan Smith (Texas Tribune)

A recent story by Morgan Smith of the Texas Tribune has gained a fair amount of attention over the past few weeks. The story highlights the work of a colleague of mine, Walter Stroup, and his work on high stakes, large scale state student accountability testing in the state of Texas with possible implications nationwide. -Dr. Petrosino

In 2006, a math pilot program for middle school students in a Dallas-area district returned surprising results.

The students’ improved grasp of mathematical concepts stunned Walter Stroup, the University of Texas at Austin professor behind the program at Richardson Independent School District. But at the end of the year, students’ scores had increased only marginally on state standardized TAKS tests, unlike what Stroup had seen in the classroom.

A similar dynamic showed up in a comparison of the students’ scores on midyear benchmark tests and what they received on their end-of-year exams. Standardized test scores the previous year were better predictors of their scores the next year than the benchmark test they had taken a few months earlier.

Now, in studies that threaten to shake the foundation of high-stakes test-based accountability, Stroup and two other researchers said they believe they have found the reason: a glitch embedded in the DNA of the state exams that, as a result of a statistical method used to assemble them, suggests they are virtually useless at measuring the effects of classroom instruction.

Pearson, which has a five-year, $468 million contract to create the state’s tests through 2015, uses “item response theory” to devise standardized exams, as other testing companies do. Using IRT, developers select questions based on a model that correlates students’ ability with the probability that they will get a question right.

That produces a test that Stroup said is more sensitive to how it ranks students than to measuring what they have learned. Such a design flaw could also explain why Richardson students’ scores on the previous year’s TAKS test were a better predictor of performance on the next year’s TAKS test than the benchmark exams were, he said. The benchmark exams were developed by the district, the TAKS by the testing company.

Stroup, who is preparing to submit the findings to multiple research journals, presented them in June at a meeting of the House Public Education Committee. He said he was aware of their implications for a widely used and accepted method of developing tests, and for how the state evaluates public schools.

“I’ve thought about being wrong,” Stroup said. “I’d love if everyone could say, ‘You are wrong, everything’s fine,’ ” he said. “But these are hundreds and hundreds of numbers that we’ve run now.”

Gloria Zyskowski, the deputy associate commissioner who handles assessments at the Texas Education Agency, said in a statement that the agency needed more time to review the findings. But she said that Stroup’s comments in June reflected “fundamental misunderstandings” about test development and that there was no evidence of a flaw in the test.

After a lengthy back and forth at the meeting, the committee’s chairman, Rob Eissler, suggested a “battle of the bands” — a hearing where the test vendors and researchers traded questions. Eissler, Republican of The Woodlands, said recently that he found Stroup’s research “very interesting” and that he was weighing another hearing.

Stroup’s research comes as growing opposition to high-stakes standardized testing in Texas is creating an alliance between parents, educators and school leaders who wonder how the tests affect classroom instruction and small-government conservatives who question the expense and bureaucracy they impose.

This is not the first time that the way the state uses standardized test scores in the accountability system has been questioned. In 2009, the state implemented the Texas Projection Measure, a formula that factored a prediction of students’ future performance on state exams into schools’ accountability ratings. Lawmakers, led by state Rep. Scott Hochberg, attacked the measure, saying it allowed schools to count students as passing who did not. After outcry prompted the education agency to issue ratings with and without the measure in 2010, the state dropped it completely the next year.

Hochberg, D-Houston, has since proposed legislation aimed at reforming the role of standardized testing in public schools because of the data he saw as he led the charge against the measure. It showed that a student’s test score on the first year highly predicted it for the next.

“I have for a long time said that the accountability system doesn’t give us all the information that the numbers are used to generate,” Hochberg said, adding that basing accountability “more on the kid’s history than the specifics of what happened in the classroom that year may make us feel good but it doesn’t give us any true information.”

Picture: Dr. Stroup being interviewed for this article.

Internet Copyright Notice & Guidelines

Any redistribution or reproduction of part or all of the contents in any form is prohibited other than the following:

· you may print or download to a local hard disk extracts for your personal and non-commercial use only

· you may copy the content to individual third parties for their personal use, but only if you acknowledge the website as the source of the material

You may not, except with my express written permission, distribute or commercially exploit the content. Nor may you transmit it or store it in any other website or other form of electronic retrieval system.

DISCLAIMER

This is a moderated blog. Comments should be respectful and pertain to the topic posted. Blog moderators reserve the right to remove any comment determined not in keeping with these guidelines.

Posts made on or before August 31, 2009 were uploaded when this site was known as "The Hoboken Curriculum Project." At that time, the site operated with the knowledge and awareness of the Hoboken School Board. However the content and opinions posted may or may not have represented their views personally or collectively, nor did it attempt to represent the official viewpoint of Hoboken School District administrators or employees.

Posts uploaded on or after September 1, 2009 are simply the thoughts, ideas, and opinions of Dr. Anthony Petrosino, and do not reflect the opinion or position of any educational boards or institutions of which I am associated or affiliated.

The information contained in this website is for general information purposes only. The information is provided by Dr. Anthony Petrosino and while I attempt to keep the information up to date and correct, I make no representations or warranties of any kind, express or implied, about the completeness, accuracy, reliability, suitability or availability with respect to the website or the information, products, services, or related graphics contained on the website for any purpose. Any reliance you place on such information is therefore strictly at your own risk.

In no event will I be liable for any loss or damage including without limitation, indirect or consequential loss or damage, or any loss or damage whatsoever arising from loss of data or profits arising out of, or in connection with, the use of this website.

Through this website you are able to link to other websites which are not under the control of Dr. Anthony Petrosino. I have no control over the nature, content and availability of those sites. The inclusion of any links does not necessarily imply a recommendation or endorse the views expressed within them.

Every effort is made to keep the website up and running smoothly. However, Dr. Anthony Petrosino takes no responsibility for, and will not be liable for, the website being temporarily unavailable due to technical issues beyond his control.

Hoboken Curriculum Project

Thursday, September 6, 2012

Design Flaw Suspected In Texas Standardized Tests - Morgan Smith (Texas Tribune)

Recent Pageviews

Search This Blog

Recommended Web Sites

Blog Archive

About Me

Internet Copyright Notice & Guidelines

DISCLAIMER