Development and Implementation (1998 - 2003)
Claudia Krenz, Ph.D. (datafriend @ gmail-.-com)
Under NCLB, one state's science achievement test could measure knowledge of geology and another's, creationism. NCLB's tolerance for state curricular definitions contrasts with reports of "faith" (and oil) based initiatives--like federal scientists studying climate change being told not to report what their models predict or they observe, the National Park Service being ordered to remove books about the geological history of the Grand Canyon, and the the National Institutes of Health having to remove the word "condom" from its HIV health prevention pages.
≠
|
Click on a link to your right to go to that section. The text in this file
is organized sequentially by these links (with the URLs mentioned in text
organized alphabetically in the Bibliography). The first two links to the
right are before the present paragraph, the rest, in order, after it: this
first section looks at "accountability" as defined by the NCLB.
The second section describes the HSGQE's development, official definition, different administrations, student performance, and what little's known about the pre-2002 administrations. Issues common to test development are illustrated with the Texas Assessment of Academic Skills (TAAS), required there since 1990--to overview the testing process and highlight issues (most certainly not to suggest that Texas sets an example to emulate)--and what might have been learned. |
|
While NCLB leaves the states free to establish their own content and performance standards for their own students, the U.S. Department of Education (USED) has various means of influencing said content. In one example, in 2005, one state, Utah, openly defied USED: USED Secretary Spellings called Utah state legislators names and threatened to cut funding for its neediest schools.
"The letter warned that 'the consequences of enacting and implementing this bill would be so detrimental to students in Utah' .... In the letter's 25-line third paragraph, Mrs. Spellings warned that the department probably would cut off at least $76 million in federal funds this year for low-income school districts and teacher training" (Archibald, G., Washington Times, 20 April 2005). Secretary Spellings is also quoted as saying that "Under the state test, 74 percent of white eighth-graders and 47 percent of Hispanics were reported 'proficient or advanced' in math, while just 34 percent of white and 7 percent of Hispanic eighth-graders were 'proficient or advanced' on the NAEP math test."
The Secretary of Education's threats were quite real: As a Utah newspaper put it, the day the state legislature rebelled against the NCLB
"also happened to be the anniversary of the opening battles of the Revolutionary War. Two-hundred-thirty years ago, Massachusetts militiamen at Lexington and Concord faced off with British troops, firing the famous shot heard 'round the world .... At Lexington and Concord, and in Salt Lake City, the central players were taking a risk. The colonists were risking the wrath of a global superpower--Britain--and the hangman's noose for treason. Utah's stake is bloodless, but a risk nevertheless" (Provo Daily-Herald, 21 April 2005). |
The Secretary's argument was "one heck of a good one:" Comparing percentages of 8th grade Utah students proficient on the federal National Assessment of Educational Progress (NAEP) and Utah's own state achievement test scores (Utah called its the "UPASS"; Alaska, the "HSGQE") is possible since test scores are only numbers. Comparing scores on tests that have never been standardized to each other--and are not necessarily even measuring the same types of achievement (or have the same definitions of "proficiency")--however makes no sense: There are so many possible explanations for the observed differences in percentages reported on the two tests that it's impossible to single out a particular one: what would be a wonder would be if the scores had been the same!
--
|
USED responded more rapidly to Hurricane Katrina displaced children than FEMA,
instructing schools to enroll them as "migrant" students so the schools could be reimbursed (NCLB expressly prohibits creation of a "single nationwide database" except "to better coordinate services" for students switching schools).
Alaska has doubtless already given its K-12 data to USED and the time is |
12:00:59 |
NCLB is reminiscent of the old Greek myth about the Trojan horse: while everyone, including me, was squawking about irrational accountability testing--individually identifiable data about US public K12 students are being silently sent behind secure socket layers. How will USED use these data? Targeted advertising is possible. I doubt these data will be used for educational research. If this relatively well funded data collection part of the NCLB were on the up-and-up, why would USED indulge in secretive and threatening messages (just so, if the NSA says it is not intercepting international email, one wonders; if the NSA says it is intercepting international email, there's little doubt).
| Alaska Summary |
In 1998, the Alaska State Legislature passed its "Quality Schools Initiative," compared to most states in the continental U.S., Alaska students were doing well compared to their counterparts in the other states. Click here to a sorted list showing percents of 8th graders proficient on the National Assessment of Educational Progress' math test (the closest the US had to gold standard or way of evaluating student achievement levels across the country. They were certainly doing better than students in California and Texas, home states of the whining oil execs who convinced the Alaska leg that its "work force " wasn't up to becoming good employees. I don't think the leg noticed that its students were doing better before it decided to inflict accountability testing on real students in real time.
Accountability Testing in Alaska under Tony.
This page is a compilation of links about educational testing in
Alaska since 1998 and implementation of the "Quality Schools
Initiative" from an educational testing and measurement perspective.
Alaska was one of the last states to implement standards-based reforms: the leg passed a law, as had most other States earlier, mandating that schools a) administer a battery of tests to students [3 partial days/academic year for the testing alone] and b) aggregate those scores over local schools to hold each "accountable." This "accountability testing" was on top of what has always normally occurred in the classroom (the typically enormous lag between testing and scoring made pedagogical use of these scores unlikely). Students took CTB tests of unknown merit (insofar as of unknown relevance to state-articulated performance objectives); their scores were aggregated over individual local schools and made public, typically on the web. Here's a 1) sorted list of how the other states compared to Alaska in degree of accountability testing in 1998. Two years before the leg enacted the Quality School's Initiative, Alaska's students had scored higher on the National Assessment of Educational Progress (NAEP) than students in the home states of the oil execs who convinced the leg that Alaska's students were insufficiently educated. Specifically, the percentage proficient and above on the NAEP math exceeded CA's and TX's by about a decile:
A state that didn't recognize the caliber of its own students--used whatever the oil execs said as its litmus test--wouldn't and didn't ask whether implementing such a testing program--often described w/ phrases like "standards-based reforms" and "high-stakes testing" and above all "accountability"--was a wise move, whatever the caliber of its students. Had the AK leg looked, it might have noticed, from correlating scores on two variables--degree of accountability testing grade (as assigned each state by EWW) with % proficient on 8th-grade math (as assigned by the NAEP)--the following Pearson product-moment correlation: 3) -.39. This statistically significant correlation is in the *wrong* direction insofar as the accountability testing model would predict that imposing its reforms enhanced, increased, or accelerated achievement.
Accountability Testing in Alaska under Frank. The national NCLB law is based on Texas' high school exit exam, the TAAS: USED Secretary Paige--former Houston principal--doubtless did not know that the Houston education miracle *poof* was fraud (created by the miracle of cooking the high school dropout data, not dissimilar to cooking enron's books--misleading accounting policies to say the least). It also mandates the impossible: that in less than a dozen 100% of all students everywhere score proficient (no accidents, no boycotts, everyone--perfect everywhere).
The NCLB acknowledges what has been observed in countless empirical studies: a high ses/white -- lo SES/not white difference on a lot of different standardized tests. There are numerous interpretations of these test score differences: Under one common positivist paradigm, scores on what for-profit companies say a test measures is what they actually measure, whether w/ "iq" or "achievement." The Alaska leg typifies this kind of naive empiricism. Others argue that such scores mean nothing at all until at least quasi validated. Ironically, as schools that can afford to opt-out, those bearing most of the burden may be those most initially in need of help. In the sense that children get left behind everyday--poverty, abusive parents, the list is long--and in the sense that the law at least superficially addresses the problem resulting from a climate of "low expectations" by requiring test scores be reported disaggregated by groups like the vanilla census 2000 ethnicity categories--which are of little relevance to Alaska. No surprise--being based on the "Texas education miracle"--the law does not require the states to provide disaggregated dropout data. Targeted advertising--based on purchases at the old Carr's/new Safeway's--has already begun on the Kenai Peninsula: how will the data on the little kids be used? Certainly for advertising; certainly for more Texas--Enron, the Houston *poof* education miracle--math; who knows to what totalitarian ends. I am not optimistic. These data will not be used for educational research, which USED regards as being conducted by the "the dullest yeomen". USED presents the PBDMI as the solution. I see it as a Trojan Horse. If this relatively well funded data collection part of the NCLB were on the up-and-up, why would USED indulge in secretive and threatening messages? There is no question that Alaska isn't unique: but to look at approval of Alaska's plan in isolation is to miss the forest for the trees: The NCLB has two components, an under-funded accountability part--17 plans for which were approved in the recent Rose Garden ceremony--and a fully funded data collection part, which is little known. There is no question that Alaska isn't smart: one parent who telephoned said that AK was one of the few states whose public schools he'd allow his kids to attend (he was mad about academic time lost to testing, his kids' time wasted). A page at the whitehouse.gov site says that public support for the NCLB is "rock solid" although that doesn't seem so in states like Utah (Alaska officials proclaim the state leaders in obeying the NCLB [while simultaneously insisting on noting the numerous discrepancies between the law and reality]). Go to an automatically generated news site (news.google or news.yahoo) and paste NCLB into its search window to find out what local communities external to your own are saying.
| Accountability Testing in Alaska under Frank
(contd.): The part of the NCLB no one talks about, The
Performance Based Data Management Initiative. That the accountability
part of the NCLB is like the HSGQE and the TAAS on steroids keeps the
schools preoccupied (so many ways to fail, busy schools writing
self-improvement plans, and after a few sequential years of not making
AYP, so much money to be made assisting those "failing").
The other part of the NCLB I came across at a whitehouse.gov site in the process of updating this page: The plan to create a single nationwide database on every U.S. public K-12 kid. No one, of course, speaks of it because there is much money to be made in educational technology these days: so many software applications for entering NCLB-compliant data, custom interfaces, lots of money. Type "hsgqe" into a google window--and see a testing industry advertisement; type in "nclb" and see many, many ads. Although the NCLB requires the local schools to provide scores disaggregated by groups, USED wants individual test scores, as well as names, addresses, and phone numbers. Several examples of state pages talking about data entry are shown below. One problem--in the grand tradition of unfunded federal mandates-- is funding for the local schools, the "first responders" in the present context. Sometimes local schools have to fork over the bucks for the data entry software themselves:
I see no other interpretation of NCLB data entry anecdotes--listed below under the red panic button (yes, my hair is on fire)-- than that, although expressly prohibited, USED is creating a single nationwide database: call it soft- and hardware evolution, Moore's law, whatever. It is now very easy for the feds to collect data on every kid in the country--especially when the states do their own data entry ("clerk work" as Rep. Dayton put it; "no way in hell are we going to do all they're asking" said another Utah official). Individually identifiable student data about every Alaska public K-12 kids would fit onto a 1.4 MB floppy. But, in Alaska, officials face this orwellian threat w/ quasi-bovine tranquility: as they do all else, silently. All the more tragic because Alaska children will be more adversely impacted than those in other states (kids get SSNs young up here). I interpret all of USED's bombast about rigor--the importance of "scientifically valid and reliable data"--to be about wanting to collect accurate data. Reading through the e-gov documents, I see little evidence that those--too often former doubleclick ad execs--speaking w/ the voice of USED understand quantitative longitudinal research (one discussion of randomization I read merits an A+, as farce in a literature class--so target rich, the documents of these advertising industry execs--and there's, of course, the new "Institute of Education Science" and so many methodological gaffes). Many in Alaska are more than comfortable w/ the idea of not being particularly easy to find ("we don't need no stinking rules" says a local radio dj). It's hard to understand why no one is concerned about individually identifiable data about their children. Perhaps it's related to their local officials telling them that the feds are only getting the aggregated data, not the individually identifiable data. Not through the pipeline crudely but silently behind SSLs. C lick on the "e-gov" link to read an appeal to the Alaska gov about USED's data collection efforts, the other part of the NCLB, whose funding has not been cut since enactment (how convenient to have data on your citizens in the new American century of total information dominance). |
|
What happens when you ignore reality? It doesn't go away. What do you
call a state that ignores reality? Fool, alaska. What happens when you
discuss reality? You make more informed decisions. What do you call a
state that faces the issues? Leader, Utah.
Alaska is the leader in obeying the NCLB: the gov and his daughter support it, as does the rest of the congressional delegation. The gov says he has fully funded Alaska education. If you believe that, I'd like to sell you the Brooklyn Bridge. What to call a government that gives its K-12 data to USED and its senior data to Wells Fargo Bank? An enemy of the people? Pretty damn dumb? What do you call a national law based on the *poof* Houston education miracle, one of numerous Texas frauds so obvious even the NYT noticed? The gov and his daughter favor fraud? No, I think them just deluded. After all, Alaska schools have no bechtel toilets that pump sewage when flushed. We're not at that level of incompetence yet. As usual, the real losers are the students. I am happy for Alaska's pot smokers, that they have a right to privacy under the state Constitution, that they have civil rights: too damn bad the rest of us don't. Alaska media can be counted on to obfuscate the issues: Regardless of merit--not based on suddenly coming up to previously articulated standards--ALL 17 unapproved State accountability plans, including Alaska's, were approved en masse, on 10 June 2003, the day before a congressional deadline, in a special Rose Garden ceremony: To the left below, read what Education Week on the Web recently said of that Rose Garden event; to the right below, read what Anchorage Daily News said of the same event.
There is no question that Alaska isn't unique: but to look at approval of Alaska's plan in isolation is to miss the forest for the trees: The NCLB has two components, an under-funded accountability part--17 plans for which were approved in the recent Rose Garden ceremony--and a fully funded data collection part, which is little known. There is no question that Alaska isn't smart: one parent who telephoned said that AK was one of the few states whose public schools he'd allow his kids to attend (he was mad about academic time lost to testing, his kids' time wasted). It is interesting that the PBDMI was renamed EDEN--and interesting the online forms showing the data the states must supply--they take as long to fill out as they did on paper--showing spaces to insert this and that aggregate data: the data entry tables are so convoluted that words like "gender" appear more than a dozen times in a single form. In a digital world--ignorance of reality is not an excuse for the state whose self image is open source but whose government is a province of redmond--where it is easy for anyone who really wants to intrude into any networked system: in this world, this current reality situation, are you comfortable with forms that aren't transparent, on their face understandable? I could imagine questions whose answers could only be obtained w/ the individually identifiable data--which means the individually identifiable data would be in the state.client computer's cache and could, even w/o an overt request for said data, it could be whisked out of the client alaska.state computer to the host.USED computer. How Like w/ Patriot, the credit goes to the people of Alaska. And so now comes the responsibility. To not act now is to make a decision. If my local newspaper is to be trusted, the courts--remember, part of the "balance of powers"--have recently ruled that an Alaska citizen's right to privacy under the Alaska Constitution is so great as to trump federal marijuana statutes. If the pot smokers have a right to privacy, what would you say about the little kids who currently have no voice? Not New: CTB's Spring 2003 tech report is out. Table 39 lists inter-rater reliability coefficients for the 9 constructued-response writing items (subtract the number of multiple choice items in Table 1 from the numer of multiple-choice (MC) items to get the number of constructed response items [35-26=9). These coefficiens range from a low of about 70 to highs of about 85% (remarkably, this table indicates that students did not submit responses--questions were left blank--from 15 to 20% of the time for each of the 9 items). And yet the students of Alaska have no substantive right of appeal or due process. CTB also published a Spring 2002 HSGQE and Benchmark technical report.
The local papers and radio talk-in shows contain an increasing number of critical if not outright hostile discussions of the underfunded accountability part of the NCLB. Often mentioned are the unintended bad side-effects of the increased testing. There is no discussion of the fully-funded--I think its budget even increased in fy04--Performance Based Data Management Initiative. Do you who distrust the accountability part of the NCLB that you understand so well trust the fully-funded data collection initiative you probably don't understand (people's eyes often glaze over at the mention of computers and data, numbers, matrices)? Especially since a file of individually identifiable Alaska K-12 data would be less than 1.4 MB--in the context of the today's common GE 80 MB HDDs, it would hardly be noticeable. As in every other state in the union, the schools enter the individual student data and send it to a central state repository: the schools do not know what happens to the data after being submitted to the state. Does the State know? Do you trust what the state says because it says it? To each of my email buds who've remarked on AK's stand on Patriot, I reply back that the leg deserves no credit (like any shapeshifter, it easily assumes different postures). The credit belongs to the people of Alaska. Go to google--I use it as an example search engine because it is currently the best--and paste in Performance Based Data Management Initiative--or PBDMI or its newer name, EDEN (Education Data Exchange Network). The state noted, in a document that is no longer online, that "In order to initiate an appeal to a student's Fall 2003 Alaska Scale Score, a re-score of a student's test book is required. An appeal to a student's Alaska Scale Score, in one or more of the Reading, Writing and Mathematics content areas will incur processing charges. A handling charge of $54.00 per student, per content area, is required for each appeal and must be submitted with a purchase order. If the re-scored student's performance level changes from NON- PROFICIENT to PROFICIENT, CTB will provide the District with a new student report, at no charge. If the re-scored student's performance level does not change, CTB will invoice the District for the cost of the re-score process. The Individual Student Report is the only report that may be appealed. Summary reports will not be re-generated to accommodate changed student scores." Why Alaskans are allowing their incompetent leaders to railroad their own kids is beyond my understanding. And to give away the data on your own kids boggles any sense of "fair play" I ever understood. Ignorance needn't be a bastion of the last frontier: functionally it is so. I hear from a reader that CTB is outsourcing exam construction--go to elance.com: who knows who will be doing what to generate tests items and test scores and test statistics--for which there is no appeal for most students--reflecting Alaska's "unique" performance objectives--it won't be me since I've pissed off the phrass [and elance doesn't work]. Jeez, how long until we found out that Halliburton is Alaska's new testing contractor? DEEDs has proposed replacing CTB as its testing contractor, putting out an rfp last December. A seven-member state committee will review proposals from five other contractors--also w/ preexisting contracts w/ other "large" (read more lucrative) states. The abysmal turn-around time is mentioned as one reason for change, as is--it's only taken 5 years for ak "leaders" to notice--the importance of tests being aligned w/ state standards. Who's going to decide what's a good proposal when no one in AK DEEDs has expertise--or shows leadership in ignoring (let's thank them)--psychometric issues common to all high-stakes testing? having a new test publisher will not magically resolve existing problems. Why now? To raise dust, make it more difficult for people to see what's going on (anyone who says the NCLB doesn't contain provisions written to benefit the test publishers has not read the law). The losers, of course, are the kids. Since your leaders refuse to take responsibility--lead the pack in ignoring reality--who is left? I wish you luck, but I do not think that will suffice. . New: "Security was tight when Texas State Board of Education members were given results ... from ... new ... test. Guards stood outside their locked meeting room, and board members were asked to sign a secrecy pledge ... 'The results were grim' ... [but] Federal officials ... are satisfied lower standards" aren't an early unintended NCLB byproduct (Dillon, 03). |
| The Single Nationwide Database |
According to NCLB Sec. 9531, there is a "Prohibition on nationwide database." However, under the not well known fully funded "Performance-Based Data Management Initiative"--such a database is being created. Searches on the pbdmi are turning up state.us pages like the following:
Nebraska districts used to "enter data ... within their districts and once per month export that data on a disk to the central repository in Lincoln." Now they use a "client/server operation ... utilize a web browser to ... enter data along with generating necessary reports" (August 02). Oregon says "This year ... grant access to kindergarten teachers to enter data ... due at the Department by Nov. 1" (September 02) A New Jersey NCLB reference manual says "Federal regulations require LEAs to collect and submit data to the NJDOE. The data is compiled and forwarded to the USDOE." |
Before the NCLB, Nebraska backed up its student data on floppies and sent them to its capital in Lincoln: now or soon, from there into a single nationwide database. I'd guess that all the states are extremely pro-active when it comes to the anonymity of student data in their individual home systems--just a hunch (just like I'd guess that that the "N" in "NPMIS" stands for "Nebraska," and one of the "M's" in "MMARS" numbers stands for "Minnesota"--and that there's an "alaska" somewhere in Alaska's OASIS numbers: many abbreviations are easy to figure out). My impression is that the single nationwide database has yet to be completed, that there is still time--but not much--to stop prevent, block transmitting individually identifiable AK student data to USED for inclusion in the single nationwide database.
It was not helpful that the governor's office "announced the retirement of Shirley Holloway, commissioner of the Department of Education and Early Development, effective March 3. Also Thursday, Susan Stitham, chairwoman of the State Department of Education, said she received a message on her answering machine from Murkowski's office telling her she was relieved from duty and thanking her for time served" (Pesznecker, 2/21/03). Not helpful to scatter resource people in these NCLB resource consumptive times.
Some think state bashing--AK and most of the other 49 were already on the accountability bandwagon before the NCLB became law and USED one to whom one is accountable: like a joke I heard when I lived in Montana about why North Dakota dogs had stub noses (from chasing parked cars). |
E-gov
You can also go online and read--for yourself--USED describing said database in its own words. Check out an email I sent to the gov (the last "cc:" bounced: I'd assumed, having accepted the seat and the office, she would keep the email address: so much for assumptions). You can paste the blue URLs below into your browser or click them: be sure to read about the "educationadvisor" site belonging to "Evaluation Software," a USED-funded TX data collection company, which mentions variables like "last name" (do a backwards directory crawl and find other HottestTopics like "student IDs"). And don't forget the new USED research office which espouses "cognitive variables."
a) From: claudia <msdata @ srv.net> Date: Fri Feb 28, 2003 9:09:59 AM America/Anchorage To Governor @ gov.state.ak.us Cc: Lt_governor @ gov.state.ak.us, stevens @ senate.gov, donyoung @ house.gov, murkowski @ senate.gov Subject: a single national USED database containing individually identifiable data on every AK K-12 kid February 28, 2003 Dear Governor Murkowski, Since DEEDs and the state school board are currently rudderless, I guess that makes you the adult in charge: I write to express my concern about the second and not well known part of the NCLB, the "Performance-Based Data Management Initiative." I speak as a citizen, as someone who regularly conducts and interprets data or statistical analyses, and as someone who has maintained a web page on the HSGQE since its inception in 1998 (the proud mom of a 33-year-old, I am not part of AK K-12). The goal of this initiative, which is not mentioned in USED's "NCLB Desktop Reference" (www.ed.gov/admins/lead/account/nclbreference/reference.pdf), is a single national database consisting of individually identifiable data --is last name sufficiently specific? (www.educationadvisor.com/documents/OCIO2001/SEAMtgNotesDC_06_03_02.doc) --on every K-12 kid in the country (http://www.whitehouse.gov/omb/budget/fy2003/bud13.html, www.whitehouse.gov/omb/budget/fy2003/bud13.html; commdocs.house.gov/committees/edu/hedcew5-73.000/hedcew5-73.htm ), with data from other federal databases to be merged to it by next year. In one of the preceding URLs, a USED spokesperson disingenuously noted that the states have recognized the value of individual student data for ascertaining student learning: that statement is true insofar as student-level data is the only way to examine student learning (something which Alaska currently does not do). However, since, amongst other data points, USED vaporized its educational research office (OERI) last November--replacing it with an "Institute of Research" headed by someone who uses scientific-looking graphs to summarize literature reviews--I cannot imagine the intended use of said database being "educational research" It will be, intentionally or not, ignoring Orwellian concerns, a "hot" commercial property, and, once created, were the NCLB later repealed, it will continue to exist, its data being bought and sold. I suspect you are not a statistics or internet user, Governor Murkowski. You may, however, be a science fiction reader, in which case you'll understand what I mean when I say that said database will follow those in it--Alaska kids and every other kid in the country--throughout their lives like a plague of flies in a William Gibson novel. Individually identifiable student data has always existed, heretofore at a local level: I cannot imagine it being anything but abused at a national level. Sincerely,
Dr. Claudia Krenz
b) From: Governor <office_of_the_governor @ gov.state.ak.us>
Date: Mon Mar 10, 2003 2:48:24 PM America/Anchorage
To: claudia <msdata @ srv.net>
Subject: Re: a single national USED database containing individually
identifiabledata on every AK K-12 kid
Received: from gov.state.ak.us ([127.0.0.1]) by jnumail1.state.ak.us
(Netscape Messaging Server 4.15) with ESMTP id HBK4SP00.JN7; Mon, 10 Mar
2003 14:48:25 -0900
Message-Id: <3E6D2448.32F37C5A@gov.state.ak.us>
Organization: Alaska Office of the Governor
X-Mailer: Mozilla 4.79 [en]C-CCK-MCD {SillyDog} (Windows NT 5.0; U)
X-Accept-Language: en
Mime-Version: 1.0
Thank you for writing to Alaska Governor Frank H. Murkowski. The concerns, opinions, and/or information you have sent is important and valuable to the Governor. Although he is for obvious reasons unable to respond to each and every email himself, your message has been received and is being reviewed by the appropriate staff person in this office who can best address your need, suggestion, or comment. c) From: claudia <msdata @ srv.net> Date: Mon Mar 17, 2003 7:42:18 AM America/Anchorage To: Governor <office_of_the_governor @ gov.state.ak.us> Subject: Re: a single national USED database containing individually identifiable data on every AK K-12 kidTO: Office of the Governor, Juneau FROM: Dr. Claudia Krenz, Nikiski Since the schools are also now otherwise occupied, I am heartened that "the appropriate staff member" is looking into the creation of a nationwide database under the fully funded "Performance-Based Data Management Initiative" part of the NCLB [the most efficient way I've found to search the new law is to go to google advanced, paste in "legislation," "ESEA," your query term (e.g., "privacy"), and "ed.gov" as domain]. Although the www.ed.gov/policy/elsec/leg/esea02/ pages assert the confidentiality and privacy of student records (and FERPA is specific about the aggregated data), I am concerned, because a) The federal government has an atrocious record of protecting data entrusted it (USED all the more since it runs on Microsoft products--why not, in these budget-cutting times, switch to the more reliable and considerably more economical Linux?). b) Despite a general prohibition against such a database (SEC. 9531, www.ed.gov/policy/elsec/leg/esea02/pg112.html#sec9531), the URLs listed in my earlier email speak of entering data and creating a database without restrictions like "coordinating migrant education activities" (SEC. 1308, www.ed.gov/policy/elsec/leg/esea02/pg8.html#sec1308). I for one used to be confident in the NAEP's reported assessment results, less so now (SEC. 411, www.ed.gov/legislation/ESEA02/pg97.html). Although a nationwide database could be used for educational research, it could also be otherwise used: imagine, for example, a new entry in your medical record, "inflamm. of R troch. bursa," being copied into a UHaul database in Reno and then your email "inbox" being flooded with independent advertisements purporting "inflamm. of R troch. bursa" cures (inconsequential insofar as blocked by another spam filter). What does "Nothing in this section shall be construed to ... prohibit the distribution of scientifically or medically true or accurate materials" mean (SEC. 9526, www.ed.gov/policy/elsec/leg/esea02/pg112.html#sec9526)? Less banal uses than this example are also imaginable. I hope that "the appropriate staff member" understands the NCLB's implications with reference to what used to be under local control: information about individual students--because I don't; the little I've observed, however, from reading USED pages discussing creating such a database (without reference to "migrant education") and state pages talking about data entry (again, without restriction) are a concern. Finally, I found: "Not later than April 30, 2003, the Secretary shall report to the Committee on Health, Education, Labor, and Pensions of the Senate and the Committee on Education and the Workforce of the House of Representatives the Secretary's findings and recommendations regarding the maintenance and transfer of health and educational information for migratory students by the States" ("the Secretary" being former Houston school superintendent Rod Paige). Again, I write to express my concern; only someone of the governor's stature can provide the digital leadership needed to prevent AK K-12 data being transferred into an initially single nationwide database housed in USED computers on the east coast. |
What do you think?
Were I a modern K-12 parent--although prohibited under Sec. 9531--I'd worry that if it looks like a duck and quacks like a duck, it might be a duck: are you willing to assume the data collection effort is limited to migrant students (Sec. 1308(b) (www.ed.gov/policy/elsec/leg/esea02/pg8.html#sec1308)? The search engines--yes! internet as a repository library!!--have, as summarized in this section, served up USED pages talking about creating a database; they have served up state pages talking about entering individually identifiable data about their K-12 kids and sending it--through SSLs (makes no noise and wouldn't, especially in Alaska's case, be that big a file by today's standards)--to USED: You're the parent, the adult in charge.
Did you know that, under SEC. 9506, "a private school [or home school] that does not receive funds or services under this Act ... are to be excluded from assessment" (www.ed.gov/policy/elsec/leg/esea02/pg111.html#sec9506)? |
Ironically, in 1996, AK students outscored students in the home states of the oil execs--whose whining is credited for convincing the leg to pass the initiative: 30% of Alaska's 8th graders scored at or above the proficient level on the venerable National Assessment of Educational Progress (NAEP) math test--compared to only 17% of CA's and 21% of TX's. |
*Alaska's performance standards (for writing, reading, and math) are the closest it has to a "gold standard." The courts think that--in the interest of fairness (not to mention common sense)--diploma-sanctioning tests must be aligned with the curriculum (e.g., Debra P. vs. Turlington): the burden of proof lies on CTB, AK's test contractor, to so demonstrate. Demonstrating that a test measures what you want it to measure is an essential but, alone, fairly low standard for meaningfulness. CTB doesn't meet it: its Spring 2000 and Fall 2000 and Spring 2001 technical reports were so lacking in detail one could not tell whether they're about the "HSGQE" or the "AHSGE" (Alabama's exit exam); one hopes that student test scores are a function of their responses--but, given CTB's quality control problems, such an assumption may be unwarranted. This is a problem of fact that will not disappear, because it is unnoticed and not reported (even Hawaii's tests are aligned with its content and performance standards).
*Although the effective date of the HSGQE was postponed, the ship of State nevertheless blindly assigns AK students CTB scores and then aggregates them to assign schools "report cards." Reminds me of the old joke about the physicist, chemist, and economist marooned with a case of canned food but no openers: The physicist started throwing rocks at trajectories that might puncture metal; the chemist started looking for plants that might dissolve metal; and the economist said "Let us pretend the cans are open." Just so, the relationship between Alaska's performance standards and CTB's tests: assuming the one is measured by the other it not helpful.
*In 2001, the federal government passed the "Leave No Child Behind" law (NCLB), jumping onto the standards bandwagon initiated by the states. The first part of the NCLB is called "accountability," and it has the states busy: USED initially rejected the accountability plans of 45 of the 50 states--including AK (4 Feb 03). States are also required to show "adequate yearly progress" (AYP), as defined by USED, most recently in last December's Federal Register. Since the impact of chance increases monotonically with test unreliability--and CTB has not shown evidence supporting the alignment of its tests with AK performance standards--for reasons having to do with the "sampling distribution of the mean" and the "law of large numbers," you'd expect half a state's schools to not show AYP, just by chance. The title "Randomly Accountable" summarizes the issue: there is increasing anecdotal evidence that schools judged meritorious by a variety of measures--including presidential proclamation--are not showing AYP (Dillon, 03; Winerip, 03). IMHO the states are too busy to notice that, just by chance--whose impact is inversely related to reliability--half their schools aren't going to show AYP no matter how well they're doing.
There is a second and not well known part of the NCLB, the "Performance-Based Data Management Initiative," whose goal is a single nationwide database consisting of individually identifiable data on every public K-12 kid in the country. Stands to reason that those most likely to be directly impacted by the existence of such a database would be the most geographically stable. Glad my kid is out of the current data-collection sweep [her UNIX is better than mine, too, so I know she knows how to stay out of trouble].
*As a parent, I had it easy: my daughter was in public junior high when I started my doc work in educational testing and measurement ... whenever an irrational test popped up--the schools had to get parental permission to administer them back then--I'd give my blanket approval (whatever paperwork needed) for her to use her time--testing occurs in real time--as she chose. Your kids though have it harder, the impact of standards-based reform being cumulative (3 days a year times 6, that many fewer learning days [minus, too, "test prep" time])
| Nikiski Elementary: A School that Worked |
| HSGQE: Spring 2002 Retrospectively |
|
Not until an empirical link between the HSGQE and state performance standards has been established. It's possible [that's what I learned how to do in grad school]; it should be part of CTB's technical reports to the state--but its submitted reports are too vague to be useful ... I recently came across an article about CTB "quality control problems" [read and be astonished that CTB's technical reports have gone 100% uncommented upon]: All the more reason not to take HSGQE results on faith. |
| HSGQE: Development |
|
*The content of this examination is grounded
in Alaska's
educational vision and goals for its youth, which educators
translated into more specific content
standards--originally for English/language
arts, math, science, geography, history, healthy life skills, arts, world languages, and technology.
Alaska has also developed performance standards in three of these domains: reading, writing, and math, whose final versions were approved by the Board in January 1999).
It is from the performance standards in these three content domains (reading, writing, and math) that HSGQE items were developed. Thus, although Alaska has articulated content standards for, say, geography, there are no geography items on the HSGQE. The performance standards in each of the three domains were broken into four age groups:
|
*Alaska planned, from the
beginning, to use test results to rank individual schools and
districts.1 The school accountability
section of the
1998
Alaska Statutes
(Sec. 14.03.123) states that
I have yet to locate any information about the nature of these other measures. In 1998, Alaska budgeted 1.5 to develop the high school exit exam and $16 to cover next year's cost of the new education funding statute (Information Exchange, 25(12), 98). CTB-McGraw Hill was awarded the contract to develop the test (Information Exchange, 24 (21), 98). The proposed test-development schedule was:
CTB/McGraw Hill, of course, develops high school exams for other states.
|
1999. Initial HSGQE items field tested. During this period, Alaska high school students spent three days taking a draft HSGQE: they were not, however, assigned scores based on their test performance. On each subtest--reading, writing, and math--they answered a combination of multiple-choice and constructed-response (short and long) items. CTB presumably used these Alaska student data to select and refine items for the first official HSGQE--but then, of course, reading, writing and math are the same here as outside.
| HSGQE: Administration |
2000. HSGQE administered as diploma sanctioning test. This time Alaska students were assigned scores based on their HSGQE test performance, and those scores were posted on their transcripts. Although CTB has stated that, at any one time, only one form or version of the HSGQE was administered, it's likely that the tests between Spring 2000 and Fall 2001 were somewhat different (being a test publisher, it is expected that CTB would use banks of items that had been equated)--and, of course, new test items would be field tested at different administrations.
The reading subtest was based on the 8 performance standards established by Alaskans; the writing subtest--and the the math, 31, of which 24 were covered on the Spring 2000 test administration. |
On its assessment information page, DEEDs lists proposed and actual testing dates from 1999 to 2005 (another posting, for the year 2000, shows how much coordination and thus time is required to administer these tests). CTB scored the tests, no doubt spending less time on the multiple choice and more on the HSGQE constructed response items (check out an overview).
Alaskans set cut scores. CTB rank ordered each HSGQE subtest's individual items by difficulty level (based, presumably, on the Spring 2000 data) and presented these results, in booklet form, to three committees of Alaskans charged with setting the "cut" or lowest passing score--DEEDs has posted a description of the process it presented to the Legislature. One Alaska parent, who sat on the HSGQE math cut scoring setting subcommittee, took notes (Weiss, 00):
The subcommittee contained few math teachers or people with math backgrounds: its members first took the HSGQE and were then given answer sheets and scoring rubrics. They were next given a booklet of the rank ordered math items and asked to independently draw a line, within their booklet, at the place where they thought the cut score should be, i.e., where they no longer found items they thought it reasonable to expect Alaska seniors to know. These first-round results were discussed, followed by two more rounds of individual bookmarking and discussion; as typically occurs, cut scores converged over the three rounds. Some subcommittee members disagreed with the difficulty ordering presented in the booklet, noting that some items at the hard end of the spectrum seemed easy and some easy ones, hard--concluding that the rank ordering in the booklet was likely due to its being based on scores from "sophomores who hadn't taken a lot of the math courses that are covered in the test itself." |
Cut scores adjusted to account for the "Standard Error of Measurement." The SEM is a mathematical expression of error: Tests are not perfect, and taking that fact into account is just common sense. Think of an individual student's score as being comprised of two parts, truth and error. In the context of the HSGQE, the former refers to the student's mastery of Alaska's performance standards and the latter, to anything influencing the student's score that's not related to his/her mastery.
Some error is systematic--often called "bias:" Wouldn't you expect that students who had had the opportunity to learn the material covered by the test to do better than those who hadn't? Some error is random: wouldn't you expect that a student whose cat hadn't died the day before the test would do better than a student of equal ability whose cat had? The SEM is a way of statistically quantifying that error. Using a computationally simplified example, suppose the cut score for a test was set at 355 and its SEM calculated to be 5, that you got a score of 360, someone else got a score of 350, and I, a score of 340. Under classical test theory, it is expected that--were the test taken an infinite number of times--your true score would fall in the interval 360 ± 5, the other student's, in the interval 350 ± 5, and mine, 340 ± 5. There'd be no question that you passed and I flunked, but what about that student whose true score is expected to occur in the same interval as the cut score? Should the inherent error in every test be taken into account when deciding who fails? Would you be making distinctions an eighth of an inch wide if the markings on your ruler were a quarter-inch thick? No, I wouldn't either. Adjusting the cut score to take measurement error into account makes sense. |
Taking error into account is not "dumbing down" the test: Assuming that student test scores perfectly reflect student achievement is though dumb.
Expect error to creep into the estimation of true scores from the most irrelevant of differences: suppose that one school refers to the amount that remains after one quantity is subtracted from another as the "remainder" and another as the "difference;" suppose also that a state cares only whether students can derive the correct answer when subtracting one number from another --not what that answer is labeled; suppose now that "difference" is the word used in the state's math subtest ... all things being equal, wouldn't you expect students from the first school to have a greater chance of being confused by the wording of the test questions than those in the second school--just because their subtraction lessons had used a synonym to the word actually used on the test ("adjectives" and "modifiers" offering another illustration)? Factors influencing obtained scores--whether individually or aggregated at the school level--that are irrelevant to a state's performance objectives add error to those scores. Without as much as looking out the window, one can conclude that error's an inevitable component of test scores --and that its existence should be taken into account when interpreting them. |
Expect error even in quantitative measures of error: a test's SEM is after all an average of scores aggregated over all students. Would a particular group's SEM be different from the overall SEM? This is an answerable question in the sense that the SEM can be computationally estimated--and reasonably so when error is evenly distributed--by multiplying the standard deviation of the group's individual test scores by the square root of the quantity (1 minus its test reliability coefficient). I'd want to consider that plus what the data suggest about a test's reliability and validity before interpreting results obtained by administering it, to examine the variability in group scores before interpreting a measure of central tendency like a mean or median.
2001. Alaska State Legislature postponed HSGQE's effective date from 2002 to 2004. Alaska sophomores taking it the previous Spring--when they were taking it-- thought they had to pass it to get their diplomas in 2002--they didn't know that the legislature was going to grandfather them--and students slated to graduate in 2003--out.
Some students in that initial cohort dropped out of high school, convinced by their HSGQE scores that they couldn't make the grade. I don't think anyone knows how many. |
Although students graduating before 2004 were no longer required to pass the HSGQE, they still had to take it: Those who did not pass continued to take it, and results were posted on their diplomas and transcripts (Information Exchange, 29 (8), 29 Mar 01). By current Alaska law, students completing high school are now required to pass the HSGQE to receive a high school "diploma;" those not doing so are to receive a "certificate of achievement" (initially called a "certificate of attendance").
The State Legislature also somewhat changed the emphasis of HSGQEs to be administered from Spring 2002 onwards (not surprisingly, the biggest changes were in the math subtest). DEEDs has posted practice tests for the current HSGQE [and the individual benchmarks: reading (grades 3 and 6), writing (grades 3 and 6), math (grades 3 and 6), and all three (grade 8)]. DEEDs has also posted a discussion of accommodations that could be expected (in particular, guidelines for participation of special education and LEP students and a form for petitioning a change in testing location). Also online were lists of Quality Schools contacts, Alaska teacher, administrator, and school standards. DEEDs' Statistics Home Page (which includes many reports, among them educational "report cards" for the pre-HSGQE years, concluding with 1998-99), and School Designator Committee minutes (Feb 22-23, April 13-14, Oct 25-26 2000; Jan 16-17, March 28-29 2001). The School Designator Committee is charged with labeling Alaska's schools, based at least in part on HSGQE and benchmark test results.
2002. New version of HSGQE administered. "On the math test, geometry [is] worth 10 percent ... instead of 21 percent" (Peninsula Clarion, 27 Sept 01). Note that the original 31 math performance standards were collapsed into 6 broad categories.
Since test emphasis changed, Alaskans needed to set new cut scores (Information Exchange, 30(15), 19 Jun 02) and CTB, to analyze the resulting data. DEEDs is currently developing a new division of accountability and assessment (out of current staff) and has requested more monies to meet its assessment mandates: Alaska's assessment contract with CTB "increased by $498,900 for FY 2002 and by an additional $770,000 for FY 2003" (Information Exchange, 30(10), 26 Apr 02). |
Alaska State Legislature postpones labeling of individual schools from 2002 to 2004. Since the School Designator Committee thought the available basis for labeling so incomplete that it did not want to see the labels applied, this seems a sound move (Peszsnecker, 02).
| HSGQEs Spring 2000 to Fall 2001: Student performance |
Student passing rates aggregated 2 statewide are available for Spring and Fall 2000 and Spring 2001. Links to other score breakdowns--by district, ethnicity, and gender--are posted under "Assessment Results" on DEEDs' Assessment Site Map.
What do these scores mean? There are many possible explanations. High failure rates--certainly the case for the HSGQE math subtest--would be expected
| HSGQEs Spring 2000 to Fall 2001: What do the test scores mean? |
I've looked, online and off, and found little supporting the realiability and validity of the HSGQE. The publicly available evidence suggests rather the opposite. The data presented in CTB's two technical reports--covering the Spring 2000 and Fall 2000 and Spring 2001 test administrations--are not promising. CTB, for example, presents its opportunities-to-teach and to-learn surveys in support of the HSGQE's curricular validity, but their data are difficult to interpret both because specifics like survey dates and response rates are not stated and because they're not organized by the same performance standards. 3
RELIABILITY
Statistical data about the reliability of the HSGQE show that the percentage agreement rate between the first pair of CTB employees scoring each constructed response, Spring 2000, ranged from a high of 75% to a low of 40% and, Fall 2000, from a high of 50% to a low of 20% (Table 17 in the lst and Table 39 in the 2nd CTB report). These low rates of inter-rater reliability (the reports say nothing about how scorer disagreements were resolved) are a concern, because about a fourth of the total reading, a fifth of the math, and an eighth of the writing items were constructed responses. Inter-rater reliability is, however, but one form of reliability.
The HSGQEs' coefficients alpha--which show how well a test hangs together--are close to or greater than .9 (shown in Table 10 in the lst and Tables 6 and 36 in the 2nd), 1.0 being perfect. Alphas for the writing subtest were a tad lower than those for reading and math across all three test administrations. |
Formally speaking, a test's validity coefficient cannot be larger than the square root of its reliability coefficient. CTB's technical reports suggest a coefficient somewhere in the interval between the square root of the HSGQE's inter-rater and alpha reliability coefficients.
VALIDITY
Another study available online 4 raises questions about the HSGQE's construct and predictive validity (based on year 2000 Anchorage data). This study compared nationally normed CAT percentiles with HSGQE cut scores: the corresponding CAT percentiles were the 25th, 60th and 91st for reading, writing, and math. The authors reasoned that, in a national sample of sophomores,
about 75% would have passed the state's reading test, about 40% would have passed the state's writing test, and about 19% would have passed the state's math test. |
A minimum competency exam would not be soso on writing, easy on reading and super hard on math. At the very least, this study calls into question the HSGQE administered Spring 2000.
In summary, in 1998 the Alaska State legislature passed a law requiring
that Alaska students pass a qualifying exam to receive a diploma. In 1999
students took a draft HSGQE but were not assigned scores. In 2000 the
HSGQE was first administered as a diploma sanctioning test. In 2001 the
legislature changed the HSGQE's effective date to 2004. In 2002 a new
version of HSGQE was administered and the legislature postponed the
labeling of individual schools until 2004.
|
It costs money, and it takes time to take and administer the HSGQE. Yet the preponderance of publicly available evidence suggests that the initial HSGQE was not a reliable and valid test. Assigning students scores according to their performance on an unreliable test is conceptually similar to assigning them scores based on the spins of roulette wheels ... It does not seem inappropriate to advocate a new civil liberty: freedom from irrational testing. |
What do the numbers being posted on their transcripts mean?
| ||||
| Where are we now? |
One thing we do know is that the HSGQE isn't going to go away, given the new Federal Leave No Child Behind law. 6 Alaska already meets many NCLB requirements: It has reading and math tests for grades 3 and 8. It has already articulated content standards for science--and will need to articulate performance objectives (as it has already had experience doing by setting them for reading, writing, and math) and test them. DEEDs has been submitting "report cards" to the public for years, and the School Designator Committee is already developing a labeling system for schools. DEEDs has already added National Assessment of Educational Progress (NAEP) links to its Assessment web site. Alaska will also need to define basic, proficient, and advanced levels for the three subjects on both benchmark tests (as it has already had experience doing by setting cut scores for the HSGQE),
For now, proficiency at the 4th, 5th, 7th, and 9th grades has been defined (by whom I know not): at least 61% of a school's students must correctly answer at least half the questions asked on CTB's off-the-shelf tests, the CAT and Terra Nova (Peszsnecker, 02b). Schools with less than 60% answering correctly are labeled as having made "no progress" and publicly identified, 50 so far. Results from the 90 schools with less than 11 students are not reported to maintain student confidentiality. |
It is assumed that Alaska will meet all NCLB reporting requirements.
Like the other 49, Alaska will send the U.S. Department of Education blocks of text and links--in whatever the specified e-format--which will be pasted between chunks of USED text-and-link boilerplate to create 50 annually updated web pages available at the nclb.gov web site. |
Like the other 49, Alaska will struggle with problems like a lack of viable alternatives: whether you live in an area where there is no other school in your "district" or in a city where there are none with empty seats, you have no choice. Tutoring, however, is currently being provided in many areas by "'supplemental' educational service providers."
It is to be hoped that Alaska will accord the numbers required by the Feds the attention they merit--and that Alaska will do what's needed to make sure that student test results accurately reflect the state of education in Alaska: Tests should be shown to be aligned with the curriculum--and to be reliable and valid measures of it [nothing's perfect, but the questions at least should be asked]. It is also to be hoped--when addressing questions like whether a test item is biased for or against a particular group--that Alaska will use ethnicity categorizations descriptive of its student population instead of the generic printed ones from the Census 2000 form.
In their only two publicly available technical documents about the HSGQE and benchmarks, CTB divides Alaska's 134,391 K-12 public students into five categories: "African-American, Asian-Pacific Islanders, Hispanic, Alaskan Native, and White." This does not make sense, because the corresponding percentages are 4.5, 5.2, 3.2, 24.9, and 62.2 (NAEP state profile, 00). Grouping culturally and linguistically diverse groups--that constitute a quarter of Alaska's students--into a single "Alaskan Native" category doesn't make sense statistically either: within group diversity would be expected to increase statistical variabiity, which would make detecting biased items more difficult. |
Finally, it is to be hoped that Alaska will look at passing rates at the level of individual performance standards--seems more useful to say "good on a/work on b" than "you passed/flunked."
| TAAS: Constructing an exit exam |
I overview some of the issues involved in constructing an exit examination like Alaska's with links to another test, the Texas Assessment of Academic Skills (TAAS), which, incidentally, is made by the Psychological Corporation. I do this because Texas has required a high school exit examination since 1990. I describe the steps in developing their test only to overview the process--most definitely not to suggest that Texas sets an example that Alaska should emulate.
|
Not that Texas would so agree: "In our perspective, Texas is the model of the nation for standards and accountability," said Linda Edwards, a spokesperson for Governor Bush (Houston Press, 98), who has been a strong TAAS supporter for at least several years (Houston Chronicle, 15 Oct 97). |
Exit exams are not inexpensive: , including 47 for yearly costs like developing new test questions and for printing, grading and distributing the TAAS; 55 for cash bonuses--$750 each for educators whose aggregated TAAS scores were in the top quartile--and the rest for developing performance standards in additional content areas and testing more students (Brooks, P.A., Austin American-Statesman, 13 Dec 98).
| The test development process begins with the articulation of specific goals. Those goals form a "blueprint" from which the individual items on the test are constructed: If one cares only about apples, then questions about oranges are not appropriate. | ||
After an exam question, i.e., test item,
is developed, it's reviewed: is it relevant to the State's stated
goals? is it unbiased? These questions must be answered whatever
the response format of the item. Items that "pass" initial muster
are combined to form tests which are then administered to students.
New items are continually being developed because multiple versions
of the test are necessary. These items are "tested" by including
them in student test booklets but not in the calculation of student
scores.
Using reasonable tests does not solve all problems. It is also essential that tests be sufficiently reliable and valid. If, for example, a written essay is to be scored, will different judges grade it the same? Does writing an essay have anything to do with the state's goal of increasing literacy? |
The preceding are technical questions that must be answered empirically. If the answer to any of these questions is not affirmative, then the test is garbage--and, as we know, garbage in, garbage out.
|
Even mundane details of test administration cannot be left to chance. It won't do to have some students get 45 minutes to take the test while others get however much time they want. Nor would it be acceptable if one student's test were lost or confused with another's: a lot of paper is generated when every graduating senior in a state takes a test (wonder how it'd work out in pounds): someone has to keep it organized. Obviously, test administration includes (or should) quality assurance measures. What happens after the tests are administered is of equal concern. Take a recent example from New York State, where 8,000 students were mistakenly sent to summer school and 3,500 held back because of incorrectly reported scores (Saltpeter, J. & Foster K., TechLearning, n.d.) |
Another decision to be made somewhere along the line is what's a "passing" score? Cut scores must be set because an exit examination is a "minimum competency" examination. The State does not care how George did relative to Dmitri--only how well George did relative to the cut score--which itself is usually determined somewhat arbitrarily: well, let's see, how about, say, 70%?...no, too low...75%...Great!
The State also needs to determine what to report to students and the public. How about test items themselves? What about the raw data? I'm still looking for last year's HSGQE data (with, of course, all identifying information removed). No one has refused my request--but you know how email tag works. Texas, on the other hand, has a page to welcome researchers (if you're going to have to make something public, you might as well do it gracefully).
|
In Texas, actual scores are reported to students, then aggregated over classrooms, schools, and districts and made freely available to all. If memory serves, Texas did not release actual test items until a sufficient number of people were sufficiently annoyed; currently, they release the previous year's items (indeed, you can download all of the 1999 TAAS tests--and read discussions of their accuracy). And for awhile, thanks to the Texas Business and Education Alliance, you could take a version of the TAAS online. I took the test and show, below, a question I was asked. 3. A teacher took 6.25 gallons of water on a field trip for her 25 students and 2 chaperones to drink. By noon, the students drank 2.75 gallons of water. How much water was left for the remainder of the field trip for the students and chaperones to drink? A. 4.5 gallons B. 3.5 gallons C. 9 gallons D. 3.75 gallons E. Not Here Much to my astonishment, "A" was scored as the "correct" answer. Just goes to emphasize that the devil is always in the details. |
| My personal favorite is the one where there were 5 times more erasures on one school's score sheets compared to others'--that 90% of those erasures were from changing incorrect into correct answers--and that "follow-up investigation of older tests by a forensic scientist showed tests had been tampered with for years" (Bushweller, K., American School Board Journal, 97). The school with the erasures was in Maryland, and the test was the Iowa Test of Basic Skills. Other states also have their share of test tamperers--among them, Florida (Miami-Herald, 6 Nov 98) and Texas (Philadelphia Inquirer, 7 April 99). |
In summary, the basis for any exit exam is goals. What do you think is important to test? The ongoing translation of goals into test items is by no means a straightforward process. Many technical criteria must be met before we can conclude that the test means anything at all. If the test is not technically adequate or if it is improperly administered, then results are garbage.
| TAAS: Living with an exit exam |
Texans do not agree that their exit exam is a valuable part of their educational system:
|
*State officials are solidly behind the TAAS.
The Texas Commissioner of Education thinks the state of education in the State of Texas is good and wants to expand the use of the TAAS (Press Release 98).
State officials were also quick to defend the TAAS against bias charges brought by the Mexican-American Legal Defense and Education Fund (Amarillo Globe-News, 24 Oct 97). Keep in mind that what we're talking about here is elected officials. The 1998 Texas Democratic Party Platform said to "Use the TAAS test as one measuring stick of success, not the sole factor in determining whether a child passes from one grade to the next, and rely on the classroom teacher's judgment of the child's overall performance to determine whether a child advances to the next grade ...." But then the Democrats didn't get elected. |
*The Texas Public Policy Foundation (98), thinks Texas education is going down the tubes, pointing out that
Others note that the TAAS itself is focused at too low a level. Examples of old TAAS items like
sure seem more akin to 6th grade. Maybe better items is what Texas ex-Governor Bush meant when he said he wanted a more rigorous TAAS (Stutz, T., Dallas Morning News, 5 Dec 98). Ironically, the questions on a high school math exam parody are more challenging. |
Conceptually, it was intended to do so.
|
The TAAS was instituted "in order to provide parents and teachers information about the progress of students on specific objectives; to help identify both outstanding and under performing schools; to provide accountability to taxpayers about the money spent on public education; and to restore the value and meaning of a high school diploma," (TFT Legislative Hotline, 27 May 98). |
If you're a student, you've got to take the test. If you're an educator, your career will be affected by the TAAS:
|
"Teacher evaluations are linked in part to their school's overall performance. Superintendents are largely judged by their districts' scores" (Melendez, M., Fort Worth Star-Telegram, 98). It's a numbers game, and concern over TAAS performance dominates numerous educators' discussions: "That is a problem when you're looking at taking "at risk" students first. If you don't think one student makes a difference, go look at Hurst Junior High School where one student made them a low-performing campus one year...It only takes one student to get you there." [my emphasis] (quoted in a report from the Texas Public Policy Institute). Tsk, pesky students. |
Texas itself is plastered with TAAS "stuff" (for lack of a better word).
|
Don't be surprised to see a list of local schools and their "grades" in the housing section of the classifieds in the Austin American-Statesman or a searchable TAAS database in the Dallas Morning News. TAAS scores influence property values. There's even a cottage industry that's grown up around the TAAS--TAAS preparation materials: AMSCO School Publications and Sleek Software Corporation (with its amazing TAAS tutor) would like to help you prepare for the TAAS (just use your Visa), as would their competitors. There are probably even ads on radio and TV, in magazines and newspapers, maybe on billboards and lamp posts, too. |
The orbit around the TAAS is faster for Texas schools with low TAAS scores.
|
As noted in the Nacogdoches High School Dragon Echo, attempts to raise TAAS scores included both
|
Texas teachers are encouraged to "teach to the test."
| Texas teachers are expected to organize daily
instruction around the TAAS (called "teaching to the test"). The
State Commissioner of Public Instruction advocates doing so. Other
Texans though are diametrically opposed: In a
letter to the editor of the Star-Telegram , P. Burkham
forcefully restated a point raised
for several years (Amarillo Today, 25 July 97):
|