![]() |
||
|
Wrightslaw l No Child Left Behind l IDEA 2004 l Fetaweb l Yellow Pages for Kids l Harbor House Law Press |
||
| Home > Articles > Tests and Measurements for the Parent, Teacher, Advocate & Attorney by Peter Wright & Pamela Wright |
|
Advocate & Attorney and Pamela Darr Wright, M.A., M.S.W. "If something exists, it exists in
some amount. I. Introduction II. The Process
of Educational Decision-Making III. Statistics:
General Principles IV. Using the Bell Curve to Measure
Progress VI. Parents
"To-Do" List Wrightslaw
Note:
This comprehensive article was updated and revised in December 2007. For more information about the bell curve and how to use tests and measurements to measure and monitor a child's progress, please read Chapters 10 and 11 in Wrightslaw:
From Emotions to Advocacy, 2nd Edition. Most parents
of special needs children know that they must understand the law
and their rights. Few parents know that they must also understand
the facts. The "facts" of their child's case are contained in the
various tests and evaluations that have been administered to the child.
Changes in test scores over time provide the means to assess educational
benefit or regression. As an attorney who specializes in representing special education children, many parents consult with me after they decide that their child's special education program is not appropriate. These parents are often right. However, in most cases they do not have the evidence to support their belief, nor do they know how to interpret and use the evidence contained in educational and psychological tests. They need evidence to support their beliefs. These parents are convinced that a special education program is not providing sufficient help for the child --- that under the present special education program, the child is failing to make adequate progress and has fallen further behind. These parents experience a sense of urgency --- the child has usually received special education for several years and time is running out. Critical educational decisions are often based on the subjective beliefs of parents and educators. As a parent, you may believe that your child is not making adequate progress in a special education program. The special education staff may firmly believe that he is doing as well as he can --- or that your expectations are too high. Without objective information, both sides take positions that are based on their emotions --- and tempered by their hopes and fears. Appropriate educational decision-making must be based on objective information and facts, not subjective emotional reactions and beliefs. Before you can participate in the development of an appropriate special education program, you must have a thorough understanding of the child's strengths and weaknesses. This information is contained in tests that are used to measure the child's abilities and educational achievement. Tests administered to children fall into several categories: intellectual or cognitive tests; educational achievement tests; projective personality tests, questionnaires and surveys; speech and language tests; and neuropsychological tests. The United States Supreme Court
In Florence County School District Four v. Shannon Carter, 510 U. S.7, 114 S. Ct. 361, (1993), the Supreme Court issued a landmark decision. In Carter, the school system defaulted on their obligation to provide a free appropriate education to Shannon Carter, a child with learning disabilities and an Attention Deficit Disorder. Let's look at how the courts viewed the facts and the law in the Carter case. Background When Shannon
was in the seventh grade, her parents talked to the public school staff
and expressed their concerns about Shannon's reading and other academic problems.
Shannon was evaluated by a public school psychologist who described her
as a "slow learner" who was lazy, unmotivated and needed to be pressured
to try harder. Her parents pressured her to work harder. In the ninth grade, despite intense
pressure, Shannon failed several subjects.
The school district developed an IEP for Shannon's tenth grade year. This IEP proposed that after a year of special education, Shannon would read at the 5.8 grade equivalent level and perform math at the 6.8 grade equivalent level. In other words, after a full year of special education to remediate her learning disabilities, Shannon was expected to make only four months of progress in reading and math. Progress would be measured by her scores on the Woodcock-Johnson and KeyMath educational achievement tests. She would progress from the 5.4 to 5.8 grade level in reading and from the 6.4 to 6.8 grade levels in math. Shannon's parents insisted that their daughter needed a more intensive program so she could learn the necessary reading, writing and math skills. They felt that the proposed program was inadequate, and worried that Shannon would still be functionally illiterate when she graduated from high school three years later. Emory Carter insisted that the school teach his daughter to read, write and do arithmetic at a high school level when she graduated from high school. Although Emory and Elaine Carter shared their concerns and wishes with the public school officials, the administrators took a "take it or leave it" position. They refused to provide Shannon with a more intensive special education program where she would receive remediation in reading, writing, and arithmetic. The parents requested a special education due process hearing. The Hearing Officer ruled that the public school IEP was appropriate. The parents appealed this decision to a Review Panel. The Review Panel upheld the decision of the Hearing Officer. At that point,
Emory and Elaine Carter withdrew Shannon from the public school
and enrolled her in Trident Academy in Mt. Pleasant, South Carolina. Trident Academy is a private school that specializes in educating and remediating children with
learning disabilities, including dyslexia. When Shannon
graduated from Trident Academy three years later, her reading and math
scores were on a high school level. After hearing testimony and reviewing the transcripts and documents from the Due Process and Review Hearings, Judge Houck found that the school district's IEP was "wholly inadequate" to meet Shannon's needs. He ruled that Shannon had received an appropriate education at Trident and ordered Florence County to reimburse Shannon's parents for the costs of her education. (Read Judge Houck's decision) On what basis did Judge Houck decide that the IEP proposed by Florence County was inappropriate? What evidence caused him to decide that Shannon received an appropriate education at Trident Academy? The decisions
in Shannon's case, and in most special education cases, are based on the evidence
provided by tests and evaluations of the child. When
Judge Houck wrote that Florence County's IEP was "wholly inadequate"
to meet Shannon's needs, he was relying on test results. Judge Houck knew the importance of accurately interpreting
test scores. He charted Shannon's test scores and included this information in his decision. (See also Hall v. Vance, 555
EHLR 437, (E.D. NC 1983), affirmed at 774 F. 2d 629, 557 EHLR 155, (4th
Cir. 1985)) in which U. S. District Court Judge Dupree charted out James
Hall's test scores to support his decision that Vance County, North
Carolina did not provide James with an appropriate education.) A three judge panel of the Fourth Circuit affirmed Judge Houck's decision about the inadequacy of Florence County's proposed IEP. Florence County then appealed to the United States Supreme Court. On October 6, 1993, Pete Wright argued Shannon's case before the Court. Thirty-four days later, on November 9, 1993, the Supreme Court issued a unanimous favorable decision on Shannon's behalf. In the Carter decision, written by Justice Sandra Day O'Connor, the Court upheld the lower court decisions, ruled against Florence County School District Four, and ordered them to reimburse Shannon's parents for the costs of her tuition, room and board, and attorney's fees. The Individuals with Disabilities Education Act (IDEA) requires IEPs to include measurable annual goals and a description of how the child's progress toward meeting the annual goals will be measured. (III) a description of how the child's progress toward meeting the annual goals described in subclause (II) will be measured and when periodic reports on the progress the child is making toward the annual goals ... will be provided ... (See 20 U.S.C. 1414(d)(1)(A)(i)(II and III; page 99 in Wrightslaw: Special Education Law, 2nd Edition, page 99. See also the "Definition of individualized education program" in the federal special education regulations, Volume 34 of the Code of Federal Regulations, 34 CFR 300.320(a)(2)(i) and (a)(3)(i); page 245 in Wrightslaw: Special Education Law, 2nd Edition) The U. S. District Court and the Fourth Circuit found that the proposed gain of four months after a full year of special education was "wholly inadequate." In an effort to avoid Florence County's fate, many school districts around the country develop IEPs that include no objective measures of the child's progress. Instead of developing goals where the child's progress is measured using objective tests, as Florence County did, many schools now devise IEPs that rely subjective teacher observations of the child's progress. Let's see how this works. Johnny is a child who has a learning disability that is affects his ability to read. In reading, Johnny is below grade level. Instead of developing an IEP that will measure his progress in reading on specific objective tests, the special education staff may write a goal like this: "Johnny will make measurable progress in reading, as measured by teacher observation and teacher made tests at 80% accuracy." In that case, "objective measurement of progress" becomes the teacher's subjective observation about whether her student made progress in reading, writing, or arithmetic. The criteria for determining progress becomes 80% of a subjective belief or opinion. When parents object and request a more intensive program that includes clear objective scores, they are often rebuffed or criticized. In many areas, school board counsel and state departments of education advised schools to stop using objective measurements of progress (tests) to measure progress for special education children. If you believe that the special education your child is receiving is inadequate, you must have evidence to support your position. You will find this evidence in the public school and private sector testing that has been or will be completed on your child. To master the material in this article, you should expect to read the article at least three times. When you take this step, you will know what tests and evaluations measure and how test results are reported. You will know how to convert scores on different tests into numbers that are easily understood. And, you will know how to measure a child's educational progress or lack of progress, i.e. regression. Michael Three years ago, your eight-year-old son Mike began to have serious difficulties in school. By the time he entered third grade, you were deeply concerned about his difficulties in learning to read. His handwriting was nearly illegible. Homework was a nightmare. You consulted with Mike's teacher about these problems several times. Eventually, the teacher sent Mike's "case" to a special education committee. You attended a meeting of this committee. The committee recommended that Mike be evaluated through the school's special education department. Relieved that something would be done to help you child, you consented to these tests. According to the evaluations, your son has a learning disability. In Mike's case, he has visual-perceptual problems and visual-motor problems that negatively affect his ability to read and write. Based on the results of the evaluations, your son was found eligible for special education services at his neighborhood school. After Mike was found eligible for special education, you attended a meeting to develop his Individualized Education Program (IEP). This IEP provided for Mike to receive one period of special education in an "LD Resource" class every day. It was your understanding that Mike would receive individualized help in reading and writing from a teacher who was specially trained to remediate his learning disability problems. Three years have passed. Mike hasn't made much progress, despite the special education help. He still has difficulty reading aloud. His spelling is poor, and his handwriting is unreadable. He is behind most of the children in his class. His attitude has changed. He is angry and depressed and says he "hates school." When you discussed your concerns about Mike's lack of progress with his special education teacher, she reassured you that he was "making progress" and told you to be patient. From your perspective, patience is not the issue. You are worried that your son will never master the basic academic skills. You are worried about his future. At a recent IEP meeting, you reiterated your concerns about Mike's lack of progress and expressed your belief that he needs more help than he is getting in the school's Resource program. The IEP team disagreed with you. One person said that Mike was getting all the help he needs and that he was really doing quite well. Another member said that your expectations were too high --- and if you don't accept Mike's "limitations," you will damage him emotionally. What should
you do? You believe that Mike's new "emotional problems" are caused by shame and embarrassment because he is not successful in school. How can you, a parent, prove this to the staff at Mike's school so that they will develop an appropriate educational program for him? How will you know when he is getting the help he needs? Many parents assume that interpreting their child's test data is beyond their competence, that this is the responsibility of the school personnel. If parents do not accept the responsibility of learning this information, they leave interpreting the test data to the school psychologist. The school psychologist often has very little information about your child, aside from scores on tests administered years ago. The basic principles of tests and measurements are not difficult to learn. As you read this article, you will see that you are already familiar with many of the concepts discussed. Statistics and statistical terms are used in many other areas of life, from business and sports to medicine. Newspaper and magazine articles use statistics to inform readers about change or lack of change. You read articles about changes in the population, the climate, the economy. Even public opinion polls include statistical information to inform you or persuade you of a point. As a parent, you need to expend time and effort to develop a basic understanding of statistics. To accomplish this goal, you should reread parts of this article several times. Underline, make margin notes, and use a highlighter. Be patient and put in the time. The time you spend now will help to change your child's life forever. As you study this material, you are likely to see some terms and concepts that are confusing at first ---terms like standard deviation, standard scores, and grade and age equivalents. Other concepts will be familiar --- averages, percentages. When you master this information, you will understand the educational and psychological
tests administered to your child. You will be able to use this
information to make wise decisions about your child's education. You'll find that your
newfound knowledge and expertise exceeds that of many of the special education
staff and IEP team members. Katie is fourteen years old and in the ninth grade. She "hates school" and is failing several subjects. As a young child, Katie was bright, happy, and curious. When she entered third grade, her attitude began to change. Now, she locks herself in her room, lies on her bed, and listens to music for hours. She is sullen and angry and says she can't wait to quit school. In desperation, Katie's parents took her to a child psychologist for testing. At a meeting to interpret the test results to Katie and her parents, the psychologist explained that Katie scored two "standard deviations" above the mean on the Similarities subtest of the Wechsler Intelligence Test for Children, Fourth Edition (WISC-IV) and two and a half "standard deviations" below the mean on the spontaneous writing sample of the Test of Written Language, Third Edition (TOWL-3). Wrightslaw Note: Test publishers
update and revise their tests fairly often. This article does not focus on any test's strengths or weaknesses, since weaknesses are often corrected in the next edition of that test. The Woodcock Johnson battery of tests was known as the Woodcock Johnson Psycho-Educational Battery. The WJPEB included educational achievement testing and cognitive ability testing. The current version is called the Woodcock-Johnson III Tests of Achievement (WJ-III) measures academic achievement. The Woodcock-Johnson III Tests of Cognitive Abilities (WJ III) measures general intellectual ability and specific cognitive abilities. Dr. Woodcock also produced the Woodcock Reading Mastery Test, now called the Woodcock-Johnson III Diagnostic Reading Battery (WJ III-DRB). Tip: For information about thousands of tests, go to Testlink from the Educational Testing Service (ETS) at http://www.ets.org/testcoll/index.html Parents are often surprised to learn that tests do not necessarily measure what they purport to measure. As you will see, a child's score on a push-up test can be represented as an overall fitness score, a measure of arm strength, an upper body measurement score, a measure of perseveration and persistence, or a measure of a child's motivation. A score may measure only one variable or it may accurately reflect all of the above. Let's look at tests that measure reading ability. One test that purports to measure a child's reading ability actually measures the child's ability to correctly read aloud and pronounce isolated words out of context, i.e., a word recognition test. This test includes a list of words, i.e., cat, tree, dog, house, person, etc. It does not measure reading and may be adversely affected by a child's speech or word finding problems. Another reading
test has the child read a passage of text, then
answer a series of multiple choice questions about the passage. In this
case, the score may be a measure of the child's ability to eliminate certain answers in the multiple choice format, i.e., reasoning, not reading. Some very bright children may need to
recognize and interpret only a few words to discern the total context.
Others have excellent word recognition abilities but cannot link
or interpret the words in a body of text or passage. Tip: You need to know what tests were used and what the tests measure. To learn more about reading tests, read Reading Tests: What They Measure, and Don't Measure by Dr. Melissa Farrall. When we first discussed Katie, we saw that she scored two "standard deviations" above the mean on the Similarities subtest of the Wechsler Intelligence Test for Children, Fourth Edition (WISC-IV) and two and a half "standard deviations" below the mean on the spontaneous writing sample of the Test of Written Language, Third Edition (TOWL-3). Do these
test scores explain the academic problems Katie is having? Do they have
anything to do with her moodiness and her intense dislike of school? (Answers:
Yes and Yes.) After you learn the material in this article, you will be able to interpret your child's test scores. You will be able to read the preceding paragraph and understand the significance of Katie's scores. You will have acquired skills that will allow you to answer questions like these:
To clarify these points, let's change the facts. You can measure your child's physical growth with a measuring tape and a bathroom scale. You can measure growth by charting how much height increases, as measured in inches, and how much weight increases, as measured by pounds, over a period of months or years. Using these tools, you can document his physical growth. You don't need to be a doctor to understand that increases in these measurements prove that your child is growing. Assume that your child's height was five feet, three inches last year. This year, the child is five feet, six inches tall. You can report this information in several ways. You can say that last year, your child was sixty-three inches tall and is now sixty-six inches tall. Or, you can say that your child was 5.25 feet tall and is now five and a half feet tall. You can even say that a year ago, your child was 160 centimeters tall and is now 168 centimeters tall. Or, that your child was 1.75 yards tall and is now 1.83 yards tall! If you or your child's pediatrician have measured your child's height and weight at regular intervals, you can create a chart or graph that documents changes in height or weight over time. Your child's pediatrician has growth charts you can use to compare your child's growth with the growth of the "average" child. Academic or educational growth can be measured and charted too. The yardsticks used for measurement are different, but the principles are the same. Measuring educational growth or progress is not much different from measuring physical growth. Instead of a tape measure and scales, you need the psychological and educational achievement test results. Where will you find the information you need? How can you measure change? Most school districts test their students on standardized educational achievement tests at regular intervals. The results of these tests provide information about how well the school district is accomplishing the mission of educating children. The information contained in group standardized tests can provide you with some basic information. Standardized educational achievement tests are general measures. The information they provide is similar to that provided by medical screening tests. Medical screening tests can suggest that a problem exists. In most cases, additional testing is necessary before the problem can be accurately identified and a treatment plan developed. Children's learning problems are identified in a similar manner. In most schools, individual ability and achievement tests that identify academic or functional problems are administered by school psychologists and educational diagnosticians. As you continue
on your advocacy journey, you need to understand the exact nature of your
child's disabling condition(s). How does the disability affect her? In
what areas? How serious is it? What are her strengths and weaknesses?
Does she need special education? What educational issues need to be addressed?
How will you know if she is making progress? How much progress is sufficient?
Many parents believe they cannot understand these tests. Usually, their reasoning goes like this: Gosh. I'm just a parent. I didn't even finish college. I don't have any training in education or special education so I can't understand that stuff!If you believe that you "can't" understand your child's testing, it's time to change your beliefs. You may be reading this article because your son or daughter is performing poorly in school --- or has been identified with learning problems --- and now believes that he or she "can't" read or write or do arithmetic. Your child must overcome these false beliefs about learning new or difficult material. And, so must you. Statistics are simply ways to measure things and to describe relationships between things, using numbers. Part of the confusion that many people experience when they begin to learn statistics is because of the terms and concepts are unfamiliar. As we learned in the earlier discussion about measuring physical growth, there are several ways to report the same information (inches, feet, yards, centimeters, etc.) This can be confusing. First, let's look at a familiar situation that many of us deal with regularly --- how to measure our car's gas mileage. Remember: When we use statistics, we can use several terms to describe the same concept. If you want to describe your car's gas mileage, you can make any of the following statements:
When you have this information, you can make decisions. When will you need to buy more gas? You know that your car has a 15 gallon gas tank. According to the gas gauge, your tank is slightly below the halfway mark. You've been driving in the city. You'll be driving on the highway for the rest of your trip. You have used a precise amount of gas and have a precise amount of gas left in your tank. You can describe and define this information in several ways --- gallons used, gallons remaining, miles driven, miles to go, percentage full, and so forth. Using this information, you can do some simple math calculations to learn that your car averages between 17 to 23 miles to a gallon of gas, depending on driving conditions. Using this information or data, you can also measure change. If you compare your car's present or current mileage to the mileage you obtained last month, before you had your car tuned up, you can measure miles per gallon before and after the tune-up. In this way, you can measure the impact of the tune-up on your car's gas consumption. You can also compare your car's mileage performance to that of other vehicles. Let's look at another common way in which we use tests and measurements. When you last visited your doctor, you mentioned that you were feeling tired and sluggish. Your doctor asked several questions, then recommended that you have some lab work. After reviewing the test results, the doctor explained that your blood glucose level was moderately elevated. To lower your blood glucose level, the doctor recommended a plan of treatment that included a special diet and a daily program of moderate exercise. After a month, you return for a follow-up visit. More lab work is completed. If your glucose level has returned to normal, it is unlikely that you will require additional treatment. But, if your glucose level remains high despite the diet and exercise program, you may need more intensive treatment. By measuring change after an intervention with "data based documentation of repeated assessments ... at reasonable intervals, reflecting formal assessment of progress" (34 C.F.R. 300.309(b)(2), you and your doctor can make rational decisions about your medical treatment. Remember: The principles that allow you to compute your car's gas mileage and make medical decisions will also allow you to measure and monitor educational progress. When you measure educational progress (as when you measure your gas mileage and blood levels), the test scores can be reported and compared in several different ways. Because educational test scores are often reported in different formats and compared in different ways, it is essential for parents and advocates to understand all of the scoring methods used in measuring and evaluating educational progress, including:
Let's look at the performance of a group of children. You need to understand how an individual child scores when compared with other children who are his age or in his grade --- and what this means. First, we'll examine a single component of physical fitness in a group of elementary school students. Our group or sample consists of 100 fifth grade students. These children are enrolled in a physical fitness class to prepare them to take the President's Physical Fitness Challenge. We will assume that the average chronological age (CA) of these children is exactly ten years, zero months. (CA=10-0) The children are tested in September, at the beginning of the school year. To qualify as "physically fit," each child must meet several goals. Push-ups are one measure of upper body strength. Each child must complete as many push-ups as possible in a period of time. Each child's raw score is the number of push-ups completed. The term raw score is simply another way of describing the number of items correctly answered or performed. After all of the fifth grade students complete the push-up test, their scores are listed. The results are as follows:
Again, two-thirds of the children in this fifth grade class were able to complete between 7 and 13 push-ups. The remaining third of the children did fewer than 7 or more than 13 push-ups. Nearly all of the children --- 98 out of 100 --- were able to complete between 4 and 16 push-ups. Click here to view the bell curve chart. The test results provide us with a sample of data. As we analyze the data in our sample, we can compare the performance of any individual child with that of the entire group. As we make these comparisons, the data will enable us to recognize any child's strengths and weaknesses when compared with the peer group of similar youngsters. If we conduct an identical push-up test with children in other grades, we can compare our original group of 100 fifth grade children with other groups of youngsters --- children who are older, younger, in different grades, in different schools. If we gather enough information or data from other sources, we can compare our original group of fifth graders --- or an individual child within our group --- to a national population of children who are being tested for their upper body strength as measured by their ability to do push-ups. In nature, traits and characteristics distribute themselves along theoretical curves. For our purposes, the most important curve is called the normal frequency distribution or bell curve. Because the percentages along the bell curve are well-known and thoroughly researched, they become our frame of reference. By using the bell curve, we can develop a diagram or graph of the children's push-up scores. This map --- on the bell curve --- provides us with additional information. We can see what percentage of children were able to complete specific numbers of push-ups. When we use the bell curve, we can visually demonstrate where any particular child scores, when compared with other children who are the same age or in the same grade. Likewise, with educational test scores, we can visually demonstrate scores and change over time. If we compare the push-up scores obtained by children who attend different schools, we can determine whether the physical fitness of children, as measured by their ability to do push-ups, varies in different schools, neighborhoods, states, or countries. We can also measure progress over time --- with push-ups and with improvement in reading skills. Let's look at our class of fifth graders again. We want to gather information as to whether the physical fitness class is effective --- whether the children's fitness levels improve. How can we answer this question? To measure the effectiveness of the fitness class, we will measure the children's number of push-ups before they take the class and compare this score with their score after they take the class. If the class is effective, we should see individual improvement and group improvement. Some children will have minimal improvement --- these children will fall further behind the peer group. Other children who performed below their peers may show significant improvement. Some children will improve so much that they now perform as well or better than the "average" youngster. We will measure the children's progress on one or more occasions as they progress through the class. If the fitness class is "working," that is, if the children's' fitness levels are improving, their ability to perform fitness skills should improve measurably over time. In our example, physical fitness improvement is being measured with "technically sound instruments" that "are valid and reliable" (34 C.F.R. §300.404(b)(c)) and use "Data-based documentation of repeated assessments of achievement at reasonable intervals, reflecting formal assessment of student progress ..." (34 C.F.R. §300.309(b)(2)) Because of its value and usefulness in measuring educational progress, we will return to the subject of the bell curve repeatedly throughout this article. On all bell
curves, the bottom or horizontal line is called the X axis. In
our sample of fifth graders, the X axis represents "number of push-ups."
And, on all bell curves, the up- and- down vertical line is called the
Y axis. In our sample, the Y axis represents the number of children
who earned a specific score (number of push-ups completed).
As you can see in the diagram above, the highest point of the bell curve on the X axis is equal to a score of 10 push-ups. You recall that more children completed 10 push-ups than any other number. Thus, the highest point on this bell curve represents a score of 10. The next most frequently obtained scores were 9 and 11, followed by 8 and 12. This pattern continues out toward the far ends of the bell curve. In our example, the ends occurred at 1 and 19 push-ups. Using the
bell curve, we can now chart each child's score and compare it to the
score achieved by all 100 students in the class. Look at the bell curve
above, and find 10 push-ups. We know that Amy completed 10 push-ups so
her raw score was 10. Ten push-ups placed her squarely in the middle of
the class. Half of the youngsters in Amy's class earned a score of 10
or more; half of the children scored 10 or less. If you look at the bell
curve diagram (below), you see that Amy's score of 10 placed her at the
50% level. The individual's percent level is referred to as their percentile
rank (PR). Amy's percentile rank is 50 (PR=50).
Erik completed 13 push-ups. Looking at the bell curve above, you see that his score of 13 placed him at the 84th percent level. Erik's percentile rank is 84 (PR=84). Erik's ability to do push-ups placed him in the 84th position out of the 100 fifth grade children tested on upper body strength. Sam completed 7 push-ups. His raw score of 7 placed him at the (bottom) 16 percent. Sam's percentile rank was 16 (PR=16). Out of our sample of 100 fifth grade children, 84 children earned a higher score than Sam. Larry completed 6 push-ups. We can convert his raw score of 6 to a percentile rank of 9 (PR=9). 91 children scored higher and 8 children scored lower than Larry in upper body strength as measured by the ability to do push-ups. Oscar completed 2 push-ups. His raw score of 2 placed him in the bottom 1 percent of fifth graders tested (PR=1). Nancy's raw score of 17 placed her at the upper 99 percent. We say that Nancy scored at the 99th percentile rank (PR=99). You can see the relationship between the number of push-ups completed and the child's percentile rank (PR) reproduced in the table below: Click here to see the table The bell curve is a powerful tool. When you use the bell curve, you can objectively compare any child's percentile rank to that of a group of children. You can also compare a single child's progress or regression when compared to the group. Using the bell curve, you can compare a single child's score to the scores obtained by other children who are older or younger or in different grades. Let's see how this works. Again, we will measure the children's upper body strength by the number of push-ups they can perform. In this case, we decide to evaluate all children in all the elementary grades, from Kindergarten through fifth grade. We will assume that the average chronological age of these elementary school children is exactly eight years (CA=8-0 years). After we test the third graders, we find that the average or mean score of our sample of 100 eight year old third graders is 6 push-ups. This means that the "average" third grade child (who is 8 years old) can do 6 push-ups. We can also compare an individual child's score on arithmetic problems answered correctly with the average number answered correctly by children the same age. How can we compare children from different groups? Let's look at Larry who was a member of our original group of fifth graders. Although the average fifth grader performed 10 push-ups, Larry only completed 6 push-ups. His raw score of 6 converts to a percentile rank of nine (PR=9). When we compare
Larry's performance to all elementary school students, we learn that Larry
(a fifth grader) is functioning at the level of the average third
grader --- who is also eight years old --- in the ability to do push-ups.
Therefore, we see that Larry's age equivalent score is 8 years
(AE=8-0) and his grade equivalent score is at the third grade level
(GE=3-0).
Look again at the table of scores above and find Frank's name. You see that Frank earned a raw score of 15 push-ups which converts to a percentile rank of 95 (PR=95). Frank's score looks great --- until we remember that Frank was "held back" three times. Although he is in the fifth grade, Frank is 13 years old! With this new information, let's take another look at Franks' performance. The average score for 8th graders (who are 13 years old) is 15. Frank scored 15. Frank had a grade equivalent score of 8th grade (GE = 8.0) and an age equivalent score of 13 years (AE = 13-0). When we compare Frank with other children in his expected grade, we see that his achievement is in the average range. Frank is in the 95th percentile level when compared to fifth graders, not when compared to eighth graders. Frank's case brings up some additional questions. Frank (age 13) was included in our sample of 5 th graders who had an average age of 10. When compared to this group of children who were younger than him, Frank scored at the 95% percentile rank (PR) level. Question: If we compare Frank's performance to that of children who are three years younger than him, will this comparison provide us with an accurate picture of his physical fitness? Answer: No. In Frank's case, statistics inform us of two facts. First, we see that Frank performs at a superior level when compared with other children in his grade. Second, we see that he performs at an average level when compared with children who are his age. When you evaluate the significance of data from tests, you must know how the scores are being reported. Test scores can be reported using percentile ranks, age equivalents, grade equivalents, raw scores, scale scores, subtest scores, or standard scores. Remember: Although Frank's performance was superior for his grade, it was average for his age. If you did not know Frank's age and grade, you would have been misled as to Frank's actual achievement. But --- if Frank was an 8 year old 3rd grader, his scores would be in the superior range, using both age equivalent and grade equivalent measures. The number of push-ups each child completed was his or her raw score. Let's assume that we want to obtain an overall fitness score. To obtain an overall or composite score, we will measure three skills (sit-ups, push-ups, a timed 50 yard dash) and obtain scores on each of these skills. In educational testing, the child's overall score (in reading, math, etc.) is often a composite of several subtest scores. Next, we will develop a weighting system that will convert each child's raw score to a scale score. After we convert the raw scores to scale scores, we will be able to compare each of the three scores to each other (number of push-ups, number of sit-ups, seconds to complete the 50 yard dash). How do we convert raw scores into scale scores? One way to convert scores is by developing a rank order system. In rank order scoring, the child who scores highest in an event (most push-ups, most sit-ups, fastest run) receives a scale score of 100; the lowest receives a score of 1. The other 98 children receive their respective "rank" as their scale score. After each child's raw scores are converted to scale scores, we can easily compare an individual child to the group and to all children who are the same age or in the same grade. We can also compare an individual child's performance at different times, i.e. before and after completing the fitness course. Was the child able to do significantly more push-ups after taking the fitness course? Was the child reading better after receiving reading remediation? You can see that after we develop a global composite score, the individual child's raw scores on each of the three fitness subtests have less significance. This is exactly what happens with educational achievement and psychological tests. Most educational tests are composites of several subtests; the subtest scores are combined to develop composite scores. More about this shortly. Let's look at how composite scores can be used and some of the problems that arise when we rely on them. John is a member of our original group of 100 fifth graders. He has good muscular strength (he scored at the 70% PR level in push-ups and at the 78% PR | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||