Possible Futures for Assessing Educational Achievement Richard J. - - PowerPoint PPT Presentation

possible futures for assessing educational achievement
SMART_READER_LITE
LIVE PREVIEW

Possible Futures for Assessing Educational Achievement Richard J. - - PowerPoint PPT Presentation

Possible Futures for Assessing Educational Achievement Richard J. Shavelson Stanford University Invited Talk Learning Research and Development Center University of Pittsburgh January 14, 2000 Overview of Talk An Achievement Framework


slide-1
SLIDE 1

Possible Futures for Assessing Educational Achievement

Richard J. Shavelson Stanford University

Invited Talk Learning Research and Development Center University of Pittsburgh January 14, 2000

slide-2
SLIDE 2

12/2/99 Assessment & Accountability 2

Overview of Talk

  • An Achievement Framework

– What does it mean to “achieve” in a domain like science? – How might we measure achievement in science? – What evidence is there that we’re measuring different aspects of achievement?

  • Items for the Design of Accountability Systems
  • Concluding Comments
slide-3
SLIDE 3

12/2/99 Assessment & Accountability 3

Toward An Achievement Framework: Knowledge Components

Extent

(How much?)

Structure

(How is it organized?)

Others

(Precision? Efficiency? Automaticity?)

Declarative Procedural Strategic Knowledge Knowledge Knowledge

Cognitive Cognitive Tools: Tools:

Planning Planning Monitoring Monitoring

Characteristics That Vary According to Proficiency Level

Low High

(Knowing the “that”) (Knowing the “how”) (Knowing the “which,” “when,” and “why”)

Domain-specific content:

  • facts
  • concepts
  • principles

Production system-- condition- action rules Problem schemata/ strategies/

  • peration systems
slide-4
SLIDE 4

12/2/99 Assessment & Accountability 4

Linking Assessments to Achievement Components

Declarative Procedural Strategic Knowledge Knowledge Knowledge

Performance Assessments Conceptual Maps

  • Performance

Assessments

  • Interviews
  • M-C Tests
  • Multiple-Choice
  • Fill-in

Procedural Maps Models/ Mental Maps

Extent Structure

Others

slide-5
SLIDE 5

12/2/99 Assessment & Accountability 5

Extent of Declarative Knowledge

Air is made up of many gases. Which gas is found in the greatest amount?

  • A. Nitrogen
  • B. Oxygen
  • C. Carbon Dioxide
  • D. Hydrogen

Source: TIMSS Pop. 2

slide-6
SLIDE 6

12/2/99 Assessment & Accountability 6

Structure of Declarative Knowledge

water rain rivers clouds soil sun

  • ceans

is falling water comes from contain goes into rivers flow to contain shines on

From White & Gunstone: “Probing Understanding” (1992, p. 16). Eleven- Year-Old’s Concept Map

slide-7
SLIDE 7

12/2/99 Assessment & Accountability 7

Procedural Knowledge: Performance Assessments for FOSS Curriculum

release position

slide-8
SLIDE 8

12/2/99 Assessment & Accountability 8

Procedural Knowledge: Performance Assessment: TIMSS Pop. 2

PULSE

At this station you should have

A watch A step on the floor to climb on

Read ALL directions carefully. Your task:

Find out how your pulse changes when you climb up and down on a step for 5 minutes.

This is what you should do:

  • Find your pulse and be sure you know how to count it. IF YOU CANNOT FIND YOUR PULSE

ASK A TEACHER FOR HELP

  • Decide how often you will take measurements starting from when you are at rest.
  • Climb the step for about 5 minutes and measure your pulse at regular intervals.

1. Make a table and write down the times at which you measured your pulse and the measurements you made. 2. How did your pulse change during the exercise? 3. Why do you think your pulse changed in this way?

slide-9
SLIDE 9

12/2/99 Assessment & Accountability 9

Strategic Knowledge: Mental Models

  • What causes day and night?
  • A. The earth spins on its axis (.66)
  • B. The earth moves around the sun (.26)
  • C. Clouds block out the sun (.03)
  • D. the sun goes round the earth (.04)

Source: Sadler, P.M. (1998) Psychometric models of student conceptions in science: Reconciling qualitative studies and distractor-driven assessment instruments. JRST, 35(3), 265-296.

slide-10
SLIDE 10

12/2/99 Assessment & Accountability 10

Strategic Knowledge: Mental Models

  • (A) A rocket is moving along sideways in deep space, with its engine off, from

point A to point B. It is not near any planets or other outside forces. Its engine is fired at point B and left on for 2 sec while the rocket travels from point B to point C. Draw in the shape of the path from B to C. (Show your best guess for this problem even if you are unsure of the answer.)

  • (B) Show the path from C after the engine is turned off on the same drawing.

A

B C

Incorrect

A C B

Correct

Source: Clement, J. (1982). Students’ preconceptions in introductory mechanics. American Journal of Physics, 50(1), 66-71.

slide-11
SLIDE 11

12/2/99 Assessment & Accountability 11

Some Empirical Evidence: Links between Knowledge and Measurement Methods

Correlations from Shultz’s Dissertation (N=109 6th Graders Studying Ecology):

– Reading and Multiple-Choice: 0.69 – Reading and Concept Map: 0.53 – M-C and CM: 0.60 – Reading and Performance Assessment: 0.25 – M-C and PA: 0.33 – CM and PA: 0.43

Declarative Knowledge Declarative vs. Procedural Knowledge

slide-12
SLIDE 12

12/2/99 Assessment & Accountability 12

Some Other Empirical Evidence

  • Correlations (covariances) tell you about

rank ordering on different measures--that’s

  • nly part of the story
  • Evidence is needed for the cognitive claims

that different methods measure somewhat different aspects of achievement:

– Talk aloud – Focus group – Group work

slide-13
SLIDE 13

12/2/99 Assessment & Accountability 13

Cognitive Validity Framework

Comparing assessment tasks on three dimensions

Inferred Cognitive Activity Scores Obtained Intended Task Demands

Congruence Congruence

slide-14
SLIDE 14

12/2/99 Assessment & Accountability 14

Talk Aloud with Concept Maps: Method Variation

0.1 0.2 0.3 0.4 0.5 0.6

Explanation Monitoring Conceptual Errors No-Code

Contruct-A-Map Fill-In-Nodes Fill-In-Lines

Verbal Codes Proportion Scores

Correspondence Between “Directedness” and Inferred Activity

Low Directed High Directed

slide-15
SLIDE 15

12/2/99 Assessment & Accountability 15

Talk Aloud with Concept Maps: Performance Level

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Explanation Monitoring Conceptual Error No Code

Teachers High Low

Construct-A-Map

Proportion Scores

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Explanation Monitoring Conceptual Error No Code

Teachers High Low

Fill-In-The-Nodes

Proportion Scores

slide-16
SLIDE 16

12/2/99 Assessment & Accountability 16

Accountability’s Good and Bad News

  • Good News: Demand for accountability is warranted

and if done well, could improve teaching and learning

  • Bad News: If current K-12 high-stakes

accountability systems serve as models, the demand for accountability will harm not benefit education by significantly narrowing the curriculum and the diversity of teaching methods

slide-17
SLIDE 17

12/2/99 Assessment & Accountability 17

Accountability Based On Faulty Logic

  • Accountability must be inferred from observing outcomes in any

system where all actions cannot be observed directly.

  • To do this “inferencing,” the performance measure is an indicator of

the desired behavior, not the behavior itself.

– In business,

  • There is a clear performance measure--revenue or stock price--to guide

business decisions and action.

  • You can’t manage a business if you can’t measure it’s performance.

– In education,

  • Performances (goals) are many and debated.
  • The performance indicator--most often a multiple-choice achievement test--is

but a distant proxy for the desired outcome.

  • When this indicator becomes an end in itself, and it does in education, well-

intentioned accountability may very well distort the system it was intended to improve.

slide-18
SLIDE 18

12/2/99 Assessment & Accountability 18

Linking Assessment To Learning Activities

Formative Summative Type Purpose Agency Learning Certification Accountability student teacher individual external tests sample surveys teacher external tests external tests individual

Paul Black 3/98

slide-19
SLIDE 19

12/2/99 Assessment & Accountability 19

California Learning Assessment System: Data Sources

B.

Standardized Curriculum-Embedded Assessments

Student’s Score

A.

“On Demand” Matrix Sampling

  • f Tasks & Tests

C.

Portfolios

slide-20
SLIDE 20

12/2/99 Assessment & Accountability 20

Conceptual Framework For CLAS

Aggregate Level of Performance

A.

Matrix Sample Benchmark: Multiple-Choice & Performance-Based Assessment “Moderated” Score: Individual, School & District Score Individual Level of Performance

B.

Standardized Curriculum-Embedded Assessments

C. Portfolios

Teacher Moderation

Teacher Calibration & Professional Development Sample from Class for Aggregation

slide-21
SLIDE 21

12/2/99 Assessment & Accountability 21

Governor Wilson’s Rationale For Discontinuing CLAS

SB 1273 [CLAS] takes a different approach . . . . Instead of mandating individual student scores first, with performance-based assessment incorporated into such scores as this method is proven valid and reliable, it mandates performance-based assessment now and treats the production of individual student scores as if it were the experimental technology-- which it clearly is not. In short, SB 1273 stands the priority for individual scores on its head.

slide-22
SLIDE 22

12/2/99 Assessment & Accountability 22

Concluding Comments on Assessment and Accountability

  • A broader framework for achievement testing is feasible and is needed to

improve teaching and assessment

  • Teacher formative and summative evaluation practices, and accountability

systems should include assessments linked to this broader framework

  • Accountability systems should link formative and summative assessment of

achievement to bring about education consistent with goals – Teachers’ formative classroom assessments are poorly developed but, if well developed, could raise standards significantly – Student self-assessment is poorly developed but, if well developed, would contribute most to self-directed, meaningful learning – Teachers’ summative assessments are currently problematic but if improved might be used for accountability purposes

  • more productive instructionally, and
  • more dependable for assessment purposes