Comparative Judgement An alternative approach to essay grading MEET - - PowerPoint PPT Presentation

comparative judgement
SMART_READER_LITE
LIVE PREVIEW

Comparative Judgement An alternative approach to essay grading MEET - - PowerPoint PPT Presentation

Randomly Distributed Comparative Judgement An alternative approach to essay grading MEET THE research team Dr. Cox Mornie Sims Dr. Eckstein Dr. Hartshorn Judson Hart Dr. Wilcox Col Reliability consistency d Validity authenticity


slide-1
SLIDE 1

An alternative approach to essay grading

Comparative Judgement

Randomly Distributed

slide-2
SLIDE 2

MEET THE

research team

  • Dr. Eckstein

Dr. Hartshorn Mornie Sims

  • Dr. Cox

Judson Hart

  • Dr. Wilcox
slide-3
SLIDE 3

Col d War

Reliability Validity

consistency authenticity

slide-4
SLIDE 4

1880s – inconsistent scoring reliability → ? validity indirect → MC testing component skills highly reliable strongly correlated with writing grades

reliability?

slide-5
SLIDE 5

1961 Study – opposite effect spurious correlations (# of bathrooms) teacher focus on component skills (Braddock, et al.) writing → active skill MC → passive, undue attention to less important features

validity?

slide-6
SLIDE 6

direct writing

Rubrics Training Double-rating Adjudication MFRM

assessme nt

RELIABILITY IN

slide-7
SLIDE 7
  • Absolute judgment
  • External standard
  • Training/calibration

rubric

THE METHOD

slide-8
SLIDE 8
  • Comparison
  • Relative choice
  • Instinctual skill

RANDOMLY DISTRIBUTED

comparativ e judgment

slide-9
SLIDE 9

“There is no absolute judgment. All judgments are comparisons of one thing to another.”

[Donald Laming]

slide-10
SLIDE 10

Explicit comparison Minimizes training Minimizes bias Inherent algorithm Implicit comparison Training for consensus Unavoidable bias MFRM

RR

&

RDCJ

slide-11
SLIDE 11

works.

HOW IT

slide-12
SLIDE 12

demo

nomoremarking.com

https://www.nomoremarking.com/demo1

slide-13
SLIDE 13

test it!

nomoremarking.com

https://www.nomoremarking.com/judges/reg/sLRRwmGAe65Wx3mbv

slide-14
SLIDE 14

CJ

RATIONALE

Steedle and Ferrara, 2016

CJ eliminates common scoring biases Strictness vs leniency Central or extreme tendencies Additionally it is less cognitively demanding/time consuming per judgment it requires less training evidence suggests that it is highly accurate (Gill & Bramley, 2008)

slide-15
SLIDE 15

comparative judgment

Reliable and Practical? and Can we trust the results?

…is a promising alternative, BUT is it…

slide-16
SLIDE 16

research question

How does traditional rubric rating compare with MFRM (many facet Rasch model) and RDCJ (randomly distributed Comparative Judgment) in an ESL setting in terms of reliability, validity, and practicality?

slide-17
SLIDE 17

Analysis

Rater Group B 4 Novice 4 Experienced Rater Group A 4 Novice 4 Experienced Essay Set 1 (n=37) Essay Set 2 (n=38) Rubric Rating (RR) MFRM Fair Average Randomly Distributed Comparative Judgment (RDCJ) RDCJ True Score 20% ANCOVA

  • I. Samples t Tests

Spearman's Rho Figure 2. Study design to compare traditional rubric rating (RR) to multi-facet Rasch modeling (MFRM) and randomly distributed comparative judgment (RDCJ). Analysis of variance (ANOVA) run to test for effects on rating time and Spearman’s rho used to correlate between MFRM adjusted fair average, the study rubric rating fair averages, and RDCJ true scores to show evidence of validity.

Raters Essays Ratings

slide-18
SLIDE 18

Essays

SELECTED

slide-19
SLIDE 19

WITHOUT MFRM

Rubric Ratin g

slide-20
SLIDE 20

RELIABILITY & VALIDITY

Evidence

slide-21
SLIDE 21

Practicality

DATA

slide-22
SLIDE 22

d

COHEN’S

slide-23
SLIDE 23

t TESTS

slide-24
SLIDE 24

ANALYSIS OF

Covarianc e

slide-25
SLIDE 25

ANALYSIS OF

Covarianc e

slide-26
SLIDE 26

essay

LENGTH & RATINGS

slide-27
SLIDE 27

CJ

APPLICATIONS

Barkhaoui, 2016 Bramley, 2015 Christodolou, 2016 Heldsinger & Humphrey, 2013

Especially suited to productive tasks Portfolios, essays, short answer Many subject areas English, ESL, History, Geography Interesting Applications Mathematical problem solving Peer Assessment (highly reliable & correlated with expert ratings)

slide-28
SLIDE 28

SUBJECT

Areas

slide-29
SLIDE 29

ASSESSMENT

Peer

slide-30
SLIDE 30

ASSESSMENT

(cont)

Peer

slide-31
SLIDE 31

EXEMPLARS

calibrate d

slide-32
SLIDE 32

thank you!

Comparative Judgment

Mornie Sims eslmornie@gmail.com

  • Dr. Troy Cox

Troy_cox@byu.edu

  • Dr. Matthew Wilcox

wilcoxmp@byu.edu

  • Dr. Grant Eckstein

grant_eckstein@byu.edu

  • Dr. K. James Hartshorn

James_Hartshorn@byu.edu Judson Hart hatuhart@gmail.com

slide-33
SLIDE 33

essay prompt

Identify one improvement that would make your city a better place to live for people your age and explain why people your age would benefit from this change. Use specific reasons and examples to support your opinion and describe the potential immediate and long-term consequences of this

  • improvement. You have 30 minutes to write your response.
slide-34
SLIDE 34

STUDY

Rubric