the new frontier for assessment
play

the New Frontier for Assessment Development: Rich New Soil or a - PowerPoint PPT Presentation

Process Data the New Frontier for Assessment Development: Rich New Soil or a Quixotic Quest? Stephen Provasnik National Center for Education Statistics, U.S. Department of Education 6 May 2019 Overview What we discussed at the ETS


  1. Process Data — the New Frontier for Assessment Development: Rich New Soil or a Quixotic Quest? Stephen Provasnik National Center for Education Statistics, U.S. Department of Education 6 May 2019

  2. Overview • What we discussed at the ETS symposium on Process Data in Washington, DC last December • What I think we might be able to agree on in regards to uses for Process Data • Where we could go by venturing into this new land

  3. Logfiles vs. process data • Logfiles - everything captured in a digital-based assessment (DBA) – from the order and speed of inputs (e.g., clicks and keystrokes) to the VPN of the device used to take the assessment. • Process Data - the empirical data that reflect the process of working on a test question – reflecting cognitive and noncogitive, particularly psychological, constructs.

  4. TIMSS Video Studies (late 1990s) Recordings of selected classroom lessons were coded to indicate (among other many things): • the assigned type of work, with categories of “whole -class work,” “individual work,” “pair/partner work,” and “small -group work” • the “number of words” the classroom teacher used in a ratio to the number of words students used, when talking to the whole class • the proportion of the lesson spent on review of previous content vs. spent explaining new content

  5. Conclusions from the last December’s symposium 1. Develop a systemic approach to logfiles — to answer the question of what exactly logfiles should capture 2. Develop a theory for Process Data — to answer the question of how to use process data 3. Develop guidelines and standards for how to convert logfiles into process data

  6. Spandrels of San Marco

  7. Spandrels of San Marco Spandrel 1 Spandrel 2

  8. Spandrels of San Marco Dome Arch 1 Arch 3 Arch 2

  9. Larynx Source: https://www.cell.com/current-biology/pdf/S0960-9822(08)00371-0.pdf

  10. Larynx Source: https://www.cell.com/current-biology/pdf/S0960-9822(08)00371-0.pdf

  11. Ongoing Evolution in Assessment Past Present Future Item Development Labor Intensive Labor Intensive Automatized Item types Generic Enhanced Real-life Test design Static Semi-static Data-driven Test assembly Labor Intensive Semi-automatized Automatized Accessibility Limited Universal design Adaptive Timing Not measurable Measured Used Pathways Not observable Observable Modeled Validity Content/core-based Construct based Process based Feedback Summative Summative Diagnostic

  12. Diagnostic or forensic applications These include using logfiles and process data to improve data quality • by helping understand how items function and what variables make items more difficult or more reliable items • by distinguishing among “missing” answers which are – “not reached” (never seen) – “omitted” (seen, taken time over, but ultimately skipped) – “not attempted” (seen, but not time taken before being skipped) • by identifying student guessing or cases that are outliers, which may indicate possible cases of cheating, or cases of programming error

  13. Visualization of NAEP reading patterns from sampled logfiles Each sampled student represented by a blue dot. Pages of text represented by “roofs” indicating which page being looked at. Ten test questions represented by “buckets”

  14. Visualization of NAEP reading patterns from sampled logfiles

  15. Diagnostic or forensic applications These include using logfiles and process data to improve data quality • by helping understand how items function and what variables make items more difficult or more reliable items • by distinguishing among “missing” answers which are truly “not reached” (never seen), which should be “omitted” (seen, taken time over, but ultimately skipped), and which are “not attempted” (seen, but not time taken before being skipped) • by identifying student guessing or cases that are outliers, which may indicate possible cases of cheating, or cases of programming error

  16. Research into understanding respondent behaviors and cognitive strategies For example • to improve teaching and learning with specific information on how different students think/perform • to better understand factors that distinguish high- and low- performers, or expert from novice strategies • to better understand the relationship of motivation and performance.

  17. Use of Process Data from NAEP Writing Essay Length by Writing Time

  18. Expanded Use of Process Data Essay Length by Writing Time

  19. Expanded Use of Process Data Essay Length by Writing Time

  20. Expanded Use of Process Data Essay Length by Writing Time

  21. Expanded Use of Process Data Essay Length by Writing Time

  22. NCES Example of Process Data Analysis

  23. Test Development Before DBA Field Test data Field Test data Main Study data Main Study data Framework Framework collection collection collection collection Score Score Score Score Item Writing Item Writing results results results results Cog lab or Cog lab or IRT scaling, IRT scaling, Review Item stats Review Item stats piloting of piloting of weighting weighting and parameters and parameters items items Select final item Select final item Analysis and final Analysis and final Create FT Create FT pool pool report report booklets booklets Make final booklets Make final booklets Release dataset Release dataset 23

  24. Test Development for DBA Field Test data Field Test data Main Study data Main Study data Framework Framework collection collection collection collection Review Review logfiles and logfiles and Item Writing Item Writing extract extract Score results Score results Score results Score results process data process data Coders Coders Cog lab or Cog lab or render render Review Item stats and Review Item stats and IRT scaling, weighting IRT scaling, weighting piloting of piloting of Use Use item item parameters parameters and logfiles and logfiles process process items items data for data for Select final item pool Select final item pool Analysis and final report Analysis and final report Create FT Create FT scaling scaling Coders program items Coders program items booklets booklets and testlets and testlets Coders program final Coders program final Make final booklets Make final booklets Analyze process data for reporting Analyze process data for reporting Release dataset Release dataset instruments and logfiles instruments and logfiles Anonymize process data for release Anonymize process data for release 24 in dataset in dataset Device Device Device Device management management Release dataset Release dataset management management 24

  25. Test Development for DBA Main Study data Main Study data Framework Framework Field Test data collection Field Test data collection collection collection Review Review logfiles and logfiles and Item Writing Item Writing extract extract Score results Score results Score results Score results Coders Coders process data process data render render Review Item stats and Review Item stats and Cog lab or Cog lab or IRT scaling, weighting IRT scaling, weighting Use Use item item parameters parameters and logfiles and logfiles piloting of items piloting of items process process data for data for Select final item pool Select final item pool Analysis and final report Analysis and final report Create FT Create FT Coders program items Coders program items scaling scaling booklets booklets and testlets and testlets Coders program final Coders program final Make final booklets Make final booklets Analyze process data for reporting Analyze process data for reporting Release dataset Release dataset instruments and logfiles instruments and logfiles Anonymize process data for release in Anonymize process data for release in 25 dataset dataset Device Device Device Device Release dataset Release dataset management management management management 25

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend