SLIDE 60 Introduction 100 years of research in readability Recipes for a readability model Main issues and challenges The modelling step
Example of the performance
Performance remains unsatisfactory for commercial usage in most studies !
Étude ♯ cl. lg. Acc.
R RMSE [Collins-Thompson and Callan, 2004a] 12 E. / / 0.79 / [Heilman et al., 2008a] 12 E. / 52% 0.77 2.24 [Pitler and Nenkova, 2008] 5 E. / / 0.78 / [Feng et al., 2010] 4 E. 70% / / / [Kate et al., 2010] 5 E. / / 0.82 / [François, 2011] 6
49% 80% 0.73 1.23 [François, 2011] 9
35% 65% 0.74 1.92 [Vajjala and Meurers, 2012] 5 E. 93.3% / / 0.15
Comparison between various models in [Nelson et al., 2012] :
Best model from [Nelson et al., 2012] is SourceRater [Sheehan et al., 2010] − → ρ = 0.860 on Gates-MacGinite corpus REAP achieve lower scores than classic models, such as DRP or Lexile.
60/119