Automating Second Language Acquisition Research: Integrating - PowerPoint PPT Presentation

Introdution Dataset System Case study Conclusions Automating Second Language Acquisition Research: Integrating Information Visualisation and Machine Learning Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou University of Cambridge Visualisation of Linguistic Patterns EACL 2012 Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset System Case study Conclusions Outline 1 Introdution 2 Dataset 3 System 4 Case study 5 Conclusions Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset Goal System Theory-driven approach Case study Data-driven approach Conclusions Introduction Common European Framework of Reference for Languages (CEFR) International benchmark of language attainment at different stages of learning Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset Goal System Theory-driven approach Case study Data-driven approach Conclusions Introduction Common European Framework of Reference for Languages (CEFR) Divides learners into three broad divisions: A Basic User A1 Breakthrough or beginner A2 Waystage or elementary B Independent User B1 Threshold or intermediate B2 Vantage or upper intermediate (e.g., can produce clear, detailed text on a wide range of subjects and explain a viewpoint on a topical issue giving the advantages and disadvantages of various options ) C Proficient User C1 Effective Operational Proficiency or advanced C2 Mastery or proficiency Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset Goal System Theory-driven approach Case study Data-driven approach Conclusions Introduction Common European Framework of Reference for Languages (CEFR) International benchmark of language attainment at different stages of learning English Profile (EP) research programme Enhance the learning, teaching and assessment of English as an additional language Reference level descriptions of the language abilities expected at each learning stage Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset Goal System Theory-driven approach Case study Data-driven approach Conclusions Introduction Common European Framework of Reference for Languages (CEFR) International benchmark of language attainment at different stages of learning English Profile (EP) research programme Enhance the learning, teaching and assessment of English as an additional language Reference level descriptions of the language abilities expected at each learning stage Goal Understand the linguistic abilities that characterise different levels of attainment and, more generally, developmental aspects of learner grammars Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset Goal System Theory-driven approach Case study Data-driven approach Conclusions Theory-driven approach Approach Theory-driven approach Linguistic intuition Literature on learner English Hypotheses that are well understood Target language determiner systems cause problems for learners whose native language doesn’t utilise determiners Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset Goal System Theory-driven approach Case study Data-driven approach Conclusions Theory-driven approach Approach Theory-driven approach Linguistic intuition Literature on learner English Hypotheses that are well understood Target language determiner systems cause problems for learners whose native language doesn’t utilise determiners Risks ’finding the obvious’ Large-scale databases How can we extract data efficiently and reliably to evaluate linguistic hypotheses? How can we make ”observations” or extract patterns that may lead to new hypotheses? Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset Goal System Theory-driven approach Case study Data-driven approach Conclusions Data-driven approach Our approach More empirical perspective for linguistic hypotheses on learner grammars Machine Learning Advantages Partially automate the process of hypothesis creation Alternative route to learner grammars Useful adjunct to hypothesis-driven approach Powerful methodology for exploring a large hypothesis space Data-driven approaches quantitatively very powerful Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset System First Certificate in English (FCE) exam Case study Conclusions First Certificate in English (FCE) exam FCE Writing Component CEFR level: vantage or upper-intermediate (B2) Two tasks eliciting free-text answers, each one between 120 and 180 words (e.g. ‘write a short story commencing ...’) Answers annotated with mark (in the range 1–40), fitted to a RASCH model (Fischer and Molenaar, 1995) Manually error-coded using a taxonomy of ∼ 80 error types (Nicholls, 2003) Meta-data Candidate’s grades Native language Age Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Dataset System First Certificate in English (FCE) exam Case study Conclusions First Certificate in English (FCE) exam – cont. FCE Writing Component Manually error-coded using a taxonomy of ∼ 80 error types (Nicholls, 2003) Examples It is a very beautiful place and the people there < NS type=‘AGV’ > < i > is < /i > < c > are < /c > < /NS > very kind and generous . I will give you all < NS type=‘MD’ > < c > the < /c > < /NS > information you need. Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Machine Learning Dataset Feature Set System Information Visualisation Case study Visual User Interface Conclusions Machine Learning Discriminative Learning Supervised discriminative machine learning methods to automate the assessment of the FCE exam (Briscoe et al., 2010) Binary classifier that best discriminates passing from failing FCE scripts (trained on FCE scripts) Linear Perceptron classifier Feature set: lexical and part-of-speech (POS) ngrams (among other feature types) Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Machine Learning Dataset Feature Set System Information Visualisation Case study Visual User Interface Conclusions Highly Ranked Discriminative Feature Instances Feature Type Example VM RR (+) POS bigram could clearly , because ( − ) word bigram , because of how to ( − ) word bigram * teach the others how to dance necessary (+) word unigram it is necessary that the people ( − ) word bigram * the people are clever probably (+) word unigram we are probably going VV ∅ VV ∅ ( − ) POS bigram * technology keep develop NN2 VVG (+) POS bigram children smiling II VVN ( − ) POS bigram * I want to gone Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Introdution Machine Learning Dataset Feature Set System Information Visualisation Case study Visual User Interface Conclusions Discriminative Instances Issues Hundreds of thousands of discriminative feature instances Proxies to aspects of the grammar and need interpretation Evaluate higher-level, more general and comprehensible hypotheses Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou Automating Second Language Acquisition Research

Automating Second Language Acquisition Research: Integrating - PowerPoint PPT Presentation

Introdution Dataset System Case study Conclusions Automating Second Language Acquisition Research: Integrating Information Visualisation and Machine Learning Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou University of Cambridge

First Language Acquisition: Inherent Difficulty of Language Acquisition Theories and Evidence

Psycholiguistics Lecture 7 By Dr. Chelli Introduction to second language acquisition Lecture

Automating the Automating the configuration of flow configuration of flow monitoring probes

Automating batch fecundity measurements Automating batch fecundity measurements using digital

REDHAT KICKSTART REDHAT KICKSTART Automating Linux Installation Automating Linux Installation

Automating MySQL Deployments on Kubernetes Calin Don & Flavius Mecea Presslabs Automating

Automating Authority Work Automating authority work, or, Be your own authority control vendor

Automating Production of Cross Media Automating Production of Cross Media Content for

RANDOMIZING AND RANDOMIZING AND AUTOMATING ASSESSMENT AUTOMATING ASSESSMENT WITH R WITH R exams

FY1 0 Second Quarter FY1 0 Second Quarter FY1 0 Second Quarter FY1 0 Second Quarter

Linguistics 101 Language Acquisition Language Acquisition All (normal) human children...

Stages of Language Acquisition Learning In-Utero Stages of Babbling Stages of Phonemic

Second Language Learners and Language Development How long does it take to learn a second

Portfolio Acquisition Portfolio Acquisition Portfolio Acquisition from from from Safe Harbor

Land Acquisition and Relocation Process Presented by: Lynn Green, Director of Acquisition

E-COMPASS ACQUISITION CORP. Acquisition of NYM Holding, Inc. Investor Presentation August 2016

Towards a theory of multivariate generating functions Mark C. Wilson University of Auckland

Lecture Slides - Part 4 Bengt Holmstrom MIT February 2, 2016. Bengt Holmstrom (MIT) Lecture

LEAKAGE - RESILIENT PUBLIC - KEY ENCRYPTION FROM OBFUSCATION Dana Dachman-Soled, S. Dov

Generic component-based middleware for a peer-to-peer flexible robot architecture What is MT

Decision aid methodologies in transportation Lecture 6 (part-2): Container terminal management

Lecture 04 (11.11.2013) Hazard Analysis Techniques Christoph Lth Christian Liguda SQS, WS

A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia Vassilevska Williams Stanford

I n s o l i d a r i t y w i t h t h e p e o p l e o f F e r g u s

Sambuz

Useful Links

Newsletter

Mail Us

Automating Second Language Acquisition Research: Integrating - PowerPoint PPT Presentation

Introdution Dataset System Case study Conclusions Automating Second Language Acquisition Research: Integrating Information Visualisation and Machine Learning Helen Yannakoudakis, Ted Briscoe, Theodora Alexopoulou University of Cambridge

First Language Acquisition: Inherent Difficulty of Language Acquisition Theories and Evidence

Psycholiguistics Lecture 7 By Dr. Chelli Introduction to second language acquisition Lecture

Automating the Automating the configuration of flow configuration of flow monitoring probes

Automating batch fecundity measurements Automating batch fecundity measurements using digital

REDHAT KICKSTART REDHAT KICKSTART Automating Linux Installation Automating Linux Installation

Automating MySQL Deployments on Kubernetes Calin Don &amp; Flavius Mecea Presslabs Automating

Automating Authority Work Automating authority work, or, Be your own authority control vendor

Automating Production of Cross Media Automating Production of Cross Media Content for

RANDOMIZING AND RANDOMIZING AND AUTOMATING ASSESSMENT AUTOMATING ASSESSMENT WITH R WITH R exams

FY1 0 Second Quarter FY1 0 Second Quarter FY1 0 Second Quarter FY1 0 Second Quarter

Linguistics 101 Language Acquisition Language Acquisition All (normal) human children...

Stages of Language Acquisition Learning In-Utero Stages of Babbling Stages of Phonemic

Second Language Learners and Language Development How long does it take to learn a second

Portfolio Acquisition Portfolio Acquisition Portfolio Acquisition from from from Safe Harbor

Land Acquisition and Relocation Process Presented by: Lynn Green, Director of Acquisition

E-COMPASS ACQUISITION CORP. Acquisition of NYM Holding, Inc. Investor Presentation August 2016

Towards a theory of multivariate generating functions Mark C. Wilson University of Auckland

Lecture Slides - Part 4 Bengt Holmstrom MIT February 2, 2016. Bengt Holmstrom (MIT) Lecture

LEAKAGE - RESILIENT PUBLIC - KEY ENCRYPTION FROM OBFUSCATION Dana Dachman-Soled, S. Dov

Generic component-based middleware for a peer-to-peer flexible robot architecture What is MT

Decision aid methodologies in transportation Lecture 6 (part-2): Container terminal management

Lecture 04 (11.11.2013) Hazard Analysis Techniques Christoph Lth Christian Liguda SQS, WS

A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia Vassilevska Williams Stanford

I n s o l i d a r i t y w i t h t h e p e o p l e o f F e r g u s

Sambuz

Useful Links

Newsletter

Mail Us

Automating MySQL Deployments on Kubernetes Calin Don & Flavius Mecea Presslabs Automating