Marie Garnier
CAS/IRIT, Université de Toulouse, France EuroCALL 2012, 22-25 August, Gothenburg
Aut utom
- mat
atic ic Cor
- rrecti
ection
- n of
- f Adv
dver erb Pl Plac acem emen ent t Er Error
- rs
s for
- r CAL
Aut utom omat atic ic Cor orrecti ection on of of Adv dver - - PowerPoint PPT Presentation
Aut utom omat atic ic Cor orrecti ection on of of Adv dver erb Pl Plac acem emen ent t Er Error ors s for or CAL ALL Marie Garnier CAS/IRIT, Universit de Toulouse, France EuroCALL 2012, 22-25 August, Gothenburg Background
2
Persistence of adverb placement errors at an intermediate
Adverbs in V-Adv-O order are markedly more frequent, mostly
Survey of 11 common grammar checkers: adverb placement
Tricky grammatical issue
3
French/English: partial overlap between canonical positions for
Possible positions for adverb lentement (Fr.) / slowly (Eng.):
V-Adv-O accepted/necessary in some cases in English
May
4
Objective: design automatic detection and correction strategies
6
Detecting errors and predicting correct placement means
Lack of definitive rules: Multifactorial syntactic phenomenon (Garnier, 2012):
semantic type of adverb and scope (VP/clause) weight and composition of the AdvP (long/short adverb, very/too/more) sentence structure (transitive/intransitive VP, information packaging
choices)
prosody (prosodically integrated/detached adverbs)
7
Huddleston and Pullum (2002). CGEL, p. 576
Indicative study: 3 native English speakers (2 AmEng, 1 BrEng) Objective: assess the influence of specific syntactic parameters
Parameters: Weight and structure of the AdvP (long/short adverb, modification by
very, too, more, etc.)
Presence and weight of verb complements (direct and indirect
Verbs followed by prep. phrases Presence of auxiliaries
8
9
56 sentences organized in 13 sets of 3 to 5 sentences
NS asked to identify all correct placements + choose the most
Results:
10
Creation of detection patterns and rewriting rules 22 detection patterns associated with 1 to 3 rewriting rules
Resources:
Lexicons: adverbs, aux., verbs, prep., adjectives, nouns, determ. Grammars: Noun Phrase, Adjective Phrase
11
Error She must remove carefully the bandage. Detection pattern NP1 (Aux) (Aux) Vlex (Prep) ADV NP2 Rewriting rules
Corrected sentence 1. She must carefully remove the bandage.
Table 1. Schematized example of a pattern/rewriting rule
12
Patterns and rules implemented within <TextCoop> <TextCoop>: discourse analysis platform programmed in
Logic-based programming means model can be implemented
forme(corr-adva, [plus(aux(AUX,_)), verb(V,_), opt(prep(P1,_)), adv(ADV,manner),
[conc(VE1,E2,E1), conc(NP11,S,E5), card(NP11, T), (T < 5)], ['<erreur3a>', AUX, VE1, P1, ADV, P2, NP11, '</erreur>',' <correct>', AUX, ADV, VE1, P1, P2, NP11, '</correct>']). forme(corr-advb, [plus(aux(AUX,_)), verb(V,_), opt(prep(P1,_)), adv(ADV,manner),
[conc(VE1,E2,E1), conc(NP11,S,E5), card(NP11, T), (T < 5)], ['<erreur3b>', AUX, VE1, P1, ADV, P2, NP11, '</erreur>',' <correct>', AUX, VE1, P1, P2, NP11, ADV,'</correct>']).
Table 2. Example of a pattern/rewriting rule in <TextCoop>
14
Evaluation on an 80,000-word corpus of native English (British
and American online newspapers, British and American blog posts, scientific publications)
279 occurrences of manner adverbs False positives = < 4% Main cause: words belonging to several grammatical categories
(ex. check n./v., use n./v.)
15
Evaluation on a modified 20,000-word corpus of learner and
The availability of appropriate learner/user corpora for testing is
16
2nd proposition: 70% precision Simplest patterns (i.e. adverb without modification in finite sentences
with NPs of less than 5 words) are used in 83% of all cases
Among them, V-Adv-O patterns used in 48% of all cases Patterns are mutually exclusive (no conflicts)
17
Number of syntactic configurations that can be recognized:
e.g. embedded that-clauses, ditransitive VPs
Words belonging to more than one category:
e.g. check, use, purchase, being, etc.
Adverb with more than one semantic type:
e.g. He looked at her sadly. (= manner) Sadly, they were arguing about money again. (= act-related)
Requires mostly grammatical input, but:
Intermediate users/learners: overall sentence structure is usually
mastered
Patterns use categories, which allows for a margin of error in the
input text
18
Directly portable to other romance languages (Italian, Spanish) Architecture is adaptable to any kind of L1, with specific
Creation of new patterns and new feedback messages is easy
Design of specific feedback for each error type is simplified
19
Five key steps:
20
In this sentence, the manner adverb [separately] is placed between the verb [tested] and the object [all the parameters]. In English, manner adverbs are not generally found between a verb and a following object. There are 2 ways to correct this sentence:
and the verb. Examples:
21
Learner: looking for help with her/his learning steps 1, 2, 3, 4, 5 without the correction, in order to elicit
Curious: looking for more information on the error or the
steps 1, 2, 3, 4, 5 Careful: wants to check why the segment was flagged as an
steps 1, 2, 5 Rushed: only wants to see the error and its correction(s) step 1
22
Profile is selected in advance by the user of the system Feedback is displayed on an html page when user clicks on the
Specific messages for each pattern + include the actual adverb,
23
Complete implementation of feedback messages and conduct
Evaluate usability and adequacy of feedback system with
Creation of a resource for adverbs: identification of major
Extension of the system to other errors :
Use of also word-order errors in the NP, i.e. N+N constructions
Integration in a tutorial/grammar-checking system
24
Foster, Jennifer and Oistein Andersen (2009). "GenERRate: Generating errors for use in grammatical error detection". Actes du workshop ACL Use of NLP for building educational applications. Garnier, Marie. (2011). "Explanation and corrective feedback in grammar checking systems". Proceedings of the 6th International ExACt workshop at IJCAI'11, 81-90. Garnier, Marie. (2012). "Automatically correcting adverb placement errors in the writings of French users of English". Procedia – Social and Behavioral Sciences 34, 59-63. Amsterdam: Elsevier. Granger, Sylviane and Estelle Dagneaux, Fanny Meunier and Magali Paquot, éds. (2009). International Corpus of Learner English v.2. Louvain La Neuve : Presses Universitaires de Louvain. Heift, Trude. (2004). "Corrective feedback and learner uptake in CALL". ReCALL, 16:2, 416- 431. Heift, Trude and Mathias Schulze. (2007). Errors and Intelligence in Computer-Assisted Language Learning: Parsers and Pedagogues. Routledge, New York. Huddleston, Rodney and Geoffrey Pullum (2002). The Cambridge Grammar of the English
Meurers, Detmar and Vanessa Metcalf (2006), "Towards a treatment of word-order errors in CALL: When to use deep processing and when not to". Large-Scale Grammar Development and Grammar Engineering Research Workshop of the Israel Science Foundation Odlin, Terence (1989). Language Transfer: Cross-linguistic influence in language learning. New York: Cambridge University Press. Osborne, John (2008). "Adverb placement in post-intermediate learner English: a contrastive study of learner corpora". Linking up Contrastive Linguistics and Interlanguage Research, Gaëtanelle Gilquin, Szilvia Papp, María Belén Díez-Bedmar. Londres : Rodopi. Saint-Dizier, Patrick (2011). "TextCoop : un analyseur de discours basé sur des grammaires logiques". Actes du colloque TALN.