Strengths and Weaknesses of Corpus Linguistics in Legal Analysis: A Case Study of the Law and Language at the European Court of Justice Project
Karen McAuliffe University of Birmingham k.mcauliffe@bham.ac.uk @dr_KMcA
Strengths and Weaknesses of Corpus Linguistics in Legal Analysis: A - - PowerPoint PPT Presentation
Strengths and Weaknesses of Corpus Linguistics in Legal Analysis: A Case Study of the Law and Language at the European Court of Justice Project Karen McAuliffe k.mcauliffe@bham.ac.uk University of Birmingham @dr_KMcA Goldfarb, Neal, Corpus
Karen McAuliffe University of Birmingham k.mcauliffe@bham.ac.uk @dr_KMcA
28 Member States 24 Official Languages
28 Member States 24 Official Languages
BG ES CS DA DE ET
EL
EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV BG ES CS DA DE ET
EL
EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV
JUDGMENT WORKING LANGUAGE: FRENCH
Case brought before CJEU Allocated to judge rapporteur (and AG where relevant) Documents translated into French Report of the judge rapporteur prepared by référendaire (in French) Where relevant, AG and référendaires prepare opinion (in pivot languages) First version of judgment drajed by référendaire (in French) Secret deliberaFons (in French) Final judgment drajed (in French) Judgment translated into language of the case (authenFc version of judgment and version signed by judges) and all other official languages
McAuliffe, K (2011) “Hybrid Texts and Uniform Law? The producFon of a mulFlingual jurisprudence by the Court of JusFce of the European Union” Interna&onal Journal for the Semio&cs of Law 24(1), 97-115 McAuliffe, K (2013) “The LimitaFons of a MulFlingual Legal System” Interna&onal Journal for the Semio&cs of Law 26(4) 861-882
PIVOT LANGUAGES
EN DE ES FR IT (PL)
BG ES CS DA DE ET EL EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV
….
(2208 texts, > 10 million words) (341 texts, 2 million words) (276 texts, > 2,5 million words) (367 texts, > 3 million words) (2357 texts, > 10 million words) (410 texts, > 2,5 million words) (339 texts, > 3,5 million words) (345 texts, > 3,5 million words)
0.468 0.468 1.146 0.86 0.849 0.2 0.4 0.6 0.8 1 1.2 1.4 before 2004 ajer 2004
English corpus lexical variety
Judgments NaFve opinions Non NaFve opinions 0.43 0.43 1.103 0.893 0.788 0.2 0.4 0.6 0.8 1 1.2 before 2004 ajer 2004
French corpus lexical variety
Judgments NaFve opinions Non NaFve opinions
52.01 55.07 33.63 45.01 36.22 10 20 30 40 50 60 ajer 2004 before 2004
Mean sentence length of English corpus
Judgments NaFve opinions Non naFve opinions 57.32 64.13 33.92 46.24 36.09 10 20 30 40 50 60 70 ajer 2004 before 2004
Mean sentence length of French corpus
Judgments NaFve opinions Non naFve opinions
(e.g. when, than, because, etc.)
0.86 0.88 0.9 0.92 0.94 0.96 0.98 1 1.02 1.04 1.06 English Judgments Opinions before 2004 NaFve opinions ajer 2004 Non naFve opinions ajer 2004 0.88 0.9 0.92 0.94 0.96 0.98 1 1.02 French Judgments Opinions before 2004 NaFve opinions ajer 2004 Non naFve opinions ajer 2004
English corpora 100% matching fragments Opinions Before 2004 Ajer 2004 NATIVE Ajer 2004 NON NATIVE All matching fragments 10,832 100% 4,145 100% 29,218 100% ≥ 10 words 331 3% 1,972 23.17% 2,028 27.81% ≥ 30 words 85 0,7% 1,000 11.75% 897 12.30% ≥ 100 words 5 0.04% 45 0.52% 41 0.56% French corpora 100% matching fragments Opinions Before 2004 Ajer 2004 NATIVE Ajer 2004 NON NATIVE All matching fragments 5,407 100% 7,155 100% 6,139 100% ≥ 10 words 389 7.19% 1,438 20.09% 1,269 20.08% ≥ 30 words 115 2.12% 559 7.81% 421 6.66% ≥ 100 words 8 0.14% 24 0.33% 21 0.33%