Using Corpus Linguistics in Legal Research: Lessons from the Law and Language at the European Court of Justice Project
Karen McAuliffe University of Birmingham k.mcauliffe@bham.ac.uk @dr_KMcA
Using Corpus Linguistics in Legal Research: Lessons from the Law and - - PowerPoint PPT Presentation
Using Corpus Linguistics in Legal Research: Lessons from the Law and Language at the European Court of Justice Project Karen McAuliffe k.mcauliffe@bham.ac.uk University of Birmingham @dr_KMcA Goldfarb, Neal, Corpus LinguisFcs in Legal
Using Corpus Linguistics in Legal Research: Lessons from the Law and Language at the European Court of Justice Project
Karen McAuliffe University of Birmingham k.mcauliffe@bham.ac.uk @dr_KMcA
Producing a multilingual jurisprudence Development of ‘precedent’ in ECJ judgments The changing role of the AG
Case law analysis ObservaFonal data Interview data Corpus linguisFcs analysis SystemaFc literature reviews
28 Member States 24 Official Languages
28 Member States 24 Official Languages
Case brought before CJEU Allocated to judge rapporteur (and AG where relevant) Documents translated into French Report of the judge rapporteur prepared by référendaire (in French) Where relevant, AG and référendaires prepare opinion (in pivot languages) First version of judgment drahed by référendaire (in French) Secret deliberaFons (in French) Final judgment drahed (in French) Judgment translated into language of the case (authenFc version of judgment and version signed by judges) and all other official languages
TRANSLATION PROCESS
BG ES CS DA DE ET EL EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV BG ES CS DA DE ET EL EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV
JUDGMENT WORKING LANGUAGE: FRENCH
BG ES CS DA DE ET
EL
EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV BG ES CS DA DE ET
EL
EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV
JUDGMENT WORKING LANGUAGE: FRENCH
PIVOT LANGUAGES
1835 4939 1558 751 358 621 2085 1114 861 992 705
1000 2000 3000 4000 5000 6000
CJEU_fr REF_fra REF_bg CJEU_de REF_deu REF_at CJEU_en REF_uk REF_ir CJEU_it REF_it
Language shapes how those drahing the judgments talk and think about EU law The judgments expressed in that language shape the development of EU law. New concepts are developed and expressed in that hybrid/new language The language is used/repeated by those drahing the judgments.
1835 4939 1558 751 358 621 2085 1114 861 992 705
1000 2000 3000 4000 5000 6000
CJEU_fr REF_fra REF_bg CJEU_de REF_deu REF_at CJEU_en REF_uk REF_ir CJEU_it REF_it
Case brought before CJEU Allocated to judge rapporteur (and AG where relevant) Documents translated into French Report of the judge rapporteur prepared by référendaire (in French) Where relevant, AG and référendaires prepare opinion (in pivot languages) First version of judgment drahed by référendaire (in French) Secret deliberaFons (in French) Final judgment drahed (in French) Judgment translated into language of the case (authenFc version of judgment and version signed by judges) and all other official languages
before delivering judgments (in relevant cases)
which principles of EU law are developed
the ECJ’s jurisprudence
layered authorship
McAuliffe, K (2011) “Hybrid Texts and Uniform Law? The producFon of a mulFlingual jurisprudence by the Court of JusFce of the European Union” Interna,onal Journal for the Semio,cs of Law 24(1), 97-115 McAuliffe, K (2013) “The LimitaFons of a MulFlingual Legal System” Interna,onal Journal for the Semio,cs of Law 26(4) 861-882
EN DE ES FR IT (PL)
BG ES CS DA DE ET EL EN FR GA HR IT LV LT HU MT NL PL PT RO SK SL FI SV
What implicaCons might changes in the linguis7c aspect of the AG’s role have for the construcCon and consolidaCon of ECJ jurisprudence?
usefulness of opinions?
and their arguments more constrained by language as a result of the fact that they no longer drah in their mother tongue?
in pivot languages?
usefulness of opinions?
and their arguments more constrained by language as a result of the fact that they no longer drah in their mother tongue?
in pivot languages?
DeterminaFon
complexity and fluency Comparison
drahed before 2004……….……………………………….…. drahed aher 2004 from naFve AGs……….... drahed aher 2004 from non-naFve AGs…..
….
drahed before 2004…………………………………………… drahed aher 2004 from naFve AGs…………. drahed aher 2004 from naFve AGs………....
(2208 texts, > 10 million words) (341 texts, 2 million words) (276 texts, > 2,5 million words) (367 texts, > 3 million words) (2357 texts, > 10 million words) (410 texts, > 2,5 million words) (339 texts, > 3,5 million words) (345 texts, > 3,5 million words)
In English In French
0.468 0.468 1.146 0.86 0.849 0.2 0.4 0.6 0.8 1 1.2 1.4 before 2004 aher 2004
English corpus lexical variety
Judgments NaFve opinions Non NaFve opinions 0.43 0.43 1.103 0.893 0.788 0.2 0.4 0.6 0.8 1 1.2 before 2004 aher 2004
French corpus lexical variety
Judgments NaFve opinions Non NaFve opinions
52.01 55.07 33.63 45.01 36.22 10 20 30 40 50 60 aher 2004 before 2004
Mean sentence length of English corpus
Judgments NaFve opinions Non naFve opinions 57.32 64.13 33.92 46.24 36.09 10 20 30 40 50 60 70 aher 2004 before 2004
Mean sentence length of French corpus
Judgments NaFve opinions Non naFve opinions
HYPOTACTIC STRUCTURES
subordinate conjuncFons
(e.g. when, than, because, etc.)
the wordlist and of its frequency
subordinate conjuncFon included in the considered list
2882 + 2767 + ……. = In the French corpus:
0.86 0.88 0.9 0.92 0.94 0.96 0.98 1 1.02 1.04 1.06 English Judgments Opinions before 2004 NaFve opinions aher 2004 Non naFve opinions aher 2004 0.88 0.9 0.92 0.94 0.96 0.98 1 1.02 French Judgments Opinions before 2004 NaFve opinions aher 2004 Non naFve opinions aher 2004
English corpora 100% matching fragments Opinions Before 2004 Aher 2004 NATIVE Aher 2004 NON NATIVE All matching fragments 10,832 100% 4,145 100% 29,218 100% ≥ 10 words 331 3% 1,972 23.17% 2,028 27.81% ≥ 30 words 85 0,7% 1,000 11.75% 897 12.30% ≥ 100 words 5 0.04% 45 0.52% 41 0.56% French corpora 100% matching fragments Opinions Before 2004 Aher 2004 NATIVE Aher 2004 NON NATIVE All matching fragments 5,407 100% 7,155 100% 6,139 100% ≥ 10 words 389 7.19% 1,438 20.09% 1,269 20.08% ≥ 30 words 115 2.12% 559 7.81% 421 6.66% ≥ 100 words 8 0.14% 24 0.33% 21 0.33%