SLIDE 1 CorALit: a Tool for Discovering Lithuanian Academic Identity
Aurelia Usonienė Birutė Ryvitytė Jolanta Šinkūnienė aurelia@usonis.lt birute.ryvityte@flf.vu.lt jolanta2palmeris@gmail.com
SLIDE 2
Background (Gotti 2007)
Globalisation & extensive scientific
collaboration promotes
spread of E as lingua franca in
international communication
possible transfer of Anglocentric textual
models
move towards global communicative
models
? Hybrid identity of academic discourse
SLIDE 3
TOOL
CADIS - Corpus of Academic English (2005)
to analyse the most significant macro- microlinguistic variants in terms of identity, evaluation and interpretation
speech acts expressing positive/negative
evaluation;
the pragmatic, interpersonal plane of
discourse (stance, hedging, politeness);
the function of verbal and lexical modality;
SLIDE 4
KIAP project (Norway 2002 – 2006) Cultural Identities and Academic Voices (Fløttum e.a. 2007)
Corpus-based analysis (3 million words)
E vs. F vs. NO
Disciplines
economics – linguistics - medicine
Features analysed
Markers of epistemic modality
SLIDE 5
CorALit: purpose
Corpus of Academic Lithuanian
to be used for linguistic research of academic
discourse
to facilitate understanding of identity-forming
features
to reveal presence/absence of signs of (cultural)
resistance to ‘hybridisation’
to explore the impact of E as lingua franca on
the identity of academic LT
SLIDE 6
Corpus design and structure
funded by the Lithuanian State Science and Studies Foundation (2007-2009)
Specialised
biomedical, humanities, social, physical,
technological sciences
Comparable (across disciplines) Encoded (TEI 5P guidelines) Written texts Synchronic
SLIDE 7
Size and structure
5 221 874 1 225 TOTAL: 547 030 81 T 1 459 098 344 S 430 806 70 P 2 114 543 449 H 670 397 281 B No of words No of texts Science areas
SLIDE 8
Structure: Text types 1 868 074 84
books
3 167 834 990
articles
130 571 144
reviews, summaries, abstracts, etc. No of words No of texts Text type
SLIDE 9
Lexical saturation
SLIDE 10 Academic discourse
“involves a complex cognitive and linguistic
activity whereby writers construct ideal texts with particular reading audiences in mind, and seek, in realising those ideal texts, to design their discursive structures and to realise their lexico-grammatical textualisations to match the conventions of the genres within which they are writing and the <...> expectancies and competencies of the readership they have in mind.”
(Hyland 2000:xv)
SLIDE 11
Author’s presence in academic LT
ATTITUDE
Epistemic qualification
Probability Source of info/knowledge
Evaluation Subjectivity
SLIDE 12
Realizations of Ep in LT Epistemic adverbials in LT
Modal words (gal, galbūt, turbūt) Particles (bemaž, kažin, gal, turbūt) Adverbs (mažne, tikriausiai)
SLIDE 13
Frequency lists
Grumadienė and Žilinskienė 1997;
a lemmatised frequency list of the 1, 2M-
word corpus of written Lithuanian
Andrius Utka
frequency list 60M-word Corpus of
Contemporary Lithuanian Language (CCLL: http://donelaitis.vdu.lt/)
SLIDE 14
Top frequency
Particle gal (‘perhaps/maybe/whether’)
99th
CCLL:
gal 92; galbūt (‘maybe’) 388 turbūt (‘probably’) 758
SLIDE 15
Academic vs. News (f/1000)
0.05 0.03 REGIS (‘apparently’) 0.04 0.10
AKIVAIZDU (‘evidently’)
0.11 0.05 TURBŪT (‘probably’) 0.20 0.12 GALBŪT (‘maybe’) 0.71 0.11 GAL (‘perhaps’)
News LT CCLL Academic LT CorALit Epistemic Adverbials
SLIDE 16 LT: Academic vs. News (f/1000)
0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8
gal galbūt turbūt akivaizdu regis
Academic CorALit News CCLL
SLIDE 17 GAL (‘perhaps’) in CorALit
LT: Tokia klasifikacija gal ir nėra labai tiksli, tačiau šių šalių valdymo bruožai skiriasi (S) E: ‘‘Perhaps this classification is not very exact, but the features of management
- f those countries are different’ (S)
SLIDE 18
AKIVAIZDU (‘evidently’) in CorALit
LT: Šiuolaikinės istoriografijos istorijos kritinis potencialas, akivaizdu, nėra suprantamas kaip istorikų rūšiavimas į „baltuosius“ ir „juoduosius“. (H) E: ‘The critical potential of contemporary historiographical history is evidently not understood as sorting historians into “white” and “black”.’ (H)
SLIDE 19
REGIS (‘apparently’) in CorALit
LT: Tokia meno tradicijos samprata estetui, regis, buvo priimtiniausia. (H) E: ‘Such a concept of art tradition apparently was the most acceptable for the aesthete.’ (H)
SLIDE 20 BNC: Academic vs. News (f/1000)
0,05 0,1 0,15 0,2 0,25 0,3 0,35
perhaps maybe probably evidently apparently
Academic BNC News BNC
SLIDE 21 Academic LT: across science areas
0,02 0,04 0,06 0,08 0,1 0,12 0,14 0,16 0,18
gal galbūt turbūt akivaizdu regis
H S P B T
SLIDE 22 Specialized vocabulary (f/1000)
CorALit
sistemin(is/ė)
0.10 0.01
sistemišk(as/a)
0.01 0.00(260)
sisteming(as/a)
0.03 0.01
sistematin(is/ė)
0.00(9) 0.00 (43)
sistemating(as/a)
0.00 (9)
sistematišk(as/a)
0.00(1) 0.00 (21)
SLIDE 23
Across science areas (CorALit) (f/1000)
0.00
sistematišk(as/a) sistemating(as/a)
0.01 0.01 0.00
sistematin(is/ė)
0.01 0.02 0.05 0.03 0.02
sisteming(as/a)
0.00 0.00 0.02 0.01 0.01
sistemišk(as/a)
0.07 0.35 0.09 0.04 0.14
sistemin(is/ė) T P S H B ADJ + ADV
SLIDE 24
Term Bank: LT
sisteminė analizė, rinkmena sistemingasis nuokrypis sistemingoji paklaida, atranka
(http://terminai.vlkk.lt:10001/pls/tb/tb.search)
SLIDE 25
BNC academic (f/1000)
systemic
0.01
systematic
0.04
systematical
SLIDE 26
Specialized vocabulary (f/1000)
0.00 0.00 problematišk(as/a) 0.00 0.00 problematin(is/ė) 0.01 0.02 problemišk(as/a) 0.01 0.04 problemin(is/ė) CCLL CorALit
SLIDE 27
BNC academic (f/1000)
problematic
0.03
problematical
0.00
SLIDE 28 Concluding observations
# Academic discourse is neutral and
Author’s presence Internationalisation of specialised
terminology