What is a written word? And if so, how many?
Martin Evertz-Rittich | University of Cologne
/gʁafematik/ Grapholinguistics in the 21st century | 17.06.2020
What is a written word? And if so, how many? Martin Evertz-Rittich | - - PowerPoint PPT Presentation
What is a written word? And if so, how many? Martin Evertz-Rittich | University of Cologne / gafematik / Grapholinguistics in the 21st century | 17.06.2020 Outline 1. Defining the written word in alphabetical writing systems 2. Properties of
Martin Evertz-Rittich | University of Cologne
/gʁafematik/ Grapholinguistics in the 21st century | 17.06.2020
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Part I
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
(e.g. Coulmas 1999, 550; Jacobs 2005, 22; Fuhrhop 2008, 193f.)
(1) A graphematic word is a string of graphemes that is bordered by spaces and may not be interrupted by spaces. Problems:
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
(Zifonun et al. 1997, 259; my translation)
(1) A graphematic word is a string of graphemes that is bordered by spaces and may not be interrupted by spaces. (2) A graphematic word is a string of graphemes that is preceded by a space and may not be interrupted by spaces. Problems:
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
classes (Bredel 2009)
quotation marks, exclamation marks
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Evertz (2016a, 391-392 based on works of Bredel; my translation)
(3) A graphematic word is a sequence of slot-filler-pairs surrounded by empty slots in which at least one filler must be a letter. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 m
h e r
n
a w!
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Evertz (2016a, 391-392)
graphematic word
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Part II
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
phonology and graphematics are hierarchically organized
hierarchy is composed of one
immediately lower category
(cf. Nespor & Vogel 1986, 7)
Word level Foot level Syllable level Grapheme level Segmental level Feature level
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
(4) A graphematic word consists of at least one graphematic foot. (5) A graphematic foot consists of at least one graphematic syllable.
formedness constraints of syllables and feet
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Evertz (2016b)
(6) Content words must have more than two letters.
(e.g. Cook 2004, 57)
graphematic minimal weight (it must be bimoraric)
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
and feet (5-6) are violable
Mr., Mrs., vs., Dr.
BA, MA, no.
Part III
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Fuhrhop (2008), Fuhrhop & Peters (2013), Evertz (2016a)
corresponds to the morphological or syntactical word in German
empty slots in between
spaces morphologically
spaces syntactically
wohlgeraten ‘great, outstanding‘
wohl geraten ‘probably guessed‘
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
length (cf. Sanchez-Stockhammer 2018)
avoid ambiguity
corresponds to the syntactic and morphological word
Part IV
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
useful for (most of) alphabetical writing systems
the definition in (3) cannot apply
spoken languages or because of certain features of these writing systems
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
syllable
(Chen 1996, 46)
characters in length (token frequency; Lexicon of Common Words in
Contemporary Chinese Research Team, 2008)
80% (Li 1977)
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Coulmas (2003, 59)
Zhōngguó zhè jǐ nián de biànhuà díquè hěn dà China these several years GEN change really very big ‘China underwent big changes during the past several years‘
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Hoosain (1992), Chen (1996), Packard (2000, 2015)
register and dialect
Chinese (Hoosain 1992, 120; Chen 1996, 46)
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
monomorphematic, thus words and characters were almost congruent (Hoosain 1992, 119; Li et al. 2015, 232)
imported from the West at the beginning of the twentieth century (Packard, 1998)
(Packard 2000)
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Li et al. (2015, 232-233)
variability in alphabetic languages
word segmentation might occur is significantly reduced in Chinese
challenge in Chinese than in English (given English had no empty slots)
efficient reading in Chinese
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Chinese, but did not interfere with reading in adult readers
(Inhoff et al. 1997; Bai et al. 2008)
inserting a space before a word did not facilitate processing and in fact may even interfere with its integration into sentential meaning as indicated by total reading times
(Li & Shen, 2013; Liu & Li, 2014)
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
e.g. Joyce & Masuda (2018)
and kanji
such as postpositions and inflectional endings
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
easily differentiate between content and grammatical elements (Joyce & Masuda 2016)
(Rogers 2005, 66)
serve as effective segmentation cues to signal word boundaries (Sainio et al. 2007)
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Shibatani (1990, 129), Rogers (2005, 66) K = kanji, hg = hiragana, kk = katakana, rom = Roman
K hg kk hg K hg rom hg
Hanako wa a no biru de hatari- i- te- i- ru
su Hanako topic that building at work- ing OL is ‘Hanako is an OL (office lady) working in that building‘
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Sainio et al. (2007)
reading texts written exclusively in syllabic kana…
kana and kanji
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
congruent
are therefore unnecessary
Part V
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
be defined in alphabetical writing systems
graphematic hierarchy
syntactic word
spacing because of linguistic features or because they already have cues to word boundaries that make spacing unnecessary
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Oberfläche und Performanz. Tübingen, 117–135.
Chinese text: Evidence from eye movements. Journal of Experimental Psychology: Human Perception and Performance 34, 1277–1287.
Journal of Speech, Language and Hearing 1(1), 43-54.
(Mass.).
Domahs (eds), Laut – Gebärde – Buchstabe. Berlin/ New York, 377-397.
Research 5(1), 1–23.
für Sprachwissenschaft 27, 189–228.
Language processing in Chinese. Amsterdam, 111-130.
Chinese sentence reading. Memory & Cognition 33, 1345-1356.
multi-script nature of the Japanese writing system. Presentation given at ‘Understanding writing systems: From core issues to implications for written language acquisition’ – 10th International Workshop on Written Language and Literacy, 12–13 May, Radboud University, Nijmegen, The Netherlands.
word processing. In H. Pae, (ed.) Writing Systems, Reading Processes, and Cross-Linguistic Influences. Reflections from the Chinese, Japanese and Korean Languages. Amsterdam, 179-200.
text: The case of the Japanese script. Vision Research 41, 2503–2510.
Chinese reading. In Alexander Pollatsek & Rebecca Treiman (Eds.), Oxford library of psychology. The Oxford handbook of reading (p. 232–244). Oxford University Press.
in Chinese reading. Journal of Research in Reading 36(S1), S64–S77.
Evidence from eye movements. British Journal of Psychology 105, 57–68.
formation: Morphology, phonology and the lexicon in modern and ancient Chinese. Berlin, 1-34.
Cambridge.
(eds.), The Oxford Handbook of Chinese Linguistics, 263-274.
Blackwell.
reading Japanese: An eye movement study. Vision Research 47, 2575–2584.
Language). Cambridge.
Berlin/New York.
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
(Bredel 2009)
same class. Examples: <abc-def>, <abc>
marks, exclamation marks
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
syllabification
the onsets of syllables
̯ t] vs. *[ˈtiː.raːɐ ̯ t]
unto each other
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Fuhrhop (2008, 224)
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Fuhrhop (2008, 193)
er fängt mit dem Schreiben an he starts with the.DAT writing PTCL ‘he starts writing‘
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Sanchez-Stockhammer (2018, 352), my emphasis
Hyphenated Adverb (well-nigh) Verb (chain-smoke)
Open
Hyphenated
Solid Accuracy: 61%-80.7% depending on corpus
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Danvivathana (1981, 269), Smyth (2014, 1-2), Kasisopa et al. (2016, 72)
commas or full stops
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Slayden (2010)
will appear on the second consonant of such a cluster
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
(Kohsom & Gobet,1997)
consonants may be used as cues to word boundaries
(Reilly et al. 2005, Kasisopa et al. 2016)
fixation) that makes opportunistic use of available statistical cues to the location of words and their centers
(Kasisopa et al. 2016, 80)
characters assist in directing Thai readers to an optimal viewing position just left of word center
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
words
writing system
/gʁafematik/ | Martin Evertz-Rittich | 17.06.2020
Denis (2016): Eye movements while reading an unspaced writing system: The case of Thai. Vision Research 86, 71–80.
English: Effects on Reading. Proceedings of the Cognitive Science Society, 19, 388–393.
(2005): Comparing reading in English and Thai: The role of spatial word unit segmentation in distributed processing and eye movement control. Paper presented to ECEM 13. University of Bern, 13–18 August, 2005.
http://www.thai-language.com/ref/breaking-words (retrieved 17.06.2020).