Between complex predicates and regular phrases: German collocational - - PowerPoint PPT Presentation
Between complex predicates and regular phrases: German collocational - - PowerPoint PPT Presentation
Between complex predicates and regular phrases: German collocational clusters Philippa Cook Freie Universit at Berlin 21st HPSG conference SUNY at Buffalo, 28th - 29th August 2014 Between complex predicates and regular phrases: German
Between complex predicates and regular phrases: German collocational clusters
Outline
Introduction to the notion of collocational cluster Statistical Evidence for the collocational tuple Analysis Function Composition for collocational selection Lexical Entries Modifier-collocation-cluster Schema Extensions and Ramifications Conclusion Appendix Analysis of light-verb-phrases without FC More on the LogDice association measure
Philippa Cook 2014, Freie Universit¨ at Berlin 1/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
multiple-fronting strings 1
(1)
[weltweit] worldwide [f¨ ur Aufregung] for upset sorgt provides eine an Werbekompagne advertising-campaign von from Benetton1 Benetton ‘A benetton advertising campaign is causing international concern’
◮ What is the status of the string before the finite verb ?
1http://woodz.schwarzwaelder-bote.de/alltag/lifestyle/8422-benetton-zieht-
kuss-foto-von-papst-zurueck.html, checked 21st Aug. 2014
Philippa Cook 2014, Freie Universit¨ at Berlin 2/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
multiple-fronting strings 2
(2)
[Heftig] heavy [in into die the Kritik] criticism geriet fell der the Kostenrechnungsbericht finance report des the Jugendamtes youth service f¨ ur for 20022 2002
‘The youth service’s 2002 financial report got slated’
◮ What is the status of the string before the finite verb ?
2COSMAS, RHZ03/SEP.09166 Rhein-Zeitung, 12.09.2003 Philippa Cook 2014, Freie Universit¨ at Berlin 3/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
multiple-fronting strings 3
(3)
[Richtig] right [Geld] money wird is nur
- nly
im in Briefgesch¨ aft letter.business verdient3 earned ‘You can only make real money with letters’
3taz 28./29.10.2000, p. 5, taken from M¨
uller (2005)
Philippa Cook 2014, Freie Universit¨ at Berlin 4/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
multiple-fronting strings 3
(3)
[Richtig] right [Geld] money wird is nur
- nly
im in Briefgesch¨ aft letter.business verdient3 earned ‘You can only make real money with letters’
◮ Do (1) - (3) involve a violation of the V2 constraint ? ◮ Or could we have a single constituent in the front field ? ◮ If a single constituent, what kind of structure exactly ?
(cf. (M¨ uller, 2003, 2005) for an analysis as a VP-constituent with empty head, I provide an alternative)
3taz 28./29.10.2000, p. 5, taken from M¨
uller (2005)
Philippa Cook 2014, Freie Universit¨ at Berlin 4/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
My Proposal: a chunk I call a ”collocational cluster”
◮ Strings such as heftig in die Kritik (heavy into the criticism)
- etc. are collocational clusters (”chunks”)
◮ collocation = a relationship of co-occurrence between
words/lemmas (cf. Firth (1957), Sinclair (1991), Evert (2008))
◮ I introduce a schema for building such collocational clusters in
the syntax, inspired by Function Composition
Philippa Cook 2014, Freie Universit¨ at Berlin 5/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
Further evidence for proposing collocational clusters
◮ The material in the cluster prefers to permute (scramble)
together rather than individually (4)
weil because heftig heavy in Kritik in criticism der the Bericht report geriet fell ‘because the report got slated’
(5) ?? weil heftig der Bericht in Kritik geriet (6) ?? weil in Kritik der Bericht heftig geriet
Philippa Cook 2014, Freie Universit¨ at Berlin 6/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
The notion of tuples in the collocational cluster4
Actually there is a third element involved in the collocational cluster that we haven’t yet mentioned
◮ the Verb
3see Stubbs (1995)/Stubbs (2001)’s notion of inter-collocation, Zinsmeister
and Heid (2003)’s tuples, also Almela (2011), Almela et al. (2013) on non-binary collocations and the notion of co-collocations
Philippa Cook 2014, Freie Universit¨ at Berlin 7/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
The notion of headword in lexicographic approaches to collocation
Wortprofil (Didakowski and Geyken (2013)) and Sketch Engine (Kilgarriff et al. (2014)) incorporate grammatical relations into collocation measures and have a notion of head:-
◮ In a modifier-noun collocation, the noun is headword ◮ In a verb-object collocation, the verb is headword ◮ relevant for the analysis
Philippa Cook 2014, Freie Universit¨ at Berlin 8/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster
The notion of tuples in the collocational cluster
richtig Geld verdienen [lit. real money earn] ’to make heaps’ has three collocations of interest
- 1. (richtigadj + GeldN) = ’real’ as a modifier of ’money’
- 2. (GeldN + verdienenV ) = ’money’ as an object of ’earn’
- 3. (richtigadv + verdienenV ) = ’really’ as a modifier of ’earn’
◮ note that adjectives like richtig can be used as adverbs with no
difference in form in German. This ambiguity is presumably a driving factor behind collocation clusters
Philippa Cook 2014, Freie Universit¨ at Berlin 9/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster Statistical Evidence for the collocational tuple
Statistical measures for the collocational tuple
Philippa Cook 2014, Freie Universit¨ at Berlin 10/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster Statistical Evidence for the collocational tuple
Association Measures for richtig Geld verdienen ’really make money’
◮ Using the LogDice measure (see Rychl´
y (2008) for the formula and advantages, see Evert (2008) for association measurement in general
◮ offered via the Wortprofil tool of the DWDS corpus5 item association using LogD frequency richtig as modifier of Geld 5.07 241 Geld as object of verdienen 11.51 22226 richtig as modifier of verdienen 6.09 332
5accessible at www.dwds.de, approx. 1.8 billion tokens, Wortprofil links collocational information to syntactic functions, cf. Didakowski and Geyken (2013) Philippa Cook 2014, Freie Universit¨ at Berlin 11/46
Between complex predicates and regular phrases: German collocational clusters Introduction to the notion of collocational cluster Statistical Evidence for the collocational tuple
Association Measures for heftig in (die) Kritik geraten ’get slated ’and international f¨ ur Aufregung sorgen ’cause international outrage’
association using LogD frequency heftig as modifier of Kritik 11.12 9882 Kritik as object of geraten 9.27 2453 heftig as modifier of geraten 5.8 174 association using LogD frequency weltweit as modifier of Aufregung 3.51 16 Aufregung as object of sorgen 9.13 3774 international as modifier of sorgen 4.41 107
Philippa Cook 2014, Freie Universit¨ at Berlin 12/46
Between complex predicates and regular phrases: German collocational clusters Analysis Function Composition for collocational selection
Function Composition for collocational selection
Philippa Cook 2014, Freie Universit¨ at Berlin 13/46
Between complex predicates and regular phrases: German collocational clusters Analysis Function Composition for collocational selection
Forward and Backward Function Composition in CCG
(7) Forward Function Composition A/B ∗ B/C = A/C (8) Backward Function Composition B\A ∗ C\B = C\A
Philippa Cook 2014, Freie Universit¨ at Berlin 14/46
Between complex predicates and regular phrases: German collocational clusters Analysis Function Composition for collocational selection
Forward and Backward Function Composition in CCG
(7) Forward Function Composition A/B ∗ B/C = A/C (8) Backward Function Composition B\A ∗ C\B = C\A
◮ A/B combines by FC with B/C yielding A/C; a category
requiring a C in order to be complete
◮ The need for a C at the initial level is postponed to the next
level.
◮ Backwards FC similarly postpones saturation of A to the next
level
◮ cf. Argument Inheritance, Hinrichs and Nakazawa (1994) and
Jacobson (1990) for Raising
Philippa Cook 2014, Freie Universit¨ at Berlin 14/46
Between complex predicates and regular phrases: German collocational clusters Analysis Function Composition for collocational selection
The spirit of backwards FC for collocational cluster formation
◮ The intensifier richtig collocationally selects the verb
verdienen (which it also modifies): richtig\verdienen (cf. B\A)
◮ The bare noun Geld collocationally selects richtig. Geld is
headword of modifier-noun string: Geld\richtig (cf. C\B)
Philippa Cook 2014, Freie Universit¨ at Berlin 15/46
Between complex predicates and regular phrases: German collocational clusters Analysis Function Composition for collocational selection
The spirit of backwards FC transferred to collocational cluster formation
◮ Backwards FC combines richtig + Geld, postponing the
requirement for verdienen GeldC \ verdienenA richtigB \ verdienenA GeldC \ richtigB
Philippa Cook 2014, Freie Universit¨ at Berlin 16/46
Between complex predicates and regular phrases: German collocational clusters Analysis Lexical Entries
Lexical Entries for richtig, Gas and geben
◮ I will exemplify with richtig Gas geben [lit. real Gas give]
’increase effort’
◮ richtig ’real(ly)’ ◮ Gas ’Gas’ ◮ geben ’give’ Philippa Cook 2014, Freie Universit¨ at Berlin 17/46
Between complex predicates and regular phrases: German collocational clusters Analysis Lexical Entries
Lexical entry for the intensifier richtig ’really’
word phon
- richtig
- ss|loc
cat head mod V lid 4
- lid
richtig-intensifier
- subcat
cont intensify 4
- coll|lid 4 geben-idiomatic
◮ lid (lexical identifier) feature appropriate for the sort head
identifies specific instantiations of words (Richter and Sailer (1999); Soehn (2004); Sag (2012); Spencer (2005))
◮ coll feature encodes in the lexical entry that it collocates
with a particular word (cf. Sailer (2003), Richter and Sailer (1999)), appropriate for the sort word and cluster = collocational selection
◮ The intensifier is lexically encoded as a verb modifier and it
collocates with that verb
◮ I could generalize the lexical entry so that the intensifier
Philippa Cook 2014, Freie Universit¨ at Berlin 18/46
Between complex predicates and regular phrases: German collocational clusters Analysis Lexical Entries
Lexical entry for Gas in the light verb phrase use
word phon Gas ss|loc cat head lid Gas-idiomatic subcat spr
-
cont index non-referential coll|lid richtig-intensifier
◮ The idiomatic bare noun collocates with the intensifier richtig ◮ The idiomatic noun is non-referential and cannot take a
specifier (must be saturated)
◮ A parallel lexical entry in which the value of coll|lid is
gebenidio handles occurrences of the light verb phrase Gas geben without richtig
Philippa Cook 2014, Freie Universit¨ at Berlin 19/46
Between complex predicates and regular phrases: German collocational clusters Analysis Lexical Entries
Lexical entry for the light verb geben ’give’
word phon
- geben
- ss|loc
- cat
- head
- lid geben-idiomatic
- subcat
np-nom 5
- cont
- rels
- increase-effort
agent
5
- coll|lid Gas-idiomatic
◮ Constitutes a new treatment of light verb phrases: Light verb
collocationally selects the (athematic) object but selects the subject NP via subcat
◮ It cannot undergo personal passive in the idiomatic use
Philippa Cook 2014, Freie Universit¨ at Berlin 20/46
Between complex predicates and regular phrases: German collocational clusters Analysis Modifier-collocation-cluster Schema
Overview of what is to come
- 1. The composition of modifier-collocational-clusters using
the modifier-collocational-schema e.g. richtig Gas [lit. really gas]
- 2. The composition of the modifier-collocational-cluster e.g.
richtig Gas with the idiomatic verb geben
Philippa Cook 2014, Freie Universit¨ at Berlin 21/46
Between complex predicates and regular phrases: German collocational clusters Analysis Modifier-collocation-cluster Schema
Recall the spirit of backwards FC
Gas \ geben = mod-coll-cluster non-head-dtr richtig \ geben head-dtr Gas \ richtig
◮ The modifier has to ’wait’ until it finds the verb it modifies ◮ What I analyze as a mod-coll-cluster is therefore not a type of
head-adjunct-structure
◮ This makes sense because the tuple is more like one complex
- lexeme. richtig-intensifier is not a normal modifier but
somewhere between modifier and argument (= a collocational modifier)
Philippa Cook 2014, Freie Universit¨ at Berlin 22/46
Between complex predicates and regular phrases: German collocational clusters Analysis Modifier-collocation-cluster Schema
Modifier-collocational-cluster Schema
modifier-coll-cluster → ss loc|cat|head mod 4 lid 1
- coll|lid 4
cont 6 non-hd-dtr ss|loc|cat head|mod 4 lid 2
- coll|lid 4
hd-dtr ss|loc|cat head|lid 1 spr
- coll|lid 2
cont 6
◮ hd-dtr, e.g. Gas, coll. selects the non-hd-dtr, e.g. richtig ◮ non-hd-dtr coll. selects the verb it modifies ◮ mthr inherits coll|lid and mod values from non-hd-dtr
= postponement
◮ mthr inherits cont value of the hd-dtr; cf. Semantics
Principle
◮ The mth inherits lid value from hd-dtr; cluster is a special
version of the lexeme Gas-idiomatic
Philippa Cook 2014, Freie Universit¨ at Berlin 23/46
Between complex predicates and regular phrases: German collocational clusters Analysis Modifier-collocation-cluster Schema
Sub-tree for ’richtig Gas’ licensed by the mod-coll-cl schema
modifier-coll-cluster phon richtig gas ss|loc|cat|head mod V lid 4
- lid
Gas-idiomatic
- cont
index non-referential coll|lid
4 geben-idiomatic
Adj
word phon richtig ss|loc cat head mod V
- lid 4
- lid
richtig-intensifier
- subcat
cont intensify 4
- coll|lid 4 geben-idiomatic
richtig-intensifier N
word phon Gas ss|loc cat head lid Gas-idiomatic subcat spr
-
cont
- index non-referential
- coll|lid richtig-intensifier
Gas-idiomatic
Philippa Cook 2014, Freie Universit¨ at Berlin 24/46
Between complex predicates and regular phrases: German collocational clusters Analysis Modifier-collocation-cluster Schema
How the verb geben combines with the collocational cluster richtig Gas
◮ The mod-coll-cluster richtig Gas, headed by Gas,
collocationally selects the (idiomatic) verb geben but it also selects it via mod
◮ The mod-coll-cluster and the verb can combine via the
head-adjunct-schema
◮ The idiomatic semantics of the verb (hd-dtr) percolate to
the mthr and the (postponed) semantic modification of the verb can apply
Philippa Cook 2014, Freie Universit¨ at Berlin 25/46
Between complex predicates and regular phrases: German collocational clusters Analysis Modifier-collocation-cluster Schema
Sub-tree for cluster richtig Gas and geben
phrase phon
- richtig gas geben
- ss|loc
- cat
- head
- lid geben-idio
- subcat
np-nom 5
- cont
- rels
- increase-effort
agent
5
- coll
-
mod-coll-Cl
mod-coll-cluster phon richtig gas ss|loc|cat|head mod V lid 4
- lid
Gas-idiomatic
- cont
- index non-referential
- coll|lid
4 geben-idiomatic
richtig Gas V
word phon geben ss|loc
- cat
- head
lid geben-idiomatic subcat np-nom 5
- cont
- rels
- increase-effort
agent
5
- coll|lid Gas-idiomatic
geben-idiomatic
Philippa Cook 2014, Freie Universit¨ at Berlin 26/46
Between complex predicates and regular phrases: German collocational clusters Extensions and Ramifications
Extensions and Ramifications
Philippa Cook 2014, Freie Universit¨ at Berlin 27/46
Between complex predicates and regular phrases: German collocational clusters Extensions and Ramifications
Collocation Tuples: more examples for which I propose a similar analysis
◮ The tuples vary in degrees of schematicity and form a
continuum from full idioms to near-compositional phrases
◮ Details of the analysis will vary for lexical strings of different
degrees of schematicity
Licht ins Dunkel bringen ’bring light into the dark = shed light onto sth.’ richtig Gas geben ’really give Gas = increase effort’ hart ins Gericht gehen ’go hard into court = roast s.o.’ ihm zur Seite stehen ’stand by him’ am billgsten in XP kommen ’get to X the cheapest (way)’ trocken durch XP kommen ’come dry through X’ postiv/negativ auf XP wirken ’react positively/negatively to X’
Philippa Cook 2014, Freie Universit¨ at Berlin 28/46
Between complex predicates and regular phrases: German collocational clusters Extensions and Ramifications
Topicalization of collocational clusters
◮ Analogous to fronting of coherent verbal clusters ◮ Alternative analysis for (some) multiple fronting constructions
(9)
[richtig right Gas] gas gibt gives er he immer always ‘he always gives his all’
(10)
[zu schlafen to sleep versucht] try hat has er he ‘he tried to sleep’
zu schlafen versuchtverbal−cluster richtig Gasmod−coll−cluster
Philippa Cook 2014, Freie Universit¨ at Berlin 29/46
Between complex predicates and regular phrases: German collocational clusters Extensions and Ramifications
Extension of the analysis to (free) Datives
(11)
[Ihm] he-DAT [zur Seite] to.the side steht stands als as stellvertretender acting Vorstandschef ceo Gerd Gerd Tenzer6 Tenzer ‘Gerd Tenzer is helping him out as acting CEO’
◮ I treat the dative as a benefactive modifier, addable to the
arg-struc of any verb in German
◮ The dative is also possessor of the noun Seite; i.e. also a
modifier introducable for any noun
6taz, 18.07.2002, S. 7, taken from M¨
uller (2005)
Philippa Cook 2014, Freie Universit¨ at Berlin 30/46
Between complex predicates and regular phrases: German collocational clusters Extensions and Ramifications
Seite \ stehen ihm \ stehen Seite \ ihm
Philippa Cook 2014, Freie Universit¨ at Berlin 31/46
Between complex predicates and regular phrases: German collocational clusters Extensions and Ramifications
Extension of the light verb analysis (appendix) to Integrated objects and subjects cf. (Jacobs, 1993, 1999)
◮ Verb-close objects e.g. der Nachwelt hinterlassen ’left to the
afterlife’, Zeitung lesen ’newspaper read’ (semi-arguments/part of predicate)
◮ accounts for lower syntactic mobility of the verb-adjacent
argument alone and for increased mobility of the cluster
◮ captures that there is no intervention of modifier/negation
between integrated objects and the verbs
Philippa Cook 2014, Freie Universit¨ at Berlin 32/46
Between complex predicates and regular phrases: German collocational clusters Conclusion
Conclusion 1
Significance for ideas about constituency and usage/frequency and the modifier-argument distinction
◮ We know there is a close relation between frequently
co-occurring elements and standard constituents but we must also capture units beyond those standardly acknowledged up to now
◮ Collocationally selected modifiers are situated inbetween
arguments and true modifiers
◮ The availability of collocationally selected items seems to
generalize to form a pattern,to provide a slot fillable by material of a certain grammatical class (cf. Dowty (2003))
Philippa Cook 2014, Freie Universit¨ at Berlin 33/46
Between complex predicates and regular phrases: German collocational clusters Conclusion
Conclusion 2
Advantages of this analysis for HPSG and specifically for the analysis of German
◮ Interfaces usage data and a usage-based view of ’constituency’
with the HPSG formalism, cf. Bybee and Cacoullos (2009); Beckner and Bybee (2009); Bod (1998)
◮ Captures analogy between verb clusters (cluster – chunk) and
these (non-standard) constituents in German - is this something specific to German ?
◮ Offers a syntactic solution for handling Integration of nouns
and PPs as discussed in (Jacobs, 1993, 1999) [also subject-V integration too]
Philippa Cook 2014, Freie Universit¨ at Berlin 34/46
Between complex predicates and regular phrases: German collocational clusters Conclusion
Almela, Mois´
- es. 2011. The case for
verb-adjective collocations: corpus-based analysis and lexicographical treatment. Revista de Ling¨ u´ ıstica y Lenguas Aplicada 6, 39 – 51. Almela, Mois´ es, Cantos, Pascual and S´ anchez, Aquilino. 2013. Collocation, Co-collocation, Constellation ... Any Advances in Distributional Semantics? Procedia Social and Behavioral Sciences 5th International Conference on Corpus Linguistics (CILC2013)(95), 231 – 240. Beckner, Clay and Bybee, Joan. 2009. A usage-based account of constituency and reanalysis. Language Learning 59, 27–46. Bod, Rens. 1998. Beyond Grammar. An Experience-Based Theory of
- Language. csli: CSLI.
Bybee, Joan L. and Cacoullos, Rena Torres. 2009. The role of prefabs in grammaticization: How the particular and the general interact in language change. In Hamid Ouali Roberta L. Corrigan, Edith A. Moravcsik and Kathleen Wheatley (eds.), Formulaic Language: Volume 1. Distribution and historical change, pages 187 – 217, Amsterdam: John Benjamins. Didakowski, J¨
- rg and Geyken,
- Alexander. 2013. From DWDS
corpora to a German Word Profile
Philippa Cook 2014, Freie Universit¨ at Berlin 34/46
Between complex predicates and regular phrases: German collocational clusters Conclusion
– methodological problems and
- solutions. In Network Strategies,
Access Structures and Automatic Extraction of Lexicographical
- Information. 2nd Work Report of
the Academic Network ¨ Internet Lexicography”, OPAL - Online publizierte Arbeiten zur Linguistik X/2012, pages 43 – 52, Mannheim: Institut f¨ ur deutsche Sprache. Dowty, Davis. 2003. The Dual Analysis of Adjuncts and Complements in Categorial
- Grammar. In Ewald Lang, Claudia
Maienborn and Cathrine Fabricius-Hansen (eds.), Modifying Adjuncts, pages 1 – 22, de Gruyter. Evert, Stefan. 2008. Corpora and
- Collocations. In Anke L¨
udeling and Merja Kyt¨
- (eds.), Corpus
- Linguistics. An International
- Handbook. Article 58, pages 1212
– 1248, Berlin: Mouton de Gruyter. Firth, J. R. 1957. A synopsis of linguistic theory 1930–55. In Studies in Linguistic Analysis, pages 1 – 32, Oxford: The Philological Society. Hinrichs, Erhard W. and Nakazawa,
- Tsuneko. 1994. Linearizing AUXs
in German Verbal Complexes. In John Nerbonne, Klaus Netter and Carl J. Pollard (eds.), German in Head-Driven Phrase Structure Grammar, CSLI Lecture Notes,
- No. 46, pages 11–38, Stanford:
CSLI Publications. Jacobs, Joachim. 1993. Integration. In
Philippa Cook 2014, Freie Universit¨ at Berlin 34/46
Between complex predicates and regular phrases: German collocational clusters Conclusion
Marga Reis (ed.), Wortstellung und Informationsstruktur, page 63–116, T¨ ubingen: Niemeyer. Jacobs, Joachim. 1999. Informational
- Autonomy. In Robert van
der Sandt Peter Bosch (ed.),
- Focus. Linguistic, Cognitive and
Computational Perspectives, pages 56–81, Cambridge University Press. Jacobson, Pauline. 1990. Raising as Function Composition. Linguistics and Philosophy 13, 423–475. Kilgarriff, Adam, Baisa, V´ ıt, Buˇ sta, Jan, Jakub´ ıˇ cek, Miloˇ s, Kov´ aˇ r, Vojtˇ ech, Michelfeit, Jan, Rychl´ y, Pavel and Suchomel, V´ ıt. 2014. The Sketch Engine: ten years on. Lexicography pages 1–30. M¨ uller, Stefan. 2003. Mehrfache
- Vorfeldbesetzung. Deutsche
Sprache 31(1), 29–62. M¨ uller, Stefan. 2005. Zur Analyse scheinbarer V3-S¨
- atze. In
Franz-Josef d’Avis (ed.), Deutsche Syntax: Empirie und Theorie. Symposium G¨
- teborg 13–15 Mai
2004, G¨
- teborger Germanistische
Forschungen, No. 46, pages 173–194, Acta Universitatis Gothoburgensis. Richter, Frank and Sailer, Manfred.
- 1999. LF Conditions on Expressions
- f Ty2: An HPSG Analysis of
Negative Concord in Polish. In Robert D. Borsley and Adam Przepi´
- rkowski (eds.), Slavic in
Head-Driven Phrase Structure
Philippa Cook 2014, Freie Universit¨ at Berlin 34/46
Between complex predicates and regular phrases: German collocational clusters Conclusion
Gram- mar, pages 247 – 282, Stanford: CSLI. Rychl´ y, Pavel. 2008. A Lexicographer-Friendly Association
- Score. In RASLAN 2008, pages
6–9, Brno: Masarykova Univerzita. Sag, Ivan. 2012. Sign-based Construction Grammar: An informal Synopsis. In Hans Boas and Ivan Sag (eds.), Sign-based Construction Grammar, pages 69 – 202, Stanford: CSLI. Sailer, Manfred. 2003. Combinatorial Semantics and Idiomatic Expressions in Head-Driven Phrase Structure Grammar. Dissertation (2000), Eberhard-Karls-Universit¨ at T¨ ubingen, Working Papers of the SFB 340, Vol. 161. Sinclair, John. 1991. Corpus, Concordance, Collocation. Oxford: Oxford university Press. Soehn, Jan-Philipp. 2004. License to Coll: How to Bind Bound Words and Readings to their Contexts. In Stefan M¨ uller (ed.), Proceedings of the 11th International Conference
- n Head-Driven Phrase Structure
Grammar, Center for Computational Linguistics, Katholieke Universiteit Leuven, pages 261 – 273, Stanford: CSLI. Spencer, Andrew. 2005. Generalized Paradigm Function Morphology – A synopsis. In Alexandra Galani and Beck Sinar (eds.), York Papers in Linguistics, volume 2 of Papers from the York-Essex Morphology
Philippa Cook 2014, Freie Universit¨ at Berlin 34/46
Between complex predicates and regular phrases: German collocational clusters Appendix Analysis of light-verb-phrases without FC
Meeting, 2003, pages 93 – 106, University of York, UK. Stubbs, M. 2001. Words and Phrases: Corpus Studies of Lexical
- Semantics. Language in Society,
Oxford: Blackwell Wiley. Stubbs, Michael. 1995. Collocations and Semantic Profiles: On the Cause of the Trouble with Quantitative Studies. Functions of Language 2(1), 23 – 55. Zinsmeister, Heike and Heid, Ulrich.
- 2003. Significant triples: Adjective
+ noun + verb combinations. In Proceedings of the 7th Conference
- n Computational Lexicography
and Text Research, COMPLEX 2003, pages 92 – 101, Budapest, Hungary.
Philippa Cook 2014, Freie Universit¨ at Berlin 35/46
Between complex predicates and regular phrases: German collocational clusters Appendix Analysis of light-verb-phrases without FC
Composition of Gas + geben as a light-verb-coll-cl
phrase phon
- gas geben
- ss|loc
- cat
- head
- lid geben-idio
- subcat
np-nom 5
- cont
- rels
- increase-effort
agent
5
-
N
word phon
- Gas
- ss|loc
cat head
- lid Gas-idio
- subcat
spr
-
cont index non-referential coll|lid geben-idiomatic
Gas-idiomatic V
word phon
- geben
- ss|loc
- cat
- head
- lid geben-idiomatic
- subcat
np-nom 5
- cont
- rels
- increase-effort
agent
5
- coll|lid Gas-idiomatic
geben-idiomatic
Philippa Cook 2014, Freie Universit¨ at Berlin 35/46
Between complex predicates and regular phrases: German collocational clusters Appendix Analysis of light-verb-phrases without FC
Schema for light-verb-collocational-clusters
integrated-object-phrase → ss|loc|cat head|lid 2 subcat
- 4
- non-hd-dtr
ss|loc|cat head|lid 1 spr
- coll|lid 2
hd-dtr ss|loc|cat head|lid 2 subcat
- 4
- coll|lid 1
◮ dtrs collocationally select each other, viz. 1 and 2 ◮ This schema also licenses integrated objects, e.g Zeitung lesen
’read the paper’ (Jacobs, 1993, 1999) and (with some changes) PP-light verb phrases
◮ mthr inherits idiomatic semantics from hd-dtr in keeping
with the Semantics Principle
◮ mthr lexeme inherits lid value from hd-dtr, viz.2
Philippa Cook 2014, Freie Universit¨ at Berlin 36/46
Between complex predicates and regular phrases: German collocational clusters Appendix More on the LogDice association measure
Features of LogDice Measure Rychl´ y (2008)
◮ Theoretical maximum is 14, in case when all occurrences of X
co-occur with Y and all occurrences of Y co-occur with X. Usually the value is less then 10.
◮ Value 0 means there is less than 1 co-occurrence of XY per
16,000 X or 16,000 Y
◮ We can say that negative values means there is no statistical
significance of XY collocation.
◮ Comparing two scores, plus 1 point means twice as often
collocation, plus 7 points means roughly 100 times frequent collocation.
◮ The score does not depend on the total size of a corpus. The
score combine relative frequencies of XY in relation to X and Y
Philippa Cook 2014, Freie Universit¨ at Berlin 37/46
Between complex predicates and regular phrases: German collocational clusters Appendix More on the LogDice association measure
Association Measures for richtig Bischof ¨ ubernehmen ’really take over the bishop’
A lexical string which intuitively does not include the collocate triple yields low association measures, as expected
association using LogD frequency richtig as modifier of Bischof 0.11 6 Bischof as object of ¨ ubernehmen 0.21 7 richtig as modifier of ¨ ubernehmen 1.28 13
Philippa Cook 2014, Freie Universit¨ at Berlin 38/46