Harnessing Bayesian phylogenetics to test a Greenbergian universal
Gerhard Jäger1 Ramon Ferrer-i-Cancho2
Tübingen University1 Universitat Politècnica de Catalunya2
52nd Annual Meeting of the Societas Linguistica Europaea
Leipzig, August 21, 2019
Harnessing Bayesian phylogenetics to test a Greenbergian universal - - PowerPoint PPT Presentation
Harnessing Bayesian phylogenetics to test a Greenbergian universal Gerhard Jger 1 Ramon Ferrer-i-Cancho 2 Tbingen University 1 Universitat Politcnica de Catalunya 2 52nd Annual Meeting of the Societas Linguistica Europaea Leipzig, August
Gerhard Jäger1 Ramon Ferrer-i-Cancho2
Tübingen University1 Universitat Politècnica de Catalunya2
52nd Annual Meeting of the Societas Linguistica Europaea
Leipzig, August 21, 2019
1 / 30
2 / 30
But: Dryer (1992)
3 / 30
1 2 1 3 2 1
multiple authors, ... (Ferrer-i-Cancho and Liu, 2014)
4 / 30
V N Adj N Adj
1 1 3 1
D=6 V N Adj N Adj
2 1 4 1
D=8 V N Adj N Adj
1 1 2 1
D=5 V N Adj N Adj
1 1 2 1
D=5
V1 Vmed Vfin AdjN NAdj
D=8 V N Adj N Adj
2 1 4 1
V N Adj N Adj
1 1 3 1
D=6
DDm provides functional motivation for Universal 17 and its mirror image.
5 / 30
NAdj AdjN V1 Vfin Vmed Vfin Vmed V1 V1 Vmed Vfin AdjN AdjN AdjN NAdj NAdj NAdj
6 / 30
NAdj AdjN V1 V1 Vfin Vfin Vmed Vmed V1 Vmed Vfin AdjN AdjN AdjN NAdj NAdj NAdj
7 / 30
V1 Vmed Vfin NAdj AdjN 8 / 30
same pattern, these are not two independent data points ⇒ we need to control for phylogenetic dependencies
(from Dunn et al., 2011) 9 / 30
“If the A-distribution for a given typology cannot be as- sumed to be stationary, a distributional universal cannot be discovered on the basis of purely synchronic statistical data.” “In this case, the only way to discover a distributional universal is to estimate transition probabilities and as it were to ‘predict’ the stationary distribution on the basis
10 / 30
11 / 30
Markov process
12 / 30
Markov process Phylogeny
12 / 30
Markov process Phylogeny Branching process
12 / 30
13 / 30
Markov model
13 / 30
14 / 30
Independent model Dependent model
Adj-N N-Adj Vmed V1
VVfin
VN-Adj/Vmed N-Adj/V1
VN-Adj/Vfin
VAdj-N/Vmed Adj-N/V1
VAdjN/Vfin
V15 / 30
16 / 30
17 / 30
CTMC trees1 data1 trees2 data2 trees3 data3 trees4 data4 trees1 data1 trees2 data2 trees3 data3 trees4 data4 CTMC4 CTMC3 CTMC2 CTMC1
lineage-specific universal
18 / 30
CTMC trees1 data1 trees2 data2 trees3 data3 trees4 data4 trees1 data1 trees2 data2 trees3 data3 trees4 data4 CTMC4 CTMC3 CTMC2 CTMC1 trees1 data1 trees2 data2 trees3 data3 trees4 data4 CTMC4 CTMC3 CTMC2 CTMC1 hyper-parameter
lineage-specific universal hierarchical
18 / 30
distribution f
cross-family variation → can be
trees1 data1 trees2 data2 trees3 data3 trees4 data4 CTMC4 CTMC3 CTMC2 CTMC1 hyper-parameter
19 / 30
distribution f
cross-family variation → can be
trees1 data1 trees2 data2 trees3 data3 trees4 data4 CTMC4 CTMC3 CTMC2 CTMC1 hyper-parameter
19 / 30
for each lineage
Principle)
distribution
20 / 30
21 / 30
Independent model Dependent model
Adj-N N-Adj Vmed V1
VVfin
VN-Adj/Vmed N-Adj/V1
VN-Adj/Vfin
VAdj-N/Vmed Adj-N/V1
VAdjN/Vfin
V1In the abstract we reported the opposite conclusion, but there we used a non-hierarchical universal model.
22 / 30
17
0.00 0.25 0.50 0.75 1.00
0.00 0.25 0.50 0.75 1.00
23 / 30
groups:
1 negative or no correlation
Nuclear Macro-Je, Mande, Siouan, Pama-Nyungan, Austronesian, ...
2 positive correlation
Uto-Aztecan, Afro-Asiatic, Indo-Euroean, Dravidian, Austroasiatic, Otomanguean, ...
0.0 0.5
correlation lineages
word order correlation: lineage-wise posterior distribution
24 / 30
groups:
1 negative or no correlation
Nuclear Macro-Je, Mande, Siouan, Pama-Nyungan, Austronesian, ...
2 positive correlation
Uto-Aztecan, Afro-Asiatic, Indo-Euroean, Dravidian, Austroasiatic, Otomanguean, ...
0.00 0.25 0.50
correlation
25 / 30
0.0 0.2 0.4
correlation
26 / 30
27 / 30
Pama-Nyungan
N-Adj/Vmed N-Adj/V1
V
N-Adj/Vfin
V
Adj-N/Vmed Adj-N/V1
V
AdjN/Vfin
V
Austroasiatic
N-Adj/Vmed N-Adj/V1
V
N-Adj/Vfin
V
Adj-N/Vmed Adj-N/V1
V
AdjN/Vfin
V
28 / 30
29 / 30
noun-adjective order
30 / 30
Matthew S. Dryer. The Greenbergian word order correlations. Language, 68(1):81–138, 1992. Michael Dunn, Simon J. Greenhill, Stephen Levinson, and Russell D. Gray. Evolved structure of language shows lineage-specific trends in word-order universals. Nature, 473(7345): 79–82, 2011. Ramon Ferrer-i-Cancho and H. Liu. The risks of mixing dependency lengths from sequences of different length. Glottotheory, (5):143–155, 2014. Joseph Greenberg. Some universals of grammar with special reference to the order of meaningful elements. In Universals of Language, pages 73–113. MIT Press, Cambridge, MA, 1963. Harald Hammarström, Robert Forkel, Martin Haspelmath, and Sebastian Bank. Glottolog 2.7. Max Planck Institute for the Science of Human History, Jena, 2016. Available online at http://glottolog.org, Accessed on 2017-01-29. Martin Haspelmath, Matthew S. Dryer, David Gil, and Bernard Comrie. The World Atlas of Language Structures online. Max Planck Digital Library, Munich, 2008. http://wals.info/.
Elena Maslova. A dynamic approach to the verification of distributional universals. Linguistic Typology, 4(3):307–333, 2000. Mark Pagel and Andrew Meade. Bayesian analysis of correlated evolution of discrete characters by reversible-jump Markov chain Monte Carlo. The American Naturalist, 167(6): 808–825, 2006. Søren Wichmann, Eric W. Holman, and Cecil H. Brown. The ASJP database (version 17). http://asjp.clld.org/, 2016. 30 / 30