Introduction to Dialectometry III
Wilbert Heeringa
German Academic Exchange Service – DAAD University of Bielefeld, Faculty of Linguistics and Literary Studies Frisian Academy
Abidjan, December, 19–23, 2016
1
Introduction to Dialectometry III Wilbert Heeringa German Academic - - PowerPoint PPT Presentation
Introduction to Dialectometry III Wilbert Heeringa German Academic Exchange Service DAAD University of Bielefeld, Faculty of Linguistics and Literary Studies Frisian Academy Abidjan, December, 1923, 2016 1 Topics Gabmap Literature 2
German Academic Exchange Service – DAAD University of Bielefeld, Faculty of Linguistics and Literary Studies Frisian Academy
Abidjan, December, 19–23, 2016
1
Topics
Gabmap Literature
2
3
What is Gabmap?
Doing dialect analysis on the web
freely distributed since 2004
2011.
4
What is Gabmap?
http://www.let.rug.nl/~kleiweg/L04/webapp
¸agri C ¸¨
http://www.gabmap.nl/ and maintained by Martijn Wieling.
5
6
Gabmap running on USB stick
https://github.com/pebbe/Gabmap-docker and which enables us to run Gabmap without internet connection.
based on the Linux kernel.
you to create and view dialect data tables.
7
How to boot from the USB stick
then right after this press F9 or F12 or ESC or ...;
then hold the Alt/Option key as soon as you hear the Macs startup chime.
3.0’.
username: guest password: guest
8
How to boot from the USB stick
support, ?)
9
10
11
Input
12
Input: map
names should be spelled exactly as in your data file!
13
Input: map
http://www.let.rug.nl/~kleiweg/L04/kml/manual.html and with Google Maps: http://coltekin.net/cagri/courses/leuven/
14
Input: map
via Google Maps.
15
16
17
18
19
20
21
Input: map
throughout the whole document.
by ‘<Polygon><outerBoundaryIs><LinearRing>’
by ‘</LinearRing></outerBoundaryIs></Polygon>’
22
Input: dialect data
https://westonruter.github.io/ipa-chart/keyboard/ for finding the Unicode characters.
23
24
25
Input: dialect data
encoded as Unicode (UTF-8 or UTF-16).
CSV (.csv)’ in the lower right corner of the window ‘Save’.
26
27
28
Input: dialect data
UTF-16) and the tab as separator.
29
30
31
32
33
Input: dialect data
categorical data.
34
35
Input: dialect data
36
Input: feature definition file
and allows that:
37
Input: feature definition file
the weight is 0.3.
primary stress, secondary stress, minor (foot) group, major (intonation) group, syllable break, linking (absence of a break).
However, be careful when changing IPA.def.
38
Running Gabmap
39
40
41
42
Literature (1)
Goebl, H. (1982). Dialektometrie; Prinzipien und Methoden des Einsatzes der numerischen Taxonomie im Bereich der Dialektgeographie. Wien: Verlag der ¨
Goebl, H. (1984). Dialektometrische Studien anhand italoromanischer, r¨ atoromanischer und galloromanischer Sprachmaterialien aus AIS und ALF. Volume 1. (Volumes 2 and 3 contain maps and tables). T¨ ubingen: Max Niemeyer. Goebl, H. (2010a). Dialectometry and quantitative mapping. In Language and Space. An International Handbook of Linguistic Variation. Volume 2: Language Mapping. Handb¨ ucher zur Sprach- und Kommunikationswissenschaft [HSK], edited by Alfred Lameli, Roland Kehrein and Stefan Rabanus, 30.2, 433–457, 2201–2212. Berlin: de Gruyter Mouton. Goebl, H. (2010b). Dialectometry: Theoretical prerequisites, practical problems, and concrete applications (mainly with examples drawn from the “Atlas Linguistique de La France”, 1902–1910). Dialectologia. Special Issue, I(2010): 63–77. Goebl, H. (2006). Recent Advances in Salzburg Dialectometry. Literary and Linguistic Computing 21(4), 411–435.
43
Literature (2)
Gooskens, Ch, Beijering, K. and Heeringa, W. (2008). Phonetic and lexical predictors of intelligibility. International Journal of Humanities and Arts Computing, 2(1–2): 63–81. Gooskens, Ch. and Heeringa W. (2004). Perceptive Evaluation of Levenshtein Dialect Distance Measurements using Norwegian Dialect Data. Language Variation and Change, 16(3), 189–207. Heeringa, W. (2004). Measuring dialect pronunciation differences using Levenshtein distance. Phd thesis, University of Groningen. Heeringa, W., Kleiweg, P., Gooskens, Ch. and Nerbonne, J. (2006). Evaluation of String Distance Algorithms for Dialectology. In Linguistic Distances Workshop at the joint conference of International Committee
edited by John Nerbonne and Erhard Hinrichs, 51–62. Stroudsburg PA: The Association for Computational Linguistics (ACL). Kessler, B. (1995). Computational dialectology in Irish Gaelic. In Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, 60–67, Dublin. EACL.
44
Literature (3)
Kruskal, J B. (1999). An overview of sequence comparison. In Time Warps, String edits, and Macromolecules. The Theory and Practice of Sequence Comparison edited by D. Sankoff, and J. Kruskal, 2nd ed., 1–44. Stanford: Center for the Study of Language and Information. 1st edition appeared in 1983. Levenshtein, V. I. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Cybernetics and Control Theory, 10(8): 707–710. Leinonen, T., C ¸¨
¸. and J. Nerbonne (2016): Using Gabmap. Lingua 178(2016), special issue on Linguistic Infrastructure edited by Jan Odijk, 71–83. Nerbonne, J. (2009). Data-driven dialectology. Language and Linguistics Compass 3(1), 175–198. Nerbonne, J. (2010). Mapping aggregate variation. In A. Lameli, R. Kehrein och S. Rabanus (eds.), Language and Space Vol. 2. Language Mapping. Berlin: De Gruyter, 476–495. Nerbonne, J., Colen, R., Gooskens, Ch., Kleiweg, P. and Leinonen, T. (2011). Gabmap – a web application for dialectology. Dialectologia: revista electr`
edited by John Nerbonne, Stef Grondelaers, Dirk Speelman & Maria-Pilar Perea, 65–89.
45
Literature (4)
Nerbonne, J. & W. Heeringa (2010). Measuring dialect differences. In J. E. Schmidt and P. Auer (eds.), Language and Space Vol. 1. Theories and Methods. Berlin: De Gruyter, 550–567. Snoek, C. (2014). Review of Gabmap: Doing Dialect Analysis on the Web. Language Documentation and Conservation 8, 192–208. Spruit, M.R., Heeringa, W. and Nerbonne, J. (2009). Associations among Linguistic Levels. In Lingua, special issue on The Forests behind the Trees, edited by John Nerbonne and Franz Manni, 119(11): 1624–1642. Wieling, M. (2012). A Quantitative Approach to Social and Geographical Dialect Variation. PhD dissertation, University of Groningen. Wieling, M., Bloem, J., Mignella, K., Timmermeister, M. and Nerbonne, J. (2014). Measuring foreign accent strength in English. Validating Levenshtein Distance as a Measure.Language Dynamics and Change 4(2): 253–269. Wieling, M., Margaretha, E. and Nerbonne, J. (2012). Inducing a measure of phonetic similarity from dialect
46
Literature (5)
Wieling, M., Proki´ c, J. and Nerbonne, J. (2009). Evaluating the Pairwise String Alignments of Pronunciations. In Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education (LaTeCH SHELT&R 2009) EACL Workshop edited by Lars Borin and Piroska Landvai, 18–25.
47
Final remarks
48
49
50