Transcribing the Digital Archive of Southern Speech: Methods and - - PowerPoint PPT Presentation

transcribing the digital archive of southern speech
SMART_READER_LITE
LIVE PREVIEW

Transcribing the Digital Archive of Southern Speech: Methods and - - PowerPoint PPT Presentation

Transcribing the Digital Archive of Southern Speech: Methods and Preliminary Analysis Rachel Miller Olsen, Michael L. Olsen, Joseph A. Stanley & Margaret E.L. Renwick The University of Georgia SECOL 84 2 Introduction u Large-scale


slide-1
SLIDE 1

Transcribing the Digital Archive of Southern Speech: Methods and Preliminary Analysis

Rachel Miller Olsen, Michael L. Olsen, Joseph A. Stanley & Margaret E.L. Renwick The University of Georgia SECOL 84

slide-2
SLIDE 2

Introduction

2

u Large-scale transcribed audio corpora are available

u Buckeye Corpus, Santa Barbara Corpus, etc.

u How do these come to be? What’s the on-the-

ground process of building such a corpus?

u Here we discuss:

u Methods for large-scale transcription u Early data & analysis resulting from transcription

slide-3
SLIDE 3

Digital Archive of Southern Speech (DASS)

3

u 64 interviews u 2.5-10hrs, µ=5.75 u 372 hours of audio

slide-4
SLIDE 4

LAGS Protocols

4

u Pilot Study:

u 1031 words/spkr x 10 = 10,310 words à u Searchable time-aligned corpus of 132,000 words

slide-5
SLIDE 5

Transcribing DASS

5

u 35 undergraduate student workers u Each student worker is assigned one interview u One reel at a time u 408 reels/files, µ=54mins

slide-6
SLIDE 6

Transcriber

(Boudahmane et al. 1998–2008)

6

u Create & edit time-

aligned orthographic transcriptions

u Easy-to-use graphical

user interface

u .trs (native .xml) u trans.sourceforge.net

slide-7
SLIDE 7

Guidelines

7

u Transcriber protocols

(~25 pages)

u Phrase Dictionary u Two-phase listening u Daily files + Multiple

backups Codes Meaning {D: } Doubt {X} Unintelligible {C: } Comment {NW} Non-word (e.g. laugh, cough) {NS} Non-speech (e.g. dog barking)

slide-8
SLIDE 8

Workflow

8

Transcription (i.e. 2 listens) complete Spot-checked for consistency File conversion via LaBB-CAT scripts (Fromont & Hay 2012)

labbcat.sourceforge.net

.trs (.xml) à .txt .trs à .TextGrid Automatic phonetic analysis!

slide-9
SLIDE 9

Forced Alignment

9

u Forced-aligned with DARLA (Reddy & Stanford 2015)

slide-10
SLIDE 10

Phonetic Analysis

10

u Formant extraction: four different methods

u In-house Praat script (Boersma & Weenink 2016) u DARLA (Reddy & Stanford 2015) u out-of-the-box FAVE (Rosenfelder et al. 2011)

u based on ANAE means

u modified FAVE (Rosenfelder et al. 2011)

u based on Southern means

slide-11
SLIDE 11

11

slide-12
SLIDE 12

Preliminary Findings: Glide weakening

12

slide-13
SLIDE 13

Glide weakening (cont.)

13

slide-14
SLIDE 14

Observations

14

u Large-scale transcription

u Time to transcribe

u Estimated: 10:1; Reality:13:1

u Phonetic Analysis

u Comparison of formant measurements

u In-house Praat script no good u DARLA filtered out 53% u Too early to tell if FAVE modifications were better

slide-15
SLIDE 15

References

15

Boersma, Paul & David Weenink. 2016. Praat: Doing phonetics by computer [Computer program], Version 5.4.08. Retrieved from http://www.praat.org. Boudahmane, Karim, Mathieu Manta, Fabien Antoine, Sylvian Galliano & Claude Barras. 1998. Transcriber v. 1.5.2. http://trans.sourceforge.net/. Fromont, Robert & Jen Hay. 2012. LaBB-CAT. Proceedings of the Australasian Language Technology Workshop, vol. 10, 113–117. Dunedin, New Zealand. Gorman, Kyle, Jonathan Howell & Michael Wagner. 2011. Prosodylab-Aligner: A Tool for Forced Alignment of Laboratory Speech. Canadian Acoustics 39(3). 192–193. Kretzschmar, William A. 2011. Linguistic Atlas Project. www.lap.uga.edu. Labov, William, Ingrid Rosenfelder & Josef Fruehwald. 2013. One hundred years of sound change in Philadelphia: Linear incrementation, reversal, and reanalysis. Language 89(1). 30–65. Pederson, Lee, Susan L. McDaniel, & Carol M. Adams, eds. 1986-93. Linguistic Atlas of the Gulf States. 7 vols. Athens, GA: University of Georgia Press. Reddy, Sravana & James N. Stanford. 2015. Toward completely automated vowel extraction: Introducing DARLA. Linguistics Vanguard 1(1). 15–28. doi:10.1515/lingvan-2015-0002. Renwick, Margaret E.L. and Rachel M. Olsen. 2016. Voices of coastal Georgia. Proceedings of Meetings on Acoustics, 25, 60004. doi:10.1121/2.0000176. Rosenfelder, Ingrid, Joe Fruehwald, Keelan Evanini & Jiahong Yuan. 2011. FAVE (Forced Alignment and Vowel Extraction) Program Suite. http://fave.ling.upenn.edu.

slide-16
SLIDE 16

Thank you!

16

This work is supported by NSF grant #1625680 Automated Large-Scale Phonetic Analysis: DASS Pilot PIs: Drs. William Kretzschmar & Margaret Renwick.

slide-17
SLIDE 17

Discussion

17

u Great free software available. u Easy to use, even for novices. u Linguistic Atlas data has much to offer! u Large audio corpora can/should be built & can be

analyzed.

slide-18
SLIDE 18

Glide weakening

18

slide-19
SLIDE 19

Example Vowel Spaces

19

i i i i i i i i i i i æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ i i i i i i i i i i i u u u

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

æ æ æ æ æ æ æ æ æ æ ææ æ æ u u u u ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ eɪ eɪ eɪ eɪ eɪ eɪeɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪeɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ ɑ ɑ ɑ ɑɑɑ ɑ ɑ ɑ i i i i i i i i ɛ ɛ ɛ ɛ ɛ ɛ ɛ u u u u u u u u u u u u u u u ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ i i i i i i ieɪ eɪ eɪ ɛ ɛ ɛɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ

  • ʊ
  • ʊ
  • ʊ
  • ʊ oʊ
  • ʊ

ɑ ɑ ɑ ɑ ɑ ɑ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪɪ ɪ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ oʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ oʊ
  • ʊ

ɑ ɑ ɑ ɑ ɑ ɑ ɪ ɪ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

eɪ eɪ eɪ eɪeɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ i i i i i i i i i i i eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊʊ ʊʊ ʊ ʊ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

u u u ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ u u u u u u u u u u u u u u u u u u u u u u u u u ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɪɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌʌʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ʊ ʊ ʊʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ

400 600 800 1000 1000 1500 2000 2500

F2 (Hz) F1 (Hz)

Speaker 195 (male, b. 1894, age 80)

æ æ æ æ æ

  • ʊ

ʊ ʊ ʊ ʊ ʊ ʊ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ i i ɑ ɑ ɑ

  • ʊ

æ u ɔ ɔ ɔ eɪ eɪ eɪ eɪ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ u ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɑ ɑ ɑ ɑ ɑ ɑ ɑ i i i i i eɪ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ ɑ ɑ ɑ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

ɪ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɪ eɪ eɪ eɪ eɪ eɪ eɪ

  • ʊ
  • ʊ
  • ʊ

eɪ eɪ eɪ i i i i i i eɪ ɑ ɑ ʊ ʊ ʊ ʊ ʊʊ ɛ ɛ ɛ

  • ʊ

u u u u u u u

  • ʊ

ɔ ɔ ɔ u u u u u u u ɛ ɛ ɛ ɛ ɪ ɪ ɪɪ ʌ ʌʌ ʌ ʌ ʌ ʌ ɔ ɛ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ i i i i i i i i ii ʊ ʊ ʊ ʊ ʊ ʊ ʊ ɛ ɛ

400 500 600 700 800 900 1000 1500 2000 2500

F2 (Hz) F1 (Hz)

Speaker 202 (female, b. 1919, age 55)

æ æ æ ʊ ʊ ʊ ʊ ɔ ʊ ɔ ʊ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔɔ ɔ ɔ ɔ ɔɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ ɔ i i i ɑ ɑ

  • ʊ
  • ʊ

æ ɔ ɔ ɔ ɔ u eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ ɑ ɑ ɑ i i i i ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ u ɪ ɪ ɪ ɪ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ i i i eɪ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ æ æ æ æ æ æ æ ææ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɑ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

ɪ ɪ ɪ ɪ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ oʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊoʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊoʊ oʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

ɑ ɑ ɑ ɑ ɑ ɑ ɑ ɪ ɪ ɪ

  • ʊ
  • ʊ

eɪ eɪ eɪ eɪ eɪ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

eɪ eɪ eɪ eɪ i i i i i i i i ii i i i i i i i i i i eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ eɪ ɑ ɑ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊʊ ʊʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ɛ ɛ ɛ ɛ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

u uu u u u u

  • ʊ
  • ʊ

ɔ ɔ ɔ u u u u u u ɛ ɛɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ʌ ɔ ɛ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ ɪ i i i i i i i i i i i i ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ʊ ɛ ɛ

400 600 800 1000 1000 1500 2000 2500

F2 (Hz) F1 (Hz)

Speaker 200 (female, b. 1900, age 74)

æ æ æ æ æ æ æ ʊ ʊ ɔ ɔɔ ɔ ɔ ɔ ɔ ɔ u ɑ ɑ

  • ʊ
  • ʊ

æ æ æ ɔ ɔ ɔɔ ɔ ɔ u u eɪ eɪ eɪ eɪ eɪ ɑ ɑ i i ɛ ɪ ɪ ɪ ɑ ɑ i i eɪ ɛ ɛ ɛ ɛ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ æ ɑ ɑ

  • ʊ
  • ʊ
  • ʊ

ɪ ɪ ɪ ɪ

  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ oʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊoʊ
  • ʊ
  • ʊ
  • ʊ oʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊoʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ
  • ʊ

ɑ eɪ eɪ eɪ eɪ eɪ ɑ ɑ ɑ ɑ ʊ ʊ ʊ ʊʊ ʊ ʊ ɛ ɛ ɛ

  • ʊ
  • ʊ
  • ʊ

u u u u u u u u ɔ ɔ ɔ u u ɛ ɛ ɛ ɛ ɛ ɛɛ ɛ ɛ ɪ ɪ ɪ ʌ ʌ ʌ ʌ ɛ ɛ ɛ ɛ ɛ ɛ ɪ ɪ ɪ ɪ ɪ ɪ i i i i i i ʊ ʊ ʊ ʊ ʊ ɛ ɛ ɛ ɛ ɛ

300 500 700 900 1000 1500 2000 2500

F2 (Hz) F1 (Hz)

Speaker 201 (female, b. 1944, age 23)

slide-20
SLIDE 20

LAGS Speaker Area AK

20

u LAGS Protocols:

u 1031 tokens/spkr x 10 spkrs = 10,310 tokens

u Full transcription of interviews:

u Searchable time-aligned corpus of 132,000 words

slide-21
SLIDE 21

Linguistic Atlas of the Gulf States (LAGS)

21