Fuori dalla torre di Babele: interoperabilit e sistemi grafjci - - PowerPoint PPT Presentation

fuori dalla torre di babele interoperabilit e sistemi
SMART_READER_LITE
LIVE PREVIEW

Fuori dalla torre di Babele: interoperabilit e sistemi grafjci - - PowerPoint PPT Presentation

Paolo Monella Fuori dalla torre di Babele: interoperabilit e sistemi grafjci pre-moderni Out of the Tower of Babel : interoperability and pre-modern writjng systems December 4, 2019 The interoperability issue The interoperability issue


slide-1
SLIDE 1

Paolo Monella

Fuori dalla torre di Babele: interoperabilità e sistemi grafjci pre-moderni

Out of the Tower of Babel : interoperability and pre-modern writjng systems December 4, 2019

slide-2
SLIDE 2

The interoperability issue

slide-3
SLIDE 3

The interoperability issue

slide-4
SLIDE 4

The interoperability issue

  • uenenū
slide-5
SLIDE 5

The interoperability issue

  • uenenū

Diplomatic

  • Historical documentation
  • Visualization
slide-6
SLIDE 6

The interoperability issue

  • uenenū
slide-7
SLIDE 7

The interoperability issue

  • uenenū
  • venenum
slide-8
SLIDE 8

The interoperability issue

  • uenenū
  • venenum
  • Processing
  • Search
  • Collation
  • NLP (lemma, PoS etc.)
  • Statistics (dist. reading)
slide-9
SLIDE 9

The interoperability issue

  • uenenū
  • venenum

venenum

  • Processing
  • Search
  • Collation
  • NLP (lemma, PoS etc.)
  • Statistics (dist. reading)
slide-10
SLIDE 10

The interoperability issue

  • uenenū
  • venenum
  • Processing
  • Search
  • Collation
  • NLP (lemma, PoS etc.)
  • Statistics (dist. reading)
slide-11
SLIDE 11

The interoperability issue

  • uenenū
  • venenum
  • Processing
  • Search
  • Collation
  • NLP (lemma, PoS etc.)
  • Statistics (dist. reading)
slide-12
SLIDE 12

The interoperability issue

  • uenenū
  • venenum
  • Processing
  • Search
  • Collation
  • NLP (lemma, PoS etc.)
  • Statistics (dist. reading)
slide-13
SLIDE 13

The interoperability issue

  • My focus: European Medieval handwritjng
  • Pre-Gutenberg

Handwritjng

Early print → imitatjng handwritjng

  • Alphabetjc writjng systems

Latjn script (Italian, English...), Greek, Cyrillic...

No Cuneiform, Arabic, Chinese etc.

slide-14
SLIDE 14

a b c d e f g h i j l m n o p q r s t u v z . , ; : !

a b c d e f g h i l m n o p q r s t u z · ;

MS A MS B

The interoperability issue

slide-15
SLIDE 15

a b c d e f g h i j l m n o p q r s t u v z . , ; : !

a b c d e f g h i l m n o p q r s t u z · ;

OCR from Teubner

The interoperability issue

OCR from Loeb

slide-16
SLIDE 16

Out of the Tower

slide-17
SLIDE 17

Unicode: the short way out (as TEI says)?

  • Solutjon for new digital texts
  • Not enough for pre-modern writjng systems

Ligatures

  • & (U+0026; ASCII 38)
  • Have I encoded that it is equivalent to “e + t” in that MS?

Allographs

  • ſ (U+017F) / s (U+0073; ASCII 115)
  • Have I encoded that they are variants of grapheme <s>?

Grapheme set

  • u (U+0075; ASCII 117)
  • Have I encoded whether it “covers” (or not) <u> and <v>?

slide-18
SLIDE 18

Project-specifjc solutjons

  • Disposable home-made solutjons

TEI: theory-agnostjc

Diplomatjc: Modelling pre-modern writjng systems

Normalized:Normalizatjon models / sofuware

slide-19
SLIDE 19

Diplomatjc/normalized: the surrender?

  • venenum
  • uenenū

Diplomatic Normalized

  • Processing
  • Search
  • Collation
  • NLP (lemma, PoS etc.)
  • Statistics (distant reading)...
  • Historical documentation
  • Visualization
slide-20
SLIDE 20

Modelling: the (long) way out

  • Scholarly discussion on modelling
  • Documentatjon on project-specifjc

modelling

formal (data models, sofuware code, tables)

prose

  • Shared models
  • Reusable sofuware libraries
slide-21
SLIDE 21

In a nutshell

slide-22
SLIDE 22

In a nutshell

  • The interoperability issue
  • Interoperability through modelling
  • Open issues
slide-23
SLIDE 23

A structuralist approach to digital modelling for pre-modern writjng systems

slide-24
SLIDE 24

Modelling

slide-25
SLIDE 25

Modelling

  • co̊paraƐur uł adſe uładalium
  • Comparatur vel ad se vel ad alium
slide-26
SLIDE 26

Modelling

  • co̊paraƐur uł adſe uładalium
  • Comparatur vel ad se vel ad alium
slide-27
SLIDE 27

Modelling

  • co̊paraƐur uł adſe uładalium
  • Comparatur vel ad se vel ad alium

Digital modelling

slide-28
SLIDE 28

Modelling

  • co̊paraƐur uł adſe uładalium
slide-29
SLIDE 29

Modelling

  • co̊paraƐur uł adſe uładalium
slide-30
SLIDE 30

System / text

  • co̊paraƐur uł adſe uładalium

Syntagmatic (text, process) Paradigmatic (langue, system)

slide-31
SLIDE 31

System / text

  • co̊paraƐur uł adſe uładalium

Text System

<z> <y> <x> <t> <s>

slide-32
SLIDE 32

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities

slide-33
SLIDE 33

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities “If [we are] given anything…, it is the as yet unanalyzed text in its undivided and absolute integrity” (“deduction”, Prol. Ch. 4)

slide-34
SLIDE 34

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-35
SLIDE 35

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

Digital modelling

slide-36
SLIDE 36

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

Text

<z> <y> <x> <t> <s>

Entities Analysis

slide-37
SLIDE 37

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

Text System

<z> <y> <x> <t> <s>

Entities Analysis

slide-38
SLIDE 38

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-39
SLIDE 39

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis “Thus the same linguistic form may also be manifested in writing… Here is a graphic ‘substance’… Describing the actually present expression... system” (Prol. Ch. 21)

slide-40
SLIDE 40

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-41
SLIDE 41

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons
slide-42
SLIDE 42
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-43
SLIDE 43
  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

paradigm

Chains and paradigms

chain

slide-44
SLIDE 44
  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

paradigm

Chains and paradigms

chain “chain”→sequence?

slide-45
SLIDE 45

Functjons

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Function

slide-46
SLIDE 46

Functjons

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Function

slide-47
SLIDE 47

Graphemes/allographs

slide-48
SLIDE 48

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

slide-49
SLIDE 49

«σ»

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s> «√»

slide-50
SLIDE 50

«σ»

Function Substitution: → No change in “meaning” daƐur / daσur

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Function Commutation: → Change in “meaning” annus / annys

«√»

slide-51
SLIDE 51

Substitution: → No change in “meaning”

«σ»

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Commutation: → Change in “meaning” Variants Invariants

«√»

slide-52
SLIDE 52

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<t> <s>

Variants Invariants Invariant → <grapheme> <t> (class, paradigm, sincretism) Variants → «allographs» «σ¼σ¼» (components, members) «σ¼ » Ɛ¼ «σ¼√»

«√»

slide-53
SLIDE 53

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<t> <s>

Variants Invariants

«√»

Gr Allogr t: σ | Ɛ | √ u: u | v z: z

slide-54
SLIDE 54
  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√» «σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Variants Invariants

«√»

slide-55
SLIDE 55
  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√» «σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Variants Invariants

«√»

slide-56
SLIDE 56
  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√» «σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Variants Invariants

«√»

slide-57
SLIDE 57

Punctuatjon

slide-58
SLIDE 58

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«σ»

Punctuatjon

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

“content-form” (Prol. Ch. 13)

«√»

slide-59
SLIDE 59

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«σ»

Punctuatjon

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Including larger units, such as sentences “content-form” (Prol. Ch. 13)

«√»

slide-60
SLIDE 60

Substitution: → No change in “meaning” Commutation: → Change in “meaning”

Punctuatjon

Including larger units, such as sentences “content-form” (Prol. Ch. 13) Truly I tell you, today you will be with me in paradise Truly I tell you today, you will be with me in paradise

slide-61
SLIDE 61

Substitution: → No change in “meaning” Commutation: → Change in “meaning”

Punctuatjon

Including larger units, such as sentences “content-form” (Prol. Ch. 13) Truly I tell you, today you will be with me in paradise Truly I tell you today, you will be with me in paradise

slide-62
SLIDE 62

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√»

Punctuatjon

  • co̊paraƐur uł adſe uładalium ·

<z> <y> <x> <t> <s> <.> «σ» «√»

slide-63
SLIDE 63

Substitution: → No change in “meaning”

<z> <y> <x> «√» <z> <y> <x>

Commutation: → Change in “meaning”

«√»

Punctuatjon

  • co̊paraƐur uł adſe uładalium ·

<z> <y> <x> <t> <s> <.> «σ» «√»

  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons
slide-64
SLIDE 64

Substitution: → No change in “meaning”

<z> <y> <x> «√» <z> <y> <x>

Commutation: → Change in “meaning”

«√»

Punctuatjon

  • co̊paraƐur uł adſe uładalium ·

<z> <y> <x> <t> <s> <.> «σ» «√»

  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons
slide-65
SLIDE 65

Ligatures

slide-66
SLIDE 66

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Function “Ligature” (entities, parts of a chain)

slide-67
SLIDE 67

«√»

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s> «σ»

Varieties

slide-68
SLIDE 68

«√»

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s> «σ»

Varieties: Solidal variants

slide-69
SLIDE 69

«√»

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s> «σ»

Varieties: Solidal variants <seg type="lig">tu</seg> "lig"/tu → Ɛ+Ц → t+u <g ref="ligTU"/> "ligTU" → Ɛ+Ц → t+u

slide-70
SLIDE 70

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Varieties: Solidal variants

  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons
slide-71
SLIDE 71

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Varieties: Solidal variants

  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons
slide-72
SLIDE 72

Abbreviatjons

slide-73
SLIDE 73

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
slide-74
SLIDE 74

Abbreviatjons: one grapheme?

  • co̊paraƐur uł adſe uładalium

<ō> <ô> <o> <p> <q>

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
slide-75
SLIDE 75

Abbreviatjons: one grapheme?

  • co̊paraƐur uł adſe uładalium

<ō> <ô> <o> <p> <q>

Principles: simplicity, economy, reduction (Prol. Ch. 6) “lowest possible number

  • f elements” (Ch. 13)
slide-76
SLIDE 76

Abbreviatjons: functjon

  • co̊paraƐur uł adſe uładalium

Function (entities)

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
slide-77
SLIDE 77

Abbreviatjons: functjon → ligature?

  • co̊paraƐur uł adſe uładalium

Function “Ligature” (Solidal variants)

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
slide-78
SLIDE 78

Abbreviatjons: functjon → ligature?

  • co̊paraƐur uł adſe uładalium

Function “Ligature” (Solidal variants) Solidal: both mandatory

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
slide-79
SLIDE 79

Abbreviatjons: functjon → selectjon?

  • co̊paraƐur uł adſe uładalium

Function Selection:

  • one is optional
  • one governs the other
  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
slide-80
SLIDE 80

Abbreviatjons: functjon → selectjon?

  • co̊paraƐur uł adſe uładalium

Function Selection:

  • one is optional
  • one governs the other

“chain”→sequence?

slide-81
SLIDE 81

Abbreviatjons: functjon → complementarity?

  • co̊paraƐur uł adſe uładalium

Function Complementarity (interdependence in a system)

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
  • 2C. Complementarity (interdependence)
slide-82
SLIDE 82

Abbreviatjons: functjon → complementarity?

  • co̊paraƐur uł adſe uładalium

Function Complementarity (interdependence in a system) [case + gender + number] Alt

  • us

nom+masc+sing Alt

  • rum

gen+m/neu+plur Alt

  • arum

gen+ fem + plur

Example of complementarity

slide-83
SLIDE 83

Abbreviatjons: functjon → complementarity?

  • co̊paraƐur uł adſe uładalium

<m> <l> <n> <p> <q> <zero> <~> <¯> <^>

slide-84
SLIDE 84

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
  • 2C. Complementarity (interdependence)
slide-85
SLIDE 85

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons
slide-86
SLIDE 86

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
  • Digital (discrete)
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Ligatures
  • Abbreviatjons
slide-87
SLIDE 87

Open issues

slide-88
SLIDE 88

Issues: 1. Abbreviatjons → functjons

  • co̊paraƐur uł adſe uładalium
  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
  • 2C. Complementarity (interdependence)
slide-89
SLIDE 89

Issues: 2. Abbreviatjons → one:many

  • co̊paraƐur
  • co̊paraσur

1:1

slide-90
SLIDE 90

Issues: 2. Abbreviatjons → one:many

  • co̊paraƐur
  • comparaσur

2:2 (2 ≠ 2)

slide-91
SLIDE 91

Issues: 2. Abbreviatjons → one:many

  • positjo

p̄positip

  • praepositjo

2:4

slide-92
SLIDE 92

Issues: 2. Abbreviatjons → one:many

  • fecta

ꝑfectb

  • perfecta

1:3

slide-93
SLIDE 93

Issues: 2. Abbreviatjons → one:many

  • fecta

ꝑfectb

  • perfecta

1:3

Alphabemes (alphabetical letters) Graphemes

slide-94
SLIDE 94

Issues: 3. Ligatures

(syntagmatic)

slide-95
SLIDE 95

Issues: 3. Ligatures

& (U+0026; ASCII 38)

slide-96
SLIDE 96

Issues: 3. Ligatures

Historical/“etymological” considerations

& (U+0026; ASCII 38)

slide-97
SLIDE 97

Issues: 3. Ligatures

Historical/“etymological” considerations

& (U+0026; ASCII 38)

slide-98
SLIDE 98

Issues: 4. Grapheme defjnitjon

(paradigmatic)

.

Full stop Abbreviation mark

slide-99
SLIDE 99

Issues: 5. Allograph defjnitjon (metasemiology)

slide-100
SLIDE 100

Issues: 5. Allograph defjnitjon (metasemiology)

slide-101
SLIDE 101

«σ»

  • co̊paraƐur uł adſe uładalium

Allographs (variants)

«√»

Issues: 5. Allograph defjnitjon (metasemiology)

slide-102
SLIDE 102

«σ»

Issues: 5. Allograph defjnitjon (metasemiology)

  • co̊paraƐur uł adſe uładalium

«√» «√» « » √¼ « √ » «σ» «σ» « σ » «σ» «σ»

slide-103
SLIDE 103

Open issues

  • 1. Abbreviatjons → functjons
  • 2. Abbreviatjons → one:many
  • 3. Ligatures
  • 4. Grapheme defjnitjon
  • 5. Allograph defjnitjon (metasemiology)
slide-104
SLIDE 104

In a nutshell

slide-105
SLIDE 105

In a nutshell

  • The interoperability issue
  • Interoperability through modelling
  • Open issues