Fuori dalla torre di Babele: interoperabilit e sistemi grafjci - - PowerPoint PPT Presentation

fuori dalla torre di babele interoperabilit e sistemi
SMART_READER_LITE
LIVE PREVIEW

Fuori dalla torre di Babele: interoperabilit e sistemi grafjci - - PowerPoint PPT Presentation

Paolo Monella Fuori dalla torre di Babele: interoperabilit e sistemi grafjci pre-moderni Out of the Tower of Babel : interoperability and pre-modern writjng systems December 4, 2019 In a nutshell In a nutshell Digital scholarly


slide-1
SLIDE 1

Paolo Monella

Fuori dalla torre di Babele: interoperabilità e sistemi grafjci pre-moderni

Out of the Tower of Babel : interoperability and pre-modern writjng systems December 4, 2019

slide-2
SLIDE 2

In a nutshell

slide-3
SLIDE 3

In a nutshell

  • Digital scholarly editjons: diplomatjc/normalized
  • Hjelmslev’s “analysis” as digital modelling for pre-modern

writjng systems

  • Open issues
  • Interoperability through modelling
slide-4
SLIDE 4

My focus

  • European Medieval handwritjng
  • Pre-Gutenberg

Handwritjng

Early print → imitatjng handwritjng

  • Alphabetjc writjng systems

Latjn script (Italian, English...), Greek, Cyrillic...

No Cuneiform, Arabic, Chinese etc.

slide-5
SLIDE 5

Diplomatjc/normalized

slide-6
SLIDE 6

Diplomatjc/normalized

slide-7
SLIDE 7

Diplomatjc/normalized

  • uenenū
slide-8
SLIDE 8

Diplomatjc/normalized

  • uenenū

Diplomatic

  • Historical documentation
  • Visualization
slide-9
SLIDE 9

Diplomatjc/normalized

  • venenum
  • uenenū

Diplomatic

  • Historical documentation
  • Visualization
slide-10
SLIDE 10

Diplomatjc/normalized

  • venenum
  • uenenū

Diplomatic Normalized

  • Processing
  • Search
  • Collation
  • NLP (lemma, PoS etc.)
  • Statistics (distant reading)...
  • Historical documentation
  • Visualization
slide-11
SLIDE 11

Diplomatjc/normalized

slide-12
SLIDE 12

Diplomatjc/normalized

slide-13
SLIDE 13

Diplomatjc/normalized

  • Comparatur vel ad se vel ad alium
  • co̊paraƐur uł adſe uładalium
slide-14
SLIDE 14

Hjelmslev’s “analysis” as digital modelling for pre-modern writjng systems

slide-15
SLIDE 15

System / text

  • co̊paraƐur uł adſe uładalium

Syntagmatic (text, process) Paradigmatic (langue, system)

slide-16
SLIDE 16

System / text

  • co̊paraƐur uł adſe uładalium

Text System

<z> <y> <x> <t> <s>

slide-17
SLIDE 17

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities

slide-18
SLIDE 18

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities “If [we are] given anything…, it is the as yet unanalyzed text in its undivided and absolute integrity” (“deduction”, Prol. Ch. 4)

slide-19
SLIDE 19

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-20
SLIDE 20

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

Digital modelling

slide-21
SLIDE 21

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

Text

<z> <y> <x> <t> <s>

Entities Analysis

slide-22
SLIDE 22

Modelling/“analysis”: entjtjes

  • co̊paraƐur uł adſe uładalium

Text System

<z> <y> <x> <t> <s>

Entities Analysis

slide-23
SLIDE 23

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-24
SLIDE 24

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis “Thus the same linguistic form may also be manifested in writing… Here is a graphic ‘substance’… Describing the actually present expression... system” (Prol. Ch. 21)

slide-25
SLIDE 25

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-26
SLIDE 26

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons
slide-27
SLIDE 27
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Graphemes as entjtjes?

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Entities Analysis

slide-28
SLIDE 28
  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

paradigm

Chains and paradigms

chain

slide-29
SLIDE 29
  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

paradigm

Chains and paradigms

chain “chain”→sequence?

slide-30
SLIDE 30

Functjons

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Function

slide-31
SLIDE 31

Ligatures

slide-32
SLIDE 32

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Function “Ligature” (entities, parts of a chain)

slide-33
SLIDE 33

«√»

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s> «σ»

Varieties

slide-34
SLIDE 34

«√»

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s> «σ»

Varieties: Solidal variants

slide-35
SLIDE 35
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Varieties: Solidal variants

slide-36
SLIDE 36
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Ligatures

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Varieties: Solidal variants

slide-37
SLIDE 37

Graphemes/allographs

slide-38
SLIDE 38

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

slide-39
SLIDE 39

«σ»

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s> «√»

slide-40
SLIDE 40

«σ»

Function Substitution: → No change in “meaning” daƐur / daσur

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Function Commutation: → Change in “meaning” annus / annys

«√»

slide-41
SLIDE 41

Substitution: → No change in “meaning”

«σ»

The commutatjon test

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Commutation: → Change in “meaning” Variants Invariants

«√»

slide-42
SLIDE 42

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<t> <s>

Variants Invariants Invariant → <grapheme> <t> (class, paradigm, sincretism) Variants → «allographs» «σ¼σ¼» (components, members) «σ¼ » Ɛ¼ «σ¼√»

«√»

slide-43
SLIDE 43
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√» «σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Variants Invariants

«√»

slide-44
SLIDE 44
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√» «σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Variants Invariants

«√»

slide-45
SLIDE 45
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)?
  • Abbreviatjons

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√» «σ»

Graphemes / allographs

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Variants Invariants

«√»

slide-46
SLIDE 46

Punctuatjon

slide-47
SLIDE 47

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«σ»

Punctuatjon

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

“content-form” (Prol. Ch. 13)

«√»

slide-48
SLIDE 48

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«σ»

Punctuatjon

  • co̊paraƐur uł adſe uładalium

<z> <y> <x> <t> <s>

Including larger units, such as sentences “content-form” (Prol. Ch. 13)

«√»

slide-49
SLIDE 49

Substitution: → No change in “meaning” Commutation: → Change in “meaning”

Punctuatjon

Including larger units, such as sentences “content-form” (Prol. Ch. 13) Truly I tell you, today you will be with me in paradise Truly I tell you today, you will be with me in paradise

slide-50
SLIDE 50

Substitution: → No change in “meaning” Commutation: → Change in “meaning”

Punctuatjon

Including larger units, such as sentences “content-form” (Prol. Ch. 13) Truly I tell you, today you will be with me in paradise Truly I tell you today, you will be with me in paradise

slide-51
SLIDE 51

Substitution: → No change in “meaning”

<z> <y> <x>

Commutation: → Change in “meaning”

«√»

Punctuatjon

  • co̊paraƐur uł adſe uładalium ·

<z> <y> <x> <t> <s> <.> «σ» «√»

slide-52
SLIDE 52
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Substitution: → No change in “meaning”

<z> <y> <x> «√» <z> <y> <x>

Commutation: → Change in “meaning”

«√»

Punctuatjon

  • co̊paraƐur uł adſe uładalium ·

<z> <y> <x> <t> <s> <.> «σ» «√»

slide-53
SLIDE 53
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Substitution: → No change in “meaning”

Punctuatjon

  • co̊paraƐur uł adſe uładalium ·

<z> <y> <x> <t> <s> <.> «σ» «√» <z> <y> <x>

Commutation: → Change in “meaning”

«√»

slide-54
SLIDE 54

Abbreviatjons

slide-55
SLIDE 55

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
slide-56
SLIDE 56

Abbreviatjons: one grapheme?

  • co̊paraƐur uł adſe uładalium

<ō> <ô> <o> <p> <q>

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
slide-57
SLIDE 57

Abbreviatjons: one grapheme?

  • co̊paraƐur uł adſe uładalium

<ō> <ô> <o> <p> <q>

Principles: simplicity, economy, reduction (Prol. Ch. 6) “lowest possible number

  • f elements” (Ch. 13)
slide-58
SLIDE 58

Abbreviatjons: functjon

  • co̊paraƐur uł adſe uładalium

Function (entities)

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
slide-59
SLIDE 59

Abbreviatjons: functjon → ligature?

  • co̊paraƐur uł adſe uładalium

Function “Ligature” (Solidal variants)

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
slide-60
SLIDE 60

Abbreviatjons: functjon → ligature?

  • co̊paraƐur uł adſe uładalium

Function “Ligature” (Solidal variants) Solidal: both mandatory

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
slide-61
SLIDE 61

Abbreviatjons: functjon → selectjon?

  • co̊paraƐur uł adſe uładalium

Function Selection:

  • one is optional
  • one governs the other
  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
slide-62
SLIDE 62

Abbreviatjons: functjon → selectjon?

  • co̊paraƐur uł adſe uładalium

Function Selection:

  • one is optional
  • one governs the other

“chain”→sequence?

slide-63
SLIDE 63

Abbreviatjons: functjon → complementarity?

  • co̊paraƐur uł adſe uładalium

Function Complementarity (interdependence in a system)

  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
  • 2C. Complementarity (interdependence)
slide-64
SLIDE 64

Abbreviatjons: functjon → complementarity?

  • co̊paraƐur uł adſe uładalium

Function Complementarity (interdependence in a system) [case + gender + number] Alt

  • us

nom+masc+sing Alt

  • rum

gen+m/neu+plur Alt

  • arum

gen+ fem + plur

Example of complementarity

slide-65
SLIDE 65

Abbreviatjons: functjon → complementarity?

  • co̊paraƐur uł adſe uładalium

<m> <l> <n> <p> <q> <zero> <~> <¯> <^>

slide-66
SLIDE 66

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
  • 2C. Complementarity (interdependence)
slide-67
SLIDE 67
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
slide-68
SLIDE 68
  • Digital (discrete)
  • Ligatures
  • Allographs
  • (Capitalizatjon)
  • Grapheme set
  • (Punctuatjon)
  • Abbreviatjons

Abbreviatjons

  • co̊paraƐur uł adſe uładalium
slide-69
SLIDE 69

Open issues

slide-70
SLIDE 70

Issues: 1. Abbreviatjons → functjons

  • co̊paraƐur uł adſe uładalium
  • 1. One entjty
  • Whole abbreviatjon = grapheme (invariant)
  • 2. Two entjtjes (functjon)
  • 2A. Ligature (solidal variants in a chain)
  • 2B. Selectjon (one governs the other)
  • 2C. Complementarity (interdependence)
slide-71
SLIDE 71

Issues: 2. Abbreviatjons → one:many

  • co̊paraƐur
  • co̊paraσur

1:1

slide-72
SLIDE 72

Issues: 2. Abbreviatjons → one:many

  • co̊paraƐur
  • comparaσur

2:2 (2 ≠ 2)

slide-73
SLIDE 73

Issues: 2. Abbreviatjons → one:many

  • positjo

p̄positip

  • praepositjo

2:4

slide-74
SLIDE 74

Issues: 2. Abbreviatjons → one:many

  • fecta

ꝑfectb

  • perfecta

1:3

slide-75
SLIDE 75

Issues: 2. Abbreviatjons → one:many

  • fecta

ꝑfectb

  • perfecta

1:3

Alphabemes (alphabetical letters) Graphemes

slide-76
SLIDE 76

Issues: 3. Ligatures

(syntagmatic)

slide-77
SLIDE 77

Issues: 3. Ligatures

& (U+0026; ASCII 38)

slide-78
SLIDE 78

Issues: 3. Ligatures

Historical/“etymological” considerations

& (U+0026; ASCII 38)

slide-79
SLIDE 79

Issues: 3. Ligatures

Historical/“etymological” considerations

& (U+0026; ASCII 38)

slide-80
SLIDE 80

Issues: 4. Grapheme defjnitjon

(paradigmatic)

.

Full stop Abbreviation mark

slide-81
SLIDE 81

Issues: 5. Allograph defjnitjon (metasemiology)

slide-82
SLIDE 82

Issues: 5. Allograph defjnitjon (metasemiology)

slide-83
SLIDE 83

«σ»

  • co̊paraƐur uł adſe uładalium

Allographs (variants)

«√»

Issues: 5. Allograph defjnitjon (metasemiology)

slide-84
SLIDE 84

«σ»

Issues: 5. Allograph defjnitjon (metasemiology)

  • co̊paraƐur uł adſe uładalium

«√» «√» « » √¼ « √ » «σ» «σ» « σ » «σ» «σ»

slide-85
SLIDE 85

Open issues

  • 1. Abbreviatjons → functjons
  • 2. Abbreviatjons → one:many
  • 3. Ligatures
  • 4. Grapheme defjnitjon
  • 5. Allograph defjnitjon (metasemiology)
slide-86
SLIDE 86

In the Tower of Babel

slide-87
SLIDE 87

a b c d e f g h i j l m n o p q r s t u v z . , ; : !

a b c d e f g h i l m n o p q r s t u z · ;

MS A MS B

In the Tower of Babel: “diplomatjc”

slide-88
SLIDE 88

a b c d e f g h i j l m n o p q r s t u v z . , ; : !

a b c d e f g h i l m n o p q r s t u z · ;

OCR from Teubner

In the Tower of Babel: “normalized”

OCR from Loeb

slide-89
SLIDE 89

In the Tower of Babel

  • Disposable home-made solutjons (project-specifjc)

TEI: theory-agnostjc

Diplomatjc: Modelling pre-modern writjng systems

Normalized:Normalizatjon models / sofuware

slide-90
SLIDE 90

Out of the Tower

slide-91
SLIDE 91

The (long) way out of the Tower

  • Scholarly discussion on modelling
  • Documentatjon on project-specifjc

modelling

formal (data models, sofuware code, tables)

prose

  • Shared models
  • Reusable sofuware libraries
slide-92
SLIDE 92

Is Unicode the short way out (as TEI says)?

  • Solutjon for new digital texts
  • Not enough for pre-modern writjng systems

Ligatures

  • & (U+0026; ASCII 38)
  • Have I encoded that it is equivalent to “e + t” in that MS?

Allographs

  • ſ (U+017F) / s (U+0073; ASCII 115)
  • Have I encoded that they are variants of grapheme <s>?

Grapheme set

  • u (U+0075; ASCII 117)
  • Have I encoded whether it “covers” (or not) <u> and <v>?

slide-93
SLIDE 93

In a nutshell

slide-94
SLIDE 94

In a nutshell

  • Digital scholarly editjons: diplomatjc/normalized
  • Hjelmslev’s “analysis” as digital modelling for pre-modern

writjng systems

  • Open issues
  • Interoperability through modelling