Melodic Segmentation Across Cultural Traditions M ARCELO E. R - - PowerPoint PPT Presentation
Melodic Segmentation Across Cultural Traditions M ARCELO E. R - - PowerPoint PPT Presentation
Melodic Segmentation Across Cultural Traditions M ARCELO E. R ODRGUEZ -L PEZ & A NJA V OLK Department of Information and Computing Sciences Utrecht University, Interaction Technology Group
2/38
What is Segmentation
3/38
What is Segmentation
Segmenting = Chunking = Grouping psychology: process of grouping individual units of information into larger units. computing: process of dividing data into smaller meaningful units.
4/38
What is Segmentation
Segmenting = Chunking = Grouping Psychology: process of grouping individual units of information into larger units. computing: process of dividing data into smaller meaningful units.
5/38
What is Segmentation
Segmenting = Chunking = Grouping Psychology: process of grouping individual units of information into larger units. Computing: process of dividing data into smaller meaningful units.
6/38
Music Segmentation: DM Task Definition
In Digital Musicology (DM), segmentation is the task
- f dividing a musical piece/melody/section into
‘smaller’ structural units.
7/38
Music Segmentation: Scope
◮ Type of Music and Segment Granularity
- input: monophonic music (most generally melodies)
- encoding: symbolic
- music-theoretic segment parallel: phrases
8/38
Melody Segmentation: an Example
Aihu renmin zidibing
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
2
&# 4
##
9/38
Melody Segmentation: Task
◮ Task
- identify segment boundary locations
- pair boundaries (begin, end)
- label boundaries
10/38
Melody Segmentation: Task
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
2
&# 4
##
phrase boundary
11/38
Models of Melody Segmentation
- development: +30 years
- # of models: ≈27
- comparative studies: 4, 3-6 models evaluated
- most successful: Gestalt-based
12/38
Gestalt Models: Basic Overview
- attempt to quantify (visual) Gestalt principles
- use system of preference rules
- Gestalt proximity is modeled as ‘discontinuity’
detection (breaks in the melodic flow)
13/38
Local Discontinuities Detection: an Example
14/38
Local Discontinuities Detection: an Example
H
q q q q q q q q q q q q e
Q Q Q Q EQ E Q
Q q q q.
15/38
Local Discontinuities Detection: an Example
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
16/38
Local Discontinuities Detection: Input
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
e1
. . .
ei
. . .
eN
24 48 72 84 96 120 2322 69 69 68 66 66 69 64
. . . . . . . . . . . .
pitch:
- nset:
17/38
Local Discontinuities Detection: Input
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
.5 .5 .5 .25 .25 .5 .5 1 .5 .5 .5 .5 2 .75 .25 .5 .5 .5 1 .5 .5 .5 .5 .5 .5 0 1 2 0 3 5 0 5 1 2 3 2 5 2 2 3 3 5 2 3 7 1 2 3 5
p-inv : ioi :
18/38
Local Discontinuity Detection: Profiles
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
p-inv : ioi :
19/38
Local Discontinuity Detection: Profiles
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
p-inv : ioi :
20/38
Local Discontinuity Detection: Output
e
H
q q q q q q q q q q q q
Q Q Q Q EQ E Q
Q q q q.
0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0
Output :
21/38
Gestalt Models: Assumptions
- discontinuity as a distance in some perceptual space
- discontinuity can be treated as a local phenomenon
- discontinuity is universal/idiom-independent
22/38
Statistical Study: Research Questions
- relevance of discontinuity as a boundary cue?
- relevance of discontinuity parametric representation?
- appropriateness of locality for modeling?
- appropriateness of universality assumption?
23/38
Statistical Study: Data
ESSEN FOLK SONG COLLECTION (EFSC)
- 20,000 annotated vocal folk melodies
- Representative subsets of German and Chinese songs
EXPERIMENTAL SUBSETS
- ≈ 5k German songs, ≈ 30k phrases, 1 . . . 29 notes
- ≈ 2k Chinese songs, ≈ 12k phrases, 1 . . . 169 notes
FILTERED
- ≈ 1.2k German songs, ≈ 5.5k phr-pairs, 13 . . . 21 notes
- ≈ 1.4k Chinese songs, ≈ 4.6k phr-pairs, 2 . . . 28 notes
24/38
Statistical Study: Processing
e
2
&# H
4 q q q q q q q q
q q q q
Q Q Q Q EQ E Q
Q q q q.
##
ph a ph b c(ph a) j(ph ab) c(ph b) p-inv : Nioi :
.35 .5 .15 2 5 2
25/38
Results 1:
pitch duration Germany China d
- 32.41 %
43.73 %
- d
49.57 % 72.56 % d
- 04.51 %
22.53 %
- d
24.13 % 57.70 % nd
- 31.49 %
28.53 %
- nd
23.26 % 09.04 %
26/38
Results 1:
pitch duration Germany China d
- 32.41 %
43.73 %
- d
49.57 % 72.56 % d
- 04.51 %
22.53 %
- d
24.13 % 57.70 % nd
- 31.49 %
28.53 %
- nd
23.26 % 09.04 %
27/38
Results 1:
pitch duration Germany China d
- 32.41 %
43.73 %
- d
49.57 % 72.56 % d
- 04.51 %
22.53 %
- d
24.13 % 57.70 % nd
- 31.49 %
28.53 %
- nd
23.26 % 09.04 %
28/38
Results 1:
pitch duration Germany China d
- 32.41 %
43.73 %
- d
49.57 % 72.56 % d
- 04.51 %
22.53 %
- d
24.13 % 57.70 % nd
- 31.49 %
28.53 %
- nd
23.26 % 09.04 %
29/38
Results 2:
J(pab) C(pa) C(pb) 5 10 15 20 Corpus: Germ
any
| case: All phab | Pitch Analysis PI (Semitones) J(pab) C(pa) C(pb) 5 10 15 20
Corpus: China | case: All ph
ab | Pitch Analysis
PI (Semitones) Join Interval and Context Intervals Join Interval and Context Intervals
30/38
Results 2:
J(pab) C(pa) C(pb) 5 10 15 20 Corpus: Germ
any
| case: All phab | Pitch Analysis PI (Semitones) J(pab) C(pa) C(pb) 5 10 15 20
Corpus: China | case: All ph
ab | Pitch Analysis
PI (Semitones) Join Interval and Context Intervals Join Interval and Context Intervals J(pab) C(pa) C(pb) 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Corpus: Germ
any
| case: All phab | Duration Analysis N IOI Join Interval and Context Intervals J(pab) C(pa) C(pb) 0.2 0.4 0.6 0.8 Corpus: China | case: All phab | Duration Analysis N IOI Join Interval and Context Intervals
31/38
Results 2:
J(pab) C(pa) C(pb) 5 10 15 20 Corpus: Germ
any
| case: All phab | Pitch Analysis PI (Semitones) J(pab) C(pa) C(pb) 5 10 15 20
Corpus: China | case: All ph
ab | Pitch Analysis
PI (Semitones) Join Interval and Context Intervals Join Interval and Context Intervals J(pab) C(pa) C(pb) 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Corpus: Germ
any
| case: All phab | Duration Analysis N IOI Join Interval and Context Intervals J(pab) C(pa) C(pb) 0.2 0.4 0.6 0.8 Corpus: China | case: All phab | Duration Analysis N IOI Join Interval and Context Intervals
32/38
Results 2:
J(pab) C(pa) C(pb) 5 10 15 20 Corpus: Germ
any
| case: All phab | Pitch Analysis PI (Semitones) J(pab) C(pa) C(pb) 5 10 15 20
Corpus: China | case: All ph
ab | Pitch Analysis
PI (Semitones) Join Interval and Context Intervals Join Interval and Context Intervals J(pab) C(pa) C(pb) 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Corpus: Germ
any
| case: All phab | Duration Analysis N IOI Join Interval and Context Intervals J(pab) C(pa) C(pb) 0.2 0.4 0.6 0.8 Corpus: China | case: All phab | Duration Analysis N IOI Join Interval and Context Intervals
33/38
Results 3:
◮ group phrase pairs in respect to size(pha) ◮ pairwise t-test between m-j(pha,b) and m-c(pha,b) ◮ China PI: 19/26, NIOI: 26/26 ◮ Germany PI: 6/19, NIOI: 15/19
34/38
Results 3:
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 23 24 25 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 28 29 0.15 0.25 0.35 0.45 0.55 0.65
Length of pha
PI
NIOI
3 6 7 9 10 12 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 0.15 0.25 0.35 0.45 0.55 0.65
PI
Chinese Germanic
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 23 24 25 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 28 29 0.15 0.25 0.35 0.45 0.55 0.65 3 6 7 9 10 12 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 0.15 0.25 0.35 0.45 0.55 0.65
NIOI Length of pha
35/38
Results 3:
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 23 24 25 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 28 29 0.15 0.25 0.35 0.45 0.55 0.65
Length of pha
PI
NIOI
3 6 7 9 10 12 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 0.15 0.25 0.35 0.45 0.55 0.65
PI
Chinese Germanic
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 23 24 25 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 28 29 0.15 0.25 0.35 0.45 0.55 0.65 3 6 7 9 10 12 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 0.15 0.25 0.35 0.45 0.55 0.65
NIOI Length of pha
4.5 0.45 0.55 0.65 3 3.5 4 4.5 0.45
36/38
Results 3:
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 23 24 25 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 28 29 0.15 0.25 0.35 0.45 0.55 0.65
Length of pha
PI
NIOI
3 6 7 9 10 12 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 0.15 0.25 0.35 0.45 0.55 0.65
PI
Chinese Germanic
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 23 24 25 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 28 29 0.15 0.25 0.35 0.45 0.55 0.65 3 6 7 9 10 12 1.5 2 2.5 3 3.5 4 4.5 5 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 0.15 0.25 0.35 0.45 0.55 0.65
NIOI Length of pha
4.5 0.45 0.55 0.65 3 3.5 4 4.5 0.45 2.5 0.25 2 2.5 0.25 0.35
37/38
Conclusions
Under the scope of our study:
◮ discontinuities in pitch
- are weak predictors of melodic phrase boundaries
- can not be considered universal
◮ discontinuities in duration
- are strong predictors of phrase boundaries
- remain stable independent of cultural origin and size
38/38