International Laboratory for Brain, Music and Sound Research
Measuring & Modeling Musical Expression
Douglas Eck University of Montreal Department of Computer Science BRAMS Brain Music and Sound
Measuring & Modeling Musical Expression Douglas Eck University - - PowerPoint PPT Presentation
Measuring & Modeling Musical Expression Douglas Eck University of Montreal Department of Computer Science BRAMS Brain Music and Sound International Laboratory for Brain, Music and Sound Research Overview Task: human-realistic music
International Laboratory for Brain, Music and Sound Research
Douglas Eck University of Montreal Department of Computer Science BRAMS Brain Music and Sound
Douglas Eck douglas.eck@umontreal.ca
2
Douglas Eck douglas.eck@umontreal.ca FRANTIC RESTFUL SPOOKY
3
Faster → more dangerous →
VICTORIOUS
Douglas Eck douglas.eck@umontreal.ca Music Composition (from video game composer) FRANTIC RESTFUL SPOOKY
P e r f
m a n c e m
e l
4
Faster → more dangerous →
VICTORIOUS
Douglas Eck douglas.eck@umontreal.ca Music Composition (from video game composer) FRANTIC RESTFUL SPOOKY
P e r f
m a n c e m
e l
5
Faster → more dangerous →
VICTORIOUS
Douglas Eck douglas.eck@umontreal.ca Music Composition (from video game composer) FRANTIC RESTFUL SPOOKY
P e r f
m a n c e m
e l
6
Faster → more dangerous →
VICTORIOUS ????
Douglas Eck douglas.eck@umontreal.ca
wordset useful for music recommendation (Eck et al. NIPS 07)
7
This work is a part of Sun Labs “Project Aura” recommendation framework.
Douglas Eck douglas.eck@umontreal.ca
wordset useful for music recommendation (Eck et al. NIPS 07)
7
This work is a part of Sun Labs “Project Aura” recommendation framework.
Deadpan (no expressive timing or dynamics) Human performance (Recorded on Boesendorfer ZEUS)
Differences limited to:
Douglas Eck douglas.eck@umontreal.ca
Beethoven minuet (Sonata op 31 no 3)
10
R
T
!
N
!
N
1
6 5 .
9 c q
8
.
1
9 .
8
6 .
M I N U E T
! I I I [ I I [ I I I I I
I
I E : 3 q 5 6 7 8 9 1 8 t ! 1
3 , t q t 5 1 6
B R R N O .
TRIO
I I I I I I I I I I I I I
t7 18 19 aO Et EE E3 Eq
g6
a8
30
3 1 3 E 3 3 3 q 3 5 3 6 3 7 3 8
B R R N O .
CODA
I I I I I I I
3 9 q a q t q E q 3 q q q 5
BRR ND.
F I G . 3 . G r a n d a v e r a g e t i m i n g p a t t e r n
1 5 h u m a n
f
m a n c e s , w i t h r e
t s p l
t
e p a r a t e l y . T h e l
i n t e r v a l i n t h e C
a ( a r r
) i s 1 5 3 8
ms.
I . R e p e a t s
I t i s e v i d e n t , f i r s t , t h a t r e p e a t s
t h e s a m e m a t e r i a l h a d
e x t r e m e l y s i m i l a r t i m i n g p a t t e r n s . T h i s c
s i s t e n c y
p r
e s s i
a l k e y b
r d p l a y e r s w i t h r e s p e c t t
e t a i l e d t i m i n g
patterns has been noted many times in the literature, begin-
ning with Seashore ( 1938, p. 244). The
systematic de-
v i a t i
s
c u r r e d i n b a r 1 a n d a t p h r a s e e n d i n g s ( b a r s 8 , 1 5
and 23/37-24/38), where the music was, in fact, not
i d e n t i c a l a c r
s r e p e a t s ( s e e F i g . 1 ) : I n b a r 1 , B e e t h
e n
a d d e d a n
n a m e n t ( a t u r n
E
l a t ) i n t h e r e p e a t ( b a r 1 B ) ,
w h i c h w a s s l i g h t l y d r a w n
t b y m
t p i a n i s t s . I n b a r 8 A ,
w h i c h l e d b a c k t
h e b e g i n n i n g
t h e M i n u e t , t h e u p b e a t
was prolonged, but in bar 8B, which led into the second
s e c t i
t h e M i n u e t , a n a d d i t i
a l r i t a r d
c u r r e d
t h e
p h r a s e
i n a l ( s e c
d ) b e a t . S i m i l a r l y , a u n i f
m r i t a r d w a s p r
u c e d i n b a r 1 6 A , w h i c h l e d b a c k t
h e b e g i n n i n g
t h e
s e c
d M i n u e t s e c t i
, a n d a n e v e n s t r
g e r r i t a r d
c u r r e d
t h e p h r a s e
i n a l ( f i r s t a n d s e c
d ) n
e s
b a r 1 6 B ,
w h i c h c
s t i t u t e d t h e e n d
t h e M i n u e t , w h e r e a s t h e t h i r d
note constituted the upbeat to the Trio and was taken shorter. Bar 15 anticipated these changes, which were more
pronounced in the second playing
the Minuet, following
the Trio. Similarly, bar 37 anticipated the large ritard in bar
6 2 8 J . A c
s t . S e c . A m . , V
. 8 8 , N
2 , A u g u s t 1 9 9 B r u n
. R e p p : E x p r e s s i v e t i m i n g i n a B e e t h
e n m i n u e t 6 2 8
Grand average timing patterns of performances with repeats plotted separately. (From B. Repp “Patterns of expressive timing in performances of a Beethoven minuet by nineteen famous pianists”,1990)
Douglas Eck douglas.eck@umontreal.ca
components
yield pleasing performance
component(s) can yield pleasing performance
musical structure
11
A d a p t e d f r
R e p p ( 1 9 9 )
600- B00- 700- 600- 500-
FACTOR I
I I I I I I I I
I E 3 q 5 6 7 8 BRR
I I I I I I I
9 10 11 1E 13 lq 15 16
NO. D U R T I O N I N 800 788- 608- 580- 900- 088- 708-
608-
500-
FACTOR 2
./
I I I I I I I I I I I I I I I I
I E :3 q 5 6 7 8 El 18 11 1E 13 lq 15 BRR NO,
FACTOR 3
I [ I I I I I I I I I I I I I I
I E 3 q 5 6 7 8 9 10 11 IE 13 tq 15 16
BRR NO,
were quite rare in the present composition. Eighth-notes
were common but provided less information, since they re-
duced the four-beat pulse to a two-beat pulse. Some mea-
surement problems were also encountered. Nevertheless, some data were
about the temporal microstructure
at this level.
1. Measurement procedures
Sequences
two sixteenth-notes
in several places
(bars 7, 20, and 34), but proved very difficult to measure; the
second note could usually not be found in the
acoustic
were re-
stricted to single sixteenth-notes following a dotted eighth-
note. Such notes
in bars 0/8A, 1, 4, and 8B/16A of the
Minuet, in bar 23/37
the Trio, and throughout the Coda.
With four repeats
Minuet and two
Trio in most
performances, there were generally four independent mea-
sures available for each of the four sixteenth-note occur-
rences in the Minuet and for the single
in the Trio
(the latter really being two similar
each repeat-
ed twice). For the Coda,
course,
a single set
mea-
surements was available for each artist, but there were 11
632 J. Acoust. Sec. Am., Vol. 88, No. 2, August 1990 Bruno H. Repp: Expressive timing in a Beethoven minuet 632
600- B00- 700- 600- 500-
FACTOR I
I I I I I I I I
I E 3 q 5 6 7 8 BRR
I I I I I I I
9 10 11 1E 13 lq 15 16
NO. D U R T I O N I N 800 788- 608- 580- 900- 088- 708-
608-
500-
FACTOR 2
./
I I I I I I I I I I I I I I I I
I E :3 q 5 6 7 8 El 18 11 1E 13 lq 15 BRR NO,
FACTOR 3
I [ I I I I I I I I I I I I I I
I E 3 q 5 6 7 8 9 10 11 IE 13 tq 15 16
BRR NO,
were quite rare in the present composition. Eighth-notes
were common but provided less information, since they re-
duced the four-beat pulse to a two-beat pulse. Some mea-
surement problems were also encountered. Nevertheless, some data were
about the temporal microstructure
at this level.
1. Measurement procedures
Sequences
two sixteenth-notes
in several places
(bars 7, 20, and 34), but proved very difficult to measure; the
second note could usually not be found in the
acoustic
were re-
stricted to single sixteenth-notes following a dotted eighth-
note. Such notes
in bars 0/8A, 1, 4, and 8B/16A of the
Minuet, in bar 23/37
the Trio, and throughout the Coda.
With four repeats
Minuet and two
Trio in most
performances, there were generally four independent mea-
sures available for each of the four sixteenth-note occur-
rences in the Minuet and for the single
in the Trio
(the latter really being two similar
each repeat-
ed twice). For the Coda,
course,
a single set
mea-
surements was available for each artist, but there were 11
632 J. Acoust. Sec. Am., Vol. 88, No. 2, August 1990 Bruno H. Repp: Expressive timing in a Beethoven minuet 632
Douglas Eck douglas.eck@umontreal.ca
performances of Chopin’s op. 10 No 3; Added 9 graduate student performances
highest, followed by E11, S1, S3, S9, S2, and SA.
12
Douglas Eck douglas.eck@umontreal.ca
13
Phrasing Phrase arch Create arch-like tempo and sound level changes over phrases Final ritardando Apply a ritardando in the end of the piece High loud Increase sound level in proportion to pitch height Micro-level timing Duration contrast Shorten relatively short notes and lengthen relatively long notes Faster uphill Increase tempo in rising pitch sequences Metrical patterns and grooves Double duration Decrease duration ratio for two notes with a nominal value of 2:1 Inégales Introduce long-short patterns for equal note values (swing) Articulation Punctuation Find short melodic fragments and mark them with a final micropause Score legato/staccato Articulate legato/staccato when marked in the score Repetition articulation Add articulation for repeated notes. Overall articulation Add articulation for all notes except very short ones Tonal tension Melodic charge Emphasize the melodic tension of notes relatively the current chord Harmonic charge Emphasize the harmonic tension of chords relatively the key Chromatic charge Emphasize regions of small pitch changes Intonation High sharp Stretch all intervals in proportion to size Melodic intonation Intonate according to melodic context Harmonic intonation Intonate according to harmonic context Mixed intonation Intonate using a combination of melodic and harmonic intonation Ensemble timing Melodic sync Synchronize using a new voice containing all relevant onsets Ensemble swing Introduce metrical timing patterns for the instruments in a jazz ensemble Performance noise Noise control Simulate inaccuracies in motor
Table 1.
An overview of the rule system
From: A. Friberg, R. Bresin & J. Sundberg (2006). Overview of the KTH rule system for musical
Psychology, 2(2-3):145-161.
From: A. Friberg, R. Bresin & J. Sundberg (2006). Overview of the KTH rule system for musical
Cognitive Psychology, 2(2-3): 145-161.
Figure 2.
The resulting IOI deviations by applying Phrase arch, Duration contrast, Melodic charge, and Punctuation to the Swedish nursery tune “Ekorr’n satt i granen”. All rules were applied with the rule quantity k=1 except the Melodic charge rule that was applied with k=2.
Douglas Eck douglas.eck@umontreal.ca
16
tempo at each note—notes above the 1.0 line are shortened relative to the tempo of the piece, notes below 1.0 are lengthened. A perfectly regular performance with no timing deviations would correspond to a straight line at y = 1.0.
RULE TL2: abstract_duration_context = equal-longer & metr_strength 1 ⇒ ritardando “Given two notes of equal duration followed by a longer note, lengthen the note (i.e., play it more slowly) that precedes the final, longer one, if this note is in a metrically weak position (‘metrical strength’ 1).”
From: G. Widmer (2003). Discovering simple rules in complex data: A meta- learning algorithm and some surprising musical
Intelligence 146:129-148.
Douglas Eck douglas.eck@umontreal.ca
18
Douglas Eck douglas.eck@umontreal.ca
audio-to-score is hard
19
Acoustics, Speech and Signal Processing (ICASSP), 1313-1316.
Douglas Eck douglas.eck@umontreal.ca
20
When exact note durations are known (i.e. when a score is available) we use a binary input encoding.
Douglas Eck douglas.eck@umontreal.ca
divergence
21
Douglas Eck douglas.eck@umontreal.ca
22
slower faster time (measures) →
Mean values predictions shown as red squares
Douglas Eck douglas.eck@umontreal.ca
24
Pianist pedaling was
pedaling from note timing
slower faster time (measures) →
Model was not trained on this piece.
Douglas Eck douglas.eck@umontreal.ca
26
Douglas Eck douglas.eck@umontreal.ca
27
Douglas Eck douglas.eck@umontreal.ca
nineteen famous pianists. Journal of the Acoustical Society of America, 88: 622-641.
preliminary experiments. Music Perception, 14: 419-444.
some surprising musical discoveries. Artificial Intelligence 146:129-148.
Information Processing Systems (NIPS) 14.
embedded in a psychoacoustic space. In Proceedings of the 22nd International Conference on Machine Learning (ICML), Bonn, Germany.
expression based on Langner’s tempo-loudness animation. In Proceedings of the International Computer Music Conference (ICMC).
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1313-1316.
28