Austrian Research Institute for Artificial Intelligence (OFAI)
International Computer Music Conference (ICMC 2012), Ljubljana/Slovenia
Visualization of perceptual qualities in textural sounds - - PowerPoint PPT Presentation
Austrian Research Institute for Artificial Intelligence (OFAI) Thomas Grill, Arthur Flexer Visualization of perceptual qualities in textural sounds International Computer Music Conference (ICMC 2012), Ljubljana/Slovenia Fundamental questions
Austrian Research Institute for Artificial Intelligence (OFAI)
International Computer Music Conference (ICMC 2012), Ljubljana/Slovenia
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
3
ringing cheeping gasping smashing piercing peeping whooping tinkling raucous chattering crooning bellowing sobbing bumping snarling growling pitch crying thumping burping croaking clattering yapping keening splashing yelping rustling volume squealing howling barking sniveling moaning pealing tone rattling grunting clanging coughing quacking whining gagging fizzing wheezing honking hissing bawling trumpeting swishing sneezing rumbling bubbling ripping cooing chirping shouting shuffling tearing popping roaring thunderous scratching snorting crashing crunching cackling tolling clucking silent tapping soothing crowing tranquil melodious cacophonous singing quiet tune loud tinkling noisy rhythmic mumbling twittering din beat blaring cawing racket chattering murmuring whistling clapping booming whispering mewing snapping snoring yelling mooing crackling sighing
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
(sound origin, recording context, etc.)
especially for abstract sounds or use in acousmatic composition
5
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia Grill, Flexer and Cunningham. Identification of perceptual qualities in textural sounds using the repertory grid method. Proceedings of the 6th Audio Mostly Conference, 2011
➡Repertory grid technique used to elicit qualities (personal constructs) "ex nihilo", for a specific selection of subjects (interviewees) and objects under examination (items)
between two randomly chosen sound examples ➡Bipolar qualities spanning range from one sound to the other
6
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
using own personal constructs
7
motion textural impulse high excentric evolutionary well-defined regular narrative pitched smooth static coherent continuous low contained repetitive diffused irregular static non-pitched porousA
4 4 4 1 2 4 4 2 4 3 3B
5 3 5 5 5 1 3 1 5 2 1C
4 5 2 2 4D
4 2 5 4 3 4 4 3 4 2 3E
2 4 1 1 2 4 1 5 5 3 5F
1 1 2 2 2G
5 5 5 5 5 2 1 2 5 1 1H
4 3 3 1 2 5 1 1 5 2 4I
4 2 2 2 2 5 2 2 4 1 4J
2 1 5 3 1K
5 2 4 4 4 4 3 1 5 4 2L
1 1 1 3 1M
4 5 5 1 2 2 3 2 5 3 2N
3 1 4 4 1 4 4 5 5 4 2O
4 2 4 3 3P
2 2 3 3 3 4 5 3 5 5 4Q
5 5 5 3 5R
3 3 4 2 3 2 2 3 4 2 3S
2 2 5 2 3 4 4 4 2 3 2T
1 1 4 4 1 4 3 2 3 5 21 … 5
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
high/low
8
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia 9
http://grrrr.org/test/classify
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
10
*nine subjects who took part in the elicitation process Construct Agreement α (core group)* Agreement α (all n ≥ 10) high – low 0.588 0.519
0.556 0.447 natural – artificial 0.551 0.492 smooth – coarse 0.527 0.420 tonal – noisy 0.523 0.435 homogeneous – heterogeneous 0.519 0.416 dense – sparse 0.492 0.342 edgy – flowing 0.465 0.376 static – dynamic 0.403 0.383 near – far 0.252 0.249
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
11
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
12
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
➡Auditory characteristics
➡Clusters, similarities, dominating characteristics
13
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia Lawrence Marks: On Perceptual Metaphors. Metaphor and Symbolic Activity 11(1), 39–66, 1996
➡very rare, asymmetric, individual
14
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia Wolfgang Köhler, Gestalt psychology,1929
15
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
personal constructs
➡Tiled map: Iconic representation of individual sounds, map for structural aspects of the collection
16
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
(bright yellow–dark red)
(on grid–deviating from grid)
(colorful–gray)
(smooth–jagged)
(no variation–much variation)
17
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia 18
high–low
chaotic tonal– noisy smooth– coarse homogeneous– heterogeneous
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
19
http://grrrr.org/test/texvis
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
20
current sound, the generation parameters of the other four examples are taken from a uniform random distribution
current sound, the other four represent other existing sounds of the pool
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
21
group voters / votes mean RMS error (random: 0,243) correctness (random: 20%) non-musicians, ≥ 20 votes 19 / 876 0,178 33,9% classical musical training, ≥ 20 votes 29 / 1570 0,163 40,0% electronic music practice, ≥ 20 votes 48 / 2811 0,137 45,2% electronic music practice, good listening conditions, ≥ 20 votes 36 / 2019 0,133 46,4%
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
22
Survey B: electronic music practitioners, good listening conditions, ≥ 10 votes
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
high–low
smooth–coarse tonal–noisy homogeneous– heterogeneous
selected
high–low
smooth–coarse tonal–noisy homogeneous– heterogeneous
reference
0.71
0.65 0.43 0.36 0.54
0.43 0.68 0.54 0.27
0.37 0.55 0.69 0.20
0.55 0.26 0.18 0.62
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
23
5 10 15 20 25 30 35 40 45
time per vote(s)
0.05 0.10 0.15 0.20 0.25 0.30 0.35
avg RMS error
users=94, x-y correlation=-0.13 @ significance(p=0.05)=0.20 mean duration=13.83 (6.65)
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
24
0.0 0.2 0.4 0.6 0.8 1.0
perceived difficulty
0.00 0.05 0.10 0.15 0.20 0.25
avg RMS error
sounds=100, x-y correlation=0.484 @ significance(p=0.05)=0.197
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia Thomas Grill: Constructing high-level perceptual audio descriptors for textural sounds. Proceedings of the 9th Sound and Music Computing Conference, 2012
i.e. metaphoric descriptions
covering 100 textural sounds ➡Use a uniform underlying time-frequency representation ➡Small number of adjustable parameters for each descriptor ➡Parameters to be tuned, so that the descriptors correlate well with human perception
25
Grill and Flexer: Visualization of perceptual qualities in textural sounds ICMC 2012, Ljubljana/Slovenia
26
http://grrrr.org/test/texvis/map.html