Music Representations Meinard Mller International Audio - - PowerPoint PPT Presentation

music representations
SMART_READER_LITE
LIVE PREVIEW

Music Representations Meinard Mller International Audio - - PowerPoint PPT Presentation

Lecture Music Processing Music Representations Meinard Mller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Book: Fundamentals of Music Processing Meinard Mller Fundamentals of Music Processing Audio,


slide-1
SLIDE 1

Music Processing Meinard Müller

Lecture

Music Representations

International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de

slide-2
SLIDE 2

Book: Fundamentals of Music Processing

Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

slide-3
SLIDE 3

Book: Fundamentals of Music Processing

Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

slide-4
SLIDE 4

Book: Fundamentals of Music Processing

Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

slide-5
SLIDE 5

Chapter 1: Music Representations

1.1 Sheet Music Representations 1.2 Symbolic Representations 1.3 Audio Representation 1.4 Further Notes

Musical information can be represented in many different ways. In Chapter 1, we consider three widely used music representations: sheet music, symbolic, and audio representations. This first chapter also introduces basic terminology that is used throughout the book. In particular, we discuss musical and acoustic properties of audio signals including aspects such as frequency, pitch, dynamics, and timbre.

slide-6
SLIDE 6

Music Representations

slide-7
SLIDE 7

Music Representations

  • Sheet music representation

 visual description of a musical score  image format (printed or scanned)

  • Symbolic representations

 description based on entities with explicit musical meaning  given in digital format that can be parsed by a computer

  • Audio representation

 physical description  encoding of sound wave

slide-8
SLIDE 8

Sheet Music Representation

  • Graphical-textual encoding of musical parameters

 notes (onsets, pitches, durations)  tempo, measure, dynamics  instrumentation  …

  • Guide for performing music
  • Leaves freedom for various interpretations
slide-9
SLIDE 9

Sheet Music Representation

slide-10
SLIDE 10

Sheet Music Representation

Piano keyboard and notes

slide-11
SLIDE 11

Sheet Music Representation

Piano keyboard and notes

A4 A ≙ pitch spelling attribute 4 ≙ octave number

slide-12
SLIDE 12

Sheet Music Representation

Piano keyboard and notes

A4 A ≙ pitch spelling attribute 4 ≙ octave number A3

slide-13
SLIDE 13

Sheet Music Representation

Chromatic circle Shepard’s helix of pitch

C C♯/D

D E F G A B D♯/E

F♯/G

G♯/A

A♯/B

Tone height ≙ octave number Chroma ≙ pitch spelling attribute

slide-14
SLIDE 14

Sheet Music Representation

Staff Staff with G-clef Staff with F-clef

slide-15
SLIDE 15

Sheet Music Representation

Musical score of a C-major scale

slide-16
SLIDE 16

Sheet Music Representation

Musical score of a C-major scale Musical score of a C-minor scale

Key signature consisting of three flats

slide-17
SLIDE 17

Sheet Music Representation

Time signature

slide-18
SLIDE 18

Sheet Music Representation

Time signature

measure (bar) bar lines

slide-19
SLIDE 19

Sheet Music Representation

Time signature

measure (bar) bar lines Four quarter notes per measure Six eighth notes per measure

slide-20
SLIDE 20

Sheet Music Representation

Time signature

measure (bar) bar lines Four quarter notes per measure Six eighth notes per measure upbeat

slide-21
SLIDE 21

Sheet Music Representation

Note durations

Whole note Half note Quarter note Eighth note Sixteenth note

Flag Note head

Whole rest Half rest Quarter rest Eighth rest Sixteenth rest

Beam Stem

Different durations of notes Different durations of rests Parts of a note

slide-22
SLIDE 22

Sheet Music Representation

Staff systems Piano Strings

Violin Viola Violoncello Right hand Left hand

slide-23
SLIDE 23

Sheet Music Representation

Dynamics and articulation

crescendo diminuendo piano forte legato staccato lyrics

slide-24
SLIDE 24

Sheet Music Representation

slide-25
SLIDE 25

Sheet Music Representation

slide-26
SLIDE 26

Sheet Music Representation

slide-27
SLIDE 27

Sheet Music Representation

  • Full score: shows music for all instruments and voices;

used by conductors

  • Piano (reduction) score: transcription for piano

Example: Liszt transcription of Beethoven symphonies

  • Short score: reduction of a work for many instruments to

just a fews staves

  • Lead sheet: specifies only melody, lyrics and harmonies

(chord symbols); used for popular music to capture essential elements of a song Types of score

slide-28
SLIDE 28

Symbolic Representation

  • Symbolic description of music

 based on entities that have an explicit musical meaning  given in some digital format  can be parsed by a computer

  • Note:

 Scanned sheet music based on pixels  Digital audio file based on samples

are not regarded as being symbolic music formats

slide-29
SLIDE 29

Symbolic Representation

MusicXML

slide-30
SLIDE 30

Symbolic Representation

Piano roll representation

slide-31
SLIDE 31

Symbolic Representation

Piano roll representation

slide-32
SLIDE 32

Symbolic Representation

  • Piano roll: music storage medium used to operate a

player piano

  • Perforated paper rolls
  • Holes in the paper encode the note parameters
  • nset, duration, and pitch
  • First pianola: 1895

Piano roll representation

slide-33
SLIDE 33

Symbolic Representation

Piano roll representation

slide-34
SLIDE 34

Symbolic Representation

Piano roll representation

slide-35
SLIDE 35

Symbolic Representation

MIDI representation

  • Musical Instrument Digital Interface (MIDI)
  • Standard protocol for controlling and synchronizing

digital instruments

  • Standard MIDI File (SMF) is used for collecting and

storing MIDI messages

  • SMF file is often called MIDI file
slide-36
SLIDE 36

Symbolic Representation

MIDI representation

MIDI note numbers (MNN) ≙ piano keys

C3 C♯3 D3 D♯3 E3 F3 F♯3 G3 G♯3 A3 A♯3 B3 D

3 E

3 G

3 A

3 B

3 C4 D4 E4 F4 G4 A4 B4 C5 C♯4 D♯4 F♯4 G♯4 A♯4 D

4 E

4 G

4 A

4 B

4 48 50 52 53 55 57 59 60 62 64 65 67 69 71 72 49 51 54 56 61 63 66 68 70 58

C3 C♯3 D3 D♯3 E3 F3 F♯3 G3 G♯3 A3 A♯3 B3 D

3 E

3 G

3 A

3 B

3 C4 D4 E4 F4 G4 A4 B4 C5 C♯4 D♯4 F♯4 G♯4 A♯4 D

4 E

4 G

4 A

4 B

4 48 50 52 53 55 57 59 60 62 64 65 67 69 71 72 49 51 54 56 61 63 66 68 70 58

slide-37
SLIDE 37

Symbolic Representation

MIDI representation

  • MIDI note number (pitch)

p = 21, …, 108 ≙ piano keys p = 69 ≙ concert pitch A4

  • Key velocity

≙ intensity

  • MIDI channel

≙ instrument

  • Note-on / note-off events

  • nset time & duration
  • Tempo measured in clock pulses or ticks

(each MIDI event has a timestamp)

  • Absolute tempo specified by

– ticks per quarter note (musical time) – micro-seconds per tick (physical time)

slide-38
SLIDE 38

Symbolic Representation

MIDI representation

Time (Ticks) Message Channel Note Number Velocity 60 NOTE ON 1 67 100 NOTE ON 1 55 100 NOTE ON 2 43 100 55 NOTE OFF 1 67 NOTE OFF 1 55 NOTE OFF 2 43 5 NOTE ON 1 67 100 NOTE ON 1 55 100 NOTE ON 2 43 100 55 NOTE OFF 1 67 NOTE OFF 1 55 NOTE OFF 2 43 5 NOTE ON 1 67 100 NOTE ON 1 55 100 NOTE ON 2 43 100 55 NOTE OFF 1 67 NOTE OFF 1 55 NOTE OFF 2 43 5 NOTE ON 1 63 100 NOTE ON 2 51 100 NOTE ON 2 39 100 240 NOTE OFF 1 63 NOTE OFF 2 51 NOTE OFF 2 39

slide-39
SLIDE 39

Symbolic Representation

MIDI representation

71/B4 67/G4 60/C4 55/G3 48/C3 43/G2 36/C2 240 480

Time (ticks)

slide-40
SLIDE 40

Audio Representation

Various interpretations – Beethoven’s Fifth Bernstein Karajan Scherbakov (piano) MIDI (piano)

slide-41
SLIDE 41

Audio Representation

Waveform

slide-42
SLIDE 42

Audio Representation

Waveform

slide-43
SLIDE 43

Audio Representation

  • Audio signal encodes change of air pressure

at a certain location generated by a vibrating object (e.g. string, vocal cords, membrane)

  • Waveform (pressure-time plot) is graphical

representation of audio signal

  • Parameters: amplitude, frequency / period

Waveform

slide-44
SLIDE 44

Audio Representation

Waveform

Air pressure deviation Time (seconds) Period Amplitude Average air pressure

0.25 0.5 0.75 1

slide-45
SLIDE 45

Audio Representation

Pure tone (harmonic sound):

  • Sinusoidal waveform
  • Prototype of an acoustic realization of a musical note

Parameters:

  • Period : time between to successive high pressure

points

  • Frequency = (measured in Hz)
  • Amplitude : air pressure at high pressure points

p f

p 1

a

Waveform

slide-46
SLIDE 46

Audio Representation

Waveform

Time (seconds) Amplitude

slide-47
SLIDE 47

Audio Representation

Waveform

Time (seconds) Time (seconds) Amplitude Amplitude

D2 (73.4 Hz)

slide-48
SLIDE 48

Audio Representation

Waveform

Time (seconds) Time (seconds) Amplitude Amplitude

D2 (73.4 Hz) 37 periods within 500 ms section

slide-49
SLIDE 49

Audio Representation

  • Sound: superposition of sinusoidals
  • When realizing musical notes on an instrument one
  • btains a complex superposition of pure tones

(and other noise-like components)

  • Harmonics: integer multiples of fundamental frequency
  • 1. Harmonic ≙ fundamental frequency

(e.g. 440 Hz)

  • 2. Harmonic ≙ first overtone

(e.g. 880 Hz)

  • 3. Harmonic ≙ second overtone

(e.g. 1320 Hz)

Sound

slide-50
SLIDE 50

Audio Representation

  • Property that correlates to the perceived frequency

(≙ fundamental frequency)

  • Example: A4 (also called concert pitch) ≙ 440 Hz
  • Slight changes in frequency have no effect on

perceived pitch (pitch ≙ entire range of frequencies)

  • Pitch perception: logarithmic in frequency

Example: octave ≙ doubling of frequency Pitch

slide-51
SLIDE 51

Audio Representation

Western music: 12-tone equal-tempered scale

  • Each octave is divided up into 12 logarithmically equal parts
  • Notes correspond to piano keys: p = 21 (A0) to

p = 108 (C8)

  • Referenz or standard pitch: p = 69 (A4) ≙

440 Hz

  • Center frequency of a note with MIDI pitch p

Pitch

Equal-tempered scale: A system of tuning in which every pair of adjacent notes has an identical frequency ratio

(Hz)

slide-52
SLIDE 52

Audio Representation

  • Semitone: difference between two subsequent scale steps
  • Ratio of frequencies one semitone apart is constant:
  • Cent:

1200 cents per octave (by definition) 100 cents per semitone (equivalent definition)

  • Ratio of frequencies one cent apart is constant:

Pitch

slide-53
SLIDE 53

Audio Representation

  • Difference in cents between two frequencies and :
  • Just noticeable difference = threshold of what is perceptible

 varies from person to person  depends on other aspects such as the timbre  25 cents recognizable by most people  10 cents recognizable only by trained listeners

Pitch

slide-54
SLIDE 54

Audio Representation

  • ctave

fifth major third +2

  • 14
  • 31

+2 +4

  • 14
  • 49

+2 +41

  • 31
  • 12

Harmonics: Frequency = integer multiples of fundamental frequency Mix Deviation in cents: MIDI: Frequency = fundamental frequency of MIDI pitch Stereo file: Harmonics vs. MIDI

Harmonics

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

slide-55
SLIDE 55

Audio Representation

  • Intensity of a sound
  • Energy of the sound per time and area
  • Loudness: subjective (psychoacoustic) perception of

intensity (depends on frequency, timbre, duration) Dynamics

slide-56
SLIDE 56

Audio Representation

  • area

power area time energy intensity   

     

2

m W

dB 20

  • f

level sound a has 100 dB 10

  • f

level sound a has 10

TOH TOH

I I I I I I      

  • Decibel (dB): logarithmic unit to measure intensity

relative to a reference level

  • Reference level: threshold of hearing (THO)
  • Intensity

measured in dB:

I

  • Examples:

Dynamics

slide-57
SLIDE 57

Audio Representation

Dynamics

Source Intensity Intensity level × TOH Threshold of hearing (TOH) 10-12 0 dB 1 Whisper 10-10 20 dB 102 Pianissimo 10-8 40 dB 104 Normal conversation 10-6 60 dB 106 Fortissimo 10-2 100 dB 1010 Threshold of pain 10 130 dB 1013 Jet take-off 102 140 dB 1014 Instant perforation of eardrum 104 160 dB 1016

slide-58
SLIDE 58

Audio Representation

Dynamics

Time Amplitude

slide-59
SLIDE 59

Audio Representation

Dynamics

Lower envelope Upper envelope Time Amplitude

slide-60
SLIDE 60

Audio Representation

Dynamics

ADSR model: attack (A), decay (D), sustain (S), and release (R) phase

slide-61
SLIDE 61

Audio Representation

Equal-loudness contours (phon)

Loudness

Frequency (Hz)

20 100 1000 10000 Threshold of hearing

Intensity (dB)

20 40 60 80 100 120

slide-62
SLIDE 62

Audio Representation

Equal-loudness contours (phon)

Loudness

Intensity (dB) Frequency (Hz)

20 100 1000 10000 20 40 60 80 100 120 0 phon 20 phon 40 phon 60 phon 80 phon 100 phon Threshold of hearing Threshold of pain 120 phon

slide-63
SLIDE 63

Audio Representation

  • Quality of musical sound that distinguishes different

types of sound production such as voices or instruments

  • Tone quality
  • Tone color
  • Depends on energy distribution in harmonics

Timbre

slide-64
SLIDE 64

Audio Representation

Piano playing note C4 (261.6 Hz)

Timbre

Time (seconds)

A D S R

Frequency (Hz)

slide-65
SLIDE 65

Audio Representation

Timbre

Time (seconds) Frequency (Hz)

A S R

Violine playing note C4 (261.6 Hz)

Vibrato: Frequency modulations Tremolo: Amplitude modulations

slide-66
SLIDE 66

Audio Representation

Digitization

slide-67
SLIDE 67

Audio Representation

  • Convertion of continuous-time (analog) signal

into a discrete signal

  • Sampling (discretization of time axis)
  • Quantization (discretization of amplitudes)

Examples:

  • Audio CD: 44100 Hz sampling rate

16 bits (65536 values) used for quantization

  • Telephone: 8000 Hz sampling rate

8 bits (256 values) used for quantization Digitization

slide-68
SLIDE 68

Music Representations

Audio Representations Symbolic Representations Sheet Music Representations Acoustic Domain Visual Domain

Transcription Synthesis Performance Rendering OMR

Musical Time Physical Time

OMR = optical music recognition Process of transforming sheet music into a symbolic representation

slide-69
SLIDE 69

Music Representations

Original score OMR score OMR

slide-70
SLIDE 70

Music Representations

Original score

OMR errors

OMR score OMR