Lecture 6: Music Mark Hasegawa-Johnson ECE 401: Signal and Image - PowerPoint PPT Presentation

Review Pitch Pitch Tracking Phase Vocoder Summary Lecture 6: Music Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020

Review Pitch Pitch Tracking Phase Vocoder Summary Review: Spectrum, Fourier Series, and DFT 1 Musical Pitch 2 Pitch Tracking: the Harmonic Sieve Algorithm 3 Music Synthesis: the Phase Vocoder 4 Summary 5

Review Pitch Pitch Tracking Phase Vocoder Summary Outline Review: Spectrum, Fourier Series, and DFT 1 Musical Pitch 2 Pitch Tracking: the Harmonic Sieve Algorithm 3 Music Synthesis: the Phase Vocoder 4 Summary 5

Review Pitch Pitch Tracking Phase Vocoder Summary Two-sided spectrum The spectrum of x ( t ) is the set of frequencies, and their associated phasors, Spectrum ( x ( t )) = { ( f − N , a − N ) , . . . , ( f 0 , a 0 ) , . . . , ( f N , a N ) } such that N � a k e j 2 π f k t x ( t ) = k = − N

Review Pitch Pitch Tracking Phase Vocoder Summary Summary Fourier Analysis (finding the spectrum, given the waveform): � T 0 X k = 1 x ( t ) e − j 2 π kt / T 0 dt T 0 0 Fourier Synthesis (finding the waveform, given the spectrum): ∞ � X k e j 2 π kt / T 0 x ( t ) = k = −∞ DFT Analysis (finding the spectrum, given the waveform): N − 1 � x [ n ] e − j 2 π kn / N X [ k ] = n =0 DFT Synthesis (finding the waveform, given the spectrum): N − 1 x [ n ] = 1 � X [ k ] e j 2 π kn / N N k =0

Review Pitch Pitch Tracking Phase Vocoder Summary Pythagorean Tuning Humans have always known that f 2 = 2 f 1 (length of one string is twice the length of the other) means they are an octave apart (“same note”). A 3:2 ratio ( f 2 = 1 . 5 f 1 ) is a musical perfect fifth. Pythagoras is attributed with a system of tuning that created an 8-note scale by combining 3:2 and 2:1 ratios (“Pythagorean tuning”), used in some places until 1600.

Review Pitch Pitch Tracking Phase Vocoder Summary Equal-Tempered Tuning Equal-tempered tuning divides the octave into twelve equal ratios. Semitones: the number of semitones, s , separating two tones f 2 and f 1 is given by � f 2 � s = 12 log 2 f 1 Cents: the number of cents, n , separating two tones f 2 and f 1 is given by � f 2 � n = 1200 log 2 f 1

Review Pitch Pitch Tracking Phase Vocoder Summary Pythagorean vs. Equal-Tempered Tuning Pythagorean, Equal-Tempered, and Just Intonation

Review Pitch Pitch Tracking Phase Vocoder Summary Pythagorean vs. Equal-Tempered Tuning By SharkD, public domain image, https://commons.wikimedia.org/wiki/File: Music_intervals_frequency_ratio_equal_tempered_pythagorean_comparison.svg

Review Pitch Pitch Tracking Phase Vocoder Summary Pitch Tracking: Intended Output

Review Pitch Pitch Tracking Phase Vocoder Summary Pitch Tracking: Input Violin.arco.ff.sulG.C4Eb5.aiff

Review Pitch Pitch Tracking Phase Vocoder Summary Pitch Tracking: One frame of the input looks like this

Review Pitch Pitch Tracking Phase Vocoder Summary Pitch Tracking: Input and Output

Review Pitch Pitch Tracking Phase Vocoder Summary The Harmonic Sieve Algorithm: Overview (c) Duifhuis, Willems & Sluyter, J. Acoust. Soc. Am. 71(6):1568-1580, 1982

Review Pitch Pitch Tracking Phase Vocoder Summary The Harmonic Sieve 1 Compute the DFT, X [ k ]. 2 For any given frequency f , define the energy at that frequency to include all of the magnitude DFT within ± 5%, i.e., (1 . 05) Nf / F s � E ( f ) = | X [ k ] | k =0 . 95( Nf / F s ) 3 In order to test the Goodness of F 0 as a possible pitch frequency, add up the energy of its first 11 harmonics: 11 � G ( F 0 ) = E ( hF 0 ) h =1 4 Choose the pitch with the best goodness.

Review Pitch Pitch Tracking Phase Vocoder Summary The Harmonic Sieve Notice that the 11 harmonic frequencies are given by: � f = [ F 0 , 2 F 0 , 3 F 0 , . . . , 11 F 0 ] = F 0 × [1 , 2 , 3 , . . . , 11] Duifhuis, Willems & Sluyter had the clever idea of computing pitch on a semitone scale: S ( � f ) = 12 log 2 ( � f ) = S ( F 0 ) + � M So you can search all of the 88 keys on a piano by starting with S = 0 (the lowest note, A0), and searching all the way up to S=87 (the highest note, C8). For each one, you just add the harmonic sieve to get the frequencies of all the harmonics: � M = [12 log 2 (1) , 12 log 2 (2) , 12 log 2 (3) , . . . , 12 log 2 (11)]

Review Pitch Pitch Tracking Phase Vocoder Summary The Harmonic Sieve (c) Duifhuis, Willems & Sluyter, J. Acoust. Soc. Am. 71(6):1568-1580, 1982

Review Pitch Pitch Tracking Phase Vocoder Summary Figuring out which bins to average So to figure out which bins to average, for any given pitch F 0 : 1 Add the pitch in semitones ( S ( F 0 )) to the mask ( � M ). 2 Convert back into linear frequency: � N � 2 ( S ( F 0 )+ � M ) / 12 k = F s

Review Pitch Pitch Tracking Phase Vocoder Summary Masks for each of the notes on the piano

Review Pitch Pitch Tracking Phase Vocoder Summary Duifhuis-Willems-Sluyter Spectral Analysis Duifhuis, Willems & Sluyter also used a “peak detector” step, after their amplitude spectrum, in order to extract peaks from the spectrum before they applied the sieve. I found this step to be unnecessary when I was designing MP2. On the other hand, this kind of peak detection is used used by Shazam, Soundhound, Beatfind, Google Sound Search etc., to reduce the number of bits per song, so that they can efficiently identify the song you’re listening to. So you might find that part of the Duifhuis et al. article interesting, even though we’re not using it in MP2.

Review Pitch Pitch Tracking Phase Vocoder Summary Duifhuis-Willems Spectral Analysis (c) Duifhuis, Willems & Sluyter, J. Acoust. Soc. Am. 71(6):1568-1580, 1982

Review Pitch Pitch Tracking Phase Vocoder Summary Fourier Synthesis Suppose you know X [ k ]. How can you get x [ n ] back again? That’s right! N − 1 x [ n ] = 1 � X [ k ] e j 2 π kn / N N k =0

Review Pitch Pitch Tracking Phase Vocoder Summary Fourier Synthesis without phase Suppose you know the magnitude only , | X [ k ] | . How can you get x [ n ] back again? Pretend that ∠ X [ k ] = 0. Oops. Sounds like a click. Pretend that ∠ X [ k ] is a random number between 0 and 2 π . Oops. Sounds like noise. Be smart about the relationship between frequency and phase.

Review Pitch Pitch Tracking Phase Vocoder Summary The relationship between frequency and phase Notice that we could write A cos (2 π ft + θ ) = A cos( φ ( t )) where φ ( t ) is the instantaneous phase: at time t = 0, the instantaneous phase is just φ (0) = θ At the end of a T -second frame, φ ( T ) = φ (0) + 2 π fT

Review Pitch Pitch Tracking Phase Vocoder Summary The phase vocoder At each time t , for each of the DFT frequency bins k : 1 Decide whether | X t [ k ] | is one of the harmonics of a tone, or just noise. 2 If it’s just noise, set φ k ( t ) to a random number between 0 and 2 π . 3 If it’s a pure tone, set φ k ( t ) = φ k ( t − T ) + 2 π f k T , where f k is the center frequency ( kF s / N ), and T is the length of the frame (in seconds).

Review Pitch Pitch Tracking Phase Vocoder Summary Result: synthesized magnitudes and phases

Review Pitch Pitch Tracking Phase Vocoder Summary Summary Semitones: the number of semitones, s , separating two tones f 2 and f 1 is given by � f 2 � s = 12 log 2 f 1 The Harmonic Sieve algorithm: choose the pitch with the best goodness, defined as (1 . 05) hNF 0 / F s 11 � � G ( F 0 ) = | X [ k ] | h =1 k =0 . 95( hNF 0 / F s ) Phase Vocoder: If | X [ k ] | is just noise, set φ k ( t ) to a random number between 0 and 2 π . If | X [ k ] | is a pure tone, set φ k ( t ) = φ k ( t − T ) + 2 π f k T , where f k is the center frequency ( kF s / N ), and T is the length of the frame (in seconds).

Lecture 6: Music Mark Hasegawa-Johnson ECE 401: Signal and Image - PowerPoint PPT Presentation

Review Pitch Pitch Tracking Phase Vocoder Summary Lecture 6: Music Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020 Review Pitch Pitch Tracking Phase Vocoder Summary Review: Spectrum, Fourier Series, and DFT 1

MUSIC THERAPY MUSIC THERAPY What is music therapy? Music therapy is simply the process of using

JEWISH MUSIC 101: WHAT IS JEWISH MUSIC? A PROGRAM OF THE LOWELL MILKEN FUND FOR AMERICAN JEWISH

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &

Music and Pain: A Music Therapy Perspective Deborah Salmon, MA, MTA, CMT BRAMS, Universit de

FOLK MUSIC AT KMH A presentation of the Folk Music Department at the Royal College of Music,

Music, Language and Computation Aline Honingh LoLaCo Guestlecture 2012 Outline Music at the

A Musical Future Options for Studying Music at UWA Why choose Music at UWA? Music at UWA

Music Tagging Ryan Curtin LUG@GT Ryan Curtin Music Tagging - p. 1 The Problem You have a

School Music Education Plan THAMES Guidance for Schools Music in Schools - Introducing School

Radium: A Music Editor Inspired by the Music Tracker Kjetil Matheussen Norwegian Center for

Music recommendation and discovery in which Web? scar Celma (Music Technology Group, UPF)

1 Music IR Music? Music IR Music? Music - Sound Music - Sound - Loudness http://

Music Composition with LISP Drew Krause LispNYC November 13, 2012 Lisp Music Environments

Workshop: Music and Pain BRAMS -- Montreal Music, Pain and Emotions Neurobiological

FREEGAL MUSIC Freegal Music offers access to nearly 3 million songs, including Sony Musics

Your home in ex-USSR countries Rights management | Production music | Music service

Early Seed Funding: The Art and Not-So-Science of Raising Money Jason St. Peter, Founder Palomar

Objectives WHY Small Businesses and Startups fail to get Funding HOW increase the odds of

TALS Hospitals, Nursing Homes & ACFs Transportation Assistance Levels for Healthcare

G Montalescot, L Bolognese, D Dudek, P Goldstein, C Hamm, JF Tanguay, JM ten Berg, DL Miller, TM

Inspiring Great Downtowns RECOVERY & RESILIENCY ON MAIN STREET Educational Series Main

Multiparty Session Types and their Applications to Large Distributed Systems Nobuko Yoshida

1 Early 1970s Phreaks Blue Boxes: Free Long Distance Calls Once trunk thinks call is over,

SHPE Sacramento State SHPE California State University, Sacramento AGENDA Announcements

Lecture 6: Music Mark Hasegawa-Johnson ECE 401: Signal and Image - PowerPoint PPT Presentation

Review Pitch Pitch Tracking Phase Vocoder Summary Lecture 6: Music Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020 Review Pitch Pitch Tracking Phase Vocoder Summary Review: Spectrum, Fourier Series, and DFT 1

MUSIC THERAPY MUSIC THERAPY What is music therapy? Music therapy is simply the process of using

JEWISH MUSIC 101: WHAT IS JEWISH MUSIC? A PROGRAM OF THE LOWELL MILKEN FUND FOR AMERICAN JEWISH

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &amp;

Music and Pain: A Music Therapy Perspective Deborah Salmon, MA, MTA, CMT BRAMS, Universit de

FOLK MUSIC AT KMH A presentation of the Folk Music Department at the Royal College of Music,

Music, Language and Computation Aline Honingh LoLaCo Guestlecture 2012 Outline Music at the

A Musical Future Options for Studying Music at UWA Why choose Music at UWA? Music at UWA

Music Tagging Ryan Curtin LUG@GT Ryan Curtin Music Tagging - p. 1 The Problem You have a

School Music Education Plan THAMES Guidance for Schools Music in Schools - Introducing School

Radium: A Music Editor Inspired by the Music Tracker Kjetil Matheussen Norwegian Center for

Music recommendation and discovery in which Web? scar Celma (Music Technology Group, UPF)

1 Music IR Music? Music IR Music? Music - Sound Music - Sound - Loudness http://

Music Composition with LISP Drew Krause LispNYC November 13, 2012 Lisp Music Environments

Workshop: Music and Pain BRAMS -- Montreal Music, Pain and Emotions Neurobiological

FREEGAL MUSIC Freegal Music offers access to nearly 3 million songs, including Sony Musics

Your home in ex-USSR countries Rights management | Production music | Music service

Early Seed Funding: The Art and Not-So-Science of Raising Money Jason St. Peter, Founder Palomar

Objectives WHY Small Businesses and Startups fail to get Funding HOW increase the odds of

TALS Hospitals, Nursing Homes &amp; ACFs Transportation Assistance Levels for Healthcare

G Montalescot, L Bolognese, D Dudek, P Goldstein, C Hamm, JF Tanguay, JM ten Berg, DL Miller, TM

Inspiring Great Downtowns RECOVERY &amp; RESILIENCY ON MAIN STREET Educational Series Main

Multiparty Session Types and their Applications to Large Distributed Systems Nobuko Yoshida

1 Early 1970s Phreaks Blue Boxes: Free Long Distance Calls Once trunk thinks call is over,

SHPE Sacramento State SHPE California State University, Sacramento AGENDA Announcements

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &

TALS Hospitals, Nursing Homes & ACFs Transportation Assistance Levels for Healthcare

Inspiring Great Downtowns RECOVERY & RESILIENCY ON MAIN STREET Educational Series Main