CTP431- Music and Audio Computing Music Information Retrieval
Graduate School of Culture Technology KAIST Juhan Nam
1
CTP431- Music and Audio Computing Music Information Retrieval - - PowerPoint PPT Presentation
CTP431- Music and Audio Computing Music Information Retrieval Graduate School of Culture Technology KAIST Juhan Nam 1 Introduction Instrument: Piano Classical Genre: Composer: Chopin Key: E-minor Mood: Melancholy, Sad,
1
2
Melancholy, Sad, …
3
5 http://www.slideshare.net/Daritsetseg/brainstem-auditory-evoked-responses-baer-or-abr-45762118
6
– Music identification, search and recommendation
– Interactive music performance – Musical Instrument learning
– Automatic composition and arrangement
– Singing evaluation, game
– Sound sample search in sound libraries – Automatic segmentation and digital audio Effects
7
10
Shazam
Audio Fingerprinting
(http://labrosa.ee.columbia.edu/matlab/fingerprint/)
11
SoundHound Melody Extraction
12
Pandora iTunes Music
13
14
[www.soribada.com]
15
16
16
Juhan Gangnam Style Juhan’s latent vector Gangnam Style’s latent vector
Tys
Song Preference
T xu2
User Similarity
s1s2 = ys1 T ys2
Song Similarity
17
18
19
21
Query word: “Female Lead Vocals” Top 5 ranked songs Norah Jones – Don’t know why Dido – Here with me Sheryl Crow – I shall believe No doubt – Simple kind of like Carpenters – Rainy days and Mondays
23
Audio Track of “Gangnam Style” Matrix factorization from collaborative filtering
“user” “song” “Gangnam Style’s latent vector
24
Collaborative Filtering only Collaborative Filtering + Audio Content
25
Zenph’s Re-performance
29
Automatic Page Turner (JKU, Austria)
32
The Piano Music Companion (JKU, Austria)
33
Sonation’s Cadenza
https://www.youtube.com/watch?v=RmT6MDOD3uc
Antares Auto-tune
Note timing, Pitch, Dynamics
Feature Extraction
Target Singing Voice Source Singing Voice Time-Scale Modification Pitch Shifting Gain
DTW Smoothing HPSS Envelope Detector Pitch Detector
Modified Singing Voice
stretching ratio smoothed stretching ratio pitch ratio gain ratio harmonic signal
Temporal Alignment Pitch Alignment Dynamics Alignment
source target all modified source 벚꽃엔딩 Let it go 취중진담
39
Augmented Transition Networks
“Daddy’s car”: Sony CSL Lab’s Flow Machines
쿨잼(Cool Jamm) – Hum On
“Musical” Knowledge Base “Physical” Knowledge Base Performer Composer Instrument Perception Cognition Sound Field Source Sound Temporal Control Symbolic Representation Room Listener Process Data
“Musical” Knowledge Base “Physical” Knowledge Base Performer Composer Instrument Perception Cognition Sound Field Source Sound Temporal Control Symbolic Representation Room Listener Process Data
“Musical” Knowledge Base “Physical” Knowledge Base Performer Composer Instrument Perception Cognition Sound Field Source Sound Temporal Control Symbolic Representation Room Listener Process Data