music information retrieval state of the art techniques
play

Music Information Retrieval State-of-the-art techniques Ladislav - PowerPoint PPT Presentation

Music Information Retrieval State-of-the-art techniques Ladislav Mark Charles University, Prague Music Information Retrieval (MIR) Applications Outline MIR problems (focus: audio query) with state-of-the-art techniques Categorization of


  1. Music Information Retrieval State-of-the-art techniques Ladislav Maršík Charles University, Prague

  2. Music Information Retrieval (MIR)

  3. Applications

  4. Outline MIR problems (focus: audio query) with state-of-the-art techniques Categorization of techniques

  5. MIR problems (audio query) 1. Audio Fingerprinting 2. Whistling and Humming Queries 3. Cover Song Identification 4. Audio similarity (related: music recommendation) 1. 2. 3. and 4.

  6. 1. Audio Fingerprinting INPUT: Song recording OUTPUT: The exact match

  7. 1. Audio Fingerprinting Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002) Time-Frequency spectrogram

  8. 1. Audio Fingerprinting Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002) Constellation analysis

  9. 1. Audio Fingerprinting Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002) Constellation analysis

  10. 1. Audio Fingerprinting Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002) h ( f 1 , f 2 , t 2 - t 1 ) | t 1 Combinatorially hashed

  11. 1. Audio Fingerprinting Summary & State-of-the-art Summary • Short search time: 5-500 milliseconds / query • Robust to noisy environment State-of-the-art • Various indexing techniques • Benchmarking: MIREX 2015 • Focus on commercial deployment, advertisment

  12. 2. Whistling and Humming Queries INPUT: Whistling or Humming OUTPUT: Song containing the melody

  13. 2. Whistling and Humming Queries Shen and Lee: Whistle for Music (2007) - Whistle: 700Hz-2.8KHz - Translation to MIDI (Query and DB) - String matching methods

  14. 2. Whistling and Humming Queries Summary & State-of-the-art Summary • Fast & Effective • False positives State-of-the-art • Hou et al.: Hierarchical K-means tree, dynamic progr. • MusicRadar • Benchmarking: MIREX 2015

  15. 3. Cover Song Identification INPUT: Song / Recording OUTPUT: Cover song / Performances

  16. 3. Cover Song Identification Khadkevich and Omologo: CSI Using Chord Profiles (2013)

  17. 3. Cover Song Identification Kim et al.: Music Fingerprint Extraction Use of Covariance Matrix Fingerprint, Beat synchronization

  18. 3. Cover Song Identification Cross-Similarity and Self-similarity matrices (Tzanetakis 2003, Foote 1999) Alignment using: Chromagram, Spectrogram

  19. 3. Cover Song Identification Cross-Similarity using MFCC (Traile, 2015) Alignment using: MFCC

  20. 3. Cover Song Identification Summary & State-of-the-art Summary • Many various techniques • Overall 80-90% precision of identifying covers State-of-the-art • Benchmarking: MIREX 2015 • Academia Sinica (Tsai, Wang): Melody extraction • Bordeaux (Hanna): Local alignment of chroma sequences

  21. 4. Audio Similarity INPUT: Song OUTPUT: Similar sounding song Music recommendation: OUTPUT: Song that user would like to listen to

  22. 4. Audio Similarity Seyerlehner, Schedl: Block-Level Audio Features (2009) Audio → blocks deriving features from blocks generalizing for the song Distance measures

  23. 4. Audio Similarity Summary & State-of-the-art Summary • Many various techniques • Useful for genre classification / maybe recommentation? State-of-the-art • Benchmarking: MIREX 2015

  24. Categorization of techniques Audio → Spectrogram Audio → MIDI Audio → Chromagram

  25. Categorization of techniques Audio → Spectrogram Audio → MIDI Audio → Chromagram

  26. Categorization of techniques 1. Audio Fingerprinting Audio → Spectrogram 4. Audio Similarity Audio → MIDI 2. Whistle and Humming Queries Audio → Chromagram 3. Cover song identification 4. Audio Similarity

  27. Thank you for your attention

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend