Music Information Retrieval State-of-the-art techniques Ladislav - - PowerPoint PPT Presentation

music information retrieval state of the art techniques
SMART_READER_LITE
LIVE PREVIEW

Music Information Retrieval State-of-the-art techniques Ladislav - - PowerPoint PPT Presentation

Music Information Retrieval State-of-the-art techniques Ladislav Mark Charles University, Prague Music Information Retrieval (MIR) Applications Outline MIR problems (focus: audio query) with state-of-the-art techniques Categorization of


slide-1
SLIDE 1

Music Information Retrieval State-of-the-art techniques

Ladislav Maršík Charles University, Prague

slide-2
SLIDE 2

Music Information Retrieval (MIR)

slide-3
SLIDE 3

Applications

slide-4
SLIDE 4

Outline

MIR problems (focus: audio query) with state-of-the-art techniques Categorization of techniques

slide-5
SLIDE 5

MIR problems (audio query)

  • 1. Audio Fingerprinting
  • 2. Whistling and Humming Queries
  • 3. Cover Song Identification
  • 4. Audio similarity (related: music recommendation)

1. 2.

  • 3. and 4.
slide-6
SLIDE 6
  • 1. Audio Fingerprinting

INPUT: Song recording OUTPUT: The exact match

slide-7
SLIDE 7
  • 1. Audio Fingerprinting

Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002)

Time-Frequency spectrogram

slide-8
SLIDE 8
  • 1. Audio Fingerprinting

Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002)

Constellation analysis

slide-9
SLIDE 9
  • 1. Audio Fingerprinting

Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002)

Constellation analysis

slide-10
SLIDE 10
  • 1. Audio Fingerprinting

Wang and Smith: An Industrial-Strength Audio Search Algorithm (2002)

Combinatorially hashed

h(f1,f2,t2-t1) | t1

slide-11
SLIDE 11
  • 1. Audio Fingerprinting

Summary & State-of-the-art

Summary

  • Short search time: 5-500 milliseconds / query
  • Robust to noisy environment

State-of-the-art

  • Various indexing techniques
  • Benchmarking: MIREX 2015
  • Focus on commercial deployment, advertisment
slide-12
SLIDE 12
  • 2. Whistling and Humming Queries

INPUT: Whistling or Humming OUTPUT: Song containing the melody

slide-13
SLIDE 13
  • 2. Whistling and Humming Queries

Shen and Lee: Whistle for Music (2007)

  • Whistle: 700Hz-2.8KHz
  • Translation to MIDI (Query and DB)
  • String matching methods
slide-14
SLIDE 14
  • 2. Whistling and Humming Queries

Summary & State-of-the-art

Summary

  • Fast & Effective
  • False positives

State-of-the-art

  • Hou et al.: Hierarchical K-means tree, dynamic progr.
  • MusicRadar
  • Benchmarking: MIREX 2015
slide-15
SLIDE 15
  • 3. Cover Song Identification

INPUT: Song / Recording OUTPUT: Cover song / Performances

slide-16
SLIDE 16
  • 3. Cover Song Identification

Khadkevich and Omologo: CSI Using Chord Profiles (2013)

slide-17
SLIDE 17
  • 3. Cover Song Identification

Kim et al.: Music Fingerprint Extraction Use of Covariance Matrix Fingerprint, Beat synchronization

slide-18
SLIDE 18
  • 3. Cover Song Identification

Cross-Similarity and Self-similarity matrices (Tzanetakis 2003, Foote 1999)

Alignment using: Chromagram, Spectrogram

slide-19
SLIDE 19
  • 3. Cover Song Identification

Cross-Similarity using MFCC (Traile, 2015)

Alignment using: MFCC

slide-20
SLIDE 20
  • 3. Cover Song Identification

Summary & State-of-the-art

Summary

  • Many various techniques
  • Overall 80-90% precision of identifying covers

State-of-the-art

  • Benchmarking: MIREX 2015
  • Academia Sinica (Tsai, Wang): Melody extraction
  • Bordeaux (Hanna): Local alignment of chroma sequences
slide-21
SLIDE 21
  • 4. Audio Similarity

INPUT: Song OUTPUT: Similar sounding song Music recommendation: OUTPUT: Song that user would like to listen to

slide-22
SLIDE 22
  • 4. Audio Similarity

Seyerlehner, Schedl: Block-Level Audio Features (2009)

Audio → blocks deriving features from blocks generalizing for the song Distance measures

slide-23
SLIDE 23

Summary

  • Many various techniques
  • Useful for genre classification / maybe

recommentation? State-of-the-art

  • Benchmarking: MIREX 2015
  • 4. Audio Similarity

Summary & State-of-the-art

slide-24
SLIDE 24

Categorization of techniques

Audio → Spectrogram Audio → MIDI Audio → Chromagram

slide-25
SLIDE 25

Categorization of techniques

Audio → Spectrogram Audio → MIDI Audio → Chromagram

slide-26
SLIDE 26

Categorization of techniques

Audio → Spectrogram Audio → MIDI Audio → Chromagram

  • 1. Audio Fingerprinting
  • 4. Audio Similarity
  • 2. Whistle and Humming Queries
  • 3. Cover song identification
  • 4. Audio Similarity
slide-27
SLIDE 27

Thank you for your attention