music genre recognition karin kosina kyrah@gnu.org systems in - - PowerPoint PPT Presentation

music genre recognition
SMART_READER_LITE
LIVE PREVIEW

music genre recognition karin kosina kyrah@gnu.org systems in - - PowerPoint PPT Presentation

music genre recognition karin kosina kyrah@gnu.org systems in motion music genre recognition p.1/22 [ mugrat ] music genre recognition by analysis of texture kyrahs masters thesis at fh hagenberg system for


slide-1
SLIDE 1

music genre recognition

karin kosina

kyrah@gnu.org

systems in motion

music genre recognition – p.1/22

slide-2
SLIDE 2

[ mugrat ]

  • music genre recognition by analysis of texture
  • kyrah’s masters thesis at fh hagenberg
  • system for the automatic recognition of music genres,

based only on the sound signal

  • no meta-data, no DB lookup,. . .
  • nly based on sound properties
  • question: how do you do it?

music genre recognition – p.2/22

slide-3
SLIDE 3

[ overview ]

two steps:

  • feature extraction

calculate numerical representation of audio data choosing the right features is crucial!

  • classification

use output of feature extraction as basis for classification

music genre recognition – p.3/22

slide-4
SLIDE 4

[ design principles ]

  • no toy data – use arbitrary polyphonic signals

music genre recognition – p.4/22

slide-5
SLIDE 5

[ design principles ]

  • no toy data – use arbitrary polyphonic signals
  • advanced listening is not important

music genre recognition – p.4/22

slide-6
SLIDE 6

[ design principles ]

  • no toy data – use arbitrary polyphonic signals
  • advanced listening is not important
  • music theory is not important

music genre recognition – p.4/22

slide-7
SLIDE 7

[ design principles ]

  • no toy data – use arbitrary polyphonic signals
  • advanced listening is not important
  • music theory is not important
  • notes are not important

music genre recognition – p.4/22

slide-8
SLIDE 8

[ design principles ]

  • no toy data – use arbitrary polyphonic signals
  • advanced listening is not important
  • music theory is not important
  • notes are not important
  • be aware of limitations of “perceptual” modeling

music genre recognition – p.4/22

slide-9
SLIDE 9

[ design principles ]

  • no toy data – use arbitrary polyphonic signals
  • advanced listening is not important
  • music theory is not important
  • notes are not important
  • be aware of limitations of “perceptual” modeling
  • describe genres in terms of typical members

music genre recognition – p.4/22

slide-10
SLIDE 10

[ design principles ]

  • no toy data – use arbitrary polyphonic signals
  • advanced listening is not important
  • music theory is not important
  • notes are not important
  • be aware of limitations of “perceptual” modeling
  • describe genres in terms of typical members
  • use information available in sound signal

music genre recognition – p.4/22

slide-11
SLIDE 11

[ feature extraction ]

music files cannot be compared directly instead: calculate feature representation, i.e. essential information needed to differentiate classes “feature vector” – point in n-dimensional space classification based on distance features used in mugrat:

  • music texture features (short-time spectral change)
  • beat-related features (rhythm and beatedness)

music genre recognition – p.5/22

slide-12
SLIDE 12

[ music texture features ]

spectral attributes; mean and variance to capture short-time spectral change

  • spectral centroid
  • rolloff
  • flux
  • zero-crossing rate

feature set based on:

George Tzanetakis, Georg Essl, and Perry Cook. Automatic Musical Genre Classification of Audio Signals. In: Proceedings International Symposium for Audio Information Retrieval (ISMIR), Princeton, N.J., October 2001.

music genre recognition – p.6/22

slide-13
SLIDE 13

[ background :: sine superposition ::

✂ ✄✆☎ ✝ ✞ ✟ ✠✡ ☛ ☞

]

music genre recognition – p.7/22

slide-14
SLIDE 14

[ background :: sine superposition ::

✡ ✁ ✠ ✂ ✄ ☎ ✝ ✞ ✟ ✂ ✞ ✡ ☛ ☞

]

music genre recognition – p.8/22

slide-15
SLIDE 15

[ background :: sine superposition ::

  • ]

music genre recognition – p.9/22

slide-16
SLIDE 16

[ background :: fourier analysis ]

music genre recognition – p.10/22

slide-17
SLIDE 17

[ classical spectrogram ]

music genre recognition – p.11/22

slide-18
SLIDE 18

[ metal spectrogram ]

music genre recognition – p.12/22

slide-19
SLIDE 19

[ dance spectrogram ]

music genre recognition – p.13/22

slide-20
SLIDE 20

[ music texture features ]

  • spectral centroid

balancing point of spectrum measure of spectral shape associated with spectral brightness

✂ ✄ ☎ ✆ ✝ ✞✠✟ ✡☞☛ ✌✎✍ ☛ ✂ ✄ ☎ ✆ ✝ ✞✏✟ ✡☞☛ ✌

music genre recognition – p.14/22

slide-21
SLIDE 21

[ music texture features ]

  • rolloff

measure of spectral shape frequency

  • corresponding to
✁ ✂
  • f the magnitude distribution, so that
✄ ☎ ✆ ✝ ✞ ✟ ✡☞☛ ✌ ✁ ✁ ✍ ✄ ☎ ✆ ✝ ✞ ✟ ✡☞☛ ✌

in mugrat prototype

✁ ✁ ☎✆ ✂

music genre recognition – p.15/22

slide-22
SLIDE 22

[ music texture features ]

  • flux

measure of local spectral change

✄ ☎ ✆ ✝ ✁ ✂ ✟ ✡ ☛ ✌☎✄ ✂ ✟✝✆ ✝ ✡☞☛ ✌✞ ✟

music genre recognition – p.16/22

slide-23
SLIDE 23

[ music texture features ]

  • zero-crossing rate

zero-crossing: successive samples in a digital signal have different signs measure of the noisiness of a signal time domain feature!

✄ ☎ ✆ ✝ ✁✄✂ ✁✄☎ ✡☞☛ ✌✞ ✄ ✂ ✁ ☎ ✡☞☛ ✄ ✆ ✌✞ ✁

music genre recognition – p.17/22

slide-24
SLIDE 24

[ beat-related features ]

DWT, envelope extraction, autocorrelation, beat histogram generation: main beat (strength and BPM), second-strongest beat, relationship of these two, general beatedness

  • relative amplitude
  • f first and second beat histogram peak
  • ratio of amplitude second peak / first peak
  • period of the first and second peak in BPM
  • sum of the histogram

(indication of beat strength)

music genre recognition – p.18/22

slide-25
SLIDE 25

[ classification ]

genres defined in terms of typical members do i know songs that sound like this one?

music genre recognition – p.19/22

slide-26
SLIDE 26

[ classification ]

genres defined in terms of typical members do i know songs that sound like this one? feature extraction == abstraction use “standard” machine learning techniques

music genre recognition – p.19/22

slide-27
SLIDE 27

[ classification ]

genres defined in terms of typical members do i know songs that sound like this one? feature extraction == abstraction use “standard” machine learning techniques k-nearest-neighbour classification: data items == points in feature space labels of songs that are close to test instance, weighted by distance

music genre recognition – p.19/22

slide-28
SLIDE 28

[ knn ]

?

metal classical dance unclassified

music genre recognition – p.20/22

slide-29
SLIDE 29

[ results ]

3 genres:

  • metal
  • dance
  • classical

189 test songs (63, 65, 61) 88.36% accuracy

music genre recognition – p.21/22

slide-30
SLIDE 30

[ http://kyrah.net/mugrat/ ]

remember: information is not knowledge; knowledge is not wisdom; wisdom is not truth; truth is not beauty; beauty is not love; love is not music; music is the best (frank zappa)

music genre recognition – p.22/22