music genre recognition
play

music genre recognition karin kosina kyrah@gnu.org systems in - PowerPoint PPT Presentation

music genre recognition karin kosina kyrah@gnu.org systems in motion music genre recognition p.1/22 [ mugrat ] music genre recognition by analysis of texture kyrahs masters thesis at fh hagenberg system for


  1. music genre recognition karin kosina kyrah@gnu.org systems in motion music genre recognition – p.1/22

  2. � � � � � [ mugrat ] music genre recognition by analysis of texture kyrah’s masters thesis at fh hagenberg system for the automatic recognition of music genres, based only on the sound signal no meta-data, no DB lookup,. . . only based on sound properties question: how do you do it? music genre recognition – p.2/22

  3. � � [ overview ] two steps: feature extraction calculate numerical representation of audio data choosing the right features is crucial! classification use output of feature extraction as basis for classification music genre recognition – p.3/22

  4. � [ design principles ] no toy data – use arbitrary polyphonic signals music genre recognition – p.4/22

  5. � � [ design principles ] no toy data – use arbitrary polyphonic signals advanced listening is not important music genre recognition – p.4/22

  6. � � � [ design principles ] no toy data – use arbitrary polyphonic signals advanced listening is not important music theory is not important music genre recognition – p.4/22

  7. � � � � [ design principles ] no toy data – use arbitrary polyphonic signals advanced listening is not important music theory is not important notes are not important music genre recognition – p.4/22

  8. � � � � � [ design principles ] no toy data – use arbitrary polyphonic signals advanced listening is not important music theory is not important notes are not important be aware of limitations of “perceptual” modeling music genre recognition – p.4/22

  9. � � � � � � [ design principles ] no toy data – use arbitrary polyphonic signals advanced listening is not important music theory is not important notes are not important be aware of limitations of “perceptual” modeling describe genres in terms of typical members music genre recognition – p.4/22

  10. � � � � � � � [ design principles ] no toy data – use arbitrary polyphonic signals advanced listening is not important music theory is not important notes are not important be aware of limitations of “perceptual” modeling describe genres in terms of typical members use information available in sound signal music genre recognition – p.4/22

  11. � � [ feature extraction ] music files cannot be compared directly instead: calculate feature representation, i.e. essential information needed to differentiate classes “feature vector” – point in n-dimensional space classification based on distance features used in mugrat: music texture features (short-time spectral change) beat-related features (rhythm and beatedness) music genre recognition – p.5/22

  12. � � � � [ music texture features ] spectral attributes; mean and variance to capture short-time spectral change spectral centroid rolloff flux zero-crossing rate feature set based on: George Tzanetakis, Georg Essl, and Perry Cook. Automatic Musical Genre Classification of Audio Signals. In: Proceedings International Symposium for Audio Information Retrieval (ISMIR), Princeton, N.J., October 2001. music genre recognition – p.6/22

  13. � ✁ ✂ ✝ ✞ ✟ ✠✡ ☛ ☞ [ background :: sine superposition :: ] ✄✆☎ music genre recognition – p.7/22

  14. ✝ ☞ ✁ ✡ ✁ ✠ ✂ ✄ ☎ � ✞ ✟ ✂ ✞ ✡ ☛ [ background :: sine superposition :: ] music genre recognition – p.8/22

  15. � ✁ � ✁ � [ background :: sine superposition :: ] music genre recognition – p.9/22

  16. [ background :: fourier analysis ] music genre recognition – p.10/22

  17. [ classical spectrogram ] music genre recognition – p.11/22

  18. [ metal spectrogram ] music genre recognition – p.12/22

  19. [ dance spectrogram ] music genre recognition – p.13/22

  20. ✝ ✌ ✆ ☎ ✄ ✂ ☛ � ✝ ✆ ☎ ✄ ✂ ✁ � [ music texture features ] spectral centroid balancing point of spectrum measure of spectral shape associated with spectral brightness ✡☞☛ ✌✎✍ ✞✠✟ ✡☞☛ ✞✏✟ music genre recognition – p.14/22

  21. ✌ ✝ ✆ ☎ ✄ ✍ ✁ ✁ � ✟ ✟ ✞ ✆ ✞ ☎ ✄ ✌ ✂ ✁ ✁ � ✁ ☎✆ ✂ ✝ [ music texture features ] rolloff measure of spectral shape frequency corresponding to of the magnitude distribution, so that ✡☞☛ ✡☞☛ in mugrat prototype music genre recognition – p.15/22

  22. ✂ ✂ ✟ ✌ ✞ � ✁ ✄ ☎ ✆ ✝ ✁ � ✟ ✡ ☛ ✝ [ music texture features ] flux measure of local spectral change ✌ ☎✄ ✡☞☛ ✟ ✝✆ music genre recognition – p.16/22

  23. ✆ ✝ ☎ ✁ ✂ ✄ ✌ ✞ ✌ ✞ ✁ � ✆ ✄ ☎ ✄ ✁ � [ music texture features ] zero-crossing rate zero-crossing: successive samples in a digital signal have different signs measure of the noisiness of a signal time domain feature! ✁✄✂ ✁ ✄☎ ✡☞☛ ✡☞☛ music genre recognition – p.17/22

  24. � � � � [ beat-related features ] DWT, envelope extraction, autocorrelation, beat histogram generation: main beat (strength and BPM), second-strongest beat, relationship of these two, general beatedness relative amplitude of first and second beat histogram peak ratio of amplitude second peak / first peak period of the first and second peak in BPM sum of the histogram (indication of beat strength) music genre recognition – p.18/22

  25. [ classification ] genres defined in terms of typical members do i know songs that sound like this one? music genre recognition – p.19/22

  26. [ classification ] genres defined in terms of typical members do i know songs that sound like this one? feature extraction == abstraction use “standard” machine learning techniques music genre recognition – p.19/22

  27. [ classification ] genres defined in terms of typical members do i know songs that sound like this one? feature extraction == abstraction use “standard” machine learning techniques k-nearest-neighbour classification: data items == points in feature space labels of songs that are close to test instance, weighted by distance music genre recognition – p.19/22

  28. [ knn ] metal classical dance unclassified ? music genre recognition – p.20/22

  29. � � � [ results ] 3 genres: metal dance classical 189 test songs (63, 65, 61) 88.36% accuracy music genre recognition – p.21/22

  30. [ http://kyrah.net/mugrat/ ] remember: information is not knowledge; knowledge is not wisdom; wisdom is not truth; truth is not beauty; beauty is not love; love is not music; music is the best (frank zappa) music genre recognition – p.22/22

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend