USING PRIORS TO IMPROVE* ESTIMATES OF MUSIC STRUCTURE Jordan B. L. - PowerPoint PPT Presentation

USING PRIORS TO IMPROVE* ESTIMATES OF MUSIC STRUCTURE Jordan B. L. Smith Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST), Japan Wednesday, August 10th 2016 * or not Oral Session #5: Structure

WHERE DO BOUNDARIES COME FROM? ➤ The music! ➤ Sudden changes ➤ Repetitions ➤ Homogenous stretches ➤ The listener! ➤ Person listens to the above, then decides on best description

MODELING “GOOD-LOOKING” DESCRIPTIONS ➤ Which is a better description of the piece L'esempio imperfetta ? Good descriptions of the signal “Good-looking” descriptions

USUAL APPROACH Single Input Algorithm Algorithm 1 Output algorithm: song Algorithm 1 Algorithm 1 Output 1 Algorithm 2 Algorithm 2 Output 2 Multiple Input song algorithms: Algorithm 3 Algorithm 3 Output 3 … … Algorithm N Algorithm N Output N

PROPOSAL 2. Use priors to predict likelihood of outputs Input priors 1. Run committee estimated of algorithms from corpus Algorithm 1 Algorithm 1 Output 1 Likelihood 1 Algorithm 2 Algorithm 2 Output 2 Likelihood 2 Choose output Input with greatest song Algorithm 3 Algorithm 3 Output 3 Likelihood 3 likelihood … … Algorithm N Algorithm N Output N Likelihood N 3. Use likelihoods to predict most accurate output

REGULARITIES ➤ Look at properties of SALAMI annotations ~ 20 seconds relative frequency in corpus log(segment length in seconds)

REGULARITIES 1:1 relative 1:2 2:1 frequency in corpus log(ratio segment length to median) 1:2 2:1 relative 1:4 4:1 frequency in corpus log(ratio of adjacent segment lengths)

BACKGROUND ➤ Some strategies to model priors are widespread. E.g.: ➤ Force segment length to fall within specific range (say, between 10 and 40 seconds) ➤ Encourage segments to be 16, 32, or 64 beats long ➤ Learning directly from annotated audio is another option: ➤ Turnbull et al. (2007) used machine learning to do binary classification of excerpts as boundaries or non- boundaries ➤ Ullrich et al. (2014) did the same with neural nets and achieved a huge increase in performance

BACKGROUND ➤ Other notable examples: ➤ Paulus and Klapuri (2009): “Defining a ‘Good’ Structural Description.” Cost function relates to description “quality”. ➤ Sargent, Bimbot and Vincent (2011): Estimate median segment length; use to regulate cost function. ➤ Rodriguez-Lopez, Volk and Bountoridis (2014): Similar approach, using corpus-estimated priors for melodic segmentation. ➤ McFee et al. (2014): Used annotations to optimise their feature representation, then used a standard approach.

PROPOSAL 2. Use priors to predict likelihood of outputs Input priors 1. Run committee estimated of algorithms from corpus Algorithm 1 Algorithm 1 Output 1 Likelihood 1 Algorithm 2 Algorithm 2 Output 2 Likelihood 2 Choose output Input with greatest song Algorithm 3 Algorithm 3 Output 3 Likelihood 3 likelihood … … Algorithm N Algorithm N Output N Likelihood N 3. Use likelihoods to predict most accurate output

1. COMMITTEE OF ALGORITHMS ➤ Foote (2000) novelty-based segmentation ➤ 40 members parameters: altogether ➤ chroma, MFCC or tempogram features ➤ Used MSAF to ➤ median kernel size run algorithms ➤ checkerboard kernel size (Nieto and Bello 2015) ➤ novelty function adaptive threshold size ➤ Serra et al. (2012) structure feature-based segmentation parameters: ➤ feature ➤ embedded feature dimension size ➤ nearest neighbour region ➤ adaptive threshold for peak picking

2. SET OF PRIORS ➤ Per-segment properties: ➤ A 1 = Segment length ( L i ) 0:00 3:22 ➤ A 2 = Fractional segment length 0.0 1.0 ( L i / song length) ➤ A 3 = Ratio of L i to median segment length ➤ A 4 = Ratio of adjacent segment lengths ( L i / L i +1 ) ➤ Per-description properties: ➤ A 5 = Median segment length 0:00 3:22 (median of L i ) ➤ A 6 = Number of segments 9 ➤ A 7 = Minimum segment length ➤ A 8 = Maximum segment length 0:00 3:22 ➤ A 9 = Standard deviation of 0:05.52 segment length

9 different priors many log-likelihood values –5.71 –5.85 –5.48 –8.75 –5.05 –6.63 –1.82 –6.27 –7.48 –5.69 –5.71 –5.76 –8.75 –4.93 –6.63 –1.82 –6.27 –7.42 –4.97 –5.09 –5.65 –7.13 –3.92 –5.34 –1.82 –4.85 –5.22 –4.72 –4.97 –5.06 –6.71 –3.68 –4.98 –1.82 –3.99 –4.17 –5.71 –5.85 –5.48 –8.75 –5.05 –6.63 –1.82 –6.27 –7.48 –5.69 –5.71 –5.76 –8.75 –4.93 –6.63 –1.82 –6.27 –7.42 –4.97 –5.09 –5.65 –7.13 –3.92 –5.34 –1.82 –4.85 –5.22 –4.72 –4.97 –5.06 –6.71 –3.68 –4.98 –1.82 –3.99 –4.17 –4.19 –4.51 –4.08 –5.47 –3.69 –4.55 –1.82 –3.76 –3.63 –4.19 –4.50 –4.07 –5.27 –3.69 –4.55 –1.82 –3.76 –3.63 –4.33 –4.76 –4.10 –5.88 –3.72 –4.72 –1.82 –3.66 –3.58 –4.33 –4.75 –3.99 –5.89 –3.76 –4.72 –1.82 –3.66 –3.60 –4.19 –4.51 –4.08 –5.47 –3.69 –4.55 –1.82 –3.76 –3.63 –4.19 –4.50 –4.07 –5.27 –3.69 –4.55 –1.82 –3.76 –3.63 –4.33 –4.76 –4.10 –5.88 –3.72 –4.72 –1.82 –3.66 –3.58 –4.33 –4.75 –3.99 –5.89 –3.76 –4.72 –1.82 –3.66 –3.60 –5.61 –6.37 –6.04 –8.75 –3.91 –6.63 –1.82 –5.67 –6.60 –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 –4.38 –4.71 –4.27 –5.81 –3.66 –4.72 –1.82 –3.95 –3.81 40 committee members –4.58 –4.98 –4.57 –6.09 –3.69 –4.98 –1.82 –3.99 –4.12 –5.61 –6.37 –6.04 –8.75 –3.91 –6.63 –1.82 –5.67 –6.60 –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 –4.38 –4.71 –4.27 –5.81 –3.66 –4.72 –1.82 –3.95 –3.81 –4.58 –4.98 –4.57 –6.09 –3.69 –4.98 –1.82 –3.99 –4.12 –4.20 –4.52 –4.22 –5.68 –3.64 –4.55 –1.82 –3.76 –3.63 –4.21 –4.51 –4.21 –5.68 –3.64 –4.55 –1.82 –3.71 –3.63 –4.33 –4.72 –4.15 –5.87 –3.72 –4.72 –1.82 –3.71 –3.60 –4.34 –4.71 –4.22 –6.10 –3.69 –4.72 –1.82 –3.74 –3.63 –4.20 –4.52 –4.22 –5.68 –3.64 –4.55 –1.82 –3.76 –3.63 –4.21 –4.51 –4.21 –5.68 –3.64 –4.55 –1.82 –3.71 –3.63 –4.33 –4.72 –4.15 –5.87 –3.72 –4.72 –1.82 –3.71 –3.60 –4.34 –4.71 –4.22 –6.10 –3.69 –4.72 –1.82 –3.74 –3.63 How to choose   –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 an output   –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 based on the priors? –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73 –6.27 –6.10 –6.32 –10.28 –5.27 –6.73 –1.82 –6.40 –8.73

3. USING PRIORS TO PREDICT BEST ANSWER ➤ Grab bag of techniques: ➤ Maximize an individual prior (A 1 through A 9 ) ➤ Maximize combination of priors: ➤ sum of the prior likelihoods ➤ minimum of A 1 through A 9 ➤ use a linear model to predict f -measure based on all likelihoods ➤ use a higher-order linear model (interactions / quadratic models)

PROPOSAL 9 likelihoods 2. Use priors to predict likelihood 40 members Input priors 14 methods 1. Run committee estimated of algorithms 3. Use likelihoods to from corpus predict most Algorithm 1 Algorithm 1 Output 1 Likelihood 1 accurate output Algorithm 2 Algorithm 2 Output 2 Likelihood 2 Input Choose output song with greatest Algorithm 3 Algorithm 3 Output 3 Likelihood 3 likelihood … … Algorithm N Algorithm N Output N Likelihood N Average committee Parameter Magically pick Compare: quality   optimization   the best one   (min) (baseline) (theoretical max)

RESULTS: FOOTE AND SERRA COMMITTEE ON PUBLIC SALAMI f-measure f-measure   System (+/-3 seconds) (+/- 0.5 seconds) A 1 - Segment length A 2 - Fractional segment 0.4230 0.1051 A 1 length 0.4156 0.0958 A 2 A 3 - Ratio to median 0.4176 0.1140 segment length A 3 A 4 - Ratio of adjacent 0.4194 0.1072 A 4 Individual   segment lengths 0.3597 0.0863 A 5 A 5 - Median segment length priors A 6 - Number of segments 0.3781 0.0991 A 6 A 7 - Minimum segment 0.0603 0.0124 A 7 length A 8 - Maximum segment 0.3907 0.0961 A 8 length 0.3956 0.0950 A 9 A 9 - Standard deviation of 0.4260 0.1093 ∑ A i segment length Multiple priors 0.4206 0.1046 min A i 0.4399 0.0845 Linear model Linear models 0.4451 0.0688 Interactions 0.4494 0.0739 Quadratic 0.2826 0.0691 Committee mean 0.4439 0.1151 Baseline 0.6015 0.2572 Theoretical max

EXPERIMENT #2: MIREX COMMITTEE ➤ Could a more diverse committee of state-of-the-art algorithms do better? ➤ Run the same experiment with new committee: ➤ Set of 23 MIREX participants, 2012–2014.

USING PRIORS TO IMPROVE* ESTIMATES OF MUSIC STRUCTURE Jordan B. L. - PowerPoint PPT Presentation

USING PRIORS TO IMPROVE* ESTIMATES OF MUSIC STRUCTURE Jordan B. L. Smith Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST), Japan Wednesday, August 10th 2016 * or not Oral Session #5: Structure WHERE DO

MUSIC THERAPY MUSIC THERAPY What is music therapy? Music therapy is simply the process of using

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

JEWISH MUSIC 101: WHAT IS JEWISH MUSIC? A PROGRAM OF THE LOWELL MILKEN FUND FOR AMERICAN JEWISH

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &

Informative Priors for Graphical Model Structure James Cussens, University of York

Music and Pain: A Music Therapy Perspective Deborah Salmon, MA, MTA, CMT BRAMS, Universit de

FOLK MUSIC AT KMH A presentation of the Folk Music Department at the Royal College of Music,

Music, Language and Computation Aline Honingh LoLaCo Guestlecture 2012 Outline Music at the

Mixture of g Priors for Bayesian Variable Selection Feng Liang, Rui Paulo et al. Sheng Zhang

Choosing Priors Probability Intervals 18.05 Spring 2014 Conjugate priors A prior is conjugate

Conjugate Priors: Beta and Normal 18.05 Spring 2018 Review: Continuous priors, discrete data

P-values, Probability, Priors, Rabbits, P-values, Probability, Priors, Rabbits, Quantifauxcation,

A Musical Future Options for Studying Music at UWA Why choose Music at UWA? Music at UWA

Music Tagging Ryan Curtin LUG@GT Ryan Curtin Music Tagging - p. 1 The Problem You have a

School Music Education Plan THAMES Guidance for Schools Music in Schools - Introducing School

TX-NM Network Gathering: August 1315, 2015 Generation X Bill Young MCC Austin Last updated:

Machine Learning Research at eBay: An Industry Lab Perspective Dennis DeCoste eBay Research Labs

Why Do People Give? An Experimental Test of Pure and Impure Altruism Lise Vesterlund Mark

Natural Language Processing Info 159/259 Lecture 3: Text classification 2 (Aug 30, 2018)

Formal Program Optimization in Nuprl Using Computational Equivalence and Partial Types Vincent

arXiv:1810.02435v2 [hep-ex] 8 Oct 2018 Department of Physics Duke University, Durham, North

Sustainability Science 1 and the Sustainability Pathway Specializing in spaceship repair

Objects Akim Demaille, Etienne Renault, Roland Levillain March 20, 2020 TYLA Objects March 20,

USING PRIORS TO IMPROVE* ESTIMATES OF MUSIC STRUCTURE Jordan B. L. - PowerPoint PPT Presentation

USING PRIORS TO IMPROVE* ESTIMATES OF MUSIC STRUCTURE Jordan B. L. Smith Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST), Japan Wednesday, August 10th 2016 * or not Oral Session #5: Structure WHERE DO

MUSIC THERAPY MUSIC THERAPY What is music therapy? Music therapy is simply the process of using

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

JEWISH MUSIC 101: WHAT IS JEWISH MUSIC? A PROGRAM OF THE LOWELL MILKEN FUND FOR AMERICAN JEWISH

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &amp;

Informative Priors for Graphical Model Structure James Cussens, University of York

Music and Pain: A Music Therapy Perspective Deborah Salmon, MA, MTA, CMT BRAMS, Universit de

FOLK MUSIC AT KMH A presentation of the Folk Music Department at the Royal College of Music,

Music, Language and Computation Aline Honingh LoLaCo Guestlecture 2012 Outline Music at the

Mixture of g Priors for Bayesian Variable Selection Feng Liang, Rui Paulo et al. Sheng Zhang

Choosing Priors Probability Intervals 18.05 Spring 2014 Conjugate priors A prior is conjugate

Conjugate Priors: Beta and Normal 18.05 Spring 2018 Review: Continuous priors, discrete data

P-values, Probability, Priors, Rabbits, P-values, Probability, Priors, Rabbits, Quantifauxcation,

A Musical Future Options for Studying Music at UWA Why choose Music at UWA? Music at UWA

Music Tagging Ryan Curtin LUG@GT Ryan Curtin Music Tagging - p. 1 The Problem You have a

School Music Education Plan THAMES Guidance for Schools Music in Schools - Introducing School

TX-NM Network Gathering: August 1315, 2015 Generation X Bill Young MCC Austin Last updated:

Machine Learning Research at eBay: An Industry Lab Perspective Dennis DeCoste eBay Research Labs

Why Do People Give? An Experimental Test of Pure and Impure Altruism Lise Vesterlund Mark

Natural Language Processing Info 159/259 Lecture 3: Text classification 2 (Aug 30, 2018)

Formal Program Optimization in Nuprl Using Computational Equivalence and Partial Types Vincent

arXiv:1810.02435v2 [hep-ex] 8 Oct 2018 Department of Physics Duke University, Durham, North

Sustainability Science 1 and the Sustainability Pathway Specializing in spaceship repair

Objects Akim Demaille, Etienne Renault, Roland Levillain March 20, 2020 TYLA Objects March 20,

The intriguing case of sad music Dr. Jonna Vuoskoski jonna.vuoskoski@music.ox.ac.uk Music &