high dimensional distribution testing
play

High-Dimensional Distribution Testing Constantinos Costis - PowerPoint PPT Presentation

High-Dimensional Distribution Testing Constantinos Costis Daskalakis CSAIL and EECS, MIT What properties do your BIG distributions have? e.g. 1 Testing Uniformity e.g.2: Linkage Disequilibrium Genome locus 1 locus 2 Single


  1. High-Dimensional Distribution Testing Constantinos “Costis” Daskalakis CSAIL and EECS, MIT

  2. What properties do your BIG distributions have?

  3. e.g. 1 Testing Uniformity • …

  4. e.g.2: Linkage Disequilibrium Genome locus 1 locus 2 Single Nucleotide Polymorphisms (SNPs), are they independent? 1000 samples (you patients)

  5. e.g.3: Behavior in a Social Network Q: Are nodes behaving independently or far from independently? Q’: Do adopted technologies exhibit weak or strong network effects? 1 sample

  6. Problem formulation • TV ( c.f. G’s talk)

  7. What do we really know about our BIG distributions of interest?

  8. Inspecting the LB Instance • u.a.r.

  9. Today’s Menu • Motivation • Testing Bayesian Networks • Testing Ising Models • Closing Thoughts

  10. Today’s Menu • Motivation • Testing Bayesian Networks • Testing Ising Models • Closing Thoughts

  11. Bayesian Networks •

  12. Testing Bayesian Networks

  13. Testing Bayesian Networks (cont’d)

  14. Testing Bayesian Networks (cont’d)

  15. Today’s Menu • Motivation • Testing Bayesian Networks • Testing Ising Models • Closing Thoughts

  16. Ising Model •

  17. Ising Model: Strong vs weak ties “low temperature regime” “high temperature regime” Forces

  18. Testing Ising Models • product measures

  19. Testing Ising Models • product measures

  20. Testing Ising Models • Low temperature. How about high temperature?

  21. High Temperature Ising •

  22. Ising Model: Strong vs weak ties “low temperature regime” “high temperature regime” Exponential mixing of the Glauber dynamics

  23. Testing Ising Models •

  24. Concentration of Measure •

  25. Using Concentration to Test •

  26. Testing Weak vs Strong Network Ties e.g. Who listens to the Beatles? Q: Given one sample (from last.fm dataset) of who does/doesn’t listen to a particular band, can we reject the hypothesis that this decision comes from high-temperature Ising model (lack of long range correlation)? A: we can for Taylor Swift, Britney Spears, Katy Perry, Rihanna, Lady Gaga; we cannot for Beatles and Muse

  27. Conclusions • Testing properties of high-dimensional distributions requires exponentially many samples • Making assumptions about the distribution being sampled gives leverage • [w/ Pan COLT’17]: Testing Bayes nets with linearly many samples • [w/ Dikkala, Kamath SODA’18]: Testing Ising models with polynomially many samples • [w/ Dikkala, Kamath NIPS’17]: Testing weak vs strong ties from one sample

  28. Testing from a Single Sample • Given one social network, one brain, etc., how can we test the validity of a certain generative model? • Ongoing with Aliakbarpour-Rubinfeld-Zampetakis, testing preferential attachment models

  29. Testing Markov Chains • How to quantify distance between Markov chains? Thanks!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend