Lower Bounds for Sampling Peter Bartlett CS and Statistics UC - PowerPoint PPT Presentation

Oct 11, 2023 •265 likes •346 views

Lower Bounds for Sampling Peter Bartlett CS and Statistics UC Berkeley EPFL Open Problem Session. July 2020 1 / 7 How hard is sampling? Problem: Given oracle access to a potential f : R d R (e.g., x f ( x ) , f ( x )) generate

Lower Bounds for Sampling Peter Bartlett CS and Statistics UC Berkeley EPFL Open Problem Session. July 2020 1 / 7
How hard is sampling? Problem: Given oracle access to a potential f : R d → R (e.g., x �→ f ( x ) , ∇ f ( x )) generate samples from p ∗ ( x ) ∝ exp( − f ( x )). 2 / 7
Positive results (Dalalyan, 2014) For smooth, strongly convex f , after n = Ω( d /ǫ 2 ) gradient queries, overdamped Langevin MCMC has � p n − p ∗ � TV ≤ ǫ . There are results of this flavor for stochastic gradient Langevin algorithms, underdamped Langevin algorithms, Metropolis-adjusted, nonconvex f , etc. Lower bounds? 3 / 7
Lower bound with a noisy gradient oracle arXiv:2002.00291 Problem: Generate samples from R d with density p ∗ ( x ) ∝ exp( − f ( x )) , Niladri Chatterji Phil Long with f smooth , strongly convex . Information protocol Algorithm A is given access to a stochastic gradient oracle Q When the oracle is queried at a point y it returns z = ∇ f ( y ) + ξ, where ξ is unbiased noise, independent of the query point y , with � ξ � ≤ d σ 2 The algorithm A is allowed to make n adaptive queries to the oracle 4 / 7
An information-theoretic lower bound Theorem For all d, σ 2 , n ≥ σ 2 d / 4 and for all α ≤ σ 2 d / (256 n ) , � d � � p ∗ � Alg[ n ; Q ] − p ∗ � TV = Ω inf A sup sup σ , n Q where the p ∗ supremum is over α -log smooth, α/ 2 -strongly log-concave distributions over R d . Hence, α is constant and n = O ( σ 2 d ) = ⇒ the worst-case total variation distance is larger than a constant. For α, σ constant, matches upper bounds for stochastic gradient Langevin (Durmus, Majewski and Miasojedow, 2019). 5 / 7
Proof idea Restrict to a finite parametric class (Gaussian) and a stochastic oracle that adds Gaussian noise. Like a classical comparison of statistical experiments: Relate the minimax TV distance to a difference of risk of two estimators, one that sees the algorithm’s samples and one that sees the true distribution. Use Le Cam’s method: relate estimation to testing. 6 / 7
Open questions What if the noise has added structure? For example, what if the potential function is sum-decomposable and the oracle returns a gradient over a mini-batch of functions? Lower bounds for sampling with oracle access to the exact gradients? Some lower bounds for related problems: Luis Rademacher and Santosh Vempala. Dispersion of mass and the complexity of randomized geometric algorithms. 2008. Rong Ge, Holden Lee, and Jianfeng Lu. Estimating normalizing constants for log-concave distributions: Algorithms and lower bounds. 2019. 7 / 7

Recommend

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2 Circuit Lower-bounds Today: 2 Circuit Lower-bounds Today: PARITY AC 0 2 Circuit Lower-bounds Today: PARITY AC 0 Two different proofs! (Latter

1.59k views • 137 slides

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment and Porewater Porewater Porewater in the in the in the in the Lower Willamette River Lower Willamette River Lower Willamette River Lower

507 views • 24 slides

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Sampling Rejection Sampling Importance Sampling Markov Chain Monte Carlo Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling Importance Sampling Markov Chain Monte Carlo Recall Inference For

573 views • 41 slides

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

What are the two types of sampling What are the two types of sampling Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two types of sampling methods Probability sampling: selection of random

280 views • 4 slides

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the luminaires sampling the luminaires sampling the BRDF Veach thesis, 1997 sampling the luminaires Veach thesis, 1997 sampling the BRDF Veach thesis, 1997

440 views • 11 slides

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Session 6. Sampling and Sample size Sampling: What are the different sampling methods used? In PIA: Convenience sampling Purposive sampling Random sampling What is the strengths and weakness of these sampling methods? Sampling

219 views • 5 slides

Lower Bounds on Matrix Rigidity via a Quantum Argument Ronald de Wolf CWI Amsterdam Lower

Lower Bounds on Matrix Rigidity via a Quantum Argument Ronald de Wolf CWI Amsterdam Lower Bounds on Matrix Rigidity via a Quantum Argument p.1/6 Rigidity: What and why? Consider full-rank n n matrix M Lower Bounds on Matrix Rigidity via

445 views • 31 slides

Lecture 2. Upper and lower bounds for subgaussian matrices The -net method refined 1 Random

Lecture 2. Upper and lower bounds for subgaussian matrices The -net method refined 1 Random processes. Multiscale -net method: Dudleys inequality 2 Upper and lower bounds Our goal: upper and lower bounds on random matrices. In Lecture

536 views • 17 slides

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013, Warsaw Andrew Drucker Kernel-Size Lower Bounds Part 3/3 Andrew Drucker Kernel-Size Lower Bounds Note These slides are taken (with minor

1.04k views • 88 slides

Amit Chakrabarti Dartmouth College WAPMDS, IIT Kanpur, Dec 2009 Amit Chakrabarti 1 Multi-Pass

Multi-Pass Lower Bounds Dec 20, 2009 Multi-pass Data Stream Lower Bounds via Round Elimination Amit Chakrabarti Dartmouth College WAPMDS, IIT Kanpur, Dec 2009 Amit Chakrabarti 1 Multi-Pass Lower Bounds Dec 20, 2009 Lower Bounds Paradigms

846 views • 70 slides

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013, Warsaw Andrew Drucker Kernel-Size Lower Bounds Part 1/3 Andrew Drucker Kernel-Size Lower Bounds Note These slides are taken (with minor

1.3k views • 95 slides

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013, Warsaw Andrew Drucker Kernel-Size Lower Bounds Part 2/3 Andrew Drucker Kernel-Size Lower Bounds Note These slides are a slightly revised version

803 views • 51 slides

Monotone Circuit Depth Lower Bounds Prashant Vasudevan April 10, 2012 Prashant Vasudevan

Introduction Reductions The Lower Bound Monotone Circuit Depth Lower Bounds Prashant Vasudevan April 10, 2012 Prashant Vasudevan Monotone Circuit Depth Lower Bounds Introduction Yaos Model Reductions KW Games The Lower Bound Table of

1.36k views • 61 slides

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random) Why is it be called probability sampling? What are we certain of? Everyone comfortable with Simple random sampling

381 views • 11 slides

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Approximate Inference: Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform sampling Importance Sampling Rejection Sampling Metropolis-Hastings Gibbs sampling Example: Collapsed Gibbs Sampler for

1.03k views • 71 slides

On lower bounds for C 0 -semigroups Yuri Tomilov IM PAN, Warsaw Chemnitz, August, 2017 Yuri

On lower bounds for C 0 -semigroups Yuri Tomilov IM PAN, Warsaw Chemnitz, August, 2017 Yuri Tomilov (IM PAN, Warsaw) On lower bounds for C 0 -semigroups Chemnitz, August, 2017 1 / 17 Trivial bounds For f

1.08k views • 56 slides

Complex Langevin Dynamics in 1+1D QCD at finite densities SIGN workshop Sebastian Schmalzbauer

Complex Langevin Dynamics in 1+1D QCD at finite densities SIGN workshop Sebastian Schmalzbauer supervisor: Jacques Bloch September 29 th , 2015 1 1 Motivation QCD phase diagram many other systems also plagued by sign problem find

511 views • 28 slides

PDE methods for statistical physics Julien Roussel Cermics, ENPC Equipe-projet INRIA Matherials

Overdamped Langevin dynamics The Langevin dynamics PDE methods for statistical physics Julien Roussel Cermics, ENPC Equipe-projet INRIA Matherials November 9, 2016, Cermics Julien Roussel Cermics, ENPC Equipe-projet INRIA Matherials PDE

433 views • 21 slides

Probabilistic reasoning over time - Hidden Markov Models (recap BNs) Applied artificial

Probabilistic reasoning over time - Hidden Markov Models (recap BNs) Applied artificial intelligence (EDA132) Lecture 10 2016-02-17 Elin A. Topp Material based on course book, chapter 15 1 A robots view of the world... 9000 Scan data

459 views • 27 slides

Is there bias in the estimated climate forcing by black carbon aerosols? John Ogren 1 Elisabeth

Is there bias in the estimated climate forcing by black carbon aerosols? John Ogren 1 Elisabeth Andrews 1,2 1 NOAA Earth System Research Laboratory 2 Univ. of Colorado Boulder, Colorado, USA Global Monitoring Annual Conference Boulder, CO, May

664 views • 35 slides

Non-asymptotic convergence bound for the Unadjusted Langevin Algorithm Alain Durmus, Eric

Motivation Framework Sampling from strongly log-concave distribution Computable bounds in total variation for super-exponential densities Deviation inequalities Non-smooth potentials Non-asymptotic convergence bound for the Unadjusted

1.29k views • 70 slides

Harmonic Analysis on data sets in high-dimensional space Mauro Maggioni Mathematics and Computer

Harmonic Analysis on data sets in high-dimensional space Mauro Maggioni Mathematics and Computer Science Duke University U.S.C./I.M.I., Columbia, 3/3/08 In collaboration with R.R. Coifman, P .W. Jones, R. Schul, A.D. Szlam Funding: NSF-DMS,

628 views • 40 slides

Scaling Analysis of MCMC algorithms Alexandre Thiry 1 1 University of Warwick MCQMC, February

The Scaling Analysis Method High Dimensional MCMC Concentration near a manifold Scaling Analysis of MCMC algorithms Alexandre Thiry 1 1 University of Warwick MCQMC, February 2012 Collaboration with Andrew Stuart (Warwick), Gareth Roberts

681 views • 33 slides

with population imbalance Shoichiro Tsutsui (RIKEN Nishina Center for Accelerator-Based Science)

Complex Langevin study of an attractively interacting two-component Fermi gas in 1D with population imbalance Shoichiro Tsutsui (RIKEN Nishina Center for Accelerator-Based Science) In collaboration with T akahiro M. Doi (RCNP Osaka Univ.)

886 views • 42 slides