Testing Monotone Continuous Distributions on High-dimensional Real - - PowerPoint PPT Presentation

testing monotone continuous distributions on high
SMART_READER_LITE
LIVE PREVIEW

Testing Monotone Continuous Distributions on High-dimensional Real - - PowerPoint PPT Presentation

Introduction Monotone distributions Conclusions Testing Monotone Continuous Distributions on High-dimensional Real Cubes Michal Adamaszek DIMAP, University of Warwick Joint work with Artur Czumaj and Christian Sohler Michal Adamaszek


slide-1
SLIDE 1

Introduction Monotone distributions Conclusions

Testing Monotone Continuous Distributions on High-dimensional Real Cubes

Michal Adamaszek

DIMAP, University of Warwick

Joint work with Artur Czumaj and Christian Sohler

Michal Adamaszek Testing Monotone Continuous Distributions

slide-2
SLIDE 2

Introduction Monotone distributions Conclusions

Testing probability distributions

Test if a probability distribution has a given property P. Distribution is accessed by drawing random samples.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-3
SLIDE 3

Introduction Monotone distributions Conclusions

Testing probability distributions

Test if a probability distribution has a given property P. Distribution is accessed by drawing random samples. Goal: distinguish between

distributions with the property P, distributions which are far from P

Michal Adamaszek Testing Monotone Continuous Distributions

slide-4
SLIDE 4

Introduction Monotone distributions Conclusions

Testing probability distributions

Test if a probability distribution has a given property P. Distribution is accessed by drawing random samples. Goal: distinguish between

distributions with the property P, distributions which are far from P

minimizing the number of samples and with error probability ≤ 1/3.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-5
SLIDE 5

Introduction Monotone distributions Conclusions

Testing probability distributions

Test if a probability distribution has a given property P. Distribution is accessed by drawing random samples. Goal: distinguish between

distributions with the property P, distributions which are far from P

minimizing the number of samples and with error probability ≤ 1/3. Examples:

is the distribution uniform? is it equal to a fixed distr.? are two distributions identical? are they independent? estimate support size etc...

Michal Adamaszek Testing Monotone Continuous Distributions

slide-6
SLIDE 6

Introduction Monotone distributions Conclusions

Classical/typical results

Is a distribution on k points uniform ˜ O( √ k) samples. Are two distributions on k points close in L1-norm ˜ O(k2/3) samples. Is a distribution on {0, 1, . . . , k} close to monotone ˜ O( √ k) samples. Is a distribution on [k] × [k] a product of its marginals ˜ O(k) samples. Batu, Fischer, Fortnow, Kumar, Rubinfeld, Smith, White et al.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-7
SLIDE 7

Introduction Monotone distributions Conclusions

Infinite domains

Michal Adamaszek Testing Monotone Continuous Distributions

slide-8
SLIDE 8

Introduction Monotone distributions Conclusions

Infinite domains

Ω = [0, 1]n

Michal Adamaszek Testing Monotone Continuous Distributions

slide-9
SLIDE 9

Introduction Monotone distributions Conclusions

Infinite domains

Ω = [0, 1]n continuous distributions with density f so that Prf [A] =

  • A

f dµ.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-10
SLIDE 10

Introduction Monotone distributions Conclusions

Infinite domains

Ω = [0, 1]n continuous distributions with density f so that Prf [A] =

  • A

f dµ. distributions with atoms f +

  • piδxi

Michal Adamaszek Testing Monotone Continuous Distributions

slide-11
SLIDE 11

Introduction Monotone distributions Conclusions

Non-testable properties

Michal Adamaszek Testing Monotone Continuous Distributions

slide-12
SLIDE 12

Introduction Monotone distributions Conclusions

Non-testable properties

Is a distribution continuous, or purely discrete?

Michal Adamaszek Testing Monotone Continuous Distributions

slide-13
SLIDE 13

Introduction Monotone distributions Conclusions

Non-testable properties

Is a distribution continuous, or purely discrete? Is a continuous distribution uniform or is it ǫ-far from uniform in the L1 metric?

Michal Adamaszek Testing Monotone Continuous Distributions

slide-14
SLIDE 14

Introduction Monotone distributions Conclusions

Non-testable properties

Is a distribution continuous, or purely discrete? Is a continuous distribution uniform or is it ǫ-far from uniform in the L1 metric? “fatten” a discrete distribution on M random points, up to ∼ √ M draws this looks like a random distribution, but is very L1-far from uniform.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-15
SLIDE 15

Introduction Monotone distributions Conclusions

A testable property - discreteness on M points

For arbitrary Ω distinguish between f =

M

  • i=1

piδxi for some x1, . . . , xM, Prf [A] < 1 − ǫ for any set A ⊂ Ω of size M.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-16
SLIDE 16

Introduction Monotone distributions Conclusions

A testable property - discreteness on M points

Tester for discreteness on M points: Take 2M/ǫ random samples If there are ≤ M distinct values accept. If there are > M distinct values reject.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-17
SLIDE 17

Introduction Monotone distributions Conclusions

A testable property - discreteness on M points

Tester for discreteness on M points: Take 2M/ǫ random samples If there are ≤ M distinct values accept. If there are > M distinct values reject. Lower bound Ω(M1−o(1)) follows from bounds for estimating distribution support size (eg. Raskhodnikova et al’09, Valiant’08). Match these bounds?

Michal Adamaszek Testing Monotone Continuous Distributions

slide-18
SLIDE 18

Introduction Monotone distributions Conclusions

Monotone distributions and uniformity

Find a class of distributions for which being uniform is testable.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-19
SLIDE 19

Introduction Monotone distributions Conclusions

Monotone distributions and uniformity

Find a class of distributions for which being uniform is testable. Ω = [0, 1]n The density f is monotone if f (x) ≤ f (y) whenever xi ≤ yi for all i.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-20
SLIDE 20

Introduction Monotone distributions Conclusions

Monotone distributions and uniformity

Find a class of distributions for which being uniform is testable. Ω = [0, 1]n The density f is monotone if f (x) ≤ f (y) whenever xi ≤ yi for all i. Given a distribution with monotone density f , Is f the uniform distribution U? Or is it ǫ-far from U in the L1 metric d(f , g) = 1 2

|f − g|.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-21
SLIDE 21

Introduction Monotone distributions Conclusions

Discrete vs. continuous cubes

Rubinfeld, Servedio’05 Testing uniformity of monotone distributions on the boolean cube {0, 1}n with L1 distance Is possible with O(n log(n/ǫ)/ǫ2) samples. Requires Ω(n/ log2 n) samples.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-22
SLIDE 22

Introduction Monotone distributions Conclusions

Discrete vs. continuous cubes

Rubinfeld, Servedio’05 Testing uniformity of monotone distributions on the boolean cube {0, 1}n with L1 distance Is possible with O(n log(n/ǫ)/ǫ2) samples. Requires Ω(n/ log2 n) samples.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-23
SLIDE 23

Introduction Monotone distributions Conclusions

1/6 1/4 1/4 1/3

1/6 1/4 1/4 1/3

Michal Adamaszek Testing Monotone Continuous Distributions

slide-24
SLIDE 24

Introduction Monotone distributions Conclusions

1/6 1/4 1/4 1/3

1/6 1/4 1/4 1/3

lower bound → lower bound upper bound ← upper bound

Michal Adamaszek Testing Monotone Continuous Distributions

slide-25
SLIDE 25

Introduction Monotone distributions Conclusions

Discrete vs. continuous cubes

Rubinfeld, Servedio’05 Testing uniformity of monotone distributions on the boolean cube {0, 1}n with L1 distance Is possible with O(n log(n/ǫ)/ǫ2) samples. Requires Ω(n/ log2 n) samples.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-26
SLIDE 26

Introduction Monotone distributions Conclusions

Discrete vs. continuous cubes

Rubinfeld, Servedio’05 Testing uniformity of monotone distributions on the boolean cube {0, 1}n with L1 distance Is possible with O(n log(n/ǫ)/ǫ2) samples. Requires Ω(n/ log2 n) samples. Our result Testing uniformity of monotone distributions on the real cube [0, 1]n with L1 distance Is possible with O(n/ǫ2) samples.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-27
SLIDE 27

Introduction Monotone distributions Conclusions

Tester

Idea: estimate x1 = x1 + x2 + . . . + xn.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-28
SLIDE 28

Introduction Monotone distributions Conclusions

Tester

Idea: estimate x1 = x1 + x2 + . . . + xn. If U is the uniform distribution then EU[x1] = n 2.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-29
SLIDE 29

Introduction Monotone distributions Conclusions

Tester

Idea: estimate x1 = x1 + x2 + . . . + xn. If U is the uniform distribution then EU[x1] = n 2. Theorem If f is a monotone distribution, ǫ-far from uniform then Ef [x1] ≥ n 2 + ǫ 2.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-30
SLIDE 30

Introduction Monotone distributions Conclusions

Tester

Ω = [0, 1]n, f - unknown monotone distribution. Draw C samples x1, . . . , xC, ˜ E = 1 C

  • xi1.

If ˜ E > n

2 + ǫ 4 say ǫ far from uniform.

If ˜ E ≤ n

2 + ǫ 4 say uniform.

C = 40n/ǫ2 is good (use Feige’s inequality).

Michal Adamaszek Testing Monotone Continuous Distributions

slide-31
SLIDE 31

Introduction Monotone distributions Conclusions

A word on the proof

Ef [x1] ≥ n 2 + ǫ 2.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-32
SLIDE 32

Introduction Monotone distributions Conclusions

A word on the proof

Ef [x1] ≥ n 2 + ǫ 2.

  • r

x1g(x)dx ≥ 1 4

|g(x)|dx for

  • Ω g(x)dx = 0, g : [0, 1]n → R - monotone.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-33
SLIDE 33

Introduction Monotone distributions Conclusions

A word on the proof

Ef [x1] ≥ n 2 + ǫ 2.

  • r

x1g(x)dx ≥ 1 4

|g(x)|dx for

  • Ω g(x)dx = 0, g : [0, 1]n → R - monotone.

1 t · g(t)dt = 1 4

  • t,s

|g(t) − g(s)|dsdt − 1 2 1 g(t)dt

Michal Adamaszek Testing Monotone Continuous Distributions

slide-34
SLIDE 34

Introduction Monotone distributions Conclusions

A word on the proof

Ef [x1] ≥ n 2 + ǫ 2.

  • r

x1g(x)dx ≥ 1 4

|g(x)|dx for

  • Ω g(x)dx = 0, g : [0, 1]n → R - monotone.

1 t · g(t)dt = 1 4

  • t,s

|g(t) − g(s)|dsdt − 1 2 1 g(t)dt If g is a function defined on the vertices of a boolean cube

  • diagonals

|g(u) − g(v)| ≤

  • edges

|g(u) − g(v)|.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-35
SLIDE 35

Introduction Monotone distributions Conclusions

Conclusions

We can test if a monotone distribution on [0, 1]n is uniform.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-36
SLIDE 36

Introduction Monotone distributions Conclusions

Conclusions

We can test if a monotone distribution on [0, 1]n is uniform. Same for monotone distributions on {0, 1, . . . , k}n.

Michal Adamaszek Testing Monotone Continuous Distributions

slide-37
SLIDE 37

Introduction Monotone distributions Conclusions

Conclusions

We can test if a monotone distribution on [0, 1]n is uniform. Same for monotone distributions on {0, 1, . . . , k}n. Other testable classes of distributions?

Michal Adamaszek Testing Monotone Continuous Distributions

slide-38
SLIDE 38

Introduction Monotone distributions Conclusions

Conclusions

We can test if a monotone distribution on [0, 1]n is uniform. Same for monotone distributions on {0, 1, . . . , k}n. Other testable classes of distributions? Other closeness measures instead of L1? Earth-mover distance? (Ba, Nguyen, Nguyen, Rubinfeld ’09)

Michal Adamaszek Testing Monotone Continuous Distributions