Sample Complexity and Expressiveness Roi Livni and Yishay Mansour - - PowerPoint PPT Presentation

β–Ά
sample complexity and
SMART_READER_LITE
LIVE PREVIEW

Sample Complexity and Expressiveness Roi Livni and Yishay Mansour - - PowerPoint PPT Presentation

Graph Based- Discriminators Sample Complexity and Expressiveness Roi Livni and Yishay Mansour Discrimination A discriminator is provided with two data sets. 1 1 2 2 Decide if 1 and 2 are


slide-1
SLIDE 1

Graph Based- Discriminators Sample Complexity and Expressiveness

Roi Livni and Yishay Mansour

slide-2
SLIDE 2

Discrimination

  • A discriminator is provided with

two data sets.

  • 𝑇1 ∼ 𝑄

1

  • 𝑇2 ∼ 𝑄2
  • Decide if 𝑄

1 and 𝑄2 are different.

  • If not, provide a certificate.
slide-3
SLIDE 3

Motivation: Synthetic Data Generation

Goodfellow et al.’14

https://thispersondoesnotexist.com/

slide-4
SLIDE 4

Discrimination: Learning Lens

  • A learner is defined by a class 𝐼 βŠ† 0,1 π‘Œ
  • Given labelled sample from some distribution 𝑄 over π‘Œ Γ— 0,1
  • Learner returns β„Ž ∈ 𝐼 such that

𝑄(𝑦,𝑧) β„Ž 𝑦 β‰  𝑧 ≀ min

β„ŽβˆˆπΌ 𝑄(𝑦,𝑧) β„Ž 𝑦 β‰  𝑧 + πœ—

  • If sup

β„ŽβˆˆπΌ

πΉπ‘¦βˆΌπ‘„1 β„Ž 𝑦 βˆ’ πΉπ‘¦βˆΌπ‘„2 β„Ž 𝑦 > πœ—

  • Learner succeeds.
slide-5
SLIDE 5

Learning as a discrimination task

  • Discriminator is defined by a class of distinguishers 𝐼 βŠ† 0,1 π‘Œ

Integral Probability Metric: 𝐽𝑄𝑁𝐼 𝑄

1, 𝑄2 = sup β„ŽβˆˆπΌ

|πΉπ‘¦βˆΌπ‘„1 β„Ž 𝑦 βˆ’ πΉπ‘¦βˆΌπ‘„2 β„Ž 𝑦 |

  • If 𝐽𝑄𝑁𝐼 𝑄

1, 𝑄2 > πœ— -- return β„Ž ∈ 𝐼 with 𝐽𝑄𝑁𝐼 𝑄 1, 𝑄2 > πœ—/2

  • If not, may fail. (return EQUIVALENT).

(Muller’97)

slide-6
SLIDE 6

Higher order discrimination

  • Instead of considering hypotheses classes, what if we take other types
  • f statistical tests:
  • Example: Collision test
  • Estimate probability to draw the same point twice. If different–

declare distinct.

  • If not, may fail (return equivalent).
slide-7
SLIDE 7

Higher order discrimination

  • Instead of considering hypotheses classes, what if we take other types
  • f distinguishers:
  • More generally: Take a family G = {𝑕: 𝑕: π‘Œ2 β†’ 0,1 }

𝐽𝑄𝑁𝐻 𝑄

1, 𝑄2 = sup π‘•βˆˆπ»

𝐹 𝑦1,𝑦2)βˆΌπ‘„1

2 𝑕 𝑦1, 𝑦2

βˆ’ 𝐹 𝑦1,𝑦2)βˆΌπ‘„2

2 𝑕 𝑦1, 𝑦2

  • Are graph-based distinguishers stronger than classical distinguishers?
  • Sample Complexity?
slide-8
SLIDE 8

Expressive power of graph-based discriminators

THEOREM: Let X be an infinite domain. There exists a graph g such that: For every hypothesis class H with finite VC dimension and πœ— > 0, there are two distributions 𝑄

π‘‘π‘§π‘œ, π‘„π‘ π‘“π‘π‘š such that

𝐽𝑄𝑁𝐼 π‘žπ‘‘π‘§π‘œ, π‘žπ‘ π‘“π‘π‘š < πœ— and, 𝐹(𝑦1,𝑦2)βˆΌπ‘žπ‘‘π‘§π‘œ

2

[𝑕 𝑦1,𝑦2 )] βˆ’ 𝐹(𝑦1,𝑦2)βˆΌπ‘žπ‘ π‘“π‘π‘š

2

[𝑕(𝑦1, 𝑦2)] > 1 4 (L, Mansour’19)

slide-9
SLIDE 9

Finite Version

  • If |X|=N, there is a graph g such that for every class H there are two

distributions that are H-indistinguishable, g-distinguishable unless:

β—‹

π‘Šπ· 𝐼 = Ξ©(πœ—2 log 𝑂) (L, Mansour’19)

  • Optimal: For every graph-based class G with finite capacity there is a

hypothesis class H with VC dimension 𝑃(πœ—2 log 𝑂) such that

𝐽𝑄𝑁𝐷 π‘žπ‘‘π‘§π‘œ, pπ‘ π‘“π‘π‘š > 1

4 β‡’ 𝐽𝑄𝑁𝐻 π‘žπ‘‘π‘§π‘œ, pπ‘ π‘“π‘π‘š > πœ—

(Alon, L, Mansour)

β—‹

Given a graph g how many sets are needed to separate every dense set from every sparse set?

slide-10
SLIDE 10

Sample complexity of graph-based discriminators

  • For a family of graph G.
  • Given samples from two unknown distributions 𝑄

1, 𝑄2: Decide if

  • How many examples are needed?
  • Recall:

β—‹

For an hypothesis class, a discriminator can decide if 𝐽𝑄𝑁𝐼 𝑄

1, 𝑄2 > πœ—, if and

  • nly if H has finite VC dimension.

β—‹

Θ π‘Šπ· 𝐼 /πœ—2 are needed

𝐽𝑄𝑁𝐻 𝑄

1, 𝑄2 > πœ—

slide-11
SLIDE 11

The graph-VC dimension

  • The graph VC dimension is obtained by considering the projections of the

graph by fixing a vertex. Namely, for every x consider the hypothesis class

𝐼𝑦 = 𝑕 𝑦,β‹… : π‘Œ β†’ 0,1 : 𝑕 ∈ 𝐻

  • Then: π‘•π‘Šπ· 𝐷 = sup

π‘¦βˆˆπ‘Œ

π‘Šπ·(𝐼𝑦)

  • 𝑃(π‘•π‘Šπ· 𝐷 ) are sufficient.
  • Ξ©( π‘•π‘Šπ· 𝐷 ) are necessary.

(L, Mansour’19)