cumulated effects in learning
play

Cumulated Effects in Learning rik Martin-Dorel Sergei Soloviev - PowerPoint PPT Presentation

Cumulated Effects in Learning rik Martin-Dorel Sergei Soloviev ACADIE team IRIT laboratory Universit de Toulouse 30 March 2018 1 / 18 rik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning Random Boolean games Cumulative effects


  1. Cumulated Effects in Learning Érik Martin-Dorel Sergei Soloviev ACADIE team IRIT laboratory Université de Toulouse 30 March 2018 1 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  2. Random Boolean games Cumulative effects in learning Positioning of this talk Starting point: a game theory result, formally verified in Coq Methodology: probabilistic analysis of entire classes of games Objective: discuss implications of this result, possible generalizations and connection with learning 2 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  3. Random Boolean games Cumulative effects in learning Outline A Coq theory of bool. games with random formulas as payoff functions 1 Discussion on modeling cumulative effects in learning 2 3 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  4. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  5. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  6. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): Strategies of players are vectors of bits. In a 2-player setting: Alice controls k bits Bob controls n − k bits 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  7. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): Strategies of players are vectors of bits. In a 2-player setting: Alice controls k bits Bob controls n − k bits � Game represented by a Boolean function F : 2 k × 2 n − k → 2 � �� � ≃ 2 n 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  8. Random Boolean games Cumulative effects in learning Boolean games A lot of literature (since 2001) Particular case of win/lose games (only 2 outcomes): Strategies of players are vectors of bits. In a 2-player setting: Alice controls k bits Bob controls n − k bits � Game represented by a Boolean function F : 2 k × 2 n − k → 2 � �� � ≃ 2 n Alice wins (with strat. a ) against Bob (with strat. b ) iff F ( a, b ) = 1 4 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  9. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  10. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  11. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games Assume the type of a game is known, but not its parameters in advance Randomly pick a game in the considered class Estimate the probability of various situations (A has a winning strategy, B has a winning strategy, no player has a winning strategy. . . ) 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  12. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games Assume the type of a game is known, but not its parameters in advance Randomly pick a game in the considered class Estimate the probability of various situations (A has a winning strategy, B has a winning strategy, no player has a winning strategy. . . ) The size of the considered class (number of n -var. Boolean functions) grows very fast ( 2 2 n ) � exhaustive computation does not scale 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  13. Random Boolean games Cumulative effects in learning Methodology As part of our project FAGames (Formal analysis of games using ITP): Use probability to explore entire classes of games Assume the type of a game is known, but not its parameters in advance Randomly pick a game in the considered class Estimate the probability of various situations (A has a winning strategy, B has a winning strategy, no player has a winning strategy. . . ) The size of the considered class (number of n -var. Boolean functions) grows very fast ( 2 2 n ) � exhaustive computation does not scale Derive symbolic results (that can then be numerically evaluated) 5 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  14. Random Boolean games Cumulative effects in learning Overview of the library Coq library “ RandBoolGames ” Based on SSReflect/MathComp ( fintype , finfun , finset , bigop ) as well as on the infotheo library [Affeldt et al.] 3.1k lines of Coq code – automation: introduce a new tactic “ under ” https://sourcesup.renater.fr/coq-bool-games/ 6 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  15. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  16. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  17. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? Prior choice: assign proba. to bool. functions or to bool. formulas? 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  18. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? Prior choice: assign proba. to bool. functions or to bool. formulas? 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  19. Random Boolean games Cumulative effects in learning Probability setting Focus on Ω := 2 2 n and S := P (Ω) = 2 2 2 n Which probability distribution? Prior choice: assign proba. to bool. functions or to bool. formulas? First step: consider any P : S → [0 , 1] 7 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  20. Random Boolean games Cumulative effects in learning First result Theorem ( Pr_ex_winA ) For any finite probability space (Ω , S , P ) , the probability that there exists some strategy a = ( a 1 , . . . , a k ) of A that is winning satisfies: � � � 2 k � � ( − 1) m − 1 P ( ∃ a. win A ( a )) = P W a , m =1 a ∈ J J ⊆ 2 k Card J = m denoting for any a ∈ 2 k , win A ( a ) := ∀ b ∈ 2 n − k . F ( a, b ) = 1 ω a := { v ∈ 2 n | v 1 = a 1 ∧ · · · ∧ v k = a k } ∈ Ω W a := { ω ∈ Ω | ω a ⊆ ω } ∈ S . 8 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  21. Random Boolean games Cumulative effects in learning Specialization of the probability setting We define P n ; p as a finite Bernoulli process: we construct bool. functions F from the truth-set F − 1 ( { 1 } ) as follows: 9 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  22. Random Boolean games Cumulative effects in learning Specialization of the probability setting We define P n ; p as a finite Bernoulli process: we construct bool. functions F from the truth-set F − 1 ( { 1 } ) as follows: we consider all vectors v ∈ 2 n for each v , we decide with proba. p if it belongs to the truth-set of F � 2 n independent Bernoulli trials with parameter p 9 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  23. Random Boolean games Cumulative effects in learning Specialization of the probability setting We define P n ; p as a finite Bernoulli process: we construct bool. functions F from the truth-set F − 1 ( { 1 } ) as follows: we consider all vectors v ∈ 2 n for each v , we decide with proba. p if it belongs to the truth-set of F � 2 n independent Bernoulli trials with parameter p Remark This setting subsumes the simpler case where all functions have the same elementary probability (just take p = 1 2 ) 9 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

  24. Random Boolean games Cumulative effects in learning Results I Theorem ( Pr_ex_winA_Bern ) For all p ∈ [0 , 1] and for all integers n , k , the probability that player A has a winning strategy is: 1 − p 2 n − k � 2 k � P n ; p ( ∃ a ∈ 2 k . win A ( a )) = 1 − . 10 / 18 Érik Martin-Dorel (IRIT/UPS) Cumulated Effects in Learning

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend