rank of tensors of l out of k functions an application in
play

Rank of tensors of l-out-of-k functions: an application in - PowerPoint PPT Presentation

Rank of tensors of l-out-of-k functions: an application in probabilistic inference Ji r Vomlel Institute of Information Theory and Automation ( UTIA) Academy of Sciences of the Czech Republic Contents The computer game of


  1. Rank of tensors of l-out-of-k functions: an application in probabilistic inference Jiˇ r´ ı Vomlel Institute of Information Theory and Automation (´ UTIA) Academy of Sciences of the Czech Republic

  2. Contents • The computer game of Minesweeper

  3. Contents • The computer game of Minesweeper • Probabilistic reasoning given evidence (using a simple example)

  4. Contents • The computer game of Minesweeper • Probabilistic reasoning given evidence (using a simple example) • Improving the computational efficiency

  5. Contents • The computer game of Minesweeper • Probabilistic reasoning given evidence (using a simple example) • Improving the computational efficiency • Rank-one decomposition of probability tables representing addition

  6. Contents • The computer game of Minesweeper • Probabilistic reasoning given evidence (using a simple example) • Improving the computational efficiency • Rank-one decomposition of probability tables representing addition • Results of experiments

  7. The game of Minesweeper

  8. Bayesian network for the game of Minesweeper ℓ ? ? ?

  9. Bayesian network for the game of Minesweeper ℓ ? Y X 1 ? ? X 3 X 2

  10. Bayesian network for the game of Minesweeper ℓ ? Y X 1 ? ? X 3 X 2 � 1 if ℓ = x 1 + x 2 + x 3 P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) = 0 otherwise.

  11. Bayesian network for the game of Minesweeper ℓ ? Y X 1 ? ? X 3 X 2 � 1 if ℓ = x 1 + x 2 + x 3 P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) = 0 otherwise. r P ( X i ) = s · t − o r is the number of mines, o is the number of observations s , t are the dimensions of the game grid.

  12. Bayes rule for updating probabilities • Assume we observe Y = ℓ .

  13. Bayes rule for updating probabilities • Assume we observe Y = ℓ . • We compute by Bayes rule P ( X 1 = x 1 , X 2 = x 2 , X 3 = x 3 | Y = ℓ )

  14. Bayes rule for updating probabilities • Assume we observe Y = ℓ . • We compute by Bayes rule P ( X 1 = x 1 , X 2 = x 2 , X 3 = x 3 | Y = ℓ ) P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) · � 3 i = 1 P ( X i = x i ) = P ( Y = ℓ )

  15. Bayes rule for updating probabilities • Assume we observe Y = ℓ . • We compute by Bayes rule P ( X 1 = x 1 , X 2 = x 2 , X 3 = x 3 | Y = ℓ ) P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) · � 3 i = 1 P ( X i = x i ) = P ( Y = ℓ ) P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) ∝

  16. Bayes rule for updating probabilities • Assume we observe Y = ℓ . • We compute by Bayes rule P ( X 1 = x 1 , X 2 = x 2 , X 3 = x 3 | Y = ℓ ) P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) · � 3 i = 1 P ( X i = x i ) = P ( Y = ℓ ) P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) ∝ • This is a probability table over 3 binary variables X 1 , X 2 , X 3 : P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) � 1 if x 1 + x 2 + x 3 = ℓ = 0 otherwise.

  17. Bayes rule for updating probabilities • Assume we observe Y = ℓ . • We compute by Bayes rule P ( X 1 = x 1 , X 2 = x 2 , X 3 = x 3 | Y = ℓ ) P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) · � 3 i = 1 P ( X i = x i ) = P ( Y = ℓ ) P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) ∝ • This is a probability table over 3 binary variables X 1 , X 2 , X 3 : P ( Y = ℓ | X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) � 1 if x 1 + x 2 + x 3 = ℓ = 0 otherwise. ψ ( X 1 = x 1 , X 2 = x 2 , X 3 = x 3 ) . =

  18. Tensors of ℓ -out-of- k functions We can visualize probability table ψ as a tensor (for ℓ = 1): � 0 � 1   � � 1 0   � 1 � 0    � �    0 0 In this talk all tensors are functions from { 0, 1 } k to real numbers.

  19. Tensors of ℓ -out-of- k functions We can visualize probability table ψ as a tensor (for ℓ = 1): � 0 � 1   � � 1 0   � 1 � 0    � �    0 0 In this talk all tensors are functions from { 0, 1 } k to real numbers. We are interested in tensors of ℓ -out-of- k functions f ℓ ( x 1 , . . . , x k ) , where: • ℓ is the observed state of Y and • k is the number of binary variables - parents of Y .

  20. Tensors of ℓ -out-of- k functions We can visualize probability table ψ as a tensor (for ℓ = 1): � 0 � 1   � � 1 0   � 1 � 0    � �    0 0 In this talk all tensors are functions from { 0, 1 } k to real numbers. We are interested in tensors of ℓ -out-of- k functions f ℓ ( x 1 , . . . , x k ) , where: • ℓ is the observed state of Y and • k is the number of binary variables - parents of Y . if ℓ = � k � 1 i = 1 x i f ℓ ( x 1 , . . . , x k ) = 0 otherwise.

  21. Tensors of ℓ -out-of- k functions We can visualize probability table ψ as a tensor (for ℓ = 1): � 0 � 1   � � 1 0   � 1 � 0    � �    0 0 In this talk all tensors are functions from { 0, 1 } k to real numbers. We are interested in tensors of ℓ -out-of- k functions f ℓ ( x 1 , . . . , x k ) , where: • ℓ is the observed state of Y and • k is the number of binary variables - parents of Y . if ℓ = � k � 1 i = 1 x i f ℓ ( x 1 , . . . , x k ) = 0 otherwise. In our example ℓ = 1 and k = 3.

  22. Combining information 0 ? 1 ? ? ? ? ?

  23. Combining information Y 1 X 1 Y 2 X 6 X 3 X 2 X 4 X 5

  24. Combining information X 1 X 6 X 3 X 2 X 4 X 5 ξ ( X 1 , . . . , X 6 ) = ψ ( X 1 , . . . , X 3 ) · ϕ ( X 1 , X 2 , X 4 , . . . , X 6 )

  25. Combining information X 1 X 6 X 3 X 2 X 4 X 5 ξ ( X 1 , . . . , X 6 ) = ψ ( X 1 , . . . , X 3 ) · ϕ ( X 1 , X 2 , X 4 , . . . , X 6 ) Total table size is 2 3 + 2 5 = 8 + 32 = 40.

  26. A more efficient way of combining information X 1 X 6 X 3 X 2 X 4 X 5 ξ ( X 1 , . . . , X 6 ) = ψ 1 ( X 1 ) · . . . · ψ 3 ( X 3 ) · ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 )

  27. A more efficient way of combining information X 1 X 6 X 3 X 2 X 4 X 5 ξ ( X 1 , . . . , X 6 ) = ψ 1 ( X 1 ) · . . . · ψ 3 ( X 3 ) · ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) Total table size is 3 · 2 + 2 5 = 6 + 32 = 38.

  28. An even more efficient way of combining information X 1 B 2 X 6 X 3 X 2 X 4 X 5 � ξ ( X 1 , . . . , X 6 ) = ψ 1 ( X 1 ) · . . . · ψ 3 ( X 3 ) B 2 · ϕ 1 ( B 2 , X 1 ) · ϕ 2 ( B 2 , X 2 ) · ϕ 4 ( B 2 , X 4 ) . . . ϕ 6 ( B 2 , X 6 )

  29. An even more efficient way of combining information X 1 B 2 X 6 X 3 X 2 X 4 X 5 � ξ ( X 1 , . . . , X 6 ) = ψ 1 ( X 1 ) · . . . · ψ 3 ( X 3 ) B 2 · ϕ 1 ( B 2 , X 1 ) · ϕ 2 ( B 2 , X 2 ) · ϕ 4 ( B 2 , X 4 ) . . . ϕ 6 ( B 2 , X 6 ) Since B is binary the total table size is 3 · 2 + 5 · 2 2 = 6 + 20 = 26.

  30. Tensor rank We have just seen that ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) � = ϕ 1 ( B 2 , X 1 ) · ϕ 2 ( B 2 , X 2 ) · ϕ 4 ( B 2 , X 4 ) . . . ϕ 6 ( B 2 , X 6 ) . B 2

  31. Tensor rank We have just seen that ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) � = ϕ 1 ( B 2 , X 1 ) · ϕ 2 ( B 2 , X 2 ) · ϕ 4 ( B 2 , X 4 ) . . . ϕ 6 ( B 2 , X 6 ) . B 2 But there is no way we can write ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) = ϕ 1 ( X 1 ) · ϕ 2 ( X 2 ) · ϕ 4 ( X 4 ) . . . ϕ 6 ( X 6 )

  32. Tensor rank We have just seen that ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) � = ϕ 1 ( B 2 , X 1 ) · ϕ 2 ( B 2 , X 2 ) · ϕ 4 ( B 2 , X 4 ) . . . ϕ 6 ( B 2 , X 6 ) . B 2 But there is no way we can write ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) = ϕ 1 ( X 1 ) · ϕ 2 ( X 2 ) · ϕ 4 ( X 4 ) . . . ϕ 6 ( X 6 ) What is the minimal number of states of a variable B so that it holds that k � � ψ ( X 1 , . . . , X k ) = ψ i ( B , X i ) ? B i = 1

  33. Tensor rank We have just seen that ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) � = ϕ 1 ( B 2 , X 1 ) · ϕ 2 ( B 2 , X 2 ) · ϕ 4 ( B 2 , X 4 ) . . . ϕ 6 ( B 2 , X 6 ) . B 2 But there is no way we can write ϕ 1 ( X 1 , X 2 , X 4 , . . . , X 6 ) = ϕ 1 ( X 1 ) · ϕ 2 ( X 2 ) · ϕ 4 ( X 4 ) . . . ϕ 6 ( X 6 ) What is the minimal number of states of a variable B so that it holds that k � � ψ ( X 1 , . . . , X k ) = ψ i ( B , X i ) ? B i = 1 This number is called the rank of tensor ψ .

  34. Symmetric rank of tensors of ℓ -out-of- k functions • Generally, finding the rank of a tensor is NP-hard.

  35. Symmetric rank of tensors of ℓ -out-of- k functions • Generally, finding the rank of a tensor is NP-hard. • However, tensors of ℓ -out-of- k functions define a restricted class of tensors.

  36. Symmetric rank of tensors of ℓ -out-of- k functions • Generally, finding the rank of a tensor is NP-hard. • However, tensors of ℓ -out-of- k functions define a restricted class of tensors. • These tensors are all symmetric. A tensor ψ is symmetric if ψ ( X 1 = x 1 , . . . , X k = x k ) = a x 1 + ... + x k where a = ( a 0 , . . . , a k ) is a vector of real numbers.

  37. Symmetric rank of tensors of ℓ -out-of- k functions • Generally, finding the rank of a tensor is NP-hard. • However, tensors of ℓ -out-of- k functions define a restricted class of tensors. • These tensors are all symmetric. A tensor ψ is symmetric if ψ ( X 1 = x 1 , . . . , X k = x k ) = a x 1 + ... + x k where a = ( a 0 , . . . , a k ) is a vector of real numbers. • The symmetric rank of tensor ψ is the minimum number of symmetric tensors of rank one that sum up to ψ .

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend