Matrix Factorizations
- ver
Non-Conventional Algebras for Data Mining
Pauli Miettinen 28 April 2015
Matrix Factorizations over Non-Conventional Algebras for Data - - PowerPoint PPT Presentation
Matrix Factorizations over Non-Conventional Algebras for Data Mining Pauli Miettinen 28 April 2015 Chapter 1. A Bit of Background Data long-haired well-known male Data ( )
Matrix Factorizations
Non-Conventional Algebras for Data Mining
Pauli Miettinen 28 April 2015
Chapter 1. A Bit of Background
Data
✔ ✔ ✘ ✔ ✔ ✔ ✘ ✔ ✔ long-haired well-known male
Data
long-haired well-known male 1 1 1 1 1 1 1
Factorization point of view
1 1 1 1 1 1 1
1 1 1 1
1 1 1 1
= × ○
Chapter 2. Boolean Matrix Factorization
Gian-Carlo Rota Foreword to Boolean matrix theory and applications by K. H. Kim, 1982
In the sleepy days when the provinces of France were still quietly provincial, matrices with Boolean entries were a favored occupation of aging professors at the universities of Bordeaux and Clermont-Ferrand. But one day…
Boolean products and factorizations
matrices A and B is their matrix product under the Boolean semi-ring
matrix A expresses it as a Boolean product of two binary factor matrices B and C, that is, A = B◦C
(A B)j =
Wk
=1 kbkj
Matrix ranks
rank-1 matrices whose sum is A
Comparison of ranks
⇒ Boolean factorization can achieve less
error than SVD
than the non-negative rank
1 1 1 1 1 1 1
The many names of Boolean rank
complexity)
Johnson GT18)
1 2 3
Boolean rank and bicliques
1 1 1 1
1 2 3 A B C 1 1 1 1 1 1 1
1 2 3 A B C 1 1 1 1
A B C
Boolean rank and sets
matrix A is the least number of subsets of U(A) needed to cover every set of the induced collection C(A)
the collection of subsets, have subcollection SC such that
1 3 2
S
S∈SC S = C
Approximate factorizations
rank
distance
and |A – B◦C| is minimized
The many applications of Boolean factorizations
The bad news
[Chalermsook et al. ’14]
Some algorithms
Chapter 3. Dioids Are Not Droids
Intuition of matrix multiplication
A and column j of B
Intuition of matrix multiplication
with the l-th row of B
Remember at least this slide
matrix as a sum of rank-1 matrices
matrix as an aggregate of simple matrices
depends on the algebra
Dioids are not droids
S = (A, ⊕, ⊗, ⓪, ①)
Some examples (1)
isomorphic to Bn
expresses matrix A as A ≈ B⊗BC where all matrices are Boolean
Some examples (2)
implies exact k-decomposition under Boolean algebra
Fuzzy example
B @
1 1 1 1 1 1 1 1 1 1 1
1 C A ≈ B @
1 1 1 1 1
1 C A ⊗F Å1
1 1 1 2/3 1
ã
=
B @
1 1 1 1 2/3 1 1 2/3 1 1 2/3 1
1 C A
Some examples (3)
values [Bělohlávek & Krmelova ’13]
Some examples (4)
M = (ℝ≥0, max, ×, 0, 1)
T = (ℝ∪{–∞}, max, +, –∞, 0)
Why max-times?
matters (a.k.a. the winner takes it all)
combination of movie’s features
most-liked feature
Max-times example
B @
1 1 1 1 1 1 1 1 1 1 1
1 C A ≈ B @
1 1 1 2/3 1
1 C A ⊗M Å1
1 1 1 2/3 1
ã
=
B @
1 1 1 1 2/3 1 2/3 4/9 2/3 1 2/3 1
1 C A
On max-times algebra
(but not fuzzy logic)
On tropical algebras
but needs care with the errors
in max-times, where kX e Xk α kX0 › X0k M2α M = exp(mx,j{Xj, e Xj})
More max-plus
f(x) = fT⊗x = max{fi+xi}
X⊗v = λ⊗v (maxj{xij + vj} = λ + vi for all i)
Computational complexity
implies exact k-factorization over B, then finding the K-rank of a matrix is NP-hard (even to approximate)
finite matrices
Anti-negativity and sparsity
element has additive inverse
factorizations of sparse data
Chapter 4. Even More General
Community detection
method
before they even took off?
100 200 300 400 500 600Generalized outer product
Example
1 1 . . . 1
1 · · · 1
Generalized decomposition
decomposes input matrix A into a sum of generalized outer products
⊕ o(xk, yk, θk)
Why generalize?
hardness results generalize well
Some results
is NP-hard if the outer product is hereditary
smallest exact sub-decomposition from them is NP-hard if addition is either OR, AND, or XOR
Chapter 5. The Chapter to Remember
Conclusions
complex data as an aggregate of simple parts
studied ones
more versatile language
Tiank Y