Unsupervised Learning
George Konidaris gdk@cs.brown.edu
Fall 2019
Unsupervised Learning George Konidaris gdk@cs.brown.edu Fall 2019 - - PowerPoint PPT Presentation
Unsupervised Learning George Konidaris gdk@cs.brown.edu Fall 2019 Machine Learning Subfield of AI concerned with learning from data . Broadly, using: Experience To Improve Performance On Some Task (Tom Mitchell, 1997)
George Konidaris gdk@cs.brown.edu
Fall 2019
Subfield of AI concerned with learning from data. Broadly, using:
(Tom Mitchell, 1997)
Input: X = {x1, …, xn} Try to understand the structure of the data. E.g., how many types of cars? How can they vary?
inputs
One particular type of unsupervised learning:
Formal definition Given:
Find:
One approach:
Major question:
Very simple algorithm:
{µ1, ..., µk}
f(xj) = i such that d(xj, µi) d(xj, µl)8l 6= i
µi = X
v∈Ci
xv |Ci|
Remaining questions … How to choose k? What about bad initializations? How to measure distance? Broadly:
Clustering: can answer which cluster, but not does this belong?
Estimate the distribution the data is drawn from. This allows us to evaluate the probability that a new point is drawn from the same distribution as the old data. Formal definition Given:
Find:
Simple approach:
Each Gaussian has its own mean and variance. Each has its own weight (sum to 1). Weighted sum of Gaussians still a PDF.
Algorithm - broadly as before:
{µ1, ..., µk} µi = X
v∈Ci
xv |Ci|
Ci = {xv|N(xv|µi, σ2
i ) > N(xv|µj, σ2 j ), ∀j}
σ2
i = variance(Ci)
wi = |Ci| P
j |Cj|
Major issue:
General statistical question: model selection. Several good answers for this. Simple example: Bayesian information criterion (BIC). Trades off model complexity (k) with fit (likelihood). −2 log L + k log n
likelihood # parameters in model # data points
Parametric:
Key assumptions:
What is the shape of the distribution
Nonparametric alternative:
Kernel density estimator: where:
PDF(x) = 1 nb
n
X
i=1
D ✓xi − x b ◆
Kernel:
Bandwidth:
PDF(x) = 1 nb
n
X
i=1
D ✓xi − x b ◆
(wikipedia)
X = {x1, …, xn}, each xi has m dimensions: xi = [x1, …, xm]. If m is high, data can be hard to deal with.
Dimensionality reduction:
For example, imagine if x1 and x2 are meaningful features, and x3 … xm are random noise. What happens to k-nearest neighbors? What happens to a decision tree? What happens to the perceptron algorithm? What happens if you want to do clustering?
Often can be phrased as a projection: where:
Variance captures what varies within the data. f : X → X0 |X0| << |X|
Principle Components Analysis. Project data into a new space:
Vi and eigenvalues vi of C. Each Vi is a direction, and each vi is its importance - the amount
New data points: xi = xi − X
j
xj n
<latexit sha1_base64="45F9LXNTgx6rd3eGkudsXpyHSWk=">ACHnicbZDLSgMxFIYz9VbrerSTbAIbiwzIuhGKLpxWcFeoNMOmTps0kQ5KRlqFbn8MHcKuP4E7c6hP4GmbaWdjWAwk/38OJ/n8iFGlbfvbyq2srq1v5DcLW9s7u3vF/YO6ErHEpIYFE7LpI0UY5aSmqWakGUmCQp+Rhj+8TfPGI5GKCv6gxFph6jHaUAx0sbyinDUofB6ep9BV8WhN4BuIBFORp3BJOETr1iy/a04LJwMlECWVW94o/bFTgOCdeYIaVajh3pdoKkpiRScGNFYkQHqIeaRnJUhUO5n+ZAJPjNOFgZDmcA2n7t+JBIVKjUPfdIZI9Vilpr/Za1YB1fthPIo1oTj2aIgZlALmGKBXSoJ1mxsBMKSmrdC3EeGgzbw5rb4Qgw18lVKxlnksCzq52XH6PuLUuUmY5QHR+AYnAIHXIKuANVUAMYPIEX8ArerGfr3fqwPmetOSubOQRzZX39AlGPosM=</latexit><latexit sha1_base64="45F9LXNTgx6rd3eGkudsXpyHSWk=">ACHnicbZDLSgMxFIYz9VbrerSTbAIbiwzIuhGKLpxWcFeoNMOmTps0kQ5KRlqFbn8MHcKuP4E7c6hP4GmbaWdjWAwk/38OJ/n8iFGlbfvbyq2srq1v5DcLW9s7u3vF/YO6ErHEpIYFE7LpI0UY5aSmqWakGUmCQp+Rhj+8TfPGI5GKCv6gxFph6jHaUAx0sbyinDUofB6ep9BV8WhN4BuIBFORp3BJOETr1iy/a04LJwMlECWVW94o/bFTgOCdeYIaVajh3pdoKkpiRScGNFYkQHqIeaRnJUhUO5n+ZAJPjNOFgZDmcA2n7t+JBIVKjUPfdIZI9Vilpr/Za1YB1fthPIo1oTj2aIgZlALmGKBXSoJ1mxsBMKSmrdC3EeGgzbw5rb4Qgw18lVKxlnksCzq52XH6PuLUuUmY5QHR+AYnAIHXIKuANVUAMYPIEX8ArerGfr3fqwPmetOSubOQRzZX39AlGPosM=</latexit><latexit sha1_base64="45F9LXNTgx6rd3eGkudsXpyHSWk=">ACHnicbZDLSgMxFIYz9VbrerSTbAIbiwzIuhGKLpxWcFeoNMOmTps0kQ5KRlqFbn8MHcKuP4E7c6hP4GmbaWdjWAwk/38OJ/n8iFGlbfvbyq2srq1v5DcLW9s7u3vF/YO6ErHEpIYFE7LpI0UY5aSmqWakGUmCQp+Rhj+8TfPGI5GKCv6gxFph6jHaUAx0sbyinDUofB6ep9BV8WhN4BuIBFORp3BJOETr1iy/a04LJwMlECWVW94o/bFTgOCdeYIaVajh3pdoKkpiRScGNFYkQHqIeaRnJUhUO5n+ZAJPjNOFgZDmcA2n7t+JBIVKjUPfdIZI9Vilpr/Za1YB1fthPIo1oTj2aIgZlALmGKBXSoJ1mxsBMKSmrdC3EeGgzbw5rb4Qgw18lVKxlnksCzq52XH6PuLUuUmY5QHR+AYnAIHXIKuANVUAMYPIEX8ArerGfr3fqwPmetOSubOQRzZX39AlGPosM=</latexit><latexit sha1_base64="45F9LXNTgx6rd3eGkudsXpyHSWk=">ACHnicbZDLSgMxFIYz9VbrerSTbAIbiwzIuhGKLpxWcFeoNMOmTps0kQ5KRlqFbn8MHcKuP4E7c6hP4GmbaWdjWAwk/38OJ/n8iFGlbfvbyq2srq1v5DcLW9s7u3vF/YO6ErHEpIYFE7LpI0UY5aSmqWakGUmCQp+Rhj+8TfPGI5GKCv6gxFph6jHaUAx0sbyinDUofB6ep9BV8WhN4BuIBFORp3BJOETr1iy/a04LJwMlECWVW94o/bFTgOCdeYIaVajh3pdoKkpiRScGNFYkQHqIeaRnJUhUO5n+ZAJPjNOFgZDmcA2n7t+JBIVKjUPfdIZI9Vilpr/Za1YB1fthPIo1oTj2aIgZlALmGKBXSoJ1mxsBMKSmrdC3EeGgzbw5rb4Qgw18lVKxlnksCzq52XH6PuLUuUmY5QHR+AYnAIHXIKuANVUAMYPIEX8ArerGfr3fqwPmetOSubOQRzZX39AlGPosM=</latexit>ˆ xi = [V1, ..., Vp]xi
<latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit><latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit><latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit><latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit>Let’s focus on this equation:
p x 1 m x 1 p x m compressed data point
compression matrix
ˆ xi = [V1, ..., Vp]xi
<latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit><latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit><latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit><latexit sha1_base64="znbIAXUs9yZmV6cY7na5NVHr8ug=">ACHXicbZDLSgMxFIYz9VbrerShcEiuCjDjAi6EYpuXFawF2inQyZN29DMZEjOSMvQpc/hA7jVR3AnbsUn8DVMLwvb+kPg4z/ncE7+IBZcg+N8W5mV1bX1jexmbmt7Z3cv39Q1TJRlFWoFLVA6KZ4BGrAfB6rFiJAwEqwX923G9siU5jJ6gGHMvJB0I97hlICx/Pxs0cgHYxaHF/jRtV3i9i27SKu+rGHBy3u5wuO7UyEl8GdQHNVPbzP82pEnIqCaN1wnRi8lCjgVLBRrploFhPaJ13WMBiRkGkvnXxkhE+N08YdqcyLAE/cvxMpCbUehoHpDAn09GJtbP5XayTQufJSHsUJsIhOF3USgUHicSq4zRWjIYGCFXc3IpjyhCwWQ3tyWQsg8k0COTjLuYwzJUz23X8P1FoXQzyiLjtAJOkMukQldIfKqIoekIv6BW9Wc/Wu/VhfU5bM9Zs5hDNyfr6BeKEoLI=</latexit>If you want to recover the original data point:
V is orthonormal
V = [V1, ..., Vp]
so:
¯ xi = V1ˆ xi
1 + V2ˆ
xi
2 + ... + Vpˆ
xi
p
<latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit><latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit><latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit><latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit>Reconstruction:
real valued numbers
axes
Every data point is expressed as a point in a new coordinate frame. Equivalently: weighted sum of basis (eigenvector) functions.
¯ xi = V1ˆ xi
1 + V2ˆ
xi
2 + ... + Vpˆ
xi
p
<latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit><latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit><latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit><latexit sha1_base64="MxrcCjLKYdXpzM5xfYFBuKwIZ4Y=">ACQnicbZDLSgMxFIYzXu96tJNsAiCMwUQTdCURcuFewF2jqcSVMbmpkMyRmxDH0hn8MHcFtfQHAnbl2YjlW8/RD4851zOMkfJlIY9LyRMzU9Mzs3X1hYXFpeWV0rm/UjEo141WmpNKNEAyXIuZVFCh5I9EcolDyetg/GdfrN1wboeJLHCS8HcF1LqCAVoUFE9bIejsdngl6BGtBT5t9QDtPfAt2bOk/AnKOXBdN8fJV2NyJYJiyXO9XPSv8SemRCY6D4pPrY5iacRjZBKMafpegu0MNAom+XCxlRqeAOvDNW9aG0PETvLfzukO5Z0aFdpe2KkOf0+kUFkzCAKbWcE2DO/a2P4X62ZYvewnYk4SZH7GNRN5UFR1HRztCc4ZyYA0wLexbKeuBoY24B9bQqX6CKEZ2mT83zn8NbWy61t/sV+qHE8yKpAtsk12iU8OSIWckXNSJYzckQcyIo/OvfPsvDivH61TzmRmk/yQ8/YOk0yvNQ=</latexit>0.2 x
6.1 x 0 x
(40 basis functions) (Turk and Pentland, 1991)
(40 basis functions) (Turk and Pentland, 1991)
Given data x1, …, xn, labels y1, …, yn:
Why?
f : ˆ X → Y
ˆ x1, ..., ˆ xn
<latexit sha1_base64="TCDnwG1zXOB2uGVD/KhB1/8aBNw=">ACF3icbZDLSgMxFIYz9VbrepK3ASL4KIMyLosujGZQV7gbaWTJpQzOTITkjlmHwOXwAt/oI7sStS5/A1zBtR7CtBwJf/v8cTvJ7keAaHOfLyi0tr6yu5dcLG5tb2zvF3b26lrGirEalkKrpEc0ED1kNOAjWjBQjgSdYwxtejf3GPVOay/AWRhHrBKQfcp9TAkbqFg/aAwLJQ3rnlrFt2X8ezdeybGdSeFcDMoayq3eJ3uydpHLAQqCBat1wngk5CFHAqWFpox5pFhA5Jn7UMhiRgupNMvpDiY6P0sC+VOSHgifp3IiGB1qPAM50BgYGe98bif14rBv+ik/AwioGFdLrIjwUGicd54B5XjIYGSBUcfNWTAdEQomtZktnpRDIJ5OTLufA6LUD+1XcM3Z6XKZRHh2iI3SCXHSOKugaVENUfSIntELerWerDfr3fqYtuasbGYfzZT1+QMzgZ90</latexit><latexit sha1_base64="TCDnwG1zXOB2uGVD/KhB1/8aBNw=">ACF3icbZDLSgMxFIYz9VbrepK3ASL4KIMyLosujGZQV7gbaWTJpQzOTITkjlmHwOXwAt/oI7sStS5/A1zBtR7CtBwJf/v8cTvJ7keAaHOfLyi0tr6yu5dcLG5tb2zvF3b26lrGirEalkKrpEc0ED1kNOAjWjBQjgSdYwxtejf3GPVOay/AWRhHrBKQfcp9TAkbqFg/aAwLJQ3rnlrFt2X8ezdeybGdSeFcDMoayq3eJ3uydpHLAQqCBat1wngk5CFHAqWFpox5pFhA5Jn7UMhiRgupNMvpDiY6P0sC+VOSHgifp3IiGB1qPAM50BgYGe98bif14rBv+ik/AwioGFdLrIjwUGicd54B5XjIYGSBUcfNWTAdEQomtZktnpRDIJ5OTLufA6LUD+1XcM3Z6XKZRHh2iI3SCXHSOKugaVENUfSIntELerWerDfr3fqYtuasbGYfzZT1+QMzgZ90</latexit><latexit sha1_base64="TCDnwG1zXOB2uGVD/KhB1/8aBNw=">ACF3icbZDLSgMxFIYz9VbrepK3ASL4KIMyLosujGZQV7gbaWTJpQzOTITkjlmHwOXwAt/oI7sStS5/A1zBtR7CtBwJf/v8cTvJ7keAaHOfLyi0tr6yu5dcLG5tb2zvF3b26lrGirEalkKrpEc0ED1kNOAjWjBQjgSdYwxtejf3GPVOay/AWRhHrBKQfcp9TAkbqFg/aAwLJQ3rnlrFt2X8ezdeybGdSeFcDMoayq3eJ3uydpHLAQqCBat1wngk5CFHAqWFpox5pFhA5Jn7UMhiRgupNMvpDiY6P0sC+VOSHgifp3IiGB1qPAM50BgYGe98bif14rBv+ik/AwioGFdLrIjwUGicd54B5XjIYGSBUcfNWTAdEQomtZktnpRDIJ5OTLufA6LUD+1XcM3Z6XKZRHh2iI3SCXHSOKugaVENUfSIntELerWerDfr3fqYtuasbGYfzZT1+QMzgZ90</latexit><latexit sha1_base64="TCDnwG1zXOB2uGVD/KhB1/8aBNw=">ACF3icbZDLSgMxFIYz9VbrepK3ASL4KIMyLosujGZQV7gbaWTJpQzOTITkjlmHwOXwAt/oI7sStS5/A1zBtR7CtBwJf/v8cTvJ7keAaHOfLyi0tr6yu5dcLG5tb2zvF3b26lrGirEalkKrpEc0ED1kNOAjWjBQjgSdYwxtejf3GPVOay/AWRhHrBKQfcp9TAkbqFg/aAwLJQ3rnlrFt2X8ezdeybGdSeFcDMoayq3eJ3uydpHLAQqCBat1wngk5CFHAqWFpox5pFhA5Jn7UMhiRgupNMvpDiY6P0sC+VOSHgifp3IiGB1qPAM50BgYGe98bif14rBv+ik/AwioGFdLrIjwUGicd54B5XjIYGSBUcfNWTAdEQomtZktnpRDIJ5OTLufA6LUD+1XcM3Z6XKZRHh2iI3SCXHSOKugaVENUfSIntELerWerDfr3fqYtuasbGYfzZT1+QMzgZ90</latexit>Another approach:
Core idea: distance metric locally Euclidean
Solve all-points shortest pairs:
From Tenenbaum, de Silva, and Langford, Science 290:2319-2323, December 2000.
Intrusion detection - when is a user behaving unusually? First proposed by Prof. Dorothy Denning in 1986. (1995 ACM Fellow)