A Framework for Learnig Predictive Structures from Multiple Tasks - PowerPoint PPT Presentation

A Framework for Learnig Predictive Structures from Multiple Tasks and Unlabeled Data Rie Kubota Ando and Tong Zhang IBM Watson Research Center Yahoo Research Nov. 20th, 2006 Lei Tang Framework for Structural Learning

1 Introduction 2 Structural Learning Problem 3 Algorithm 4 Experiments Lei Tang Framework for Structural Learning

Semi-supervised Learning Large amount of unlabeled data, while labeled data are very costly Various methods: transductive inference, co-training (basically label propagation), fails when noise is introduced into classification through non-perfect classification. Another direction: define a good functional structures using unlabeled data. (what is a structure? distance, kernel, manifold) But a graph structure might not be predictive. Can we learn a predictive structure? Yes, if we have multiple related tasks. Lei Tang Framework for Structural Learning

Learning Predictive Structures 1 Structural learning from multiple tasks 2 Use unlabeled data to generate auxiliary(related) tasks. Lei Tang Framework for Structural Learning

A toy example The intrinsic distance metric should force A , C , D “close” to each other, and F and E to each other. Lei Tang Framework for Structural Learning

Connection to Hypothesis Space Supervised Learning Find a predictor in the hypothesis space. Estimation error: The smaller the space is, the easier to learn a best predictor given limited samples. Approximation error: caused by a restricted size of hypothesis Need a trade-off of these two types of errors (model selection) Model Selection Cross validation Can achieve better result if we have multiple problems on the same underlying domain. Lei Tang Framework for Structural Learning

Empirical Risk Minimization(ERM) Supervised Learning Find a predictor f such that R ( f ) = E X , Y L ( f ( X ) , Y )) Empirically, we use the loss on training data as an indicator. n � ˆ f = arg min L ( f ( X i ) , Y i ) f ∈H i =1 To avoid over-fitting, usually some regularization term is added n � ˆ f = arg min L ( f ( X i ) , Y i ) + g ( f ) f ∈H �� i =1 Regularization term Lei Tang Framework for Structural Learning

Joint Empirical Risk Minimization In STL, the hypothesis space (bias) is fixed. n � ˆ f = arg min L ( f ( X i ) , Y i ) + g ( f ) f ∈H i =1 Use parameter θ to represent the hypothesis space, then n � ˆ f θ = arg min L ( f ( X i ) , Y i ) + g ( f ) f ∈H ( θ ) i =1 For multiple related tasks, we want to find the hypothesis shared by all these tasks. (To determine a proper θ )  � � m n l g ( f l ( θ )) + 1 � � [ˆ f l , ˆ L ( f l ( θ ) , X l i , Y l   θ ] = arg min r ( θ ) + i )   n l f l ,θ �� l =1 l =1 regularization Lei Tang Framework for Structural Learning

Structural Learning with Linear Predictors f ( x ) = w T · + v T · φ ( x ) ψ θ ( x ) �� task specific features internal dimensions How to represent θ ? A matrix(can be considered as a transformation matrix to find new dimensions) f θ ( w , v ; x ) = w T φ ( x ) + v T θψ ( x ) Lei Tang Framework for Structural Learning

Alternating structure optimization(1) Assume φ ( x ) = ψ ( x ) = x , it follows that v l } , ˆ [ { ˆ w l , ˆ θ ] = � � m n l 1 � � L (( w l + θ T v l ) T X l i , Y l i ) + λ l || w l || 2 arg min 2 n l { w l , v l } ,θ l =1 i =1 θθ T = I s . t . � �� equivalent to regularization Let u = w + v θ T , then f ( x ) = u T x . � � min � m � n l 1 i ) + λ l || u l − θ T v l || 2 i =1 L ( u T l X l i , Y l l =1 2 n l θθ T = I s . t . Lei Tang Framework for Structural Learning

Alternating structure optimization (2) Algorithm 1 Fix ( θ, v ), optimize with respect to u (a convex optimization problem) 2 Fix u , optimize with respect to ( θ, v ). It turns out θ are the top left eigenvectors for the SVD of a matrix � � � U = [ λ 2 u 2 , · · · , λ m u m ] λ 1 u 1 , 3 Iterate until convergence. 4 Usually one iteration is enough. Connection to PCA PCA find the “principal components” of data points. u l is actually the predictor for task l . It is finding the “principal components” of the predictors. Each predictor is considered a point in the predictor space. Lei Tang Framework for Structural Learning

Semi-supervised learning 1 Learn structure parameter θ by joint empirical risk minimization. 2 Learn a predictor based on θ How to generate auxiliary problems? Automatic labeling. Relevancy. Two strategies: Unsupervised Semi-supervised Lei Tang Framework for Structural Learning

A Framework for Learnig Predictive Structures from Multiple Tasks - PowerPoint PPT Presentation

A Framework for Learnig Predictive Structures from Multiple Tasks and Unlabeled Data Rie Kubota Ando and Tong Zhang IBM Watson Research Center Yahoo Research Nov. 20th, 2006 Lei Tang Framework for Structural Learning 1 Introduction 2

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

Predictive Analytics for Capacity Planning HIC 2015 Andrae Gaeth What is predictive

Hypo contact and Sasakian SU ( 2 ) -structures in 5-dimensions structures on Lie groups Sasakian

Contact manifolds and SU ( 2 ) -structures in 5-dimensions SU ( n ) -structures Sasaki-Einstein

Overcoming big data bottlenecks in healthcare : a Predictive Modeling case study Predictive

COVID-19 Predictive Analytics April 8th, 2020 Predictive Analytics Focus Areas Health System

Session 2 Predictive Analytics in Policyholder Behavior Eileen Burns, FSA, MAAA David Wang, FSA,

pTec Predictive Maintenance Solution Predictive Maintenance Solutions by Indalyz AG What if you

pTec by Indalyz AG A predictive maintenance software solution for effective asset management

Educational Predictive Analytics: Navigating Disparate Views Aaron Springer , Victoria Chou,

Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to

High-Fidelity Coupling of Predictive Plasma-Wall Models Goal: Develop a predictive model of the

Lessons Learned (the Hard Way) in an Organization from Predictive Modeling Projects Predictive

Generalized Model Predictive Control (Discretely Generalized MPC) Sa sa V. Rakovi c, Ph.D.

Welcome Overview of Predictive Analytics Claudia Perlich Chief Scientist, Dstillery Predictive

NZILA conference SHAKEN NOT STIRRED Professor Ursula Cheer, Dean of law, University of Canterbury

Power Generation and The Tug/Barge Power Generation and The Tug/Barge Industry in the 21 st st

Accomplishments for FY13 University Relations Accomplishments University Relations Managed Crisis

in a small institution Scott McGowan Research Support Librarian Its the Keele difference.

Lights Out! Emilia Brinckhaus The Game What it is: 2/20

February 6, 2010 Priya Anand & Meaghan Nolan The Plan 1. The HRT problem 2.

Communic ations plan Pub lishe d F e b . 11, 2016 Pr oc e ss and time line s Complete internal

1 " &

A Framework for Learnig Predictive Structures from Multiple Tasks - PowerPoint PPT Presentation

A Framework for Learnig Predictive Structures from Multiple Tasks and Unlabeled Data Rie Kubota Ando and Tong Zhang IBM Watson Research Center Yahoo Research Nov. 20th, 2006 Lei Tang Framework for Structural Learning 1 Introduction 2

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

Predictive Analytics for Capacity Planning HIC 2015 Andrae Gaeth What is predictive

Hypo contact and Sasakian SU ( 2 ) -structures in 5-dimensions structures on Lie groups Sasakian

Contact manifolds and SU ( 2 ) -structures in 5-dimensions SU ( n ) -structures Sasaki-Einstein

Overcoming big data bottlenecks in healthcare : a Predictive Modeling case study Predictive

COVID-19 Predictive Analytics April 8th, 2020 Predictive Analytics Focus Areas Health System

Session 2 Predictive Analytics in Policyholder Behavior Eileen Burns, FSA, MAAA David Wang, FSA,

pTec Predictive Maintenance Solution Predictive Maintenance Solutions by Indalyz AG What if you

pTec by Indalyz AG A predictive maintenance software solution for effective asset management

Educational Predictive Analytics: Navigating Disparate Views Aaron Springer , Victoria Chou,

Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to

High-Fidelity Coupling of Predictive Plasma-Wall Models Goal: Develop a predictive model of the

Lessons Learned (the Hard Way) in an Organization from Predictive Modeling Projects Predictive

Generalized Model Predictive Control (Discretely Generalized MPC) Sa sa V. Rakovi c, Ph.D.

Welcome Overview of Predictive Analytics Claudia Perlich Chief Scientist, Dstillery Predictive

NZILA conference SHAKEN NOT STIRRED Professor Ursula Cheer, Dean of law, University of Canterbury

Power Generation and The Tug/Barge Power Generation and The Tug/Barge Industry in the 21 st st

Accomplishments for FY13 University Relations Accomplishments University Relations Managed Crisis

in a small institution Scott McGowan Research Support Librarian Its the Keele difference.

Lights Out! Emilia Brinckhaus The Game What it is: 2/20

February 6, 2010 Priya Anand &amp; Meaghan Nolan The Plan 1. The HRT problem 2.

Communic ations plan Pub lishe d F e b . 11, 2016 Pr oc e ss and time line s Complete internal

1 &quot; &amp;

February 6, 2010 Priya Anand & Meaghan Nolan The Plan 1. The HRT problem 2.

1 " &