Learning more from end-users and teachers Oregon State University - PowerPoint PPT Presentation

Learning more from end-users and teachers Oregon State University AI and EUSES Groups Tom Dietterich on behalf of Alan Fern, Kshitij Judah, Saikat Roy, Joe Selman Weng-Keen Wong, Ian Oberst, Shubumoy Das, Travis Moore, Simone Stumpf, Kevin McIntosh, Margaret Burnett 1 End-Users and Teachers

Research Space Supervised Imitation Learning Reinforcement Learning Learning Current Label feedback Demonstrations Demonstrations Methods Active learning for Active learning via online Active learning via labels action feedback online action feedback Equivalence queries and Membership Queries Novel Feature Labeling by State Queries with � Practice & Critiques Methods end users responses [AAAI 2010] [IUI 2011] [ICML Workshop 2010] Object Queries and Pairing Queries [ECML 2011] 2 End-Users and Teachers

Label Feedback from End Users  Setting:  Document classification (multi-class)  Features are words, n-grams, etc.  End user labels features as positive or negative for a class  Small data set; user-specific classes 3 End-Users and Teachers

Related Work Supervised feature labeling algorithms: SVM Method 1 [Raghavan and Allan 2007] 1. Scales relevant features by � • Scales non-relevant features by � • Where � � � • SVM Method 2 [Raghavan and Allan 2007] 2. Inserts pseudo-documents into the dataset • pseudo-document: (0, 0, ..., � , ..., 0, class label) Influences position of margin • Combined method will be called SVM-M1M2 4 End-Users and Teachers

Idea: Combine local learning algorithm with feature weights  Algorithm:  Locally-weighted logistic regression  Given query � � assign weight � � � �� , � � � to each training example � �  Fit logistic regression to maximize weighted log likelihood  Incorporating feature labels:  When training classifier for class � , if � � and � � share a feature labeled as positive for class � then make them “more similar”  If they share a feature labeled as positive for some other class, then make them “less similar”  Hypothesis:  Local learning will prevent feature weights from over- generalizing beyond the local neighborhood 5 End-Users and Teachers

Experiments: Oracle Study Oracle study: What happens if you can pick the “best” feature labels possible?  Datasets  Balanced subset of 20 Newsgroups (4 classes)  Balanced subset of Modapte (4 classes)  Balanced subset of RCV1 (5 classes)  Oracle feature labels:  10 most informative features for each class (information gain computed over entire dataset) 6 End-Users and Teachers

Results: Oracle Study 7 End-Users and Teachers

Results: Oracle Study Summary With oracle feature labels, LWLR-FL outperforms or matches the performance of SVM variants 10 End-Users and Teachers

Experiment: User Study But what about real end users?  How good are their feature labels?  First user study of its kind: Statistical user study allowing end users to label any features 11 End-Users and Teachers

Results: User Study  Presented 24 news articles from 4 Newsgroups: Computers, For Sale, Medicine, Outer Space  Collected feature labels from 43 participants:  24 male, 19 female  Non-CS background  Experimental Setup  Features are unigrams  Training set: 24 instances  Validation set: 24 instances  Test set: remainder of data 12 End-Users and Teachers

User Study: Open-Ended Feature Set  Participants allowed to highlight any text (including words and punctuation) that they thought was predictive of the newsgroup  Separate results into two groups:  Existing: feature labels only on unigrams  All: feature labels on unigrams and any additional features highlighted by end users 13 End-Users and Teachers

Results: User study 14 End-Users and Teachers

Results: User Study  End users introduced  non-continuous words (“cold” with “flu”)  continuous phrases (“space shuttle”)  features with punctuation (“for sale” with “$”)  Analysis of participants’ features vs the oracle:  Lower average information gain (0.035 vs 0.078)  Higher average ConceptNet relatedness (0.308 vs 0.231) 15 End-Users and Teachers

Results: User Study  Looked at relatedness from ConceptNet as an alternative to information gain  End users picked features with higher average relatedness than oracle 16 End-Users and Teachers

Results: User Study 17 End-Users and Teachers

Results: User Study SVM-M1M2 Gains Over Baseline 0.2 Gain over Baseline 0.1 (Macro ‐ F1) 0 ‐ 0.1 ‐ 0.2 Participants (not in the same order) LWLR-FL Gains Over Baseline 0.2 Gain over Baseline 0.1 (Macro ‐ F1) 0 ‐ 0.1 ‐ 0.2 Participants (not in the same order) 18 End-Users and Teachers

Results: User Study Sensitivity Analysis Variation in Macro-F1 with r for SVM-M1M2 Variation in Macro-F1 with k for LWLR-FL 0.8 0.9 0.7 0.8 0.6 0.7 Participant Macro F1 0.5 0.6 Participant 23165 Macro F1 Participant 0.4 0.5 23165 19162 Participant 0.3 0.4 Participant 19162 0.2 19094 0.3 Participant 0.1 0.2 19094 0 0.1 0.002 0.006 0.010 0.040 0.080 0.500 1.500 2.500 3.500 4.500 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 k 2 r LWLR-FL is less sensitive to changes in key parameter 19 End-Users and Teachers

Results: User Study Summary  With real end-user feature labels, LWLR-FL outperforms SVM variants  LWLR-FL is more robust to lower quality feature labels  End users able to select features that have high relatedness to class label 20 End-Users and Teachers

Research Space Supervised Imitation Learning Reinforcement Learning Learning Current Label feedback Demonstrations Demonstrations Methods Active learning for Active learning via online Active learning via labels action feedback online action feedback Equivalence queries and Membership Queries Novel Feature Labeling by State Queries with � Practice & Critiques Methods end users responses [AAAI 2010] [IUI 2011] [ICML Workshop 2010] Object Queries and Pairing Queries [ECML 2011] 21 End-Users and Teachers

Learning First-Order Theories using Object-Based Queries  Goal:  Learn a first-order Horn theory  Set of Horn clauses  No functions  No constants (only variables)  A Horn theory covers a training example if it D-subsumes the example  Subsumption is required to be a one-to-one mapping  For example:  Theory: P(X,Y), P(Y,Z) ⇒ Q(X,Z)  D-subsumes P(1,2), P(2,3) ⇒ Q(1,3)  Does not D-subsume P(a,b), P(b,b) ⇒ Q(a,b)  Every theory under normal semantics has an equivalent theory that uses the new semantics 22 End-Users and Teachers

Previous Work  Angluin et al. 1992:  Propositional Horn theories can be learned in polynomial time using Equivalence Queries and Membership Queries  Equivalence Query (EQ):  Ask teacher if theory T is equivalent to the correct theory  If No, returns a counter-example  Membership Query (MQ):  Ask teacher if example X is a positive example of the correct theory  Reddy & Tadepalli, 1997:  Non-recursive function free first-order Horn definitions (single target predicate) can be learned in polynomial time using EQs and MQs  Khardon, 1999  General first-order Horn theories can be learned in polynomial time using EQs and MQs (for fixed max size) 23 End-Users and Teachers

Shortcoming: MQs and EQs are unrealistic  All of the algorithms make heavy use of MQs  This can be unnatural for humans to answer  T eacher effort of labeling can be especially high  Often the examples asked about are created by the algorithm, and may not make sense in the real world  Each query only conveys a small amount of information 24 End-Users and Teachers

New Queries  ROQ: Relevant Object Query  Given a positive example � , returns a minimal set of objects � such that there exists a clause � in the true theory and a D- substitution Θ such that �Θ ⊆ �  Example for target concept  �: �� , � , �� , � , �� , � , �� , � , �� , � , �� , � , �� , � , �� , � , �� , � , �� , � , �� , � , ��, ��  �: ��, �, ��  Clause: 25 End-Users and Teachers

Learning more from end-users and teachers Oregon State University - PowerPoint PPT Presentation

Learning more from end-users and teachers Oregon State University AI and EUSES Groups Tom Dietterich on behalf of Alan Fern, Kshitij Judah, Saikat Roy, Joe Selman Weng-Keen Wong, Ian Oberst, Shubumoy Das, Travis Moore, Simone Stumpf, Kevin

End User Programming Glenn Vanderburg Relevance, Inc. End Users Software Programmers Your

Fermilab Users Meeting Fermilab Users Meeting Fermilab Users Meeting Fermilab Users

Writing reliable end to end tests End to end browser tests They take a long time to run. Around

Learn more Do more Be more Learn more Do more Be more UNITY Learn more Do

Cirrus: A Serverless Framework for End-to-end ML Workflows Joao Carreira , Pedro Fonseca, Alexey

Defect Detection Thomas Zimmermann The First Bug September 9, 1947 More Bugs More Bugs More

Why Transformers Work. More info blablabla More info blablabla More info blablabla More

Appendix Deposits and Loans Appendix Deposits (ending balance) Loans (ending balance) (Unit:

Is End-to-End Integrity Verification Really End- to-End? Ahmed Alhussen, Batyr Charyyev, and Engin

End-to-End Argument Jeff Chase Duke University End-To-End Argument Application TCP Where to

End-to-end IoT Platform Connect Collect Manage Learn Analyze Act End-to-end Solution

Ontario Teachers Federation The OnTariO Teachers FederaTiOn is the advocate for the

NC TEACHER EVALUATION PROCESS NC Standards for Teachers Standard 1: Teachers demonstrate

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

Annual Reporting for ESS Users 50-373, 6/20 GoToWebinar employer_education@strsoh.org 1 STATE

End-to-end arguments: End-to-end arguments: The Internet and beyond The Internet and beyond

Promoting Innovation-Led and Technology-Driven SMEs By: Fadzilah Ahmad Din SMI DEC ASI A PACI

English Acquisition IA k , IIA f , 2011 12 ( 14 ) ( )

Evaluating Centering for Information Ordering Using Corpora M.Sc. Seminar: Discourse Coherence

?'36'&6.':6&-%32'"3"8.%5*' J

DECOMMISSIONING PROGRAM Larry W. Camper, NMSS 1 KEY MESSAGES Program improvements

Radiopharmaceuticals radiolabelled with 188 Re as potential therapeutic tools for hepatocellular

Chemical Thermodynamics D G Chemical potential At equilibrium chemical potentials are

Water, Steam, and Ice the temperatures of the ice and the water compare? The ice is colder than

Learning more from end-users and teachers Oregon State University - PowerPoint PPT Presentation

Learning more from end-users and teachers Oregon State University AI and EUSES Groups Tom Dietterich on behalf of Alan Fern, Kshitij Judah, Saikat Roy, Joe Selman Weng-Keen Wong, Ian Oberst, Shubumoy Das, Travis Moore, Simone Stumpf, Kevin

End User Programming Glenn Vanderburg Relevance, Inc. End Users Software Programmers Your

Fermilab Users Meeting Fermilab Users Meeting Fermilab Users Meeting Fermilab Users

Writing reliable end to end tests End to end browser tests They take a long time to run. Around

Learn more Do more Be more Learn more Do more Be more UNITY Learn more Do

Cirrus: A Serverless Framework for End-to-end ML Workflows Joao Carreira , Pedro Fonseca, Alexey

Defect Detection Thomas Zimmermann The First Bug September 9, 1947 More Bugs More Bugs More

Why Transformers Work. *More info blablabla *More info blablabla *More info blablabla *More

Appendix Deposits and Loans Appendix Deposits (ending balance) Loans (ending balance) (Unit:

Is End-to-End Integrity Verification Really End- to-End? Ahmed Alhussen, Batyr Charyyev, and Engin

End-to-End Argument Jeff Chase Duke University End-To-End Argument Application TCP Where to

End-to-end IoT Platform Connect Collect Manage Learn Analyze Act End-to-end Solution

Ontario Teachers Federation The OnTariO Teachers FederaTiOn is the advocate for the

NC TEACHER EVALUATION PROCESS NC Standards for Teachers Standard 1: Teachers demonstrate

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

Annual Reporting for ESS Users 50-373, 6/20 GoToWebinar employer_education@strsoh.org 1 STATE

End-to-end arguments: End-to-end arguments: The Internet and beyond The Internet and beyond

Promoting Innovation-Led and Technology-Driven SMEs By: Fadzilah Ahmad Din SMI DEC ASI A PACI

English Acquisition IA k , IIA f , 2011 12 ( 14 ) ( )

Evaluating Centering for Information Ordering Using Corpora M.Sc. Seminar: Discourse Coherence

?*'36'&amp;6.':6&amp;-%3*2'&quot;3&quot;8.%5*' J

DECOMMISSIONING PROGRAM Larry W. Camper, NMSS 1 KEY MESSAGES Program improvements

Radiopharmaceuticals radiolabelled with 188 Re as potential therapeutic tools for hepatocellular

Chemical Thermodynamics D G Chemical potential At equilibrium chemical potentials are

Water, Steam, and Ice the temperatures of the ice and the water compare? The ice is colder than

Why Transformers Work. More info blablabla More info blablabla More info blablabla More

?'36'&6.':6&-%32'"3"8.%5*' J