This Class Weighted Majority Algorithm Mul+ple experts - PowerPoint PPT Presentation

Online ¡Predic+on ¡& ¡Decision ¡Making ¡ CompSci ¡590.03 ¡ Instructor: ¡Ashwin ¡Machanavajjhala ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 1 ¡

This ¡Class ¡ • Weighted ¡Majority ¡Algorithm ¡ ¡ – Mul+ple ¡experts ¡problem ¡ • Follow ¡the ¡perturbed ¡Leader ¡ – Online ¡shortest ¡paths ¡ • Mul+-‑armed ¡bandit ¡problems ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 2 ¡

Mul+ple ¡Experts ¡Problem ¡ Will ¡it ¡rain ¡ Yes ¡ Yes ¡ Yes ¡ No ¡ today? ¡ ¡ What ¡is ¡the ¡best ¡predic5on ¡based ¡on ¡these ¡experts? ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 3 ¡

Mul+ple ¡Experts ¡Problem ¡ • Suppose ¡we ¡know ¡the ¡best ¡expert ¡(who ¡makes ¡the ¡least ¡error), ¡ then ¡we ¡can ¡just ¡return ¡that ¡expert ¡says. ¡ – This ¡is ¡the ¡best ¡we ¡can ¡hope ¡for. ¡ • We ¡don’t ¡know ¡who ¡the ¡best ¡expert ¡is. ¡ – But ¡we ¡can ¡learn ¡… ¡we ¡know ¡whether ¡it ¡rained ¡or ¡not ¡at ¡the ¡end ¡of ¡the ¡ day. ¡ • Regret ¡Minimiza+on ¡: ¡number ¡of ¡mistakes ¡made ¡by ¡our ¡ algorithms ¡should ¡be ¡close ¡to ¡the ¡number ¡of ¡mistakes ¡made ¡by ¡ the ¡best ¡expert. ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 4 ¡

Weighted ¡Majority ¡Algorithm ¡ [LiZlestone&Warmuth ¡‘94] ¡ “Experts” ¡ Algorithm ¡ W 1 ¡ ¡ W 2 ¡ ¡ W 3 ¡ ¡ W 4 ¡ ¡ Y 1 ¡ ¡ Y 2 ¡ ¡ Y 3 ¡ ¡ Y 4 ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 5 ¡

Weighted ¡Majority ¡Algorithm ¡ [LiZlestone&Warmuth ¡‘94] ¡ “Experts” ¡ Algorithm ¡ Truth ¡ 1-‑ε ¡ 1-‑ε ¡ 1-‑ε ¡ 1 ¡ 1 ¡ 1 ¡ 1 ¡ Yes ¡ ¡ Yes ¡ ¡ Yes ¡ ¡ No ¡ ¡ No ¡ Yes! ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 6 ¡

Weighted ¡Majority ¡Algorithm ¡ Maintain ¡weights ¡(or ¡probability ¡distribu+on) ¡over ¡experts. ¡ • ¡ Answering/Predic+on: ¡ ¡ Answer ¡using ¡weighted ¡majority, ¡OR ¡ • Randomly ¡pick ¡an ¡expert ¡based ¡on ¡current ¡probability ¡ • distribu+on. ¡Use ¡random ¡experts ¡answer. ¡ ¡ ¡ Update: ¡ Observe ¡truth. ¡ • Decrease ¡weight ¡(or ¡probability) ¡assigned ¡to ¡the ¡experts ¡who ¡are ¡ • wrong. ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 7 ¡

Error ¡Analysis ¡ [Arora, ¡Hazan, ¡Kale ¡‘05] ¡ Theorem: ¡ ¡ Aber ¡t ¡steps, ¡ ¡ let ¡m(t,j) ¡be ¡the ¡number ¡of ¡errors ¡made ¡by ¡expert ¡j ¡ let ¡m(t) ¡be ¡the ¡number ¡of ¡errors ¡made ¡by ¡algorithm ¡ let ¡n ¡be ¡the ¡number ¡of ¡experts, ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 8 ¡

Error ¡Analysis: ¡Proof ¡ • Let ¡ ¡φ(t) ¡= ¡Σw i . ¡Then, ¡φ(1) ¡= ¡n. • When ¡the ¡algorithm ¡makes ¡a ¡mistake, ¡ φ(t+1) ¡≤ ¡φ(t) ¡(1/2 ¡+ ¡½(1-‑ε)) ¡= ¡φ(t)(1-‑ε/2) • When ¡the ¡algorithm ¡is ¡correct,� ¡φ(t+1) ¡≤ ¡φ(t) • Therefore, ¡� Therefore, ¡� ¡φ ¡φ(t (t) ¡ ) ¡≤ ¡ ≤ ¡n(1 (1-‑ -‑ε/2 /2) m(t m(t) ¡� ¡� Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 9 ¡

Error ¡Analysis: ¡Proof ¡ • φ(t) ¡≤ ¡n(1-‑ε/2) m(t) ¡ • Also, ¡W j (t) ¡= ¡(1-‑ε) m(t,j) • φ(t) ¡≥ ¡W j (t) ¡ ¡=> ¡n(1-‑ε/2) m(t) ¡≥ ¡(1-‑ε) m(t,j) • ¡Hence, ¡m(t) ¡≥ ¡2/ε ¡ln ¡n ¡+ ¡2(1+ε)m(t,j) Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 10 ¡

Online ¡Learning ¡ • Mistake ¡bound ¡model ¡ – Algorithm ¡receives ¡an ¡unlabeled ¡example ¡ x ¡ (like ¡our ¡experts) ¡ – Algorithm ¡predicts ¡a ¡classifica+on ¡of ¡this ¡example ¡ p ¡(either ¡-‑1 ¡or ¡+1) ¡ – Environment ¡produces ¡the ¡correct ¡answer ¡ y ¡(either ¡-‑1 ¡or ¡+1) ¡ • Winnow ¡algorithm ¡ – Learn ¡a ¡weight ¡func+on ¡ w ¡such ¡that ¡ sign ( w ¡x ) ¡= ¡p ¡ – Same ¡as ¡the ¡Weighted ¡Majority ¡algorithm ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 11 ¡

Online ¡Shortest ¡Paths ¡Problem ¡ • Input: ¡A ¡directed ¡graph ¡G ¡= ¡(V,E), ¡and ¡a ¡fixed ¡pair ¡of ¡nodes ¡(u,v) ¡ ¡ • Each ¡period ¡(+me ¡t), ¡we ¡pick ¡a ¡path ¡from ¡u ¡to ¡v, ¡and ¡the ¡length ¡of ¡ the ¡path ¡is ¡revealed. ¡ ¡ • Cost ¡at ¡+me ¡t ¡= ¡length ¡of ¡chosen ¡path. ¡ ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 12 ¡

Online ¡shortest ¡paths ¡ • We ¡could ¡have ¡used ¡weighted ¡majority, ¡where ¡each ¡path ¡is ¡an ¡ expert ¡ • But, ¡number ¡of ¡paths ¡(experts) ¡is ¡exponen+al ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 13 ¡

Follow ¡the ¡perturbed ¡leader ¡(FPL) ¡ Randomized ¡variant ¡… ¡ ¡ Ini+aliza+on: ¡ ¡ • Each ¡expert ¡ j ¡is ¡assigned ¡a ¡cost ¡ c(j, ¡0) ¡= ¡0 ¡ Predic+on ¡(+me ¡t): ¡ • For ¡each ¡expert ¡j ¡select ¡p(j, ¡t) ¡ ¡>= ¡0 ¡from ¡an ¡exponen+al ¡ distribu+on ¡( ¡ ¡μ(x) ¡~ ¡εe -‑εx ¡ ¡ ) ¡ • Make ¡the ¡same ¡predic+on ¡as ¡expert ¡with ¡smallest ¡c(j, ¡t) ¡– ¡p(j, ¡t) ¡ Update: ¡ ¡ • If ¡expert ¡j’s ¡predic+on ¡is ¡correct, ¡c(j, ¡t+1) ¡= ¡c(j, ¡t) ¡ • Else, ¡c(j, ¡t+1) ¡= ¡c(j,t) ¡+ ¡1 ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 14 ¡

Error ¡Analysis ¡ Theorem: ¡ ¡ Aber ¡t ¡steps, ¡ ¡ let ¡m(t,j) ¡be ¡the ¡number ¡of ¡errors ¡made ¡by ¡expert ¡j ¡ let ¡m(t) ¡be ¡the ¡number ¡of ¡errors ¡made ¡by ¡algorithm ¡ let ¡n ¡be ¡the ¡number ¡of ¡experts, ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 15 ¡

Linear ¡Generaliza+on ¡ • FPL ¡works ¡for ¡more ¡general ¡predic+on ¡problems, ¡where ¡ ¡ – The ¡predic+on ¡and ¡states ¡are ¡ ¡in ¡R n ¡ – Total ¡cost ¡of ¡the ¡decisions ¡are ¡Σ ¡d t ¡s t ¡ – Σ ¡d t ¡s t ¡should ¡be ¡close ¡to ¡min d ¡Σ ¡d ¡s t ¡ • Mul+ple ¡experts: ¡ – ¡d ¡: ¡0/1 ¡vector ¡where ¡d[j] ¡= ¡1 ¡if ¡expert ¡j ¡is ¡picked ¡by ¡the ¡algorithm ¡ – ¡s ¡: ¡0/1 ¡vector ¡where ¡s[j] ¡= ¡0 ¡if ¡jth ¡expert ¡is ¡correct. ¡ ¡ – Total ¡cost ¡is ¡number ¡of ¡mistakes. ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 16 ¡

Online ¡Shortest ¡Paths ¡ ¡ Algorithm: ¡ ¡ Ini+alize ¡all ¡edge ¡costs ¡c(e,0) ¡= ¡0. ¡ ¡ At ¡each ¡+me ¡period: ¡ ¡ • For ¡each ¡edge, ¡pick ¡p(e, ¡t) ¡from ¡an ¡exponen+al ¡distribu+on ¡ • Use ¡the ¡shortest ¡path ¡in ¡the ¡graph ¡with ¡lengths ¡c(e,t) ¡+ ¡p(e,t) ¡on ¡ each ¡edge. ¡ ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 17 ¡

Online ¡shortest ¡paths ¡ • We ¡could ¡have ¡used ¡weighted ¡majority, ¡where ¡each ¡path ¡is ¡an ¡ expert ¡ • But, ¡number ¡of ¡paths ¡(experts) ¡is ¡exponen+al ¡ • FPL ¡allows ¡solving ¡the ¡problem ¡in ¡polynomial ¡+me. ¡ ¡ E [ cost ] ≤ (1 + ε )(best-time in hindsight) + O ( mn log n ) ε Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 18 ¡

Mul+-‑armed ¡Bandit ¡Problem ¡ • A ¡set ¡of ¡ac+ons ¡(or ¡arms) ¡ • Selec+ng ¡ac+on ¡a ¡in ¡A ¡(or ¡pulling ¡an ¡arm) ¡results ¡in ¡a ¡reward ¡from ¡ an ¡unknown ¡probability ¡distribu+on ¡P(r ¡| ¡a) ¡ • ¡At ¡+me=t, ¡agent ¡selects ¡ac+on ¡a t ¡ • Environment ¡generates ¡reward ¡r t ¡ • Goal ¡is ¡to ¡maximize ¡Σ t ¡r t ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 19 ¡

Applica+on ¡ • Web ¡adver+sing ¡ – What ¡is ¡the ¡best ¡ad/ar+cle ¡to ¡show ¡a ¡user? ¡ • Clinical ¡trials ¡ – Iden+fying ¡efficient ¡drugs ¡with ¡minimal ¡pa+ent ¡loss/side-‑effects ¡ • Web ¡search ¡ – Which ¡result ¡must ¡be ¡ranked ¡at ¡the ¡top? ¡ ¡ • … ¡ Lecture ¡10 ¡: ¡590.02 ¡Spring ¡13 ¡ 20 ¡

This Class Weighted Majority Algorithm Mul+ple experts - PowerPoint PPT Presentation

Online Predic+on & Decision Making CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 10 : 590.02 Spring 13 1 This Class Weighted Majority

Programming Abstraction in C++ Eric S. Roberts and Julie Zelenski Stanford University 2010

BIBLICAL SURVEY Introductory Class Introductory Class BIBLICAL SURVEY Introductory Class

Curriculum on The Cadet Corps Uniform Class A Uniform Class A Uniform Agenda C1. Class A

Electing Your Membership Class Class TG, Class TH, or Class DC As a school employee who

TwissOptics Class Joschua Dilly TwissOptics Class 2 The TwissOptics Class Resonance Driving

3/14/16 Review Class/Object Type Class Keyword class class Point

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

Curriculum on The Cadet Corps Uniform Wear It WIth honor Class C Uniform Class C Uniform

Classroom Assessment Scoring System (CLASS) 104 B New Report Format Interpreting your CLASS

Essential Oils Class with Jami Borlik 1 Essential Oils Class with Jami Borlik 2 Essential Oils

Inheritance II Is-a versus has-a When an object of class A has a n object of class B, use

Inheritance A class can be a sub-type of another class The inheriting class contains all

UML Class Diagrams Steven Zeil February 25, 2013 UML Class Diagrams Outline Class

Multiple inheritance Multiple inheritance Can derive a class from more than one base class

CS 135: File Systems Class Overview 1 / 11 Class Overview Todays Topics Purpose of class

Chapter 10 Object-Oriented Thinking 1 Class Abstraction and Encapsulation Class abstraction is

Multiplicative Weights Algorithms CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 13 :

Ontological Pathfinding: Mining First-Order Knowledge from Large Knowledge Bases Yang Chen, Sean

Multimedia Information Retrieval Prof Stefan Rger Multimedia and Information Systems Knowledge

Success Criteria Success Criteria Statement 1 Lorem ipsum dolor sit amet, consectetur

Conditional Gradient Methods via Stochastic Path-Integrated Differential Estimator Alp Yurtsever

Adaptive primal-dual stochastic gradient methods Yangyang Xu Mathematical Sciences, Rensselaer

The Sample-Computational Tradeoff Shai Shalev-Shwartz School of Computer Science and Engineering

Waiting for 6+ years Pete Beckman Argonne National Laboratory 2 Data from Peter