A Desktop Can Machines Learn? Pascal Poupart Associate Professor - PDF document

Jan 04, 2023 •308 likes •367 views

A Desktop Can Machines Learn? Pascal Poupart Associate Professor David R. Cheriton School of Computer Science University of Waterloo 1 2 A Computer Program Machine Learning Arthur Samuel (1959): Machine learning is the field of study

A Desktop Can Machines Learn? Pascal Poupart Associate Professor David R. Cheriton School of Computer Science University of Waterloo 1 2 A Computer Program Machine Learning • Arthur Samuel (1959): Machine learning is the field of study that gives computers the ability to learn without being explicitly programmed. • Tom Mitchell (1998): A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E. 3 4 Three categories Supervised Learning • Example: digit recognition (postal code) Supervised learning • Simplest approach: Reinforcement learning memorization Unsupervised learning 5 6 1
Supervised Learning More Formally • Nearest neighbour: • Inductive learning: – Given a training set of examples of the form (x,f(x)) • x is the input, f(x) is the output – Return a function h that approximates f • h is called the hypothesis 7 8 Prediction Prediction • Find function h that fits f at instances x • Find function h that fits f at instances x 9 10 Prediction Prediction • Find function h that fits f at instances x • Find function h that fits f at instances x 11 12 2
Generalization Prediction • Key: a good hypothesis will generalize • Find function h that fits f at instances x well (i.e. predict unseen examples correctly) • Ockham’s razor: prefer the simplest hypothesis consistent with data 13 14 Reinforcement Learning Animal Psychology • Differs from supervised learning • Negative reinforcements: – Pain and hunger • Positive reinforcements: Reinforcement learning Supervised learning Don’t – Pleasure and food touch. You will get • Reinforcements used to train animals burnt Ouch! • Let’s do the same with computers! 15 16 Backgammon Helicopter Control • TD-Gammon: – Gerald Tesauro (1995) • Difficult to control: – Computer program – Highly unstable – Best backgammon player! • Play many games in simulation against itself • Andrew Ng (Stanford, 2006): – +1 for each win – Autonomous control by reinforcement learning – -1 for each loss – Step 1: learn neural net simulator based on flight data with human pilot • Optimization problem: find strategy that maximizes – Step 2: optimize controller based on reinforcements for cumulative score following a predefined trajectory 17 18 3
Applications of Machine Learning Vision • Speech recognition • Meta-programming: program computers to learn by themselves – dictation software • Natural Language Processing • Lifelong machine learning: machines that continuously learn – Text categorization – Information Retrieval • Transfer learning: machines that generalize their experience to new situations • Data Mining – Customer profiling • Robotic Control • Challenges: – Mobile robots – Computational complexity – Soccer playing robots – Sample complexity 19 20 Thank You Questions? 21 4

Recommend

Desktop Capture 164.pdf Page 1 of 35 Made with Doceri Desktop Capture 164.pdf Page 2 of 35

Desktop Capture 164.pdf Page 1 of 35 Made with Doceri Desktop Capture 164.pdf Page 2 of 35 Made with Doceri Desktop Capture 164.pdf Page 3 of 35 Made with Doceri Desktop Capture 164.pdf Page 4 of 35 Made with Doceri Desktop Capture

473 views • 35 slides

The State of the Linux Desktop An OSDL Perspective John Cherry OSDL Desktop Linux (DTL)

The State of the Linux Desktop An OSDL Perspective John Cherry OSDL Desktop Linux (DTL) September 23, 2006 1 2 The State of the Linux Desktop Riding the Open Software Wave The Linux Desktop Markets Linux Desktop Market Data The Linux

544 views • 40 slides

The K Desktop Environment (KDE) Page 1 We Shall be Covering ... Desktop environment The

The K Desktop Environment (KDE) Page 1 We Shall be Covering ... Desktop environment The KDE Desktop Control Center Konqueror Help Center Page 2 The Desktop Environment A common graphical user interface and platform

323 views • 17 slides

Desktop Patterns and Data Binding Karsten Lentzsch Desktop Patterns and Data Binding JGoodies

J-Fall 2006 Desktop Patterns and Data Binding Karsten Lentzsch Desktop Patterns and Data Binding JGoodies J-Fall 2006 Goal Learn how to organize presentation logic and how to bind domain data to views Desktop Patterns and Data Binding

997 views • 78 slides

Create presentation: Record desktop using MDR You can record your desktop and upload the recording

Create presentation: Record desktop using MDR You can record your desktop and upload the recording to Mediasite. However, you must install Mediasite Desktop Recorder (MDR) first. If MDR is not available on your machine, you will be prompted to

241 views • 3 slides

IDGF International Desktop Grid Federation First Release of Desktop Grids for e-Science Road Map

IDGF International Desktop Grid Federation First Release of Desktop Grids for e-Science Road Map Taipei, 2011-03-18 Ad Emmen, AlmereGrid DEGISCO project 1 DEGISO WP4 2011-03-18 version 1 Desktop Grids Introduction IDGF 2 DEGISO WP4

1.26k views • 66 slides

The Dynamic Desktop Agenda 5 signs of a broken desktop Jonty Pearce, Editor, Call Centre

The Dynamic Desktop Agenda 5 signs of a broken desktop Jonty Pearce, Editor, Call Centre Helper The Dynamic Desktop Paul White, CEO mplsystems Customer Case Study Richard Wilcox, Operations Manager, Express Medicals

145 views • 12 slides

Central Desktop Online Collaboration Tool Christina Boyce Gregory Marchini

Central Desktop Online Collaboration Tool Christina Boyce Gregory Marchini solutions-support@standards.ieee.org Introducing Central Desktop! The SA is making available Central Desktop to working groups for standard development, Industry

158 views • 14 slides

A Desktop Support Perspective Joe Bowen Desktop Engineering Manager Harvard Vanguard Medical

Software Implementations: A Desktop Support Perspective Joe Bowen Desktop Engineering Manager Harvard Vanguard Medical Associates An Affiliate of Atrius Health Joseph_Bowen@AtriusHealth.org Welcome & Introductions Agenda Review

301 views • 27 slides

Secure (Research) Data Desktop Stuart C. Ray, MD y, Director, Infectious Diseases Fellowship

Secure (Research) Data Desktop Stuart C. Ray, MD y, Director, Infectious Diseases Fellowship Training Program Professor of Medicine and Oncology Johns Hopkins Medical Institutions Secure (Research) Data Desktop Secure (Research) Data Desktop

306 views • 11 slides

What to look for in a new computer Presenter: Mary Burns Desktop vs Laptop Examples: Desktop,

What to look for in a new computer Presenter: Mary Burns Desktop vs Laptop Examples: Desktop, Laptop, Chromebook MORE>>>>>>> Examples: All in one and Two in one Operating system Price range $ spent depends on computer

667 views • 23 slides

Thorium desktop reader app made with the Readium SDK Desktop reader app

Thorium desktop reader app made with the Readium SDK Desktop reader app Windows Mac Linux Free (of charge) Free (Open Source Software) Developed by EDRLab A Few Key Features Accessible User Interface

793 views • 50 slides

MOBILE MUSIC MOBILE MUSIC Post Desktop UI UI Post Desktop WS 06/07 WS 06/07 Mei

MOBILE MUSIC MOBILE MUSIC Post Desktop UI UI Post Desktop WS 06/07 WS 06/07 Mei Fang Fang Liau Liau, Hendra , Hendra Hendra Hendra Mei Advisor : : Tico Balagas Tico Balagas Advisor Aachen, 23.11.2006 Aachen, 23.11.2006

405 views • 29 slides

Enterprise and Desktop Search Lecture 5: Desktop Search and Personal Information Personal

Enterprise and Desktop Search Lecture 5: Desktop Search and Personal Information Personal Information Management Pavel Dmitriev Pavel Serdyukov Sergey Chernov Delft University of L3S Research Center Yahoo! Labs Technology Hannover

1.13k views • 90 slides

Human-Computer Interaction Mobile Technologies Desktop Environments Desktop Mobile Interactive

Interactive Environments Human-Computer Interaction Mobile Technologies Desktop Environments Desktop Mobile Interactive Environments context and task context and task context and task challenges challenges challenges input technologies

479 views • 45 slides

Apiary: Easy-to-Use Desktop Application Fault Containment on Commodity Operating Systems Shaya

Apiary: Easy-to-Use Desktop Application Fault Containment on Commodity Operating Systems Shaya Potter and Jason Nieh June 23, 2010 USENIX ATC IBM Research Research performed at Columbia University Desktop Applications are Buggy! Desktop

793 views • 50 slides

Reinforcement Learning Philipp Koehn 16 April 2020 Philipp Koehn Artificial Intelligence:

Reinforcement Learning Philipp Koehn 16 April 2020 Philipp Koehn Artificial Intelligence: Reinforcement Learning 16 April 2020 Rewards 1 Agent takes actions Agent occasionally receives reward Maybe just at the end of the process,

1.06k views • 49 slides

Larry Holder School of EECS Washington State University Artificial Intelligence 1 } Classic AI

Larry Holder School of EECS Washington State University Artificial Intelligence 1 } Classic AI challenge Easy to represent Difficult to solve } Perfect information (e.g., Chess, Checkers) Fully observable and deterministic }

405 views • 27 slides

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments Rationality PEAS (Performance measure, Environment, Actuators, Sensors) Environment types Agent types Intelligent Agents p.2/25 Agents and

364 views • 23 slides

OPTICAL QUANTUM DOTS FOR QUANTUM INFORMATION Tom Reinecke Naval Research Laboratory Washington,

OPTICAL QUANTUM DOTS FOR QUANTUM INFORMATION Tom Reinecke Naval Research Laboratory Washington, DC , USA reinecke@nrl.navy.mil outline outline single spin qubits single spin qubits two qubit gates two qubit gates

696 views • 25 slides

CS440/ECE448 Lecture 12: Stochastic Games, Stochastic Search, and Learned Evaluation Functions

CS440/ECE448 Lecture 12: Stochastic Games, Stochastic Search, and Learned Evaluation Functions Slides by Svetlana Lazebnik, 9/2016 Modified by Mark Hasegawa-Johnson, 2/2019 Types of game environments Deterministic Stochastic Perfect

600 views • 38 slides

POMDPs and Policy Gradients MLSS 2006, Canberra Douglas Aberdeen Canberra Node, RSISE Building

POMDPs and Policy Gradients MLSS 2006, Canberra Douglas Aberdeen Canberra Node, RSISE Building Australian National University 15th February 2006 Outline Introduction 1 What is Reinforcement Learning? Types of RL Value-Methods 2 Model

849 views • 43 slides

Reinforcement Learning II George Konidaris gdk@cs.brown.edu Fall 2019 Reinforcement Learning

Reinforcement Learning II George Konidaris gdk@cs.brown.edu Fall 2019 Reinforcement Learning t r t R = max : S A t =0 MDPs Agent interacts with an environment At each time t: Receives sensor signal s t

717 views • 40 slides

AAAI-14 Tutorial Image sources: britannica.com, wikimedia.org

From Deep Blue to Monte Carlo: An Update on Game Tree Research Akihiro Kishimoto and Mar0n Mller AAAI-14 Tutorial Image sources:

814 views • 39 slides