Br Bringing inging Gaming, ing, VR, and nd AR to L to Life fe - - PowerPoint PPT Presentation

br bringing inging gaming ing vr and nd ar to l to life
SMART_READER_LITE
LIVE PREVIEW

Br Bringing inging Gaming, ing, VR, and nd AR to L to Life fe - - PowerPoint PPT Presentation

Br Bringing inging Gaming, ing, VR, and nd AR to L to Life fe Wi With th D Deep L Learn rning Danny Lange Vice President of AI & ML Unity Technologies What Was Before Machine Learning? Clockwork Universe FEEDBACK


slide-1
SLIDE 1

Br Bringing inging Gaming, ing, VR, and nd AR to L to Life fe Wi With th D Deep L Learn rning

Danny Lange Vice President of AI & ML Unity Technologies

slide-2
SLIDE 2

What Was Before Machine Learning?

PROGRAM “ALL-KNOWING PROGRAMMER” DATA RESULTS FEEDBACK Clockwork Universe

slide-3
SLIDE 3

What is Machine Learning

MODEL LEARNER DATA PREDICTIONS HISTORIC DATA Indeterminism

slide-4
SLIDE 4

OODA Loop (John Boyd)

Heisenberg's Uncertainty Principle

  • There is a limit on our ability to observe reality with precision.

Gödel's Incompleteness Theorem

  • Any model of reality is incomplete and must be continuously

refined in the face of new observations.

Second Law of Thermodynamics (Entropy) - Ludwig Boltzman

  • Any given system is continuously changing even as we try to maintain order
slide-5
SLIDE 5

Multi-armed Bandit & Reinforcement Learning

Objective: Maximize winnings Exploration vs Exploitation

  • Gaining knowledge
  • Max payout with current knowledge

Reinforcement Learning

  • Actions
  • Rewards

X% Y% Z%

AG AGENT EN ENVIRONMEN ENT ACTION STATE & REWARD

slide-6
SLIDE 6

DeepMind Playing Atari

slide-7
SLIDE 7
slide-8
SLIDE 8

DeepMind Lab

Current AI Research Landscape

OpenAI Gym/Universe

slide-9
SLIDE 9

About Unity

Leading global game industry platform

  • 16 Billion Downloads in 2016 – up 31%
  • 2.6 Billion Unique Devices
  • 700 Million Gamers
  • 38% of top 1000 free mobile games
slide-10
SLIDE 10

The Unity Ecosystem

slide-11
SLIDE 11

Bringing Reinforcement Learning to Unity

slide-12
SLIDE 12

How did the Chicken Cross the Road?

Actions Rewards

  • Fatal penalty (being hit by a car)
  • Positive reward (collecting gift packet)

Exploration Exploitation

slide-13
SLIDE 13
slide-14
SLIDE 14

Navigating by reading maps

slide-15
SLIDE 15
slide-16
SLIDE 16

Network Architecture

  • Deep Recurrent Q-Network
  • Dual-stream input
  • Train network with additional

auxiliary losses Actions Inputs Auxiliary signals

slide-17
SLIDE 17
slide-18
SLIDE 18

Synthetic Data

slide-19
SLIDE 19
slide-20
SLIDE 20
slide-21
SLIDE 21

Bringing it to developers and researchers

slide-22
SLIDE 22

Unity Reinforcement Learning API (Coming Soon)

Action State & Reward

slide-23
SLIDE 23
slide-24
SLIDE 24
slide-25
SLIDE 25

Bringing it together

slide-26
SLIDE 26
slide-27
SLIDE 27

Danny Lange Vice President, AI & Machine Learning +1 425.463.5801 dlange@unity3d.com

@danny_lange dannylange