Why is Go hard for computers to play? Game tree complexity = b d - PowerPoint PPT Presentation

Apr 10, 2024 •280 likes •444 views

Why is Go hard for computers to play? Game tree complexity = b d Brute force search intractable: 1. Search space is huge 2. Impossible for computers to evaluate who is winning Convolutional neural network Value network Evaluation v

Why is Go hard for computers to play? Game tree complexity = b d Brute force search intractable: 1. Search space is huge 2. “Impossible” for computers to evaluate who is winning
Convolutional neural network
Value network Evaluation v (s) � � s Position
Policy network Move probabilities p (a|s) � � s Position
Neural network training pipeline Human expert Supervised Learning Reinforcement Learning Self-play data Value network positions policy network policy network
Supervised learning of policy networks Policy network: 12 layer convolutional neural network Training data: 30M positions from human expert games (KGS 5+ dan) Training algorithm: maximise likelihood by stochastic gradient descent Training time: 4 weeks on 50 GPUs using Google Cloud Results: 57% accuracy on held out test data (state-of-the art was 44%)
Reinforcement learning of policy networks Policy network: 12 layer convolutional neural network Training data: games of self-play between policy network Training algorithm: maximise wins z by policy gradient reinforcement learning Training time: 1 week on 50 GPUs using Google Cloud Results: 80% vs supervised learning. Raw network ~3 amateur dan.
Reinforcement learning of value networks Value network: 12 layer convolutional neural network Training data: 30 million games of self-play Training algorithm: minimise MSE by stochastic gradient descent Training time: 1 week on 50 GPUs using Google Cloud Results: First strong position evaluation function - previously thought impossible
Exhaustive search
Reducing depth with value network
Reducing breadth with policy network
Professional Amateur Beginner dan (p) dan (d) kyu (k) Evaluating AlphaGo against computers 9p 7p 5p 3p 1p 9d 7d 5d 3d 1d 1k 3k 5k 7k Gnu Go Fuego Pachi Zen Crazy Stone AlphaGo (Nature v13) AlphaGo (Seoul v18) 4500 4000 3500 3000 2500 2000 1500 1000 500 0
Computer Programs Calibration Human Players Lee Sedol (9p) DeepMind challenge match AlphaGo (Mar 2016) Top player of 4-1 past decade Beats Beats Fan Hui (2p) Nature match AlphaGo (Oct 2015) 3-times reigning 5-0 Euro Champion Beats Beats KGS Amateur Crazy Stone and Zen humans
What’s Next?
Demis Hassabis

Recommend

Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the

Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the cell? Does God play dice with the cell? Does

350 views • 33 slides

Language and Computers where to start? Outline Computers Computers Computers Topic 1: Text

Language and Language and Language and Language and Computers where to start? Outline Computers Computers Computers Topic 1: Text and Topic 1: Text and Topic 1: Text and Speech Encoding Speech Encoding Speech Encoding Writing

213 views • 7 slides

Quantum Mechanics; a Blessing and a Curse By Elias Marcopoulos Quantum Computers Quantum

Quantum Mechanics; a Blessing and a Curse By Elias Marcopoulos Quantum Computers Quantum Computers are just like regular computers, they crunch numbers Quantum Computers Quantum Computers are just like regular computers, they crunch

926 views • 74 slides

H H PLAY HARD H HAVE FUN H H NationalAcademyofAthletics.com H H PLAY HARD H HAVE FUN H H THE

H H PLAY HARD H HAVE FUN H H NationalAcademyofAthletics.com H H PLAY HARD H HAVE FUN H H THE PLAYBOOK H Company Overview H 3 Year plan - Growth Strategy H Problem H Risk Mitigation Milestones H Solution H Competitive Advantage H Market

758 views • 27 slides

Who cares about spelling? Why people care about spelling Computers Computers Computers Topic

Language and Language and Language and Who cares about spelling? Why people care about spelling Computers Computers Computers Topic 4: Topic 4: Topic 4: Writers aids Writers aids Writers aids Aoccdrnig to a rscheearch at

246 views • 8 slides

Play with Ants, Play as Ants: The Kodomo Project Report on the Play-Shop Hiroaki Ishiguro

Play with Ants, Play as Ants: The Kodomo Project Report on the Play-Shop Hiroaki Ishiguro Graduate School of Education, Hokkaido University Paper presented in Kajaani University Consortium, University of Oulu, Finland. 29,2004 Play with Ants,

351 views • 12 slides

Promenade Park Garden Play Area New Play Area Layout wynne-williams associates Promenade Park

Promenade Park Garden Play Area New Play Area Layout wynne-williams associates Promenade Park Garden Play Area Vision for a New Play Area wynne-williams associates wynne-williams associates Promenade Park Garden Play Area LOCALLY EQUIPPED

323 views • 4 slides

Language and Computers where to start? Language and Outline Language and Computers

Language and Language and Computers where to start? Language and Outline Language and Computers Computers Computers Topic 1: Text and Topic 1: Text and Topic 1: Text and Speech Encoding Speech Encoding Speech Encoding Writing

312 views • 7 slides

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

Language and Language and Language and Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6: CALL Language learning Language learning Language learning First language aquisition First language

342 views • 6 slides

Good Morning! INT1004 Computers for Business Ulrich Werner Discovering Computers Technology in

Good Morning! INT1004 Computers for Business Ulrich Werner Discovering Computers Technology in a World of Computers, Mobile Devices, and the Internet Chapter 9 Operating Systems Operating System Functions Starting Computers and Mobile

873 views • 65 slides

Outline Searching Computers Computers Computers Topic 2: Searching Topic 2: Searching Topic

Language and Language and Language and Outline Searching Computers Computers Computers Topic 2: Searching Topic 2: Searching Topic 2: Searching Introduction Introduction Introduction Text Text Text Speech Speech Speech Searching

480 views • 4 slides

A Brief History of Computers A Brief History of Computers A Brief History of Computers By

A Brief History of Computers A Brief History of Computers A Brief History of Computers By Debdeep Mukhopadhyay Assistant Professor Dept of Computer Sc and Engg IIT Madras Pre-Mechanical Computing: Pre-Mechanical Computing: Mechanical

423 views • 38 slides

Risky Play! Why? Play and exploration is a fundamental human right for all children,

Risky Play! Why? Play and exploration is a fundamental human right for all children, regardless of age, gender, culture, social class or disability . Annie Nolan 2016 All children and young people need to play. The impulse to play

986 views • 44 slides

HydroCare HC-44 HydroCare HC-44 Hard Water Problems Hard Water Problems Hard Water Costs You

HydroCare HC-44 HydroCare HC-44 Hard Water Problems Hard Water Problems Hard Water Costs You Money! Hard Water Costs You Money! Limescale originates in the components of the waters carbonic rigidity-calcium carbonate (lime), magnesium

617 views • 8 slides

6/18/2018 When Family Life Gets Hard 1 6/18/2018 When Family Life Gets Hard God

6/18/2018 When Family Life Gets Hard 1 6/18/2018 When Family Life Gets Hard God promises that life will be hard When Family Life Gets Hard God promises that life will be hard John 16:33, NIV In this world you will have

629 views • 11 slides

Play for All Play for All Play I will know you I will touch you and hold you And smell and

Play for All Play for All Play I will know you I will touch you and hold you And smell and taste and listen To the noises that you make and the words if any And when I have come to know you intimately I will insist, gently gradually,

259 views • 23 slides

Types Sequences: Lists Strings Exercises on the above and loops

Types Sequences: Lists Strings Exercises on the above and loops Robotics: motion commands as an example of the input-compute-output pattern Please sit with your robot partner Well, sit with your robotics partner

730 views • 18 slides

CSE 473: Artificial Intelligence Spring 2014 Adversarial Search Hanna Hajishirzi

CSE 473: Artificial Intelligence Spring 2014 Adversarial Search Hanna Hajishirzi Based on slides from Dan Klein, Luke Zettlemoyer Many slides over the course adapted from either Stuart Russell or Andrew Moore 1 Outline

469 views • 29 slides

String Matching II Algorithm : Design & Analysis [19] In the last class Simple String

String Matching II Algorithm : Design & Analysis [19] In the last class Simple String Matching KMP Flowchart Construction Jump at Fail KMP Scan String Matching II Boyer-Moores heuristics Skipping unnecessary

705 views • 22 slides

Lecture 14 : The Gamma Distribution and its Relatives 0/ 18 The gamma distribution is a

Lecture 14 : The Gamma Distribution and its Relatives 0/ 18 The gamma distribution is a continuous distribution depending on two parameters, and . It gives rise to three special cases 1 The exponential distribution ( = 1 , = 1 ) 2

303 views • 19 slides

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Introduction Alpha-beta pruning is the standard searching procedure used for 2-person perfect-information zero sum games.

804 views • 52 slides

Effective R Programming Jacob Colvin February 21, 2009 Jacob Colvin () Effective R Programming

Effective R Programming Jacob Colvin February 21, 2009 Jacob Colvin () Effective R Programming February 21, 2009 1 / 21 Introduction 1 Motivation R Concepts 2 Language Details Debuging 3 Profiling 4 Tidying R Code Good Code, Bad

779 views • 21 slides

Serverless Boom or Bust? An Analysis of Economic Incentives Xiayue Charles Lin, Joseph E.

Serverless Boom or Bust? An Analysis of Economic Incentives Xiayue Charles Lin, Joseph E. Gonzalez, Joseph M. Hellerstein UC Berkeley HotCloud 2020 Monday, July 13 An Economic Model for Serverless Serverless: pay for consumption instead of

160 views • 14 slides

Game Theory -- Lecture 1 Patrick Loiseau EURECOM Fall 2016 1 Lecture 1 outline 1.

Game Theory -- Lecture 1 Patrick Loiseau EURECOM Fall 2016 1 Lecture 1 outline 1. Introduction 2. Definitions and notation Game in normal form Strict and weak dominance 3. Iterative deletion of dominated strategy A first model

962 views • 49 slides