LeelaChessZero Open Source Community (F. Huizinga) Overview What - - PowerPoint PPT Presentation

leelachesszero
SMART_READER_LITE
LIVE PREVIEW

LeelaChessZero Open Source Community (F. Huizinga) Overview What - - PowerPoint PPT Presentation

LeelaChessZero Open Source Community (F. Huizinga) Overview What is Lc0? The GameTree and A0 in a nutshell Contribute Useful links Technical details What is Lc0? 2016 Deepminds AlphaGo 2017 AlphaZero


slide-1
SLIDE 1

LeelaChessZero

Open Source Community (F. Huizinga)

slide-2
SLIDE 2

Overview

  • What is Lc0?
  • The GameTree and A0 in a nutshell
  • Contribute
  • Useful links
  • Technical details
slide-3
SLIDE 3

What is Lc0?

  • 2016 Deepmind’s AlphaGo
  • 2017 AlphaZero
  • 2017 LeelaZero
  • 2018 LeelaChessZero
slide-4
SLIDE 4

The Game Tree

slide-5
SLIDE 5

Why care?

  • General approach, no domain knowledge

required (Go, Chess, Shogi, …)

  • Visual interpretation of the game allows for a

deep positional - and materialistic understanding obtained from selfplay

  • Fascinating gameplay, see youtube videos
  • n alphazero/leelachesszero
slide-6
SLIDE 6

LeelaChessZero

  • Initially missing details on the neural network

architecture

  • Variable compute budget
  • Obtain dedicated hardware for training
  • Always looking for contributors

○ Developers ○ Computational help ○ Testers/Elo estimators ○ Enthusiasts

slide-7
SLIDE 7

Links

  • lczero.org
  • testtraining.lczero.org
  • github.com/LeelaChessZero
  • discord.gg/pKujYxD
slide-8
SLIDE 8

Thanks to

  • DeepMind
  • Gian-Carlo Pascutto
  • Leela Developers
  • Lc0 Developers
  • Testers
  • Chess enthusiasts
slide-9
SLIDE 9

Minimax Algorithm

  • 1

+1

  • 1
  • 1

+1 +1 min(0, 0, -1) min(0, +1) max(-1, +1, -1)

slide-10
SLIDE 10

Evaluation Function

  • Minimax unable to reach terminal nodes

given time constraints

  • Approximate minimax value of subtree
  • Must evaluate non-terminal nodes
  • Centuries of human chess understanding to

properly define this function

slide-11
SLIDE 11

Minimax + Eval

8

  • 3

1 2 4

  • 3

2 2 min(8, -3, 1) min(2, 4) max(-3, 2, 0)

slide-12
SLIDE 12

AlphaZero

Main objective: Prune the gametree Learn the evaluation function (value) and most promising moves (policy) of the gametree iteratively from selfplay data.

slide-13
SLIDE 13

Neural Network

Neural Network Expected outcome: 1 Move distribution X O X O 2 4 1 2 1

slide-14
SLIDE 14

Training Data

Neural Network Expected outcome: 1 Move distribution X O X O 2 4 1 2 1 Game state Result Win +1 Loss -1 Draw 0 Policy

Obtain data through selfplay

slide-15
SLIDE 15

(MCT) Search

slide-16
SLIDE 16

(MCT) Search

slide-17
SLIDE 17

(MCT) Search

slide-18
SLIDE 18

(MCT) Search

slide-19
SLIDE 19

(MCT) Search

slide-20
SLIDE 20

(MCT) Search

slide-21
SLIDE 21

Records of data

(State1, Policy1, Result1) (State2, Policy2, Result2) ... (Staten, Policyn, Resultn) Where n is the total moves in the game played.