[PPT] - PowerPoint Presentation, free download

SLIDE 1

2018.8/7 @高エネルギー宇宙物理学研究会2018

深層学習：

現状のレビューと宇宙物理学・天文学への応用の可能性

(Deep Learning: Review and Possible Applications)

瀧雅人 (Masato Taki)

RIKEN, iTHEMS

SLIDE 2

深層学習 Deep Learning 機械学習 Machine Learning 人工知能 AI

SLIDE 3

深層学習 Deep Learning 機械学習 Machine Learning 人工知能 AI

Many approach (Or any approach) An approach A set of concrete methods

SLIDE 4

深層学習 Deep Learning 機械学習 Machine Learning 人工知能 AI Big progress for the past 6 years

‘the time is ripe for them’

SLIDE 5

Deep Learning＝A Machine Learning

Improving computer program(=machine)’s ability to solve tasks through experience/ data

Learning/Training？

SLIDE 6

1. Deep Learning

SLIDE 7

~ x, ~ y ∼ P (x, y)

Cat = (0, 0, 0, 1, 0, 0, …)

Supervised Learning

SLIDE 8

~ x, ~ y ∼ P (x, y)

This is a pen. これはペンです。

Supervised Learning

SLIDE 9

~ x, ~ y ∼ P (x, y)

Predict output from input Supervised Learning

x

ˆ y

Model

SLIDE 10

Supervised Learning

x

ˆ y

Model

Which model should we employ?

SLIDE 11

Supervised Learning

x

ˆ y

Model

Which model should we employ?

SLIDE 12

Brain … ??

SLIDE 13

?

Modeling Brain!?

SLIDE 14

?

Network ＝

Modeling Brain!?

SLIDE 15

＝ Network

Modeling Brain!?

SLIDE 16

(Artificial) Neural Network

Modeling Brain!?

SLIDE 17

Neural Network / Deep Learning

x

ˆ y

SLIDE 18

Neural Network / Deep Learning

~ y

~ x

Model = Directed Graph

x1 x2 y2 y1

input

utput

SLIDE 19

Neural Network / Deep Learning ・We can solve various problem by designing ‘good graph’. ・Intuitive (=geometric) design of network Model = Directed Graph

SLIDE 20

Neural Network / Deep Learning

Deep Learning = Geometrization of Machine Learning !

c.f. General Relativity = Geometrization of Gravity

SLIDE 21

Details on Neural Network

SLIDE 22

u = X

i

xi

Input from neuron１ Full input Output to other neurons

x1 x2

a(u)

McCulloch-Pitts’s Artificial Neuron (1943)

Input from neuron2

SLIDE 23

9

17

= 9 + 17

= 26

u

McCulloch-Pitts’s Artificial Neuron (1943)

SLIDE 24

a(26)

9

17

= 9 + 17

= 26

u

McCulloch-Pitts’s Artificial Neuron (1943)

SLIDE 25

x1 x2

u = X

i

xi

Activation function

ReLU

u

Threshold

a(u)

McCulloch-Pitts’s Artificial Neuron (1943)

SLIDE 26

9

17

26 26

McCulloch-Pitts’s Artificial Neuron (1943)

SLIDE 27

−17

9 −8

McCulloch-Pitts’s Artificial Neuron (1943)

SLIDE 28

u = X

i

xi

Any logical circuit

x1 x2

McCulloch-Pitts’s Artificial Neuron (1943)

a(u)

SLIDE 29

Rosenblatt’s Perceptron (1957) Tunable parameters

w1 w2

u = X

i

wixi ～ Connection-Strength between a pair

x1 x2

w

a(u)

SLIDE 30

Rough network design.
Learning tunes the behavior of the net.

Multi-layer (Artificial) Neural Network Rosenblatt’s Perceptron (1957)

SLIDE 31

Multi-layer Neural Net

Layer structure is key to performance!
Many layers＝Deep

Rosenblatt’s Perceptron (1957)

SLIDE 32

Fit=Train these parameters

w1 w2 w3 w4 w5 w6

Rosenblatt’s Perceptron (1957)

（フィッティング=訓練・学習）

ˆ y(x; w)

x

SLIDE 33

Supervised Learning

(x1, y1) (x2, y2) (xN, yN)

. . .

Observed Data Set

SLIDE 34

Supervised Learning Observed Data Set Prediction Error Measure

SLIDE 35

Supervised Learning Observed Data Set Prediction Error Measure

E(w) = 1 N

N

X

n=1

ˆ

y(xn; w) − yn 2

E.g. Mean Square Error

w∗ = argminwE(w)

‘pseudo’-optimization (regularization, modified optimization algorithms, …)

SLIDE 36

Neural Network / Deep Learning

Why high performance ? still open question

SLIDE 37

2. Progress in DL

SLIDE 38

Achievement in Image Recognition

SLIDE 39

ILSVRC (ImageNet Large Scale Visual Recognition Challenge) 14 milion 1000 classes

Image recognition competition by using ImageNet dataset (2010-2017)

SLIDE 40