Reinforcement Learning with Neural Networks for Quantum Multiple - PowerPoint PPT Presentation

Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing Sarah Brandsen 1 , Kevin D. Stubbs 2 , Henry D. Pfister 2 , 3 1 Department of Physics, Duke University 2 Department of Mathematics, Duke University 3 Department of Electrical Engineering, Duke University IEEE International Symposium on Information Theory June 21-26, 2020 (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 1 / 27

Outline 1 Overview of Multiple State Discrimination 2 Reinforcement Learning with Neural Networks (RLNN) 3 Comparing RLNN performance to known results Binary pure state discrimination RLNN performance as function of subsystem number Comparison to “Pretty Good Measurement” 4 Performance of RLNN in more general cases Trine ensemble Comparison to Semidefinite Programming Upper Bounds 5 Open questions (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 2 / 27

Quantum State Discrimination Given: ρ ∈ { ρ j }| m j =1 with priors � q = ( q 1 , ..., q m ) Objective: find quantum measurement ˆ Π = { Π j }| m j =1 that maximizes m � P success = Tr[ q j ρ j Π j ] j =1 ρ 1 q j = Pr( ρ = ρ j ) ρ 4 ρ 3 ρ 2 (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 3 / 27

Locally Adaptive Strategies Locally adaptive protocols consist of measuring one subsystem at a time, then choosing the next subsystem and measurement based on previous results ρ (3) 4 ρ (3) 1 ρ (2) ρ (1) 3 ρ (2) 1 ρ (1) 1 2 ρ (3) ρ (1) ρ (1) ρ (2) 3 4 3 2 ρ (2) ρ (3) 4 2 (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 4 / 27

Motivation for Locally Adaptive Strategies Analytic solution for optimal collective measurement generally not known when m ≥ 3 Approximately optimal solutions found via semidefinite programming [EMV03] may be experimentally impractical for large systems ρ (3) 4 ρ (3) 1 ρ (2) ρ (1) 3 ρ (2) 1 ρ (1) 1 2 ρ (3) ρ (1) ρ (1) ρ (2) 3 4 3 2 ρ (2) ρ (3) 4 2 k =1 ρ ( k ) ρ j = � n j (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 5 / 27

Reinforcement Learning Main idea- agent learns to maximize the expected future reward through repeated interactions with the environment. a t ∈ A s t ∈ S r t (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 6 / 27

Advantage function Agent’s policy - draw random action a given state s according to π θ ( a | s ) = Pr[A = a | S = s ] Advantage function - compares the expected reward of choosing action a given state s to the average expected reward for being in state s given policy π N γ ℓ − t � � � � � A π ( s t , a t ) = E π θ [ r ( s ℓ , a ℓ ) � s t , a t ] − E π θ [ r ( s ℓ , a ℓ ) � s t ] ℓ = t (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 7 / 27

Neural Networks for Function Approximation Setup- we use a fully connected neural network where the input layer feeds into two parallel sets of sub-networks. π ∗ ( a 1 | s ) π ∗ ( a 2 | s ) π ∗ ( a | A | | s ) V ( s ) Input Layer Two Hidden Layers (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 8 / 27

Set of allowed quantum measurements Binary Projective Measurement Set – taken to be { ˆ Π( ℓ ) } Q ℓ =1 where Π( ℓ ) � ℓ � ℓ �  �  � 2 � 2 Π( ℓ − 1) Π( ℓ + 1) ℓ 1 − ˆ Q Q Q  , Π( ℓ ) � � ℓ � ℓ  � � 2 � 2 ℓ 1 − 1 − Q Q Q � ℓ � ℓ  �  � 2 � 2 � − ℓ 1 − 1 − Q Q Q � ℓ � ℓ  �  � 2 � 2 − ℓ 1 − Q Q Q and ℓ ∈ { 0 , 1 , ..., Q − 1 } . Q = 20 in our experiments. (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 9 / 27

Applying RLNN to Multiple State Discrimination Initialize- Randomly generate ρ ∈ { ρ j } m j =1 according to � q = ( q 1 , ..., q m ) Initialize � s = ( s 1 , ..., s n ) to all-zeros vector � s = [0 , 0 , 0] (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 10 / 27

Applying RLNN to Multiple State Discrimination (cont) Step- Agent chooses an action of the form ( j , ˆ Π) Implement action and sample outcome according to Tr[Π out ρ ] Update prior via Bayes’ Theorem Set s j → 1 j = 2 s → [0 , 1 , 0] � (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 11 / 27

Reward scheme If subsystem j has already been measured in a previous round, return penalty of -0.5 r = − 0 . 5 When s j = 1 for all j return reward of 1 if ρ guess = ρ and 0 else. 1 1 Results are generated using the default PPO algorithm from Ray version 0.7.6 (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 12 / 27

Binary Pure State Discrimination Setup- in the special case where m = 2, the state set is { ρ + , ρ − } with prior q = Pr( ρ = ρ + ). Optimal solution- the Helstrom measurement is optimal, where Π h = { Π + , Π − } and Π ± are projectors onto the positive/negative eigenspace of M � q ρ + − (1 − q ) ρ − In the special case where ρ ± are both tensor products of pure subsystems, an adaptive greedy protocol is fully optimal. (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 13 / 27

RLNN Performance in the Binary Case Setup- for each trial, we randomly select pure tensor product quantum states with m = 2, n = 3. Results for the optimal RLNN policy are plotted after 1000 training iterations. 1 0 . 9 0 . 8 P succ 0 . 7 0 . 6 Helstrom RLNN 0 . 5 0 2 4 6 8 trial (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 14 / 27

RLNN Performance as Function of Training Iterations 0 . 1 P succ, Helstrom − P succ, RLNN 8 · 10 − 2 6 · 10 − 2 4 · 10 − 2 2 · 10 − 2 0 0 200 400 600 800 1 , 000 training iteration (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 15 / 27

Special known case Given a base set { ρ 0 , ρ 1 } , consider: S (1) � { ρ 0 , ρ 1 } S (2) � { ρ 0 ⊗ ρ 0 , ρ 0 ⊗ ρ 1 , ρ 1 ⊗ ρ 0 , ρ 1 ⊗ ρ 1 } S (3) � { ρ 0 ⊗ ρ 0 ⊗ ρ 0 , ρ 0 ⊗ ρ 0 ⊗ ρ 1 , ρ 0 ⊗ ρ 1 ⊗ ρ 0 , ρ 0 ⊗ ρ 1 ⊗ ρ 1 , ρ 1 ⊗ ρ 0 ⊗ ρ 0 , ρ 1 ⊗ ρ 0 ⊗ ρ 1 , ρ 1 ⊗ ρ 1 ⊗ ρ 0 , ρ 1 ⊗ ρ 1 ⊗ ρ 1 } .... and for each state set, assume each candidate state is equally probable. (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 16 / 27

Special known case Results- the RLNN performance starts to show a signficant gap from the optimal success probability when n ≥ 5 0 . 8 P succ 0 . 6 0 . 4 Optimal NN 1 2 3 4 5 6 n (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 17 / 27

The “Pretty Good Measurement” (PGM) The “Pretty Good Measurement” defines the POVM q j ρ j ) − 1 q j ρ j ) − 1 � � Π PGM , k � ( 2 q k ρ k ( ∀ k ∈ { 1 , ..., m } 2 j j Motivation- PGM is known to be optimal for several cases: Symmetric pure states with uniform prior where ρ j = | ψ j � � ψ j | and | ψ j � = U j − 1 | ψ 1 � with U m = I Linearly-independent pure states where the diagonal elements of the square-root of the Gram matrix are all equal (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 18 / 27

RLNN vs Pretty Good Measurement Setup- we generate 10 trials of candidate states with n = 3, m = 5 and plot the difference in RLNN and PGM success probability. 4 3 2 1 0 − 5 · 10 − 2 0 5 · 10 − 2 0 . 1 P succ , NN − P succ , PGM (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 19 / 27

Trine Ensemble Candidate States The trine ensemble consists of three equally spaced real qubit states, 3 ) † � ⊗ j � 2 � 3 ) ⊗ j | 0 � � 0 | � R ( 4 π R ( 4 π namely j =0 ρ ( 0 ) ρ ( 1 ) 0 0 ρ ( 0 ) ρ ( 0 ) ρ ( 1 ) ρ ( 1 ) 2 1 2 1 (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 20 / 27

Conjectured optimal local method Step 1: “Anti-trine” measurement implemented on subsystem 1 ρ ( 0 ) 0 Π 1 Π 2 ρ ( 0 ) ρ ( 0 ) 2 1 Π 0 (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 21 / 27

Conjectured optimal local method Step 2: Helstrom measurement for the remaining two candidate states is implemented on subsystem 2 ρ ( 1 ) 0 Π 0 ρ ( 1 ) 1 Π 1 out = Π 2 Success probability of this method is P succ ≈ 0 . 933, whereas success probability of a locally greedy method is P succ, lg = 0 . 8 (Duke University) Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing 22 / 27

Reinforcement Learning with Neural Networks for Quantum Multiple - PowerPoint PPT Presentation

Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing Sarah Brandsen 1 , Kevin D. Stubbs 2 , Henry D. Pfister 2 , 3 1 Department of Physics, Duke University 2 Department of Mathematics, Duke University 3 Department of

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Neural networks and Reinforcement learning review CS 540 Yingyu Liang Neural Networks Outline

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

A Framework for Remote Usability Evaluation on Mobile Devices Bachelorarbeit in Informatik

A Historical Critique of Symmetry's Success Story Starting point: "Symmetries have a special

Thursday, December 2nd CloudCom 2010 A Comparison and Critique of Eucalyptus, OpenNebula and

Critiques of the DICE model Spring 09 UC Berkeley Traeger 6 Integrated Assessment 29

BIM in Ireland Update 12 th June 2019 BIM in Ireland Umbrella Forum BIM Macro Adoption

Minutes of the CERME10 General Meeting February, 2, 2017, 16.30 18:00, Dublin, Croke Park.

Code Modification Forum The Clarion Hotel, Cork 4 th November 2015 Agenda 1. Review of minutes

Handouts Appointment Procedures June 2012 (Amended Version) Slide 1 Slide 2 Slide 3 Slide 4

Sambuz

Useful Links

Newsletter

Mail Us

Reinforcement Learning with Neural Networks for Quantum Multiple - PowerPoint PPT Presentation

Reinforcement Learning with Neural Networks for Quantum Multiple Hypothesis Testing Sarah Brandsen 1 , Kevin D. Stubbs 2 , Henry D. Pfister 2 , 3 1 Department of Physics, Duke University 2 Department of Mathematics, Duke University 3 Department of

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Neural networks and Reinforcement learning review CS 540 Yingyu Liang Neural Networks Outline

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

A Framework for Remote Usability Evaluation on Mobile Devices Bachelorarbeit in Informatik

A Historical Critique of Symmetry's Success Story Starting point: &quot;Symmetries have a special

Thursday, December 2nd CloudCom 2010 A Comparison and Critique of Eucalyptus, OpenNebula and

Critiques of the DICE model Spring 09 UC Berkeley Traeger 6 Integrated Assessment 29

BIM in Ireland Update 12 th June 2019 BIM in Ireland Umbrella Forum BIM Macro Adoption

Minutes of the CERME10 General Meeting February, 2, 2017, 16.30 18:00, Dublin, Croke Park.

Code Modification Forum The Clarion Hotel, Cork 4 th November 2015 Agenda 1. Review of minutes

Handouts Appointment Procedures June 2012 (Amended Version) Slide 1 Slide 2 Slide 3 Slide 4

Sambuz

Useful Links

Newsletter

Mail Us

A Historical Critique of Symmetry's Success Story Starting point: "Symmetries have a special