paper optimal kronecker sum approximation of real time
play

Paper: Optimal Kronecker-Sum Approximation of Real Time Recurrent - PowerPoint PPT Presentation

Paper: Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning Poster: Online & Untruncated Gradients for RNNs Frederik Benzing*, Marcelo Matheus Gauy*, Asier Mujika, Anders Martinsson, Angelika Steger Department for Computer


  1. Paper: Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning Poster: Online & Untruncated Gradients for RNNs Frederik Benzing*, Marcelo Matheus Gauy*, Asier Mujika, Anders Martinsson, Angelika Steger Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 1 Online & Untruncated Gradients for RNNs

  2. Recurrent Neural Nets (RNNs) Model temporal and sequential data (RL, audio synthesis, language  modelling,...) One of the key research challenges:  Learn Long-Term dependencies Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 2 Online & Untruncated Gradients for RNNs

  3. Training RNNs Truncated Backprop Trough Time (TBPTT) (Williams & Peng, 1990) Output hidden h 10 ... ... h t-1 h t h t+1 state Input Introduces arbitrary Truncation Horizon → no longer term dependencies  Parameter Update Lock during forward & backward pass  Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 3 Online & Untruncated Gradients for RNNs

  4. Forward Computing Gradients It looks like you Real Time Recurrent Learning (RTRL) (Williams & Zipser, 1989) want to do RTRL. Forward compute with recurrence Untruncated Gradients  Memory is independent of sequence length  Online parameter updates (no update lock)  BUT: Need n 4 Runtime and n 3 Memory (for n hidden units) → infeasible Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 4 Online & Untruncated Gradients for RNNs

  5. Approximate RTRL to save time & space It looks like you Online Recurrent Optimization (UORO) (Tallec & Ollivier, 2017) want to do RTRL. Idea: Don't store G t precisely, but approximately  n x 1 1 x n 2 and unbiasedly approximate recurrence equation. ➢ Memory: n 2 ➢ Runtime: n 3 Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 5 Online & Untruncated Gradients for RNNs

  6. Does it work? Part I UORO (Tallec & Ollivier, 2017) and KF-RTRL (Mujika et al., 2018) Character-level PTB Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 6 Online & Untruncated Gradients for RNNs

  7. Does it work? Part II Provably optimal approximation – Optimal Kronecker-Sum (OK) (our contribution) Copy Task Character-level PTB Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 7 Online & Untruncated Gradients for RNNs

  8. What to remember It looks like you got interested in Truncated BPTT has problems (truncation, update lock)  RTRL. Have a look at RTRL as online & untruncated alternative, but too costly Poster #166.  Our OK approx of RTRL reduces costs by factor n  No performance loss  Break update lock → faster convergence  Theoretically optimal (for certain class of approx)  Still need to reduce computational costs  Department for Computer Science, D-INFK Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | | 8 Online & Untruncated Gradients for RNNs

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend