Low memory RNNs... for emoji! Xavier Snelgrove , CTO & - PowerPoint PPT Presentation

Apr 03, 2023 •750 likes •1.3k views

Low memory RNNs... for emoji! Xavier Snelgrove , CTO & Co-Founder, Whirlscape @wxswxs March 2017 Me, me, me! Me, me, me! Me, me, me! Minuum Dango http:/ /minuum.com http:/ /getdango.com Me, me, me! Minuum Dango http:/

Low memory RNNs... for emoji! Xavier Snelgrove , CTO & Co-Founder, Whirlscape @wxswxs March 2017
Me, me, me!
Me, me, me!
Me, me, me! Minuum Dango http:/ /minuum.com http:/ /getdango.com
Me, me, me! Minuum Dango http:/ /minuum.com http:/ /getdango.com
Title Text
Title Text
Title Text
Title Text
Title Text
Title Text
Title Text
With Dango
With Dango
With Dango
With Dango
With Dango
Hi prince 👒 Never mind. I forgot I’m single 😓😪 that's what I like to hear 😈❤ Highway driving in the morning 🌆👍 happy bro bro it was cool chilling with you in line for Travis gotta catch another show turn up one time 🙐😝 100s of Millions of Examples
100s of Millions of Examples
100s of Millions of Examples 💮 💮 GPUs crunch away 💮 for days
100s of Millions of Examples 💮 💮 GPUs crunch away 💮 for days 💮 💭 Trained Model 📛
Let’s eat lunch later
Let’s eat lunch later
Let’s eat lunch later Let’
Let’s eat lunch later Let’
Let’s eat lunch later Let’ 🍵😌🍞
Emoji in semantic-space
How can we run this on device?
Let’s eat lunch later Word Embedding Recurrent Layers Dense Output Layers
Embedding memory the and cat yesterday eggplant . . . alchemist missspellling
Embedding memory } the and cat yesterday eggplant . 100,000 . . alchemist missspellling } 512
Embedding memory 512 × 100,000 × 4 bytes
Embedding memory 512 × 100,000 × 4 bytes =200 MB
Embedding memory 512 × 100,000 × 4 bytes =200 MB SQLite
Embedding memory Quantize 3 bits 512 × 100,000 × 4 bytes =200 MB 20 MB SQLite
Embedding memory Distribution of embedding values SQLite Hu fg man coding? Depends on quantization
Let’s eat lunch later Word Embedding Recurrent Layers Dense Output Layers
Recurrent Layer Memory Input Vector Previous State + Next State Output Vector
Recurrent Layer Memory
Recurrent Layer Memory } 768 } 768
Recurrent Layer Memory 768 × 768 × 3 × 2 layers × 4 bytes = 14MB
Recurrent Layer Memory Quantize (float16) 2 bytes 768 × 768 × 3 × 2 layers × 4 bytes = 14MB 7MB
Recurrent Layer Memory Distribution of weight values
Recurrent Layer Memory Distribution of weight values Many near-zero values
Recurrent Layer Memory
Recurrent Layer Memory Prune 50% of weights closest to 0
Recurrent Layer Memory Prune 50% of weights closest to 0 Train the rest of the network
Recurrent Layer Memory Prune 50% of weights closest to 0 Train the rest of the network Repeat, pruning more each iteration
Recurrent Layer Memory Prune 50% of weights closest to 0 Train the rest of the network Repeat, pruning more each iteration 90% prune 7MB × 0.1 = 700 kB
Recurrent Layer Memory Prune 50% of weights closest to 0 Train the rest of the network Repeat, pruning more each iteration 90% prune 7MB × 0.1 = 700 kB
Questions? http:/ /getdango.com Xavier Snelgrove , CTO & Co-Founder @wxswxs

Recommend

Recursive Neural Networks and Its Applications LU Yangyang luyy11@sei.pku.edu.cn KERE Seminar

Outline RNNs RNNs-FQA RNNs-NEM Recursive Neural Networks and Its Applications LU Yangyang luyy11@sei.pku.edu.cn KERE Seminar Oct. 29, 2014 Outline RNNs RNNs-FQA RNNs-NEM Outline Recursive Neural Networks RNNs for Factoid Question

907 views • 67 slides

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA Remember that

1.26k views • 99 slides

emoji and domain names SAC095 https://www.icann.org/en/system/files/files/sac-095-en.pdf Patrik

emoji and domain names SAC095 https://www.icann.org/en/system/files/files/sac-095-en.pdf Patrik Fltstrm SSAC Emoji in IDNA IDNA is specified by the IETF in RFC 5890-5893 The IDNA standard was designed to create, in conjunction

483 views • 10 slides

Stanford CS193p Developing Applications for iOS Fall 2017-18 CS193p Fall 2017-18 Today Emoji

Stanford CS193p Developing Applications for iOS Fall 2017-18 CS193p Fall 2017-18 Today Emoji Art Demo continued UITextField to add more Emoji Persistence UserDefaults Property List Archiving and Codable Filesystem Core Data Cloud Kit

788 views • 59 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class Recurrent Neural Network Cell Recurrent Neural Networks (RNNs) Bi-Directional Recurrent Neural Networks (Bi-RNNs) Multiple-layer /

583 views • 47 slides

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented by: Adam Balint Presented by: Adam Balint Email: balint@uoguelph.ca Email: balint@uoguelph.ca Working with more complex data Working with more

608 views • 25 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

793 views • 21 slides

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

Main Focus I. Memory as a process Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory the process by which I. Sensory Memory information is - acquired, II. Short -Term Memory - stored,

169 views • 5 slides

1 Memory SoC Persistent Memory-Driven Memory Memory Processor-Centric Memory SoC SoC

1 Memory SoC Persistent Memory-Driven Memory Memory Processor-Centric Memory SoC SoC Computing Computing + Fabric SoC Memory HYPERCONVERGED Exascale EDGE DEVICE SYSTEM Eliminate data movement via shared

401 views • 11 slides

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory Device Device Memory Computer-Computer Comm CPU CPU CPU CPU Comm Comm Comm Comm Memory Memory Memory Memory Device Device Device Device

629 views • 36 slides

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if we want to run a process that requires 10GB memory? 2 Memory Hierarchy Virtual Memory Memory Cache Registers Answer: Pretend we had something

737 views • 45 slides

Personal SE Computer Memory Addresses C Pointers Computer Memory Organization Memory is a

Personal SE Computer Memory Addresses C Pointers Computer Memory Organization Memory is a bucket of bytes . Computer Memory Organization Memory is a bucket of bytes. Each byte is 8 bits wide. Computer Memory Organization Memory

994 views • 42 slides

Memory Memory processing is the ability to: Acquire (Short term memory) Manipulate

Memory Memory processing is the ability to: Acquire (Short term memory) Manipulate (Working memory) Retain (Long term memory) Memory Retrieve (Long term memory) processing A difficulty with any one or more of these skills

361 views • 6 slides

Memory Management Memory Manager Requirements Minimize primary memory access time

Memory Management Memory Manager Requirements Minimize primary memory access time Maximize primary memory size Primary memory must be cost-effective Todays memory manager: Allocates primary memory to processes Maps

637 views • 27 slides

Air Pollution All emojis designed by OpenMoji the open-source emoji and icon project. License:

Air Pollution All emojis designed by OpenMoji the open-source emoji and icon project. License: CC BY-SA 4.0 Lesson objectives 1) Understand why clean air is important 2) Understand experiments we can do to prove air pollution exists 3)

316 views • 17 slides

THE INTERPLAY BETWEEN EMOJI & LINGUISTIC TEXT Helena Lau Yan Ping, Sophia Lee Yat Mei

THE INTERPLAY BETWEEN EMOJI & LINGUISTIC TEXT Helena Lau Yan Ping, Sophia Lee Yat Mei helena.lau@connect.polyu.hk CLSW2020 ym.lee@polyu.edu.hk 1 Outline Introduction Objectives Related Work Corpus Data & Data

415 views • 15 slides

PUBLIC CHARGE UPDATES BRIAN DITTMEIER, ESQ. SENIOR PUBLIC POLICY COUNSEL NATIONAL WIC

PUBLIC CHARGE UPDATES BRIAN DITTMEIER, ESQ. SENIOR PUBLIC POLICY COUNSEL NATIONAL WIC ASSOCIATION WHAT IS PUBLIC CHARGE? Public charge is a balancing test. It defines what federal immigration officers can review when weighing to grant legal

231 views • 10 slides

Ethical issues in online trust May 2014 Robin Wilton Technical Outreach Director Trust and

Ethical issues in online trust May 2014 Robin Wilton Technical Outreach Director Trust and Identity wilton@isoc.org www.internetsociety.org Topics Four problem areas in online trust Three standard ethical models Discussion

390 views • 13 slides

FJPPL Computing Workshop Operational experience with second machine room at CC-IN2P3 Xavier

Centre de Calcul de lInstitut National de Physique Nuclaire et de Physique des Particules FJPPL Computing Workshop Operational experience with second machine room at CC-IN2P3 Xavier Canehan Introduction 2 computing rooms at CC-IN2P3

638 views • 25 slides

Connecting Consumers with Care 2019-21 Grant Guidelines Jennifer Lee, Senior Program Officer May

Connecting Consumers with Care 2019-21 Grant Guidelines Jennifer Lee, Senior Program Officer May 23, 2019 Foundation Mission & Values Expand access to health care for low-income and vulnerable populations in Massachusetts Invested in

341 views • 15 slides

Alice The 3D Object-Oriented Programming Environment Presentation by: Tom Goff What is Alice?

Alice The 3D Object-Oriented Programming Environment Presentation by: Tom Goff What is Alice? Alice is a freely available, innovative way of teaching OOP concepts to students through storytelling. The user acts like the director of movie

308 views • 17 slides

Trump Administrations Proposed Public Charge Rule What Housing and Homelessness Advocates

Trump Administrations Proposed Public Charge Rule What Housing and Homelessness Advocates Should Know November 8, 2018 Public Charge and Housing Resources https://www.nhlp.org/our-initiatives/public- charge-and-housing/

755 views • 37 slides

ANTI-TERRORISM BILL WITH THE RESEARCH ASSISTANCE OF: PREPARED BY: LAYOUT BY: Amer Madcasim,

at a glance: The ANTI-TERRORISM BILL WITH THE RESEARCH ASSISTANCE OF: PREPARED BY: LAYOUT BY: Amer Madcasim, Jr. Professor Elizabeth Aguiling-Pangalangan, Director Roy Necesario, Micah Taguibao Atty. Glenda Litong, Law Reform Specialist

703 views • 19 slides

Dusit Thani Plc. Analyst Meeting 1H 2018 Performance 27 August 2018 Disclaimer The

Dusit Thani Plc. Analyst Meeting 1H 2018 Performance 27 August 2018 Disclaimer The material contained in this document has been prepared by Dusit Thani Public Company Limited (DTC) and may contain forward looking statements which

1.28k views • 60 slides

Low memory RNNs... for emoji! Xavier Snelgrove , CTO & - PowerPoint PPT Presentation

Low memory RNNs... for emoji! Xavier Snelgrove , CTO & Co-Founder, Whirlscape @wxswxs March 2017 Me, me, me! Me, me, me! Me, me, me! Minuum Dango http:/ /minuum.com http:/ /getdango.com Me, me, me! Minuum Dango http:/

Recursive Neural Networks and Its Applications LU Yangyang luyy11@sei.pku.edu.cn KERE Seminar

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL

emoji and domain names SAC095 https://www.icann.org/en/system/files/files/sac-095-en.pdf Patrik

Stanford CS193p Developing Applications for iOS Fall 2017-18 CS193p Fall 2017-18 Today Emoji

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

1 Memory SoC Persistent Memory-Driven Memory Memory Processor-Centric Memory SoC SoC

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

Virtual Memory 1 Memory Hierarchy Memory 4GB Cache 1M Registers 1K Question: What if

Personal SE Computer Memory Addresses C Pointers Computer Memory Organization Memory is a

Memory Memory processing is the ability to: Acquire (Short term memory) Manipulate

Memory Management Memory Manager Requirements Minimize primary memory access time

Air Pollution All emojis designed by OpenMoji the open-source emoji and icon project. License:

THE INTERPLAY BETWEEN EMOJI &amp; LINGUISTIC TEXT Helena Lau Yan Ping, Sophia Lee Yat Mei

PUBLIC CHARGE UPDATES BRIAN DITTMEIER, ESQ. SENIOR PUBLIC POLICY COUNSEL NATIONAL WIC

Ethical issues in online trust May 2014 Robin Wilton Technical Outreach Director Trust and

FJPPL Computing Workshop Operational experience with second machine room at CC-IN2P3 Xavier

Connecting Consumers with Care 2019-21 Grant Guidelines Jennifer Lee, Senior Program Officer May

Alice The 3D Object-Oriented Programming Environment Presentation by: Tom Goff What is Alice?

Trump Administrations Proposed Public Charge Rule What Housing and Homelessness Advocates

ANTI-TERRORISM BILL WITH THE RESEARCH ASSISTANCE OF: PREPARED BY: LAYOUT BY: Amer Madcasim,

Dusit Thani Plc. Analyst Meeting 1H 2018 Performance 27 August 2018 Disclaimer The

THE INTERPLAY BETWEEN EMOJI & LINGUISTIC TEXT Helena Lau Yan Ping, Sophia Lee Yat Mei