Energy Efficient Content-Addressable Memory Advanced Seminar - - PowerPoint PPT Presentation

energy efficient
SMART_READER_LITE
LIVE PREVIEW

Energy Efficient Content-Addressable Memory Advanced Seminar - - PowerPoint PPT Presentation

Energy Efficient Content-Addressable Memory Advanced Seminar Computer Engineering Institute of Computer Engineering Heidelberg University Fabian Finkeldey 26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 1 Table of


slide-1
SLIDE 1

Institute of Computer Engineering Heidelberg University Fabian Finkeldey

Energy Efficient Content-Addressable Memory

Advanced Seminar Computer Engineering

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 1

slide-2
SLIDE 2

Table of Contents

  • Introduction
  • Standard Circuit Design
  • Energy Efficient Design
  • Simulation
  • Implementation
  • Conclusion

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 2

slide-3
SLIDE 3

Introduction

Use-Cases and basic design of content-addressable memory

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 3

slide-4
SLIDE 4

Example: Looking up a phone number

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 4

  • Problem: Find a name to a given

phone number

  • Linear Search:
  • Looking up every entry in a phone

book takes a lot of time...

  • A phonebook that can deliver a

name to a given number is needed

  • Ref. 3
slide-5
SLIDE 5

Example II: Translation Lookaside Buffer

  • TLB: Cache for address translation
  • Typical size ~1024 entries
  • Faster than page table access
  • Needs to be searched:
  • Search-key: Virtual Address
  • Search-result: Physical Address

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 5

CPU MMU TLB Memory Virtual Address Physical Address

slide-6
SLIDE 6

Basic concept of a CAM

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 6

1 1 1 1 1 1 1 1 1 1 1 0 1 Memory Search Word 1

slide-7
SLIDE 7

Standard Circuit Design

A conventional CAM

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 7

slide-8
SLIDE 8

Conventional CAM Cell

  • M is a standard SRAM-Cell
  • D is the stored Data value
  • Search-Data is applied to SL
  • ML indicates match-state

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 8

  • Ref. 1
slide-9
SLIDE 9

CAM Cell – search operation 1

  • Assume:
  • D = 1 -> D = 0

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 9

  • Ref. 1
slide-10
SLIDE 10

CAM Cell – search operation 1

  • 1. SL = 0 and SL = 0
  • > M1 and M2 are switched off

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 10

  • Ref. 1
slide-11
SLIDE 11

CAM Cell – search operation 2

  • 1. SL = 0 and SL = 0
  • 2. ML is precharged to VDD -> ML = 1

M1 and M2 are switched off

  • > No path ML to GND

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 11

  • Ref. 1
slide-12
SLIDE 12

CAM Cell – search operation 3 (match)

  • 1. SL = 0 and SL = 0
  • 2. ML is precharged to VDD -> ML = 1
  • 3. Assume SL = 1 -> SL = 0

M2 and M3 are switched off

  • > No path ML to GND

ML stays at VDD -> ML = 1, match!

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 12

  • Ref. 1
slide-13
SLIDE 13

CAM Cell – search operation 3 (mismatch)

  • 1. SL = 0 and SL = 0
  • 2. ML is precharged to VDD -> ML = 1
  • 3. Assume SL = 0 -> SL = 1

M2 and M4 are switched on

  • > Path ML to GND

ML discharges -> ML = 0, no match

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 13

  • Ref. 1
slide-14
SLIDE 14

Array of Cells

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 14

  • Ref. 1
slide-15
SLIDE 15

Power consumption

  • Matchlines:
  • Long Lines with high capacitance:
  • Wire capacitance
  • Diffusion capacitance of the pull-down

Transistors

  • Assumption: Miss in most cases
  • ML is precharged and discharged in

every cycle

  • Searchlines:
  • Long Lines with high capacitance:
  • Wire capacitance
  • Gate capacitance of the match-

Transistors

  • SL and SL are pulled to GND in every

cycle

  • Either SL or SL is charged to VDD

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 15

slide-16
SLIDE 16

Energy Efficient Design

Reducing the power consumption

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 16

slide-17
SLIDE 17
  • 1. Pipelining the match line

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 17

c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c c Non-pipelined Pipelined

Stage 1 Stage 2 Stage 3 Stage 4 Stage 5

ML0 ML1 ML2 ML3 ML4 ML0 ML1 ML2 ML3 ML4

slide-18
SLIDE 18
  • 1. Pipelining the match-line
  • Breaking up the long ML in stages
  • In case of match, the following

stage is activated

  • In most cases, the ML is only partly

precharged

  • > Reduced power consumption

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 18

c c c c c ML

x8 x34 x34 x34 x34

Flip-flop Match-line-sense-amplifier

enable

slide-19
SLIDE 19
  • 2. Hierarchical search-lines
  • Global-search-lines
  • Not directly connected to CAM-

cells

  • To reduce capacitance
  • Driven in every cycle
  • Local-search-lines
  • Short, connected to a few CAM-

cells

  • enabled, if match-line-segment is

activated

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 19

c c c c c c c c c c c c c c c c c c c c c c c c c ML0 ML1 ML2 ML3 ML4

Global-search-line (GSL) Local-search-line (LSL)

slide-20
SLIDE 20
  • 2. Hierarchical search-lines
  • SL power consumption: 𝑄

𝑡𝑚= 𝐷𝑡𝑚 ∗ 𝑊 𝑡𝑚 2

  • Usually: 𝑊

𝑡𝑚 = 𝑊 𝐸𝐸

  • Lower voltage reduces power consumption
  • Lower Gate-Overdrive -> Decreased Speed
  • Solution:
  • Lower voltage VDDLow on global-search-lines
  • Amplifier to drive local-search-lines with VDD

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 20

slide-21
SLIDE 21
  • 3. ML-Precharge low
  • ML-Precharge high requires to

precharge SL low

  • Contributes to SL power

consumption

  • Precharge low:
  • 1. Discharge all ML to GND
  • 2. Apply Data to SL
  • 3. Drive fixed current IML to all ML
  • 4. In match state there is no path

to GND

  • > Voltage will rise

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 21

  • Ref. 1
slide-22
SLIDE 22
  • 3. ML-Precharge low
  • Low Swing on match-line
  • Match-line-sense-amplifier

triggered at VTh < VDD => No need to charge ML to VDD

  • No need to precharge SL low
  • No problem with path to GND in

cells in mismatch state

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 22

  • Ref. 1
slide-23
SLIDE 23

Simulation

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 23

slide-24
SLIDE 24

Simulation Setup

  • 1024 x 144 Bit Cam
  • 1x 8Bit, 4x 34Bit Segments
  • 180nm Cmos
  • 1,8V VDD
  • Typical Workload:
  • Populated with random data
  • 1 Match per search

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 24

slide-25
SLIDE 25

Schematic vs. Waveform

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 25

  • Ref. 1
  • Ref. 1
slide-26
SLIDE 26

Simulation: Pipelined Matchlines

  • Assumption:
  • Most ML segments miss in the first 8

Bit

  • Expectation:
  • Power consumption reduced by

136/144 or 95%

  • Result:
  • 1,59/3,64 or 56%
  • Explanation:
  • Overhead of clocking the additional

ML-Flip-Flops and repeated circuitry

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 26

  • Ref. 1
slide-27
SLIDE 27

Simulation: Total power consumption

  • Adding hierarchical searchlines:
  • 63% Reduction in SL power

consumption

  • Total power consumption

reduced by 60%

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 27

  • Ref. 1
slide-28
SLIDE 28

Implementation

A real-world Test chip

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 28

slide-29
SLIDE 29

Test Chip

  • VDD: 1,8V
  • Process: 180nm
  • Size: 2,3 x 2,1mm
  • Cycle Time: 7ns
  • 256 x 144 Bit CAM
  • 1x 8Bit, 4x 34Bit Segments
  • Only two segments use hierarchical SL
  • Allows Direct Comparison of power consumption

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 29

  • Ref. 1
slide-30
SLIDE 30

Simulation vs. Implementation

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 30

  • Ref. 1
slide-31
SLIDE 31

Conclusion

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 31

slide-32
SLIDE 32

Conclusion

  • Presented Techniques to reduce power consumption:

1. Pipelined matchlines 2. Hierarchical searchlines 3. Precharge low scheme

  • Expected reduction of power consumption: ~60%
  • Slightly increased area needs: ~6%
  • Similar cycle times to conventional designs
  • Pipelined architecture introduces additional latency
  • Reduced Noise Immunity

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 32

slide-33
SLIDE 33

References

  • This Talk is based on:

1.

  • K. Pagiamtzis, A. Sheikholeslami, “A low-power content-addressable memory (CAM) using pipelined hierarchical search-scheme”,

IEEE Journal of solid-state Circuits, 2004 2.

  • K. Pagiamtzis, A. Sheikholeslami, “Content-Addressable Memory (CAM) Circuits and Architectures: A Tutorial and Survey”, IEEE

Journal of solid-state Circuits, 2006

  • Figures and graphics:
  • 3. Android, incoming call, 2016 - http://i.stack.imgur.com/5eHW2.png

26.01.2016 Fabian Finkeldey, Energy Efficient Content-Addressable Memory 33