Web Search Using Mobile Cores Quantifying and Mitigating the Price - - PowerPoint PPT Presentation

web search using mobile cores
SMART_READER_LITE
LIVE PREVIEW

Web Search Using Mobile Cores Quantifying and Mitigating the Price - - PowerPoint PPT Presentation

Web Search Using Mobile Cores Quantifying and Mitigating the Price of Efficiency Vijay Janapa Reddi Benjamin Lee Trishul Chilimbi Kushagra Vaid Engineering & Applied Science Electrical Engineering Runtime Analysis & Design Global


slide-1
SLIDE 1

Web Search Using Mobile Cores

Quantifying and Mitigating the Price of Efficiency Vijay Janapa Reddi

Engineering & Applied Science Harvard University

Benjamin Lee

Electrical Engineering Stanford University

Trishul Chilimbi

Runtime Analysis & Design Microsoft Research

Kushagra Vaid

Global Foundation Services Microsoft Corporation International Symposium on Computer Architecture 22 June 2010 1

slide-2
SLIDE 2

Conventional Wisdom

  • Moore’s Law provides transistors
  • Simple cores improve energy efficiency
  • Parallelism recovers lost performance

2

slide-3
SLIDE 3

Simple Cores

  • Pursue aggregate throughput, energy efficiency
  • Assume task parallelism
  • Assume latency tolerance

3

slide-4
SLIDE 4

Applications in Transition

  • Conventional Enterprise
  • Process independent requests
  • Exhibit high memory, I/O intensity
  • Ex: web, database, Java, mail, file servers
  • Emerging Cloud
  • Extract information, value from data
  • Exhibit high compute intensity
  • Ex: analytics, machine learning

4

slide-5
SLIDE 5

Computational Intensity

  • Microsoft Bing ranks pages with neural network
  • RMS foreshadows future analytic workloads

5

slide-6
SLIDE 6

Cloud Efficiency

  • Challenges
  • Migrate computation, data to cloud
  • Choose efficient components
  • Understand application, component interaction
  • Case Study
  • Mobile cores for efficiency, parallelism for performance?
  • Achieve efficiency with mobile cores (Intel Atom)
  • Quantify price of efficiency (Microsoft Bing)

6

slide-7
SLIDE 7

Efficiency

Atom is more energy, cost efficient than Xeon

Price of Efficiency

Atom limitations impact latency, relevance, flexibility

Mitigating Price of Efficiency

Atom over-provisioning should consider platform overheads

7

slide-8
SLIDE 8

Efficiency

Atom is more energy, cost efficient than Xeon

Price of Efficiency

Atom limitations impact latency, relevance, flexibility

Mitigating Price of Efficiency

Atom over-provisioning should consider platform overheads

7

slide-9
SLIDE 9

Search Architecture

  • Rank pages using neural network
  • Deploy on server (Xeon), mobile (Atom) processors

8

slide-10
SLIDE 10

Processor Activity

  • Compare Xeon (4-issue, OOO) and Atom (2-issue, IO)
  • Measure µarch activity with hardware counters

9

slide-11
SLIDE 11

Processor Power

  • Compare Xeon (15W per core) and Atom (1.5W per core)
  • Measure processor power at voltage regulator

10

slide-12
SLIDE 12

Processor Efficiency

  • Demonstrate energy, cost efficiency with Atom
  • Measure max QPS within QoS target

11

slide-13
SLIDE 13

Efficiency

Atom is more energy, cost efficient than Xeon

Price of Efficiency

Atom limitations impact latency, relevance, flexibility

Mitigating Price of Efficiency

Atom over-provisioning should consider platform overheads

12

slide-14
SLIDE 14

Price of Efficiency

  • Latency
  • Cut-off latency limits refinement opportunities
  • Per query latency impacts quality-of-service
  • Relevance
  • Search rank orders documents
  • Choice, ordering of results impact relevance
  • Flexibility
  • Query activity, complexity increase load
  • Processor resources impact flexibility

13

slide-15
SLIDE 15

Latency

  • Atom increases latency average (µ) by 3×
  • Atom increases latency variance (σ2)

14

slide-16
SLIDE 16

Relevance

  • Consider choice, ordering of top N documents
  • Atom impacts relevance under all query loads

15

slide-17
SLIDE 17

Flexibility

  • Consider activity, complexity of queries
  • Atom harms QoS for more complex queries

16

slide-18
SLIDE 18

Mitigating Price of Efficiency

Efficiency

Atom is more energy, cost efficient than Xeon

Price of Efficiency

Atom limitations impact latency, relevance, flexibility

Mitigating Price of Efficiency

Atom over-provisioning should consider platform overheads

17

slide-19
SLIDE 19

Mitigating Price of Efficiency

Mitigating Price of Efficiency

  • Addressing Latency & Relevance
  • Address µarchitectural limitations
  • Integrate application-specific accelerators
  • Manage heterogeneous servers
  • Addressing Flexibility
  • Over-provision Atoms
  • Mitigate platform overheads
  • Integrate more cores per chip

18

slide-20
SLIDE 20

Mitigating Price of Efficiency

Platform Overheads

  • Xeon: 4-core, 2-socket
  • Atom: 2-core, 1-socket ⇒ Hyp-Atom: 8-core, 2-socket

19

slide-21
SLIDE 21

Mitigating Price of Efficiency

Total Cost of Ownership (TCO)

  • Pie slice shows breakdown of TCO $
  • Pie size shows throughput per TCO $

20

slide-22
SLIDE 22

Mitigating Price of Efficiency

Case for Integration

  • Hyp-Atom attributes more per TCO $ to servers
  • Hyp-Atom achieves greater throughput per TCO $

21

slide-23
SLIDE 23

Conclusion

Efficiency

Atom is more energy, cost efficient than Xeon

Price of Efficiency

Atom limitations impact latency, relevance, flexibility

Mitigating Price of Efficiency

Atom over-provisioning should consider platform overheads

22

slide-24
SLIDE 24

Conclusion

Also in the paper ...

  • µarchitecture
  • Processor activity from hardware counters
  • µarchitectural bottlenecks
  • Search
  • Application phases in computation
  • Execution time breakdown
  • Mitigating Price of Efficiency
  • µarchitectural enhancements
  • Heterogeneous, accelerated processors

23

slide-25
SLIDE 25

Conclusion

Conclusion

  • Emerging Cloud Applications
  • Extract value from data
  • Increase compute intensity
  • Energy Efficiency
  • Improve efficiency by 5× with mobile processors
  • Exact price in latency, relevance, flexiblity
  • Future Challenges
  • Pursue efficiency given compute intensity
  • Consider heterogeneous, accelerated processors

24

slide-26
SLIDE 26

Web Search Using Mobile Cores

Quantifying and Mitigating the Price of Efficiency Vijay Janapa Reddi

Engineering & Applied Science Harvard University

Benjamin Lee

Electrical Engineering Stanford University

Trishul Chilimbi

Runtime Analysis & Design Microsoft Research

Kushagra Vaid

Global Foundation Services Microsoft Corporation International Symposium on Computer Architecture 22 June 2010 25