CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 - PowerPoint PPT Presentation

Nov 27, 2022 •264 likes •368 views

CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Lecture Outline Vector Processors Scribe for today? Why Vector Processing

CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html
Lecture Outline ● Vector Processors ● ● Scribe for today?
Why Vector Processing ● Deep pipeline ==> more parallelism – But more dependences – Need to fetch and issue many instructions (Flynn bottleneck) ● Same issues with multiple-issue processor ● Operations on vectors: – No data dependences – No control hazards – Single instn. ==> instn. bandwidth reduced – Well defined memory access pattern
Basic Architecture ● Vector-register processors vs. memory- memory vector processor ● DLXV: vector extn. of DLX (vector-register) ● Components: – Vector registers (V0..V7), 64-element – Vector functional units: ● ADD/SUB, MUL, DIV, Integer, Logical ● Each is pipelined, can start a new opn. every cycle – Vector load/store unit: also pipelined – Scalar registers and scalar unit (like in DLX)
Some Vector Instructions ● ADDV V1, V2, V3 ● ADDSV V1, F0, V2 ● SUBV V1, V2, V3 ● SUBVS V1, V2, F0 ● SUBSV V1, F0, V2 ● Similar for MUL and DIV ● LV V1, R1 ● SV R1, V1
SAXPY/DAXPY Loop ● Y = aX + Y (caps ==> vector) LD F0, a LD F0, a ADDI R4, Rx, 512 LV V1, Rx Loop: LD F2, 0(Rx) MULTSV V2, F0, V1 MULTD F2, F0, F2 LV V3, Ry LD F4, 0(Ry) ADDV V4, V2, V3 ADDD F4, F2, F4 SV Ry, V4 SD 0(Ry), F4 Reduction in instn. bandwidth ADDI Rx, Rx, 8 Lesser pipeline interlocks ADDI Ry, Ry, 8 SUB R20, R4, Rx BNEZ R20, Loop
Estimating Execution Time ● Convoy: set of vector instructions which can begin execution in same cycle – Check for structural, data hazards ● For simplicity: convoy must complete before initiating next convoy ● Chime: time taken to execute one vector opn. ● Approximations: – Only one instn. can be initiated per cycle – Pipeline setup latency
Adding Flexibility ● Vector-length register (VLR), Maximum vector length (MVL) – MOVI2S VLR, R1 – MOVS2I R1, VLR ● Vector longer than MVL ==> use strip-mining ● Vector stride: – LVWS V1, (R1, R2) – SVWS (R1, R2), V1 ● Memory-bank conflicts?
Enhancing Vector Performance ● Chaining: data-forwarding ● Conditional execution: – Vector Mask Register – Some related instructions ● SNEV V1, V2 ● SGTSV F0, V1 ● CVM ● Sparse matrices: scatter-gather – LVI V1, (R1+V2) – SVI (R1+V2), V1

Recommend

CS422 Computer Architecture Spring 2004 Lecture 04, 06 Jan 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 04, 06 Jan 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Announcements Course web-page is up http://web.cse.iitk.ac.in/~cs422/index.html

307 views • 27 slides

CS422 Computer Architecture Spring 2004 Lecture 23, 26 Mar 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 23, 26 Mar 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Topics Remaining HW2 handed out today/tomorrow Multiprocessors: 4 lectures

326 views • 11 slides

CS422 Computer Architecture Spring 2004 Lecture 18, 26 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 18, 26 Feb 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Memory Hierarchy Two principles: Smaller is faster Principle of locality

251 views • 21 slides

CS422 Computer Architecture Spring 2004 Lecture 13, 17 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 13, 17 Feb 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Dynamic Scheduling Better than static scheduling Scoreboarding: Used by the

883 views • 13 slides

CS422 Computer Architecture Spring 2004 Lecture 15, 20 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 15, 20 Feb 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Further Topics in ILP Multiple issue Software support Hardware support

465 views • 15 slides

CS422 Computer Architecture Spring 2004 Lecture 05, 06 Jan 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 05, 06 Jan 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html DLX DLX pronounced Deluxe Has the features of many recent experimental

248 views • 13 slides

CS422 Computer Architecture Spring 2004 Lecture 02, 01 Jan 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 02, 01 Jan 2004 Bhaskaran Raman Department of CSE IIT Kanpur Performance Comparison What performance metric to use? User cares about response time Performance is inversely

479 views • 9 slides

Theory of Computation Textbook The Nature of Computation by Cristopher Moore and (CS

CS422 CS422 Fall Semester 2014 Textbook Theory of Computation Textbook The Nature of Computation by Cristopher Moore and (CS 422/MAS480B) Stephan Mertens. Expensive and has about 1000 pages. The book is available on course reserve in the

500 views • 5 slides

User Interface Design and Programming - CS422 Luc Renambot renambot@uic.edu Yiwen Sun

User Interface Design and Programming - CS422 Luc Renambot renambot@uic.edu Yiwen Sun ysun25@uic.edu 1 Schedule Class Tuesday 12.30 to 1.45 Thursday 12.30 to 1.45 Room A6 LC 2 Syllabus CS 422: User Interface Design

457 views • 26 slides

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for Predicting Protein Secondary for Predicting Protein Secondary for Predicting Protein Secondary for Predicting Protein Secondary Structures

992 views • 65 slides

Architecture: Culture and Space Architecture: Culture and Space Architecture: Culture and Space

Architecture: Culture and Space Architecture: Culture and Space Architecture: Culture and Space Architecture: Culture and Space Building Religion Building Religion architecture is important to the study of history for several reasons:

1.29k views • 58 slides

CSE 675.02: three aspects of computer design: instruction set architecture, Introduction to

Computer Architecture A modern meaning of the term computer architecture covers CSE 675.02: three aspects of computer design: instruction set architecture, Introduction to Computer computer organization and Architecture

692 views • 9 slides

ICS 233 ICS 233 ICS 233 ICS 233 Computer Architecture & Computer Architecture &

ICS 233 ICS 233 ICS 233 ICS 233 Computer Architecture & Computer Architecture & Assembly Language Assembly Language MI PS MI PS PROCESSOR PROCESSOR I NSTRUCTI ON SET I NSTRUCTI ON SET Lecture Slides on Computer 1 Architecture

478 views • 35 slides

Introduction to Software Architecture Reid Holmes Architecture Architecture is: All

Introduction to Software Architecture Reid Holmes Architecture Architecture is: All about communication. What parts are there? How do the parts fit together? Architecture is not: About development. About

392 views • 18 slides

CMS Strip Readout Architecture for SLHC OUTLINE brief review of LHC strip readout architecture p

CMS Strip Readout Architecture for SLHC OUTLINE brief review of LHC strip readout architecture p proposed architecture for SLHC front end amplifier design in 130nm system architecture ideas system architecture ideas triggering possibilities

369 views • 24 slides

A New Golden Age for 1. Software advances can inspire architecture Computer Architecture:

8/28/19 Lessons of last 50 years of Computer Architecture A New Golden Age for 1. Software advances can inspire architecture Computer Architecture: innovations 2. Raising the hardware/software interface creates History, Challenges, and

1.08k views • 14 slides

Multicast Source Notification of Interest Protocol (MSNIP) draft-ietf-idmr-msnip-01 Bill Fenner

Multicast Source Notification of Interest Protocol (MSNIP) draft-ietf-idmr-msnip-01 Bill Fenner Hugh Holbrook Isidor Kouvelas MSNIP overview Operates between source IP systems and first-hop routers. Applications register with IP host stack

308 views • 8 slides

INSTITUTE of ATOMIC PHYSICS Magurele-Bucharest Gamma Laser Controlled by High External Fields M

INSTITUTE of ATOMIC PHYSICS Magurele-Bucharest Gamma Laser Controlled by High External Fields M Apostol Institute of Physics and Nuclear Engineering, Magurele-Bucharest May 2010 1 Laser Dichotomy , usually : (two levels) Narrow width for

477 views • 43 slides

Neutrino mass constraint from CMB and its degeneracy with other cosmological parameters Kazuhide

Neutrino mass constraint from CMB and its degeneracy with other cosmological parameters Kazuhide Ichikawa (Institute for Cosmic Ray Research) KI, M. Fukugita & M. Kawasaki, PRD71 043001 (2005) M. Fukugita, KI, M. Kawasaki & O. Lahav,

623 views • 14 slides

Internal dissipation and heat leaks in nanoscale heat devices Luis A. Correa QUANTUM T E

QUANTUM T E CHNOLOGIE S CONFE RE NCE V I 23-06 --2015 W ARSAW Internal dissipation and heat leaks in nanoscale heat devices Luis A. Correa QUANTUM T E CHNOLOGIE S CONFE RE NCE V I 23-06 --2015 W ARSAW + JOI NT WORK W IT

1.04k views • 34 slides

Resonant Excitation of Envelope Modes as an Emittance Diagnostic in High-Intensity Circular

Resonant Excitation of Envelope Modes as an Emittance Diagnostic in High-Intensity Circular Accelerators Will Stem 3-19-2015 Outline Some Traditional Methods of Measuring Emittance Emittance Dependence on Envelope Mode Frequency

330 views • 32 slides

AmI Taxonomy AmI Taxonomy Network Characteristics of the technologies allowing devices to

AmI Taxonomy file:///D:/Mirror/elite/Didattica/Laurea/01PRD Ambient Intelligence/sl... AmI Taxonomy AmI Taxonomy Network Characteristics of the technologies allowing devices to communicate an collaborate in the exchange of information.

205 views • 9 slides

ESI Workshop on Higher Spin Gravity Based on: hep-th/1006.4788 [Ammon, Gutperle, Kraus, EP]

Eric Perlmutter (UCLA) April 10, 2012 ESI Workshop on Higher Spin Gravity Based on: hep-th/1006.4788 [Ammon, Gutperle, Kraus, EP] hep-th/1008.2567 [Kraus, EP] (Part of) 3d Vasiliev gravity as a Chern-Simons theory 1. Introduction to hs[ ]

238 views • 20 slides

Software Trigger Board Reader Progress Report Giovanna Lehmann Miotto, Alessandro Thea 1 R

Software Trigger Board Reader Progress Report Giovanna Lehmann Miotto, Alessandro Thea 1 R EMINDER : C URRENT T RIGGER D ISTRIBUTION S YSTEM Triggers (beam, no beam) generated by CTB Distributed by the Timing master CTB Inhibit

72 views • 5 slides