Cg: A system for programming Cg: A system for programming graphics - PowerPoint PPT Presentation

Cg: A system for programming Cg: A system for programming graphics hardware in a graphics hardware in a C-like language C-like language William R. Mark – University of Texas at Austin R. Steven Glanville – NVIDIA Kurt Akeley – NVIDIA and Stanford University Mark Kilgard – NVIDIA

GPUs are now GPUs are now programmable programmable Framebuffer Triangle Vertex Fragment Framebuffer Assembly & Processor Processor Operations Rasterization Texture Decompression & Filtering Textures ADD R3.xy, R3.xyww, C11.z; MOV R4.y, R2.y; ADD R3.xy, R3.xyww, C11.z; MOV R4.y, R2.y; TEX H5, R3, TEX2, 2D; ADD R4.x, -R4.y, C[3].w; TEX H5, R3, TEX2, 2D; ADD R4.x, -R4.y, C[3].w; TEX H6, R3, TEX2, 2D; MAD R3.xy, R2, R3.xyww, C[2].z; TEX H6, R3, TEX2, 2D; MAD R3.xy, R2, R3.xyww, C[2].z; … … … …

Programmable units in GPUs Programmable units in GPUs are stream processors are stream processors kernel state Input stream Output stream Stream I1 I2 … In O1 O2 … On Processor • The programmable unit executes a computational kernel for each input element • Streams consist of ordered elements

Example: Example: Vertex processor Vertex processor uniform values Transformed Vertex Array Vertexes Vertex V1 V2 … Vn V1 V2 … Vn Processor Fragment processor can do more: It can read from texture memory

Previous work Previous work • Earlier programmable graphics HW – Ikonas [England 86] – Pixar FLAP [Levinthal 87] – UNC PixelFlow [Olano 98] – Multipass SIMD: SGI ISL [Peercy00] • RenderMan [Hanrahan90] • Stanford RTSL [Proudfoot01]

Design goals Design goals • Easier programming of GPUs • Ease of adoption • Portability – HW, API, OS • Complete support for HW capabilities • Performance - similar to assembly • Minimal interference with application data • Longevity -- useful for future hardware • Support for non-shading computations Non-goal: Backward compatibility

The design process The design process Concentrate on: – Overall system architecture – System interfaces: Cg language and APIs What principles guided the design? Why did we make the choices we did? What alternatives did we consider?

Major Design Decisions Major Design Decisions

Modular or monolithic Modular or monolithic architecture? architecture? • Cg system is Application modular • Provides access Cg Cg to assembly-level OpenGL Direct3D interfaces runtime runtime Cg common • GLSL made runtime different choices (compiler) Direct3D OpenGL • Compile off-line, or API API at run time GPU The Cg system manages Cg programs AND the flow of data on which they operate

Specialize for shading? Specialize for shading? • RenderMan language is domain-specific – Domain-specific types (e.g. “color”) – Domain-specific constructs (e.g. “illuminance”) – Imposes a particular model on the user • C is general purpose – A HW oriented language – Avoids assumptions about problem domain • Cg follows C’s philosophy – language follows syntax and philosophy of C – Targets GPU HW – performance transparency – Some exceptions – we were not dogmatic

Cg language example Cg language example void simpleTransform(float4 objectPosition : POSITION, void simpleTransform(float4 objectPosition : POSITION, float4 color : COLOR, float4 color : COLOR, float4 decalCoord : TEXCOORD0, float4 decalCoord : TEXCOORD0, out float4 clipPosition : POSITION, out float4 clipPosition : POSITION, out float4 Color : COLOR, out float4 Color : COLOR, out float4 oDecalCoord : TEXCOORD0, out float4 oDecalCoord : TEXCOORD0, uniform float brightness, uniform float brightness, uniform float4x4 modelViewProjection) uniform float4x4 modelViewProjection) { { clipPosition = mul(modelViewProjection, objectPosition); clipPosition = mul(modelViewProjection, objectPosition); oColor = brightness * color; oColor = brightness * color; oDecalCoord = decalCoord; oDecalCoord = decalCoord; } }

One program or two? One program or two? Framebuffer Triangle Vertex Fragment Framebuffer Assembly & Processor Processor Operations Rasterization Texture Decompression & Filtering Textures void vprog( float4 objP : POSITION , void fprog( float4 t0 : TEXCOORD0 , float4 color : COLOR , float4 t1 : TEXCOORD1 , … … Fragment program Vertex program

How should stream HW be How should stream HW be exposed in the language? exposed in the language? • How should stream inputs be accessed? – Global variables? – Parameters to “main()”? – our choice • How should stream outputs be written? – Global variables? – Output parameters from “main()”? – our choice • Should we distinguish between stream inputs and kernel state? – e.g. vertex position vs. modelview matrix – HW makes a distinction; so we expose it – State parameters marked with keyword: uniform

How should system support How should system support different levels of HW? different levels of HW? NV40 NV20 NV50 R300 … ? R400 R200 NV30 • HW capabilities change each generation – Data types – Support for branch instructions, … • We expect this problem to persist – Future GPUs will have new features • Mandate exactly one feature set? – Must strand older HW or limit newer HW

Two options for handling Two options for handling HW differences HW differences • Emulate missing features? – Too slow on GPU – Too slow on CPU, especially for fragment HW • Expose differences to programmer? – We chose this option – Differences exposed via subsetting – A profile is a named subset – Cg supports function overloading by profile

Two detailed Two detailed design decisions design decisions

How to specify a returnable How to specify a returnable function parameter? function parameter? void foobar( float *y ) { void foobar( float *y ) { • C uses pointers *y = 2*x; *y = 2*x; – But no pointers on current GPUs … … • C++ uses pass-by-reference: float &y – Preferred implementation still uses pointers • Cg uses pass-by-value-result – Implementation doesn’t void foobar( out float y ) { void foobar( out float y ) { need pointers y = 2*x; y = 2*x; …. …. • By using new syntax, we preserve C and C++ syntax for the future

General mechanism for General mechanism for surface/light separability surface/light separability LIGHT 1 SURFACE 1 LIGHT 3 SURFACE 2 LIGHT 2 • Convenient to put surface and light code in different modules… – Decouples surface selection from light selection – Proven usefulness: Classic OpenGL; RenderMan; etc. – We lost it for a while! • RenderMan uses a specialized mechanism • Cg uses a general-purpose mechanism – More flexible – Follows C-like philosophy

Light/surface example… Light/surface example… First: Declare an interface First: Declare an interface // Declare interface to lights // Declare interface to lights interface Light { interface Light { float3 direction(float3 from); float3 direction(float3 from); float4 illuminate(float3 p, out float3 lv); float4 illuminate(float3 p, out float3 lv); }; }; Mechanism adapted from Java, C#

Second: Declare a light Second: Declare a light that implements interface that implements interface // Declare object type for point lights // Declare object type for point lights struct PointLight : Light { struct PointLight : Light { float3 pos, color; float3 pos, color; float3 direction(float3 p) { return pos - p; } float3 direction(float3 p) { return pos - p; } float3 illuminate(float3 p, out float3 lv) { float3 illuminate(float3 p, out float3 lv) { lv = normalize(direction(p)); lv = normalize(direction(p)); return color; return color; } } }; }; Declare an object that implements the interface

Third: Define surface Third: Define surface that calls interface that calls interface // Main program (surface shader) // Main program (surface shader) float4 main (appin IN, out float4 COUT, float4 main (appin IN, out float4 COUT, uniform Light lights[ ] ) { uniform Light lights[ ] ) { ... ... for (int i=0; i < lights.Length; i++) { // for each light for (int i=0; i < lights.Length; i++) { // for each light Cl = lights[i].illuminate (IN.pos, L); // get dir/color Cl = lights[i].illuminate (IN.pos, L); // get dir/color color += Cl * Plastic(texcolor, L, Nn, In, 30); // apply color += Cl * Plastic(texcolor, L, Nn, In, 30); // apply } } COUT = color; COUT = color; } } Call object(s) via the interface type

Cg is closely related to Cg is closely related to other recent languages other recent languages • Microsoft HLSL – Largely compatible with Cg – NVIDIA and Microsoft collaborated • OpenGL ARB shading language • All three languages are similar – Overlapping development – Extensive cross-pollination of ideas – Designers mostly agreed on right approach • Systems are different

Summary Summary • Cg system: – A system for programming GPUs • Cg language: – Extends and restricts C as needed for GPU’s – Expresses stream kernels – HW oriented language • Designed to age well – By reintroducing missing C features

Cg: A system for programming Cg: A system for programming graphics - PowerPoint PPT Presentation

Cg: A system for programming Cg: A system for programming graphics hardware in a graphics hardware in a C-like language C-like language William R. Mark University of Texas at Austin R. Steven Glanville NVIDIA Kurt Akeley NVIDIA and

J ET B RAINS MPS MPS = Meta Programming System Implements the Language Oriented Programming

DATABASE SYSTEMS Database programming in a web environment Database System Course AGENDA FOR

Chapter 7 Programming Hardware Programming Barely Information (Low level programming

DATABASE SYSTEMS Database programming in a web environment Database System Course, 2016-2017

Multicore DSP Architecture and Programming O. Dahl 1 1 Electrical Engineering, Linkping

Alec Muffett Programming Holes PROGRAMMING GOOFS THAT WILL HOSE YOUR SYSTEM SECURITY (a purely

A Compiler and Run-1me System for Network Programming

Rust: system programming with guarantees CRI Monthly Seminar Arnaud Spiwack Pierre Guillou

Super advanced functional programming Or: dependently-typed programming in Agda Dr. Dominic

functional thinking applying the philosophy of functional programming to system design &

Operating System Review Chi Zhang czhang@cs.fiu.edu 1 About the Course Prerequisite: COP

UNIX System Programming UNIX System Programming 1. What is a Process? 1. What is a Process?

System Programming in Windows Naming in Windows: Kernel Objects and Kernel Object Handles

? P12 2 Getting Started/Lab Programming Lab Programming Program of Requirements PRELIMINARY

CS 241: Systems Programming Lecture 26. System Calls I Spring 2020 Prof. Stephen Checkoway 1

CS 423 Operating System Design: Introduction to Linux Kernel Programming (MP1 Q&A)

Programming for Performance 1 Textbook Definition of Real-time A Real-time System responds

CS2422 Assembly Language & System Programming November 30, 2006 Todays Topic

GPU programming Dr. Bernhard Kainz 1 Overview About myself Last week Motivation GPU

DEMONIC Programming Horsman Abramsky Outline Maxwells Demon and Landauers Hypothesis.

CS 5412/LECTURE 3 Ken Birman PROGRAMMING AN I O T SYSTEM Spring, 2019

Chapter 02 Programming Hardware Information Number System 2-2 Information definition

Type System for a Polymorphic Multi-Stage Programming Language Atsushi Igarashi (Kyoto Univ.)

Finite State Machines CS 3410 Computer System Organization & Programming [K. Bala, A. Bracy,

Cg: A system for programming Cg: A system for programming graphics - PowerPoint PPT Presentation

Cg: A system for programming Cg: A system for programming graphics hardware in a graphics hardware in a C-like language C-like language William R. Mark University of Texas at Austin R. Steven Glanville NVIDIA Kurt Akeley NVIDIA and

J ET B RAINS MPS MPS = Meta Programming System Implements the Language Oriented Programming

DATABASE SYSTEMS Database programming in a web environment Database System Course AGENDA FOR

Chapter 7 Programming Hardware Programming Barely Information (Low level programming

DATABASE SYSTEMS Database programming in a web environment Database System Course, 2016-2017

Multicore DSP Architecture and Programming O. Dahl 1 1 Electrical Engineering, Linkping

Alec Muffett Programming Holes PROGRAMMING GOOFS THAT WILL HOSE YOUR SYSTEM SECURITY (a purely

A Compiler and Run-1me System for Network Programming

Rust: system programming with guarantees CRI Monthly Seminar Arnaud Spiwack Pierre Guillou

Super advanced functional programming Or: dependently-typed programming in Agda Dr. Dominic

functional thinking applying the philosophy of functional programming to system design &amp;

Operating System Review Chi Zhang czhang@cs.fiu.edu 1 About the Course Prerequisite: COP

UNIX System Programming UNIX System Programming 1. What is a Process? 1. What is a Process?

System Programming in Windows Naming in Windows: Kernel Objects and Kernel Object Handles

? P12 2 Getting Started/Lab Programming Lab Programming Program of Requirements PRELIMINARY

CS 241: Systems Programming Lecture 26. System Calls I Spring 2020 Prof. Stephen Checkoway 1

CS 423 Operating System Design: Introduction to Linux Kernel Programming (MP1 Q&amp;A)

Programming for Performance 1 Textbook Definition of Real-time A Real-time System responds

CS2422 Assembly Language &amp; System Programming November 30, 2006 Todays Topic

GPU programming Dr. Bernhard Kainz 1 Overview About myself Last week Motivation GPU

DEMONIC Programming Horsman Abramsky Outline Maxwells Demon and Landauers Hypothesis.

CS 5412/LECTURE 3 Ken Birman PROGRAMMING AN I O T SYSTEM Spring, 2019

Chapter 02 Programming Hardware Information Number System 2-2 Information definition

Type System for a Polymorphic Multi-Stage Programming Language Atsushi Igarashi (Kyoto Univ.)

Finite State Machines CS 3410 Computer System Organization &amp; Programming [K. Bala, A. Bracy,

functional thinking applying the philosophy of functional programming to system design &

CS 423 Operating System Design: Introduction to Linux Kernel Programming (MP1 Q&A)

CS2422 Assembly Language & System Programming November 30, 2006 Todays Topic

Finite State Machines CS 3410 Computer System Organization & Programming [K. Bala, A. Bracy,