| 1
Y.Durand, C.Fabre, A. Bocco, T. Trevisan | IMPRENUM Project | Oct 2019
ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH - - PowerPoint PPT Presentation
ENHANCING SCIENTIFIC COMPUTATION USING A VARIABLE PRECISION FPU WITH A RISC-V PROCESSOR Y.Durand, C.Fabre, A. Bocco, T. Trevisan | IMPRENUM Project | Oct 2019 | 1 USE CASES FOR (LARGE) VARIABLE PRECISION Applications Techniques & Kernels
| 1
Y.Durand, C.Fabre, A. Bocco, T. Trevisan | IMPRENUM Project | Oct 2019
| 2 Y.Durand | Oct 2019
differences
| 3
we need
the FPU,
(𝑙)
(𝑙+1) = 1 𝑏𝑗𝑗 (𝑐𝑗 − 𝜏)
Vector update :
memory Accumulation : Requires max precision should be done inside the FPU Matrix coeffs: read-only, sparse doubles Stay in remote memory
end
| 4 Y.Durand | Oct 2019
(𝑙)
(𝑙+1) = 1 𝑏𝑗𝑗 (𝑐𝑗 − 𝜏)
Rocket tile
VP co-proc
RoCC
L&S
Risc V
FPU L&S $ L1
R A M
Scratchpad
$ L1/ L2/ L3
R A M
Input data, RO, in RAM, double format (sparse)
Internal format, for accumulation (high precision)
Intermediate vector, adjustable format (dense)
| 5
L1$
FPU VP scratchpad L1$ Distant Shared memory Standard core + specialized registers V.P Floating Point Unit (FPU) Large size registers for accumulation (eg 64 512b registers) Specific access to memory hierarchy LLC$ Large size (10s
close memory
Y.Durand | Oct 2019
| 6 Y.Durand | Oct 2018
kernel kernel
VP SOLVERS & ALGORITHMS Variable precision is contained within calls to kernel (BLAS level) and Solver (LaPack level) calls
| 7 Y.Durand | Oct 2019
memory
| 8 Y.Durand | April 2019
for scientific computing. In Conference on Next-Generation Arithmetic, March 2019
Precision Capabilities in RISC-V Processors, RISC-V Workshop Zurich (June 11 – 13, 2019)