normal basis is usin ing novel concurrent seria ial
play

Normal Basis is Usin ing Novel Concurrent Seria ial Squarin ing - PowerPoint PPT Presentation

A New Mult ltipli licative In Inverse Archit itecture in in Normal Basis is Usin ing Novel Concurrent Seria ial Squarin ing and Mult ltipli lication Amin Monfared, Hayssam El-Razouk and Arash Reyhani-Masoleh Presented by: Arash


  1. A New Mult ltipli licative In Inverse Archit itecture in in Normal Basis is Usin ing Novel Concurrent Seria ial Squarin ing and Mult ltipli lication Amin Monfared, Hayssam El-Razouk and Arash Reyhani-Masoleh Presented by: Arash Reyhani-Masoleh Department of Electrical and Computer Engineering Western University, London, Ontario, Canada 24 th IEEE Symposium on Computer Arithmetic, 2017 1

  2. Outline β€’ Motivation β€’ Arithmetic operations over 𝐻𝐺(2 𝑛 ) using Gaussian Normal Basis (GNB) β€’ Proposed digit-level square-multiply architecture β€’ It computes 𝐡 Γ— 𝐢 2 𝑓 β€’ Both digits of inputs 𝐡 and 𝐢 are entered serially β€’ Denoted by Digit-Level Fully Serial-In Square-Multiply (DL-FSISM) β€’ Proposed inversion architecture β€’ It uses the DL-FSISM β€’ ASIC implementations and comparison β€’ Conclusions and future work 2

  3. Motivation: Fin inite Fields β€’ Many applications use arithmetic operations over 𝐻𝐺(2 𝑛 ) β€’ Cryptography: Elliptic Curve, AES β€’ Error control coding β€’ Reed-Solomon code β€’ There are different bases to represent a field element. β€’ Polynomial basis, normal basis (NB), dual basis, etc. β€’ In NB, squaring is free in hardware. 3

  4. Motivation: Gaussian Normal Basis is (GNB) β€’ GNB over 𝐻𝐺 2 𝑛 is a special class of NB and exists whenever 𝑛 is not divisible by 8. β€’ GNBs have been included in IEEE and NIST standards for ECDSA. β€’ Any field element 𝐡 can be represented as π‘›βˆ’1 𝑏 𝑗 𝛾 2 𝑗 , where 𝑏 𝑗 πœ—{0,1} and 𝐡 = ෍ 𝑗=0 {𝛾, … , 𝛾 2 π‘›βˆ’1 } is a GNB over 𝐻𝐺 2 𝑛 . β€’ In this paper, we consider GNB and propose new digit-level architectures for square-multiply and inversion. 4

  5. ic Operations over 𝐻𝐺(2 𝑛 ) using GNB Arit ithmetic GNB β€’ Addition β€’ Let 𝐡 and 𝐢 be two Field elements represented in GNB. β€’ The addition operation is bit-wise XOR operation of the coordinates of the two inputs: π‘›βˆ’1 (𝑏 𝑗 +𝑐 𝑗 )𝛾 2 𝑗 𝐡 + 𝐢 = ෍ 𝑗=0 β€’ Squaring β€’ Squaring operation is performed by right cyclic shift of the coordinates of 𝐡 : π‘›βˆ’1 𝐡 2 = ෍ 𝑏 𝑗 𝛾 2 𝑗+1 𝑗=0 β€’ It is free in hardware if all coordinates are available in parallel. 5

  6. Arit ithmetic ic Operations usin ing GNB: : Mult ltip ipli lication β€’ Finite field multiplication is more complex than addition and squaring. β€’ Multiplication can be implemented in digit-level architectures, in which the digit size can be chosen based on available resources. β€’ In this paper, we have used two different types of digit-level multiplier namely: β€’ Digit-Level Parallel-In Serial-Out (DL-PISO) β€’ Digit-Level Parallel-In Parallel-Out (DL-PIPO) β€’ Also, we have proposed a new multiplier/squarer architecture β€’ Digit-Level Fully Serial-In Square-Multiply (DL-FSISM). 6

  7. Arit ithmetic ic Operations usin ing GNB: : In Inversion β€’ Based on Fermat Little Theorem, an inversion can be calculated by β€’ 𝐡 βˆ’1 = 𝐡 2 𝑛 βˆ’2 ∈ 𝐻𝐺 2 𝑛 , 𝐡 β‰  0. β€’ In Itoh and Tsujii algorithm (ITA) [4], the number of multiplications is reduced based on decomposing 2 π‘›βˆ’1 βˆ’ 1 β€’ As an example for the NIST recommended field over 𝐻𝐺(2 233 ) : 2 232 βˆ’ 1 = (1 + 2)(1 + 2 2 )(1 + 2 4 )(1 + 2 8 (1 + 2 8 )(1 + 2 16 )(1 + 2 32 (1 + 2 32 )(1 + 2 64 (1 + 2 64 )))) β€’ The inversion using ITA takes a total of 10 iterations. β€’ Each iteration consists of one single digit-level parallel-in parallel- out (DL-PIPO) multiplication and one free squaring. -------------------------------------------------------------------------------------- 7 [4] T. Itoh and S. Tsujii , β€œA fast algorithm for computing multiplicative inverses in GF(2 m ) using normal bases,” Information and computation, vol. 78, no. 3, pp. 171 – 177, 1988.

  8. Arit rithmetic ic Operatio ions usi sing GNB: In Inversio ion ( cont’d) β€’ Our inversion flow diagram (based on ITA) uses an interleaved computations of digit-level parallel-in serial-out (DL-PISO) multiplier and our new DL-FSISM architecture. β€’ It only needs a total of 5 iterations. β€’ Each iteration consists of two single multiplications (and squarings) β€’ In this paper, we propose a new digit-level fully serial-in parallel-out square-multiply (DL-FSISM) architecture which performs concurrent squaring and multiplication without introducing any delay. 8

  9. Proposed Dig igit it-Level l Fully lly Se Seri rial-In Sq Square-Mult ltip iply ly (DL-FSISM) (D β€’ Let A and B be field elements and e be an integer. β€’ The proposed scheme reads the inputs of A and B digit-by- digit serially and concurrently computes 𝐺 = 𝐡 Γ— 𝐢 2 𝑓 . β€’ The composite operations of squaring and multiplication are concurrently performed without introducing any additional delay. 𝑛 β€’ For a digit size of 𝑒 bits, it would take ⌈ 𝑒 βŒ‰ clock cycles to generate the result 𝐺 = 𝐡 Γ— 𝐢 2 𝑓 . 9

  10. Proposed DL-FSISM: Key y Formulation Proposition 1: Let 𝐡 and 𝐢 be two 𝐻𝐺(2 𝑛 ) elements that are represented in GNB {𝛾, … , 𝛾 2 π‘›βˆ’1 } . One can compute 𝐺 = 𝐡𝐢 2 𝑓 , by proceeding from 𝑗 = 0 to 𝑙 βˆ’ 1 , the result 𝐺 = 𝐺 π‘™βˆ’1 = 𝐡 (π‘™βˆ’1) (𝐢 π‘™βˆ’1 ) 2 𝑓 is obtained using the following recurrence relation 𝐺 𝑗 = 𝐺 π‘—βˆ’1 2 𝑒 + Οƒ π‘˜=0 2 𝑓 π‘’βˆ’1 πœ€ π‘˜ 𝑏 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐢 𝑗 + 2 π‘’βˆ’π‘“ π‘’βˆ’1 πœ€ ) 2 𝑓 𝑐 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐡 π‘—βˆ’1 (Οƒ π‘˜=0 π‘˜ π‘›βˆ’1 𝑀 π‘š 𝛾 2 π‘š ∈ 𝐻𝐺 2 𝑛 . π‘˜ 𝑣, π‘Š = π‘£π‘Šπ›Ύ 2 π‘˜ , u πœ— 0,1 and π‘Š = Οƒ π‘š=0 where πœ€ 10

  11. Proposed DL-FSISM: Archit itecture π‘’βˆ’1 π‘’βˆ’1 π‘—βˆ’1 2 𝑒 + ෍ 2 𝑓 2 π‘’βˆ’π‘“ ) 2 𝑓 π‘˜ 𝑏 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐢 𝑗 𝑐 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐡 π‘—βˆ’1 𝐺 𝑗 = 𝐺 πœ€ + (෍ πœ€ π‘˜ π‘˜=0 π‘˜=0 β€’ Three registers X, a d(k-1-i)+d- 1 in1 1 d m-d d m B (i) Β»e n Y, and Z are d- 1 m e n in2 B B B m - - i - m m 0 k 1 k 1 B (i) + 0 m-d -1 <Y> initially cleared d n d 0 m -1 a d(k-1-i)+ 0 m <Z> in1 1 + d d 𝐡𝐢 2 𝑓 β€’ Digits of inputs 0 m in2 m m d are entered to X b d(k-1-i)+d- 1 in1 1 d and Y serially ((A (i-1) Β»d)Β«e n ) m-d m d- 1 m e n d in2 m + A (i- 1 ) from MSB A A A - - - e n 0 k 1 i k 1 n b d(k-1-i)+ 0 0 m-d -1 m m in1 <X> 1 d d β€’ After ⌈ 𝑛 n 0 m 𝑒 βŒ‰ clock in2 m cycles, Z contains 𝐡𝐢 2 𝑓 11

  12. Proposed DL-FSISM: Archit itecture (cont’d) π‘’βˆ’1 π‘’βˆ’1 π‘—βˆ’1 2 𝑒 + ෍ 2 𝑓 2 π‘’βˆ’π‘“ ) 2 𝑓 π‘˜ 𝑏 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐢 𝑗 π‘˜ Γ— 𝑐 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐡 π‘—βˆ’1 𝐺 𝑗 = 𝐺 πœ€ + (෍ πœ€ π‘˜=0 π‘˜=0 a d(k-1-i)+d- 1 in1 1 d m-d d m B (i) Β»e n d- 1 m e n in2 B B B - - i - m m m k 1 0 k 1 B (i) + 0 m-d -1 <Y> d n d 0 m -1 a d(k-1-i)+ 0 m <Z> in1 1 + d d 0 m in2 m m d b d(k-1-i)+d- 1 in1 1 d ((A (i-1) Β»d)Β«e n ) m-d m d- 1 m e n in2 d m + A (i- 1 ) A A A - - - e n 0 k 1 i k 1 n b d(k-1-i)+ 0 m πœ€ 0 m-d -1 m in1 <X> 1 π‘˜ d d n 0 m in2 m 1 1 in1 in2 1  m j j 1 m m m m e 1 e 1 m m 1 X 2 -e n X 2 e n e n e n 1 X X m m 1 m m m m e v m e v m n n 12

  13. Proposed In Inversion Archit itecture β€’ The inversion core is made by serially connecting of DL-PISO and DL-FSISM β€’ The register file only stores from the multipliers β€’ 𝑒 -bits register is Ξ΅ = {2,8,32,64} 32 added between two multipliers to shorten the critical path β€’ Each iteration selects one of inputs of multiplexers and takes ⌈ 𝑛 𝑒 βŒ‰ +1 clock cycles 13

  14. In Inversion Archit itecture Comparison (Number of It Iterations) Architecture Algorithm Multiplication Number of m = 163 m = 233 m = 283 m = 409 m = 571 type Iterations [4] ITA 1 Γ— Single N 1 9 10 11 11 13 [7, 6] TIT/MTIT 1 Γ— double N 2 5 9 8 7 8 [8] Optimal-3 1 Γ— double N 3 5 7 6 7 7 chain ⌈ N 1 Proposed ITA 2 Γ— Single 5 5 6 6 7 2 βŒ‰ Interleaved β€’ Our Proposed inversion architecture reduces the required number of iterations as compared with previous works. β€’ The best performance is achieved when 𝑛 = 233. [4] T. Itoh and S. Tsujii , β€œA fast algorithm for computing multiplicative inverses in GF(2m) using normal bases,” Information and computation, vol. 78, no. 3, pp. 171 – 177, 1988. [6] J. Hu, W. Guo , J. Wei, and R. Cheung, β€œFast and Generic Inversion Architectures Over GF(2m) Using Modified Itoh– Tsujii Algorithms,” IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 62, pp. 367– 371, April 2015. [7] R. Azarderakhsh, K. Jarvinen, and V. Dimitrov , β€œFast Inversion in GF(2m) with Normal Basis Using Hybrid - Double Multipliers,” IEEE Trans. Comput., vol. 63, pp. 1041 – 1047, April 2014. [8] K. Jarvinen, V. Dimitrov, and R. Azarderakhsh , β€œA Generalization of Addition Chains and Fast Inversions in Binary Fields,” IEEE Trans. Comput., vol. 64, pp. 2421 – 2432, Sept. 2015. 14

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend