Short Division of Long Integers (joint work with David Harvey) Paul - PowerPoint PPT Presentation

Short Division of Long Integers (joint work with David Harvey) Paul Zimmermann October 6, 2011

The problem to be solved Divide efficiently a p -bit floating-point number by another p -bit f-p number in the 100-10000 digit range Paul Zimmermann - Short Division of Long Integers October 6, 2011- 2

From www.mpfr.org/mpfr-3.0.0/timings.html (ms): Maple Mathematica Sage GMP MPF MPFR digits 12.00 6.0.1 4.5.2 5.0.1 3.0.0 100 mult 0.0020 0.0006 0.00053 0.00011 0.00012 div 0.0029 0.0017 0.00076 0.00031 0.00032 sqrt 0.032 0.0018 0.00132 0.00055 0.00049 1000 mult 0.0200 0.007 0.0039 0.0036 0.0028 div 0.0200 0.015 0.0071 0.0058 0.0040 sqrt 0.160 0.011 0.0064 0.0049 0.0047 10000 mult 0.80 0.28 0.11 0.107 0.095 div 0.80 0.56 0.28 0.198 0.261 sqrt 3.70 0.36 0.224 0.179 0.176 Paul Zimmermann - Short Division of Long Integers October 6, 2011- 3

What is GMP (GNU MP)? ◮ the most popular library for arbitrary-precision arithmetic ◮ distributed under a free license (LGPL) from gmplib.org ◮ main developer is Torbj¨ orn Granlund ◮ contains several layers: mpn (arrays of words), mpz (integers), mpq (rationals), mpf (floating-point numbers) ◮ mpn is the low-level layer, with optimized assembly code for common hardware, and provides optimized implementations of state-of-the-art algorithms Paul Zimmermann - Short Division of Long Integers October 6, 2011- 4

Can we do better than GMP? An anonymous reviewer said: What are the paper’s weaknesses? The resulting performance, in the referee’s opinion, is only marginally better a standard exact-quotient algorithm in GMP. One can expect about 10% improvement. It seems to be a weak result for the sophisticated recursive algorithm with the big error analysis effort. Paul Zimmermann - Short Division of Long Integers October 6, 2011- 5

What is GNU MPFR? ◮ a widely used library for arbitrary-precision floating-point arithmetic ◮ distributed under a free license (LGPL) from mpfr.org ◮ main developers are Guillaume Hanrot, Vincent Lef` evre, Patrick P´ elissier, Philippe Th´ eveny and Paul Zimmermann ◮ contrary to GMP mpf , implements correct rounding and mathematical functions (exp , log , sin , ... ) ◮ implements Sections 3.7 (Extended and extendable precisions) and 9.2 (Recommended correctly rounded functions) of IEEE 754-2008 ◮ aims to be (at least) as efficient than other arbitrary-precision floating-point without correct rounding Paul Zimmermann - Short Division of Long Integers October 6, 2011- 6

The problem to be solved (binary fp division) Assume we want to divide a > 0 of p bits by b > 0 of p bits, with a quotient c of p bits. First write a = m a · 2 e a and b = m b · 2 e b such that: ◮ m b has exactly p bits ◮ 2 p − 1 ≤ m a / m b < 2 p ( m a has 2 p − 1 or 2 p bits) The problem reduces to finding the p -bit correct rounding of m a / m b with the given rounding mode. We do not assume that the divisor b is invariant, thus we do not allow precomputations involving b . Paul Zimmermann - Short Division of Long Integers October 6, 2011- 7

Division routine mpfr div in MPFR 3.0.x The MPFR division routine relies on the (GMP) low-level division with remainder mpn divrem . mpn divrem computes q and r such that m a = qm b + r with 0 ≤ r < m b . Since 2 p − 1 ≤ m a / m b < 2 p , q has exactly p bits. The correct rounding of the quotient is q or q + 1 depending on the rounding mode. For rounding to nearest, if r < m b / 2, the correct rounding is q ; if r > m b / 2, the correct rounding is q + 1. Paul Zimmermann - Short Division of Long Integers October 6, 2011- 8

What’s new with GMP 5? In GMP 5, the floating-point division ( mpf div ) calls mpn div q , which only computes the (exact) quotient, and is faster (on average) than mpn divrem or its equivalent mpn tdiv qr . This is based on an approximate Barrett’s algorithm, presented at ICMS 2006. In most cases computing one more word of the quotient is enough to decide the correct rounding: ◮ pad the dividend with two zero low words ◮ pad the divisor with one zero low word ◮ one will obtain one extra quotient low word Paul Zimmermann - Short Division of Long Integers October 6, 2011- 9

Our goal Design an approximate division routine for arrays of n words An array of n words [ a n − 1 , ..., a 1 , a 0 ] represents the integer a n − 1 β n − 1 + · · · + a 1 β + a 0 with β = 2 64 We want a rigorous error analysis and a O ( n ) error Paul Zimmermann - Short Division of Long Integers October 6, 2011- 10

Plan of the talk ◮ Mulders’ short product ◮ Mulders’ short division ◮ Barrett’s algorithm ◮ ℓ -fold Barrett’s algorithm (cf Hasenplaugh, Gaubatz, Gopal, Arith’18) Paul Zimmermann - Short Division of Long Integers October 6, 2011- 11

Mulders’ short product for polynomials (2000) Short product: compute the upper half of U · V , U and V having n terms (degree n − 1) With Karatsuba’s multiplication, can save 20% over a full product. Paul Zimmermann - Short Division of Long Integers October 6, 2011- 12

Our variant of Mulders’s algorithm for integers Algorithm ShortMul. Input: U = � n − 1 i = 0 u i β i , V = � n − 1 i = 0 v i β i , integer n Output: an integer approximation W of UV β − n 1: if n < n 0 then W ← ShortMulNaive ( U , V , n ) 2: 3: else choose a parameter k , n / 2 + 1 ≤ k < n , ℓ ← n − k 4: write U = U 1 β ℓ + U 0 , V = V 1 β ℓ + V 0 5: 1 β k + U ′ 1 β k + V ′ write U = U ′ 0 , V = V ′ 6: 0 W 11 ← Mul ( U 1 , V 1 , k ) ⊲ 2 k words 7: W 10 ← ShortMul ( U ′ 1 , V 0 , ℓ ) ⊲ ℓ most significant words 8: W 01 ← ShortMul ( U 0 , V ′ 1 , ℓ ) ⊲ ℓ most significant words 9: W ← ⌊ W 11 β 2 ℓ − n ⌋ + W 10 + W 01 10: Paul Zimmermann - Short Division of Long Integers October 6, 2011- 13

Lemma The output of Algorithm ShortMul satisfies UV β − n − ( n − 1 ) < W ≤ UV β − n . (In other words, the error is less than n ulps.) Paul Zimmermann - Short Division of Long Integers October 6, 2011- 14

Mulders’ short division (2000) U is unknown V is known W = UV is known 1. estimate U high from V high and W high , subtract U high V high from W 2. compute U ′ high V low and subtract from W 3. estimate U low from V ′ high and the remainder W Paul Zimmermann - Short Division of Long Integers October 6, 2011- 15

Our variant of Mulders’ short division for integers Algorithm ShortDiv. Input: W = � 2 n − 1 i = 0 w i β i , V = � n − 1 i = 0 v i β i , with V ≥ β n / 2 Output: an integer approximation U of Q = ⌊ W / V ⌋ 1: if n < n 1 then U ← Div ( W , V ) ⊲ Returns ⌊ W / V ⌋ 2: 3: else choose a parameter k , n / 2 < k ≤ n , ℓ ← n − k 4: write W = W 1 β 2 ℓ + W 0 , V = V 1 β ℓ + V 0 , V = V ′ 1 β k + V ′ 5: 0 ( U 1 , R 1 ) ← DivRem ( W 1 , V 1 ) 6: 1 β k − ℓ + S with 0 ≤ S < β k − ℓ write U 1 = U ′ 7: T ← ShortMul ( U ′ 1 , V 0 , ℓ ) 8: W 01 ← R 1 β ℓ + ( W 0 div β ℓ ) − T β k 9: while W 01 < 0 do ( U 1 , W 01 ) ← ( U 1 − 1 , W 01 + V ) 10: U 0 ← ShortDiv ( W 01 div β k − ℓ , V ′ 1 , ℓ ) 11: return U 1 β ℓ + U 0 12: Paul Zimmermann - Short Division of Long Integers October 6, 2011- 16

Lemma The output U of Algorithm ShortDiv satisfies, with Q = ⌊ W / V ⌋ : Q ≤ U ≤ Q + 2 ( n − 1 ) . (In other words, the error is less than 2 n ulps.) Paul Zimmermann - Short Division of Long Integers October 6, 2011- 17

The optimal cutoff k in ShortMul and ShortDiv heavily depends on n . There is no simple formula. Instead, we determine the best k ( n ) by tuning, for say n < 1000 words (about 20000 digits). For ShortMul the best k varies between 0 . 5 n and 0 . 64 n , for ShortDiv it varies between 0 . 54 n and 0 . 88 n (for a particular computer and a given version of GMP). Paul Zimmermann - Short Division of Long Integers October 6, 2011- 18

Barrett’s Algorithm (1987) Goal: given W and V , compute quotient Q and remainder R : W = QV + R 1. compute an approximation I of 1 / V 2. compute an approximation Q = WI of the quotient 3. (optional) compute the remainder R = W − QV and adjust if necessary When V is not invariant, computing 1 / V is quite expensive: ◮ ℓ -fold reduction from Hasenplaugh, Gaubatz, Gopal (Arith’18, 2007) (LSB variant) ◮ for ℓ = 2, HGG is exactly Karp-Markstein division (1997) Paul Zimmermann - Short Division of Long Integers October 6, 2011- 19

2-fold division (Karp-Markstein) 1. compute an approximation I of 1 / V to n / 2 words 2. deduce the upper n / 2 words Q 1 = ShortMul ( W , I , n / 2 ) 3. subtract Q 1 V from W , giving W ′ 4. deduce the lower n / 2 words Q 0 = ShortMul ( W ′ , I , n / 2 ) In step 3, Q 1 V is a ( n / 2 ) × n multiplication, giving a 3 n / 2 product. However, we know the upper n / 2 words match with W , and we are not interested in the lower n / 2 words. This is exactly a middle product (Hanrot, Quercia, Zimmermann, 2004): ❅ ❅ lower ❅ ❅ middle Q 1 ❅ ❅ product ❅ ❅ upper ❅ ❅ V Paul Zimmermann - Short Division of Long Integers October 6, 2011- 20

The 3-fold division algorithm Paul Zimmermann - Short Division of Long Integers October 6, 2011- 21

Short Division of Long Integers (joint work with David Harvey) Paul - PowerPoint PPT Presentation

Short Division of Long Integers (joint work with David Harvey) Paul Zimmermann October 6, 2011 The problem to be solved Divide efficiently a p -bit floating-point number by another p -bit f-p number in the 100-10000 digit range Paul

Basic Types C s basic (built-in) types: o Integer types, including long integers, short

CS 4803 If a, N are integers with N > 0 then there are unique integers r , q such that a = Nq

The Long and Short of Passwords Rich Shay November 5, 2009 1 / 34 The Long and Short of

Travelling Integers Number of players 2 (or more) Adding and subtracting integers Math

Minimal Fractional Representations of Integers mod M David Greve ACL2 Workshop May, 2020

Using lots of space to save lots of time. Our desktop PCs have 32-bit integers; there are 2 32 10 9

Integers Today ! Numeric Encodings ! Programming Implications ! Basic operations ! Programming

Week 5 Integers Discrete Math Marie Demlov http://math.feld.cvut.cz/demlova March 19, 2020

Representing Big Integers Multiple-precision integers cant be stored in a single machine word

Division Definition : Let a and b be integers. We say that a divides b , or a | b if d s.t. b =

Division Definition : Let a and b be integers. We say that a divides b , or a | b if d s.t. b =

2016 ANNUAL GENERAL MEETING Short Sea Shipping is OUR BUSINESS 2 Short Sea Shipping is OUR

GSM Short Message Service GSM Short Message Service GSM Short Message Service GSM Short Message

Lecture 15: Integers and Division Dr. Chengjiang Long Computer Vision Researcher at Kitware Inc.

1 Location Attendance 1 st Division 20 2 nd Division 34 3 rd Division 36 4 th Division 34 5

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials

k -times Full Traceable Ring Signature Xavier Bultel Pascal Lafourcade 31 August 2016, P .

9.520 Math Camp Probability Theory Say we have some training data S ( n ) , consisting of n

Access to community mental health services How COVID19 is shaping our approach Steve Appleton

Infrastucture and IT support for the MAGIX collaboration Stefano Caiazza Mainz, February 17 2017

All Ireland Schwartz Rounds and QI Conference People Make Change Happen | #QIreland

Earth's Layers Three Types of Rocks Early Life on Earth / Fossils Rock Strata Return to

Dante Firefighters are our heroes. Lets make them our super heroes. OCTOBER 5TH 2017 | RED B

CSCI 5417 Information Retrieval Systems Jim Martin Lecture 12 10/4/2011 Today 10/4

Short Division of Long Integers (joint work with David Harvey) Paul - PowerPoint PPT Presentation

Short Division of Long Integers (joint work with David Harvey) Paul Zimmermann October 6, 2011 The problem to be solved Divide efficiently a p -bit floating-point number by another p -bit f-p number in the 100-10000 digit range Paul

Basic Types C s basic (built-in) types: o Integer types, including long integers, short

CS 4803 If a, N are integers with N &gt; 0 then there are unique integers r , q such that a = Nq

The Long and Short of Passwords Rich Shay November 5, 2009 1 / 34 The Long and Short of

Travelling Integers Number of players 2 (or more) Adding and subtracting integers Math

Minimal Fractional Representations of Integers mod M David Greve ACL2 Workshop May, 2020

Using lots of space to save lots of time. Our desktop PCs have 32-bit integers; there are 2 32 10 9

Integers Today ! Numeric Encodings ! Programming Implications ! Basic operations ! Programming

Week 5 Integers Discrete Math Marie Demlov http://math.feld.cvut.cz/demlova March 19, 2020

Representing Big Integers Multiple-precision integers cant be stored in a single machine word

Division Definition : Let a and b be integers. We say that a divides b , or a | b if d s.t. b =

Division Definition : Let a and b be integers. We say that a divides b , or a | b if d s.t. b =

2016 ANNUAL GENERAL MEETING Short Sea Shipping is OUR BUSINESS 2 Short Sea Shipping is OUR

GSM Short Message Service GSM Short Message Service GSM Short Message Service GSM Short Message

Lecture 15: Integers and Division Dr. Chengjiang Long Computer Vision Researcher at Kitware Inc.

1 Location Attendance 1 st Division 20 2 nd Division 34 3 rd Division 36 4 th Division 34 5

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials

k -times Full Traceable Ring Signature Xavier Bultel Pascal Lafourcade 31 August 2016, P .

9.520 Math Camp Probability Theory Say we have some training data S ( n ) , consisting of n

Access to community mental health services How COVID19 is shaping our approach Steve Appleton

Infrastucture and IT support for the MAGIX collaboration Stefano Caiazza Mainz, February 17 2017

All Ireland Schwartz Rounds and QI Conference People Make Change Happen | #QIreland

Earth's Layers Three Types of Rocks Early Life on Earth / Fossils Rock Strata Return to

Dante Firefighters are our heroes. Lets make them our super heroes. OCTOBER 5TH 2017 | RED B

CSCI 5417 Information Retrieval Systems Jim Martin Lecture 12 10/4/2011 Today 10/4

CS 4803 If a, N are integers with N > 0 then there are unique integers r , q such that a = Nq