Appendix D: Calculation Times | Cryptography in C and C++

Team-Fly

CALCULATION TIMES FOR SEVERAL FLINT/C functions, calculated with a Pentium III processor running at 500 MHz and 64 Mbyte main memory, are given in Tables D.1 and D.2. The times for n operations were measured and then divided by n. Depending on the functions, n ranged between 100 and 5 million. An additional table shows, for comparison, calculation times that were measured for several functions in the GNU Multi Precision Arithmetic library (GMP, version 2.0.2); cf. page 420.

Table D.1: Calculation times for several C functions (without assembler support)
	Binary digits of the arguments; time in seconds
	128	256	512	768	1024	2048
add_l	3.8 × 10⁻⁷	5.5 × 10⁻⁷	8.8 × 10⁻⁷	1.2 × 10⁻⁶	1.5 × 10⁻⁶	2.2 × 10⁻⁶
mul_l	1.9 × 10⁻⁶	5.1 × 10⁻⁶	1.6 × 10⁻⁵	3.3 × 10⁻⁵	5.6 × 10⁻⁵	2.1 × 10⁻⁴
sqr_l	1.5 × 10⁻⁶	3.7 × 10⁻⁶	1.1 × 10⁻⁵	2.1 × 10⁻⁵	3.5 × 10⁻⁵	1.3 × 10⁻⁴
div_l^[a]	2.6 × 10⁻⁶	5.8 × 10⁻⁶	7.6 × 10⁻⁵	2.7 × 10⁻⁵	5.1 × 10⁻⁵	7.8 × 10⁻⁴
mmul_l	7.5 × 10⁻⁶	2.1 × 10⁻⁵	6.6 × 10⁻⁵	1.4 × 10⁻⁴	2.3 × 10⁻⁴	8.9 × 10⁻⁴
msqr_l	7.3 × 10⁻⁶	1.9 × 10⁻⁵	6.1 × 10⁻⁵	1.2 × 10⁻⁴	2.1 × 10⁻⁴	8.1 × 10⁻⁴
mexpk_l	1.2 × 10⁻³	6.4 × 10⁻³	3.8 × 10⁻²	1.2 × 10⁻¹	2.1 × 10⁻¹	1.9
mexpkm_l	5.4 × 10⁻⁴	2.9 × 10⁻³	1.7 × 10⁻²	5.2 × 10⁻²	1.2 × 10⁻¹	8.6 × 10⁻¹
^[a]For the function div_l the number of digits refers to the dividend, while the divisor has half that number of digits.

Table D.2: Calculation times for several C functions (with 80×86 assembler support)
	Binary digits of the arguments; time in seconds
	128	256	512	768	1024	2048
mul_l	4.3 × 10⁻⁶	6.9 × 10⁻⁶	1.5 × 10⁻⁵	2.9 × 10⁻⁵	4.7 × 10⁻⁵	1.6 × 10⁻⁴
sqr_l	2.5 × 10⁻⁶	4.6 × 10⁻⁶	1.0 × 10⁻⁵	1.8 × 10⁻⁵	2.9 × 10⁻⁵	9.5 × 10⁻⁵
div_l^[a]	2.7 × 10⁻⁶	4.6 × 10⁻⁶	1.0 × 10⁻⁵	1.8 × 10⁻⁵	2.9 × 10⁻⁵	9.5 × 10⁻⁵
mmul_l	8.0 × 10⁻⁶	1.3 × 10⁻⁵	3.3 × 10⁻⁵	6.4 × 10⁻⁵	1.0 × 10⁻⁴	3.8 × 10⁻⁴
msqr_l	7.2 × 10⁻⁶	1.1 × 10⁻⁵	2.9 × 10⁻⁵	5.6 × 10⁻⁵	9.0 × 10⁻⁵	3.1 × 10⁻⁴
mexpk_l	1.3 × 10⁻³	4.3 × 10⁻³	1.9 × 10⁻²	5.3 × 10⁻²	1.1 × 10⁻¹	8.7 × 10⁻¹
mexpkm_l	7.6 × 10⁻⁴	3.3 × 10⁻³	1.7 × 10⁻²	4.9 × 10⁻²	1.1 × 10⁻¹	7.8 × 10⁻¹
^[a]For the function div_l the number of digits refers to the dividend, while the divisor has half that number of digits.

One can see clearly the savings that squaring achieves over multiplication. Even the advantage realized by Montgomery exponentiation in mexpkm_l() can been seen, which requires less than half the time for the exponentiation with mexpk_l(). An RSA step with a 2048-bit key can thereby, with application of the Chinese remainder theorem (cf. page 199), be computed in one-fourth of a second.

Table D.2 demonstrates the difference in time that results from the use of assembler routines. Assembler support results in a speed advantage of about 70% for the modular functions. The gap between multiplication and squaring remains stable at about 50%.

Since the two functions mulmon_l() and sqrmon_l() do not exist as assembler routines, in this comparison the exponentiation function mexpk_l() can catch up significantly to the Montgomery exponentiation mexpm_l(). Both functions are roughly equally fast. There exists here an interesting potential for further improvement in the performance (cf. Chapter 18) by suitable assembler extensions.

In the comparison between the FLINT/C and GMP functions (see Table D.3) one may see that the GMP multiplication and division are faster by 30% and 40% than the corresponding FLINT/C functions. However, with the routine of modular exponentiation, the functions of both libraries amount more or less to the same thing, which turns out to be the same for arguments with 4096 digits as well. Since the GMP is considered the fastest of the available libraries for large-integer arithmetic, we need not be dissatisfied with this result.

Table D.3: Calculation times for several GMP functions (with 80×86 assembler support)
	Binary digits of the arguments; time in seconds
	128	256	512	768	1024	2048
mpz_add	2.4 × 10⁻⁷	3.2 × 10⁻⁷	3.6 × 10⁻⁷	4.2 × 10⁻⁷	4.5 × 10⁻⁷	6.9 × 10⁻⁷
mpz_mul	9.8 × 10⁻⁷	3.0 × 10⁻⁶	1.1 × 10⁻⁵	2.2 × 10⁻⁵	4.1 × 10⁻⁵	4.8 × 10⁻⁵
mpz_mod^[a]	5.2 × 10⁻⁷	1.8 × 10⁻⁶	5.0 × 10⁻⁶	6.4 × 10⁻⁶	1.6 × 10⁻⁵	4.0 × 10⁻⁵
mpz_powm	4.5 × 10⁻⁴	2.6 × 10⁻³	1.7 × 10⁻²	5.2 × 10⁻²	1.7 × 10⁻¹	7.8 × 10⁻¹
^[a]For the function mpz_mod the number of digits refers to the dividend, while the divisor has half that number of digits.

Team-Fly