This document is incomplete and in preliminary condition.
Proof by Partha Dasgupta, based on a proof by Zvi M. Kedem
p, q are distinct primes
N = p·q
Find a, b such that a·b = 1 mod (p-1)(q-1)
Encryption Key: e
= (b, n) Decryption Key:
d = (a, n)
Encryption of message m: Ee(m) = mb mod n = C (cipher)
Decryption of
C : Dd(C) = ya
mod n = m
So in order for
RSA to work we must have the property :
(mb)a = m mod n - [1]
In this document (7 pages, large font text), we will
· Prove [1]
· Show how to find p, q, a and b.
·
Show how to compute
mb fast
Preliminaries:
Zn = {0, 1, 2, n-1}
Z*n = {x <n-1 | x and n are relatively prime}
f(n) = number of elements of Zn that are relatively prime to n.
Hence f(n) = | Z*n|
How to find f(n)?
All those number that are not multiples of p or q are in f(n). If we count all the multiples of p and q.
0, 1
p, 2p, 3p,
.
(q-1)p q-1
q, 2q, 3q,
.
(p-1)q p-1
hence
f(n) = pq 1 (q 1) (p 1) = pq p q
+1 = (p 1)(q 1)
Example: p=3, q=5 n=15.
Now f(n) = 2*4 = 8 = {1, 2, 4, 7, 8, 11, 13, 14}
Claim 1: Z*n is closed under
multiplication mod n.
If a,b e Z*n then ab and n are relatively prime
i.e. ab shares no primes with n.
By definition of Z*n a, b do not share primes with n. Their product, ab, gets its primes from a and b and therefore does not share primes with n (if ab is less than n, else we have to do the following)
The product can be written as ab = an + g. We just need to show that g is in Z*n . But if it is not, then it shares primes with n and the right hand side is divisible by some prime that is a factor of n. But then, so is the left side, which is impossible as we showed that it is relatively prime with n.
EndClaim1
Definition:
Z*n = {b1, b2, , bf(n)}
For any a e Z*n,
Let Sa = {a·b1
mod n, a·b2
mod n,
, a·bf(n) mod n}
Claim 2: Sa Ί Z*
First, by Claim 1, all elements of Sa are in Z*n
Second, no two elements can be the same. Suppose they were, then for some bi and bj (bi < bj)
a· bi
= an
+ g
a· bj = bn + g
Subtracting, (bi bj)·a = (b-a)·n or x· a = y· n
x· a and y· n are the same product of primes. Since a and n do not share any common primes. All primes that form n has to appear in x.
Hence x>= n. That is a contradiction, as bi <m.
Since all elements of Sa are distinct, and in Z*n, then Sa and Z*n are identical.
Note that since all elements of Z*n are produced when a is multiplied by each element of Z*n, then the element 1 is also a result of such a multiplication. Hence we get the following: [Note: Not useful at this point]
Corollary: if a e Z*n then $ bk e Z*n, s.t. abk =
1 mod n
Corollary is proven in the prior paragraph.
Claim3: if a e Z*n
then af(n) = 1 mod n
Define c and A such that:
b1 · b2 · · bf(n) = c mod n
(ab1 · ab2 · · abf(n)) = A mod n [Note, A and c are less than n]
Now since ab1 mod n · ab2 mod n · · abf(n) = A mod n
By Claim 2, ab1 mod n, ab2 mod n, · abf(n) mod n is a permutation of Z*n
Hence: A = c (plain arithmetic)
Now distributing the sequence differently:
(ab1 · ab2 · · abf(n)) = af(n)· (b1 · b2 · · bf(n)) .. [1]
Let af(n) = a mod n
Taking the modulo of both sides of [1] uses rule [PD] below
A
= a · c,
Replacing A with c, c = a · c
, (plain arithmetic, not modulo).
Hence a = 1.
Thus: af(n) = 1 mod n
Rule [PD] [needs to
be stated better]
For all a, b e Zn (not Z*n)
if (ab = c mod n) and (a = x mod n) and (b = y mod n) and (xy = z mod n)
then c = z
Claim4: if a and f(n) are relatively prime then $ b, s.t. ab = 1 mod f(n)
If a and f(n) are relatively prime, then a e Z* f(n) and from Corollary of Claim 2 we know that b exists (and is a member of Z* f(n)).
Thus there exists a and b, both relatively prime to f(n), such that:
a·b = kf(n) + 1 (regular arithmetic)
Take a message m < n and choose a relatively prime to f(n) and find b such that a· b=1 mod f(n).
Now compute (ma)b using modulo n arithmetic:
(ma)b = mkf(n) +
1 = mkf(n) m = mf(n) mf(n)
mf(n) m
Take the modulo of the last term and since mf(n) = 1 mod n, then result is m.
Hence (ma)b = m mod n
Deficiency of this proof: The proof is for all messages in Z*n
If n=512 bit number, then the chance of a number being in Zn but not in Z*n is about 1025. That is negligible J
How to really find a, b?
We know that given a, b exists, but how to find them?
Find a, relatively prime to n (3, 5, 7 etc start with a small odd number and work your way up). Note that f(n) is even.
Then find b using extended Euclidean algorithm as follows
Extended
Euclidean Algorithm:
Given p and q, p>q the
algorithm finds x and y, such that
x·p
* y·q = GCD(p, q)
[note: regular arithmetic, x or y is negative]
So we use it as follows:
· We provide n and a as input and get x and y [note: GCD(a,n) = 1]
· We know a·b = α·n + 1
Or α·n
+ a·b
= 1
So b = y
Hence in modulo arithmetic,
· b = y, if y is positive and
· b = f(n) y, if y is negative
Now we need to find p, q and hence N.
p and q are large prime numbers. So the problem is to find large prime numbers. There is no good deterministic way of doing this. However we can do it with probabilistic algorithms.
Fact: There are lots of large prime numbers. The number of prime numbers below N is about N/(ln n) and hence for a random 2048 bit number, the probability of it being prime is about 0.0007(one in 1500).
Claim 5: If p is prime, for any a < p, ap-1 = 1 mod p
Since p is prime, a e Z* p and f(p) = p 1
Thus ap-1 = af(p) = 1 mod p
EndClaim
Claim
6: If p is prime, the equation x2 = 1 mod p .
has only 2 solutions, 1
and p1 (or 1 mod p)
Lets say the equation has 2 solutions, S1 and S2 .
Thus S12 = 1 + kp
Hence (S1+1) · (S1-1) = kp
This means either (S1+1) or (S1-1) or both is divisible by p.
Suppose both are divisible by p. But these two numbers are only 2 apart and unless if p=2 this is not possible.
Thus only one of them is divisible by p. If (S1+1)
is divisible, then
S1
= 1 mod p.
If (S1-1) is divisible by p then
S1
= - 1 mod p (or S1+1
= p-1 mod p)
Thus the two solutions have to be 1 and 1. (same happens for S2)
Choose a number p, randomly. This number, if large has a chance of being prime in the order of 1 in several thousand (good!).
Then choose a number a < p, randomly. We will use a to test the primality of p. First, we know that if p is prime, ap-1 is 1 (mod p). Secondly, we know that if for some x (where x is not 1 or p-1), if x2 is 1 or p-1 (mod p) then p is not prime.
Now we compute ap-1 fast. Since the computing involves squaring numbers, we can do the x2 test also.
Computing ax can be done as follows. Put 1 in a result variable. Take the binary representation of x. Then for every bit in the binary representation, working from left to right (MSB to LSB), for every 0 square the result variable. For every 1, square the result variable and multiply with a.
b[k]
b[k-1]
b[1] b[0] is the binary representation of p 1
result
= 1 // start with the value of
a0
for i = k downto 0 {
//from MSB to LSB
temp = result; // store prev result for
checking
result = (result * result) mod n //square prev
result
if
(result = 1) and (temp!=1) and (temp!=n-1)
then p is not prime; //by Claim 6
break;
if b[i]
= 1 then result = (result*a) mod n //mult by a
}
if result = 1 then
n is possibly prime // good
else p is not prime // bad
If the above test says possibly prime then the number p is not prime with probability 0.5. Hence if we run the test R times, then p is not prime, with probability (0.5)R. If R = 100 and for all the 100 tests the result was possibly prime, then the chance of the number being not prime is a one in a million.
So we select 1 large number. Test for primality about 500
times. In about a 2000 choices, we will find a prime number. Do it again for
another prime number. Now call them p and q. All this should take
about 1-2 seconds. Definitely under 10 secs.
And the rest, as they say is trivial J
Note that encryption and decryption uses the same fast exponenting algorithm as shown above.