A friend of mine asked me how OTR can guarantee that we're not MITM'ed, and I thought that maybe she was not the only one wondering about this crypto-powered black-magic, so I wrote this small blogpost.

Since not everyone is well versed into mathematics or computer science, every explanation will have two parts: an "explain me like I'm five" one, and a more mathematical one. As usual in cryptography, our protagonists will be Alice and Bob who're trying to communicate together, while Eve will try to eavesdrop on them.

To perform mutual-authentication, OTR uses something called the Social Millionaire's Protocol (SMP). It's based on Yao's Millionaires' Problem, which was proposed by Andrew C. Yao, in 1982.

Two millionaires wish to know who is richer; however, they do not want to find out inadvertently any additional information about each other’s wealth. How can they carry out such a conversation?

Diffie-hellman

The solution to this problem used by OTR is using the Diffie-Hellman key exchange (DH) as a primitive. As you may have guessed, this scheme was invented by Whitfield Diffie and Martin Hellman, in 1976. It allows two parties that have no prior knowledge of each other to establish a shared secret, over an insecure communication channel. This primitive is not authenticated, meaning that it's vulnerable to MITM.

Wikipedia has an awesome picture to explain this process with paint:

Wikipedia's picture of Diffie-Hellman

And now some mathematics:

Let p be a prime number (A prime number is greater than 1, and can only be divided by itself or by 1), and g a primitive root modulo p.

Primitive root modulo p means that every number coprime (Coprimes numbers have no common divisors except 1.) to p is congruent to a power of g modulo n. This can be summarized as: g primitive root modulo p ⇔ ∀n, n⊥p, ∃k, g^k ≡ n (mod p)

In OTR, both p and g are fixed: the first one to a 1536-bit prime, and the later is equal to 2. This is how it works:

Alice is choosing a random number a, and she sends over an insecure channel A = g^a mod p to Bob.
Bob is choosing a random number b, and he sends over an insecure channel B = g^b mod p to Alice.
Alice computes s = B^a mod p.
Bob computes s = A^b mod p.

Now Alice and Bob are sharing the same secret s.

When numbers are big enough, given g, p, g^b mod p and g^a mod p, it's really super hard to find a or b, even for super-computers. This is knows as the discrete logarithm problem.

Ok, now we have a way to create a shared secret, but we're still vulnerable to MITM.

Socialist millionaire protocol

You may wonder how we could use the SMP to establish authentication without being vulnerable to MITM. Think of the wealth amount as a shared secret : You can check if both of you are knowing it, without disclosing it. If someone is trying to MITM you, she'll have to guess this amount.

Someone else already came up with a great explanation of the SMP, and I didn't managed to find something better.

So let's jump to the mathematical side, straight from OTR 3.4 specification:

Alice
- Picks random exponents a₂ and a₃
- Sends Bob g_2a = g₁^a₂ and g_3a = g₁^a₃
Bob
- Picks random exponents b₂ and b₃
- Computes g_2b = g₁^b₂ and g_3b = g₁^b₃
- Computes g₂ = g_2a^b₂ and g₃ = g_3a^b₃
- Picks random exponent r
- Computes P_b = g₃^r and Q_b = g₁^r g₂^y
- Sends Alice g_2b, g_3b, P_b and Q_b
Alice
- Computes g₂ = g_2b^a₂ and g₃ = g_3b^a₃
- Picks random exponent s
- Computes P_a = g₃^s and Q_a = g₁^s g₂^x
- Computes R_a = (Q_a / Q_b) ^a₃
- Sends Bob P_a, Q_a and R_a
Bob
- Computes R_b = (Q_a / Q_b) ^b₃
- Computes R_ab = R_a^b₃
- Checks whether R_ab == (P_a / P_b)
- Sends Alice R_b
Alice
- Computes R_ab = R_b^a₃
- Checks whether R_ab == (P_a / P_b)

Proof

R_ab = R_a^b₃ = (Q_a / Q_b) ^a₃·b₃ = (g₁^{s - r} · g₂^{x - y}) ^a₃·^b₃ = g₁^{a₃·b₃·(s - r)} · g₂^{(x - y)·a₃·b₃}

P_a / P_b = g₃^s / g₃^r = g₃^{s - r} = g_3a^{b₃·(s - r)} = g₁^{a₃·b₃·(s - r)}

This means that

g₂^a₃·^{b₃·(x - y)} = 1

Since g₂^a₃·^b₃ is a random number different from 1, the only solution is that x - y = 0, meaning that x = y.

Thanks to the discrete logarithm problem, an attacker eavesdropping on the connection wouldn't be able to guess the secret. If she was trying to MITM with the wrong secret, the final check will fail.

The common secret

When Alice and Bob are initiating an OTR communication, they are first creating a shared secret with Diffie-Hellman, then are doing a mutual authentication. This process is named Authenticated Key Exchange (AKE), and produces the shared secret s.

Don't be fooled by the term mutual authentication, here, we're speaking of keys, not persons: the communication is secure between those two keys, but at this point, Alice and Bob can't be sure (Unless they know each other fingerprints.) that they are not MITM'ed.

Why is the secret x (or y) so complicated? Because this ensures that you're doing the authentication with the right fingerprints, in the right session.

If you have used a decent OTR client, you'll noticed that you can either use a shared secret, or ask a question. When you're choosing the later, the answer to the question is used as the shared secret, while the question is transmitted along with the first message. You can also of course manually verify the fingerprint, but this is less convenient.

Conclusion

If hope that now you understand how the magic authentication of OTR is working, and that you're eager to know more about this wonderful protocol by reading its specification, or at least, to use it for your IM communications.

By the way, if you know some C, feel free to help us improving libotr, the reference implementation of OTR ;)

Artificial truth

archives | latest | homepage

Social Millionaire's Protocol in OTR
Thu 08 January 2015 — download

Diffie-hellman

Socialist millionaire protocol

Proof

The common secret

Conclusion