TripleSec - Symmetric Encryption in the Browser combining AES, Salsa20, and Twofish

TripleSec

Nowadays, responsible programmers should consider encrypting sensitive user data on the browser before it arrives at their servers. They furthermore ought to consider the possibility of weaknesses in popular cryptography (e.g., AES and SHA1) or cryptographic resources (e.g. system entropy sources).

TripleSec is a simple, open-source triple-paranoid symmetric encryption library for the browser and Node.js. It encrypts data with Salsa 20, AES, and Twofish, so that a compromise of one or two of the ciphers will not expose the secret.

Of course, encryption is only part of the story. TripleSec also: derives keys with PBKDF2 to defend against password-cracking and rainbow tables; authenticates with HMAC to protect against (adaptive) chosen-ciphertext attacks; and supplements the native entropy sources (window.crypto.getRandomValues in the browser and crypto.rng in Node.js) for fear they are weak.

Demo

Installation

For the browser, download triplesec.js (version 1.0.0).

In node, npm install triplesec.

How to Use It

Encryption is performed by the encrypt function. It periodically yields control to not lock up your CPU. When done, it calls back with (err, buffer).

CoffeeScript
JavaScript

triplesec.encrypt ({

    data:          new triplesec.Buffer('Pssst. I believe I love you.'),
    key:           new triplesec.Buffer('top-secret-pw'),
    progress_hook: function (step, iterations, i) { /* ... */ }

}, function(err, buff) {
  
    if (! err) { 
        var ciphertext = buff.toString('hex');
    }

});

triplesec.encrypt

    data:          new triplesec.Buffer 'Pssst. I believe I love you.'
    key:           new triplesec.Buffer 'top-secret-pw'
    progress_hook: (step, iterations, i) -> # ...

, (err, buff) ->

    ciphertext = buff.toString 'hex' unless err

TripleSec's decrypt is painless.

CoffeeScript
JavaScript

triplesec.decrypt ({

    data:          new triplesec.Buffer(ciphertext, "hex"),
    key:           new triplesec.Buffer('top-secret-pw'),    
    progress_hook: function (step, iterations, i) { /* ... */ }

}, function (err, buff) {

    if (! err) { 
        console.log(buff.toString());
    }  

});

var plaintext = buff.toString();

triplesec.decrypt

    data:          new triplesec.Buffer ciphertext, 'hex'
    key:           new triplesec.Buffer 'top-secret-pw'
    progress_hook: (step, iterations, i) -> # ...

, (err, buff) ->

    console.log buff.toString() unless err

Anatomy of Output

Output is the xor of 4 values, shown as rows below. The columns in the diagram are aligned. For more information, read on.

Algorithm Design

The TripleSec library encrypts data in four steps:

Key derivation. Given a user-provided password, and a random salt value, generate four separate secret keys, one for each cipher (see Step 3), and two final keys for signing the ciphertext (see Step 4). This "key stretching" is done via PBKDF2. The "PRF" passed to PBKDF2 is the XOR of HMAC-SHA-512 and HMAC-SHA3. We use the XOR composition here to preserve and boost the pseudorandom property of the underlying two HMACs, therefore adding resilience to a break of either SHA-512 or SHA3 (which are fundamentally different algorithms). The output of this step is five seperate keys, used below.
Initial value (IV) generation. A random number generator is queried to produce an initial for each of the three ciphers: a 192-bit IV for Salsa20; a 128-bit IV for Twofish; and a 128-bit IV for AES.
Cascading encryption. Each of the ciphers runs with the keys generated in Step 1, and the IVs generated in Step 2.
1. Salsa20. The innermost cipher is a Salsa20 variant called XSalsa20. Like Salsa20, XSalsa20 is a stream cipher, meaning it can encrypt input texts of arbitrary length without a a block cipher mode of operation. XSalsa20 takes a 192-bit nonce rather than Salsa20's 64-bit nonce, but is provably as secure. Given a key, and an IV, XSalsa20 generates a random pad, which is then XOR'ed with the input message. This step of the algorithm outputs the concatenation of the IV and the result of the XOR operation.
2. Twofish-CTR. The output of the previous step (call it C₁) is the input of this step, which uses Twofish running in CTR mode. Let R₂ be the the IV generated for Twofish in Step 2. Twofish-CTR works by encrypting R₂, R₂+1, R₂+2,... with Twofish, and concatenating the result to yield a pad the size of C₁. Call this pad P₂ . Output (R₂ || (P₂ ⊕ C₁)), where "||" denotes concatenation, and "⊕" denotes XOR.
3. AES-256-CTR. In the final encryption step, apply AES-256 running in CTR mode to the output of the Twofish-CTR step. As above, first XOR the output of the previous step with the pad output by AES-256-CTR. Then prepend the IV used.
HMAC (or "sign") the ciphertext. Finally, TripleSec "signs" the ciphertext to ensure that no adversary tampers with it. The data to be signed is everything generated to date: a small header that encapsulates the version of the algorithm (now at 1); the salt used in key derivation; and the output of the AES stage of the cascading encryption above. TripleSec "macs" with a concatenation of two HMACs: HMAC-SHA-512, and HMAC-SHA3, each run with a seperate key. The final output is a concatenation of: the header; the salt; the signature; and the outermost ciphertext.

Though this is not the exact composition suggested by Schneier in Applied Cryptography (Section 15.8 in the Second Edition), it is close. TripleSec never uses the output of one block cipher as input into the next, which theoretically might allow a crack of one cipher to be used to crack another. Rather, by merit of CTR mode, the three ciphers run on statistically independent IVs, so a crack of one will not spread up or down the chain. The TripleSec technique takes one futher step not suggested by Schneier, which is to protect the inner IVs with the outer encryption algorithms, and only exposing the outermost IV in the clear. Though we can't prove this makes the scheme more secure, it seems like a reasonable idea: why reveal cipher inputs if we don't have to? Finally, this algorithm has the added advantage that the output ciphertext only increases by a constant additive term (i.e., the lengths of the header, the salt, the HMAC and the three IVs). Schneier's technique inflates ciphertexts by a factor N, where N is the number of independent ciphers used.

Similarly, TripleSec protects against a break in HMAC-SHA-512 by always combining it with an HMAC based on Keccak hash algorithm (soon to become the SHA-3 standard). Note that TripleSec combines these HMACs in two different ways. For PBKDF2 in Stage 1, TripleSec XORs the result of the two HMACs for pseudorandomness. For "signing" in Stage 4, TripleSec concatenates the two results to preserve collision-resistance. Unlike the suspect compositions in TLS and SSH, these simple compositions don't require either SHA-512 or SHA-3 to be strongly collision-resistant; rather, just weakly collision-resistant in line with the original construction. See Anja Lehmann's dissertation for more details on combinations of hashes.

Anticipated Questions

I can understand double encryption, but triple encryption is madness!

User data uploaded to a remote cloud-hosted server is nearly impossible to delete, so any encryption scheme has to be future-proof. The amount of time spent encrypting reasonably-sized plaintexts pales in comparison to (1) PBKDF2, which is intentionally slow; and (2) how long it will sit on the server. Why not go the extra mile?

What's triplesec.Buffer?

It is a wrapper around either Node.js's Buffer or a browser equivalent. When you generate encrypted data, you can use the output buffer however you like. In our above examples, we converted to and from hex strings.

How does TripleSec generate randomness/entropy? Can I provide my own?

TripleSec first derives a random seed from a variety of sources: from window.crypto.getRandomValues in the browser; from crypto.rng in Node.js; from the millisecond field of your system time; and finally, from more-entropy, which counts how many floating-point-heavy computations can be done in a set amount of time. This data is then stirred together and becomes the seed for HMAC_DRBG, whose HMAC is the XOR of HMAC-SHA-512 and HMAC-SHA3.

You may alternatively provide your own random number generator for encryption. Pass an rng function along with your other data. This function should take two arguments: the number of bytes needed, and a callback that you fire with a triplesec.WordArray containing the random data. You can create a WordArray from a triplesec.Buffer by simply calling WordArray.from_buffer(buffer).

How are passphrases salted?

PBKDF2 takes as input a salt in addition to a secret passphrase, to prevent an adversary from cracking many TripleSec-encrypted ciphtertexts in parallel. TripleSec salts passphrases with a random 8-byte sequence that's included with the ciphertext. By default, TripleSec's triplesec.Encryptor object uses the same salt until you call triplesec.Encryptor.resalt. The advantage of salt reuse is that it's faster, since it avoids the intentionally slow PBKDF2 step. On the other hand, an adversary can tell if two different ciphertexts were encrypted in the same session if the salt is not reset.

Can I encrypt files with it, in the browser?

Yes, using HTML5 features you can access file data without uploading it to a server. We're also likely to add an additional interface to the encrypt and decrypt functions, where you provide a data function instead of a single Buffer, for large data performance. TripleSec was planned with this feature in mind, and it'll be easy to use.

If you implement a file-hosting service using TripleSec, let us know, and we'll link to it in the "Who's Using It" section below.

Why isn't library X good enough (for X in Clipperz, Forge, SJCL, CryptoJS, etc.)?

There are lots of great JS Crypto libraries out there, and we've borrowed from some to build TripleSec. But combining cryptographic primitives to achieve IND-CCA2 security involves many fussy decisions and much avoidance of implementation pitfalls. We want all to have access to higher-level primitives that can be applied with little thought. Hence TripleSec!

Is this provably secure?

We don't have any exact proof of security for a cascade of block ciphers in CTR mode. But we're pretty sure TripleSec's encryption can only be broken if all three algorithms are broken. We furthmore think that TripleSec is non-malleable (and hence IND-CCA2 secure) due to the HMAC step. Let us know if there's a simple proof (or a citation) that we missed.

If the input message size is n, how big is the ciphertext?

n + 200. The additive term is broken down as:

8 bytes for the header (which is [0x1c94d7de, 0x1]).
8 bytes for PBKDF2 salt
64 bytes for the HMAC-SHA512 signature
64 bytes for the HMAC-SHA3 signature
16 bytes for AES-256 IV
16 bytes for Twofish IV
24 bytes for Salsa20 IV

How do I verify the implementation against known test vectors?

In the browser, you can visit our-browser based test page. If you have Node.js on your system, you can clone the github repo and run make test. We've checked all algorithms against known test vectors, with the exception of the XSalsa20 extension to Salsa20, which doesn't have published test vectors. For the XSalsa20 extension, we check outputs against the official Go Language Crypto library. We still check the underlying Salsa20 core against published test vectors.

I read someplace that it's impossible to write real crypto in JavaScript.

There are well-read articles on this topic, but we don't agree with a lot of the rhetoric. Of course you should deliver your Crypto libraries over TLS, and nowadays, that's accepted and common. And maybe JavaScript isn't the most convenient language to write Crypto code in, but it still can express all the necessary primitives. Browsers have good CSPRNGs now, and even if you don't trust Apple and/or Linux and/or Chrome, we have some good workarounds (see above). True, one needs to take care not to overflow 32-bits, but with a robust testing suite against known test vectors, one can rule out this class of bugs. Of course one shouldn't allow untrusted libraries to trample one's trusted primitives, but that's true of any language (see LD_PRELOAD attacks against libraries written in C). A shortcoming we encountered in writing TripleSec is that JavaScript doesn't offer desctructors, so it's incovenient to scrub buffers properly. TripleSec has taken care to do this job manually. If you spot some unscrubbed buffers, please let us know.

We are as worried as anyone else about XSS attacks, CSRF attacks and the ability of third party code to tamper with vetted Crypto code. But these attacks and the quality of Crypto libraries are othogonal concerns. Those sites with high quality JS libraries should feel confident encrypting data with TripleSec. Those with lots of unvetted third party JS code won't gain much.

Is there a streaming interface?

Not yet, it's in progress. The current interface requires the file to be fully loaded into memory before it's encrypted, but the current file format is compatible with streaming (with a single seek to write the HMACs).

Implementations Outside JavaScript

We welcome ports, and we'll list such projects here. The TripleSec checkout has test vectors which your implementation should match.

Who's Using it?

For starters, we are (Max Krohn & Chris Coyne), co-founders of OkCupid. We're working on an unrelated site now, and TripleSec will be used to encrypt our users' keys.

If you use TripleSec for something public, please contact us. We'll mention you here.

Here are some ideas, in case you're feeling ambitious:

a TripleSec browser extension for highlighting and encrypting/decrypting text on any page
diary/journal
key storage
password manager
Bitcoin wallet

Can I help?

Please! Above all else, we encourage review of both our algorithm and the source code.

How do I reach you?

Our email addresses are right here. Please enter the password peppermint patty in the demo box, and this as the ciphertext:

1c94d7de000000018bcc834fefa40a8a8bb16adb750ebc68a443e2144b79a270d546a5040da5b68d73bc36feb2e36f015ba5c165e71947153e9ad41be84bf049433582e2e2da250d81e35dd65988a01eae8a15bf0d693a0c332835134d6d2e7474e4fd0d424e544e3329901c040490deca520077d19da5bd8842ba2a35ffc2fa9ec7cbac16a09011a22dc588cc5ba473c464fcdbdc21894360a5943ac7131bfcfafedbf6306575aaee684bd1356a78323e84f782b3272e1789c4f1b9d00029ebd7dd81099d72d9ab9efb2f60deb790c51df5cea7049a59e26210830f8cf5d37b17f156e9f6d4c87c9c5b81b30f59a55d0e201ef202cc4d5bfba948d94a2c626e864337c1fe9980e41646751d853c83923269f2579f376cfd30

Or see the discussion on Hacker News.