[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-dev] Error-Correcting Onions with Bech32

To: tor-dev@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-dev] Error-Correcting Onions with Bech32
From: nullius <nullius@xxxxxxxx>
Date: Sun, 31 Dec 2017 02:46:00 +0000
Delivered-to: archiver@xxxxxxxx
Delivery-date: Sat, 30 Dec 2017 21:46:38 -0500
In-reply-to: <CAFWeb9KhgkGzQro4M1kJL4H5oay6+c3jquAYtPV9oMQEfcAHjg@mail.gmail.com>
List-archive: <http://lists.torproject.org/pipermail/tor-dev/>
List-help: <mailto:tor-dev-request@lists.torproject.org?subject=help>
List-id: discussion regarding Tor development <tor-dev.lists.torproject.org>
List-post: <mailto:tor-dev@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=unsubscribe>
References: <d7da66c02c838afd73e3384d09e8f59b@nym.zone> <CAFWeb9KhgkGzQro4M1kJL4H5oay6+c3jquAYtPV9oMQEfcAHjg@mail.gmail.com>
Reply-to: tor-dev@xxxxxxxxxxxxxxxxxxxx
Sender: "tor-dev" <tor-dev-bounces@xxxxxxxxxxxxxxxxxxxx>

On 2017-12-31 at 00:57:49 +0000, Alec Muffett <alec.muffett@xxxxxxxxx>wrote:

Thanks! That's very interesting!  TIL :-)

Why, if it isn’t instant feedback from the RFC 7686 co-author! Inresponse to what you said, in brief: I will propose that any subdomaindata (which is presumably human-readable) be transmitted in a separateor affixed string, leaving Bech32 to deal with the pseudorandom blobs.Technical details follow.

What would you propose to do with subdomains, likewww.facebookcorewwwi.onion? Or is that outside the scope of yourproposal?

Good question. That had briefly occurred to me; but I couldn’t figureout any feasible means to stuff subdomains into the Bech32 string, forthe following reasons:

(0) RFC 1034 DNS names may be up to 255 octets in length. But Bech32strings are more length-limited. After subtracting an HRP of “onion” (5chars), the required separator of “1”, and the 6 characters of ECCchecksum in the data part, the 90-character total length limit can onlyspare up to 78 characters for the onion address data. For both v2 andv3 onions, that’s more than sufficient. But even if the length limitcould be raised, an excessively long string would destroy thehuman-friendliness which is the raison d’être for Bech32.

(I *infer* that this last may be one reason for the length limit.Although of course I can’t say for certain, I’ve read Greg Maxwelldiscussing some of the user testing involved in the standard’sdevelopment; and 90 chars seems to me the extreme of what a mortalflesh-and-blood creature could handle with such a string.)

(1) Bech32 is a base-32 encoding, only with a different alphabet thanRFC 4648. Thus, it would be necessary to design another layer ofencoding to most efficiently represent subdomain labels and thedot-separator with an alphabet of 38 characters [-0-9a-z.]. Worse,depending on which standards an implementation follows or ignores, thatis not really a strict limitation on names seen in the wild. How shouldthe Bech32 transformation deal with names containing an underscore “_”?Or other characters? I think it would only be safe to go with fulloctets. This would severely exacerbate the problem of (0) above.

(Aside: The special alphabet is bound to raise some eyebrows; so I willhere quote its rationale from BIP 173: “The character set is chosen tominimize ambiguity according to [this](https://hissa.nist.gov/~black/GTLD/)visual similarity data, and the ordering is chosen to minimize thenumber of pairs of similar characters (according to the same data) thatdiffer in more than 1 bit. As the checksum is chosen to maximizedetection capabilities for low numbers of bit errors, this choiceimproves its performance under some error models.” From what Iunderstand, a large amount of CPU time was spent crunching over the datain search of the most error-resistant alphabet.)

(2) Most subdomains are human-memorable—in your example, “www”. Codingthem with Bech32 would decrease human-friendliness, which is the preciseopposite of my objective in making this suggestion. Bech32 is great forhelping humans deal with pseudorandom blobs; for those, it improves uponRFC4648 Base32, Base64, hexadecimal, or in Bitcoin’s case, the oldbase58-based address encoding. But it is absolutely inappropriate as acoding format for text which humans can easily read, type, and remember.

It is also important to consider relative impact in common usage. Iobserve that most .onions do not use subdomains. I do think that it’simportant to support this use case; but if tradeoffs must be made, thenI would optimize more for making that pseudorandom blob less brittle inhuman hands.

For the foregoing reasons, I will propose that subdomain data, if any,be kept separate from the Bech32 coding. It may be either kept in aseparate string, or somehow affixed with a special delimiter eitherbefore or after the Bech32 representation of the onion. Off-the-cuff,which of these looks best to you?


	www:onion19qzypww2zw3ykkkglr4tu9

	onion19qzypww2zw3ykkkglr4tu9:www

	another-level.www:onion19qzypww2zw3ykkkglr4tu9

(My choice of a delimiter here may be wrong, if we want for thebrowser’s address bar to translate it. I should think more about this.)

Finally, I think I should mention: Yes, “onion19qzypww2zw3ykkkglr4tu9”is not as pretty as “facebookcorewwwi.onion”. But few .onion sites havethe compute power available to Facebook! Moreover, my proposal shouldapply to v3 onions—where nobody on Earth will be able to fullybruteforce out a human-memorable string.

I would advise users to stick to the DNS-style coding forfacebookcorewwwi.onion, and take advantage of Bech32 as an alternativerepresentation for http://yz7lpwfhhzcdyc5y.onion/ ,http://5nca3wxl33tzlzj5.onion/ , and other such strings. Those are purepain for users now, and it will only get use when v3 onions get uptake.Error-correcting codes do not make the names any easier to read; butthey certainly do help with the inevitable mistakes in all the use caseswhich involve voice, handwriting, manual typing, carrier pigeons, etc.


--
nullius@xxxxxxxx | PGP ECC: 0xC2E91CD74A4C57A105F6C21B5A00591B2F307E0C
Bitcoin: bc1qcash96s5jqppzsp8hy8swkggf7f6agex98an7h | (Segwit nested:
3NULL3ZCUXr7RDLxXeLPDMZDZYxuaYkCnG)  (PGP RSA: 0x36EBB4AB699A10EE)
“‘If you’re not doing anything wrong, you have nothing to hide.’
No!  Because I do nothing wrong, I have nothing to show.” — nullius

Attachment: signature.asc
Description: PGP signature

_______________________________________________
tor-dev mailing list
tor-dev@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

Follow-Ups:
- Re: [tor-dev] Error-Correcting Onions with Bech32
  - From: Alec Muffett
- Re: [tor-dev] Error-Correcting Onions with Bech32
  - From: teor

References:
- [tor-dev] Error-Correcting Onions with Bech32
  - From: nullius
- Re: [tor-dev] Error-Correcting Onions with Bech32
  - From: Alec Muffett

Prev by Author: Re: [tor-dev] Discussion Meeting for Prop#249 "Large CREATE cells"
Next by Author: [tor-dev] Prop-279 for Onion Alternative Name Representations (Re: Error-Correcting Onions with Bech32)
Previous by thread: Re: [tor-dev] Error-Correcting Onions with Bech32
Next by thread: Re: [tor-dev] Error-Correcting Onions with Bech32
Index(es):
- Author
- Thread