[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"

To: tor-dev@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"
From: teor <teor2345@xxxxxxxxx>
Date: Wed, 14 Feb 2018 11:17:50 +1100
Delivered-to: archiver@xxxxxxxx
Delivery-date: Tue, 13 Feb 2018 19:18:27 -0500
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:mime-version:subject:date:references:to:in-reply-to:message-id; bh=74yAyCMSsY5RWgaOTRRz6T7dIrw16CwwqoRjyS1/PXU=; b=f/drxGkb1onFG6rR4mKdmEQvnlbowgGumBimu+cc8pMSJkNvhGa5W2JqDKAmxybd76 gYEdqv293twWN8tI1X/a91z1Ug5YTV/puQrdC3nNXlEjNpjTpYyZRyJiXDImILj3HS13 n1jts+2qqu2Vxq8yODWB0JL6j4IFV6sfQ3u5FBMEwTtBkfIGiciuzGF2Ow+Gb3bwD+Xl MTHyu/GLOrPCBkPsVGPvUVsfMrtauVsBgprsf2Kn/JGf4jxJSkH5MNSscpfjBTruyqsb AjwY3lNWulY6cKDNYOImZZKNbJNmV53lCvOvRlyxLV3sSMA7JBfssggQtCQAVIiYql6Q fflQ==
In-reply-to: <CAJdkzEPbYRqSSx-=Ukao10NiOxDVkwEkq4hkdjZh5y7BqyD+BQ@mail.gmail.com>
List-archive: <http://lists.torproject.org/pipermail/tor-dev/>
List-help: <mailto:tor-dev-request@lists.torproject.org?subject=help>
List-id: discussion regarding Tor development <tor-dev.lists.torproject.org>
List-post: <mailto:tor-dev@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=unsubscribe>
References: <20171209011708.GG1550@patternsinthevoid.net> <20180129200717.GC1368@patternsinthevoid.net> <20180129203631.GE1368@patternsinthevoid.net> <20180205174300.GK28008@patternsinthevoid.net> <20180205201643.GM28008@patternsinthevoid.net> <20180212235522.GA28876@patternsinthevoid.net> <d1d1f6a1-3c72-7a8a-8fd8-bad8d3d101e9@torproject.org> <CAJdkzEPbYRqSSx-=Ukao10NiOxDVkwEkq4hkdjZh5y7BqyD+BQ@mail.gmail.com>
Reply-to: tor-dev@xxxxxxxxxxxxxxxxxxxx
Sender: "tor-dev" <tor-dev-bounces@xxxxxxxxxxxxxxxxxxxx>


> On 14 Feb 2018, at 11:03, Damian Johnson <atagar@xxxxxxxxxxxxxx> wrote:
> 
>> For the metrics tools there are some guidelines on this we can follow:
>> https://docs.oracle.com/javase/tutorial/i18n/text/design.html. The other
>> language would be Python (for stem), but Python developers have probably
>> got a good understanding of unicode/str/bytes by now. (In Python 3: when
>> using UTF-8, BOM will not be stripped and will be interpreted as data,
>> and you can have a NUL in a str).
> 
> Hi Iain. Actually, for Stem I'm really looking forward to this too.
> Stem has special handling for the contact and platform fields (iirc
> the only spot non-ascii content can presently appear). Stem's parsers
> and API will be simplified once everything is uniformly utf-8. :P
> 
> Possibly a stupid question but any reason not to require the whole
> descriptor document to be printable characters?

Requiring printable ASCII throughout the document means that people
can't spell their names and email addresses correctly in contact lines.

Requiring printable unicode introduces a dependency on a particular
unicode version, because we don't know if unallocated blocks will be
printable or not.

I think we could make platform lines printable ASCII without losing
much. Unless there are platforms that have non-ASCII names?

T

--
Tim Wilson-Brown (teor)

teor2345 at gmail dot com
PGP C855 6CED 5D90 A0C5 29F6 4D43 450C BA7F 968F 094B
ricochet:ekmygaiu4rzgsk6n
------------------------------------------------------------------------

Attachment: signature.asc
Description: Message signed with OpenPGP

_______________________________________________
tor-dev mailing list
tor-dev@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

References:
- Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"
  - From: isis agora lovecruft
- Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"
  - From: isis agora lovecruft
- Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"
  - From: isis agora lovecruft
- Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"
  - From: Iain Learmonth
- Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"
  - From: Damian Johnson

Prev by Author: Re: [tor-dev] Enhancement for Tor 0.3.4.x
Next by Author: Re: [tor-dev] Enhancement for Tor 0.3.4.x
Previous by thread: Re: [tor-dev] [prop-meeting] [prop#285] "Directory documents should be standardized as UTF-8"
Next by thread: [tor-dev] [release] Onionoo 5.0-1.10.0
Index(es):
- Author
- Thread