[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]
Re: [tor-bugs] #8182 [EFF-HTTPS Everywhere]: Explicitly figure out handling of internationalized domain names
#8182: Explicitly figure out handling of internationalized domain names
----------------------------------+-----------------------------------------
Reporter: schoen | Owner: pde
Type: task | Status: new
Priority: major | Milestone:
Component: EFF-HTTPS Everywhere | Version:
Keywords: | Parent:
Points: | Actualpoints:
----------------------------------+-----------------------------------------
Comment(by mikkoharhanen):
I created rulesets for unused domain 'Ã.fi'. In these rulesets, the letter
a-umlaut (Ã) was created with the following methods:
- html entities
- punycodes
- UTF-8 characters
- ISO-8859-15 characters
The test URL reveals the rules 'from' and 'to' fields. For example, with
the URL 'http://Ã.fi/entity-to-puny/' the from field uses html entities
and to field uses punycodes to indicate a-umlaut. If the rule works, the
address should be redirected to https.
a-uni.xml (file encoding: UTF-8)
{{{
[[Typed URL]] [[Resulted URL]]
http://Ã.fi/entity-to-entity/ --> [OK] https://Ã.fi/entity-to-entity/
http://Ã.fi/entity-to-puny/ --> [OK] https://Ã.fi/entity-to-puny/
http://Ã.fi/entity-to-uni/ --> [FAIL] https://ÃÂ.fi/entity-to-uni/
http://Ã.fi/puny-to-puny/ --> [FAIL] http://www.Ã.fi/puny-to-puny/
http://Ã.fi/puny-to-entity/ --> [FAIL] http://www.Ã.fi/puny-to-entity/
http://Ã.fi/puny-to-uni/ --> [FAIL] http://www.Ã.fi/puny-to-uni/
http://Ã.fi/uni-to-uni/ --> [FAIL] http://www.Ã.fi/uni-to-uni/
http://Ã.fi/uni-to-entity/ --> [FAIL] http://www.Ã.fi/uni-to-entity/
http://Ã.fi/uni-to-puny/ --> [FAIL] http://www.Ã.fi/uni-to-puny/
}}}
{{{
[[Typed URL]] [[Resulted URL]]
http://ä.fi/entity-to-entity/ --> [FAIL]
http://www.&.com/#228;.fi/entity-to-entity/
http://ä.fi/entity-to-puny/ --> [FAIL]
http://www.&.com/#228;.fi/entity-to-puny/
http://ä.fi/entity-to-uni/ --> [FAIL]
http://www.&.com/#228;.fi/entity-to-uni/
http://xn--4ca.fi/entity-to-entity/ --> [OK] https://Ã.fi/entity-to-
entity/
http://xn--4ca.fi/entity-to-puny/ --> [OK] https://Ã.fi/entity-to-
puny/
http://xn--4ca.fi/entity-to-uni/ --> [FAIL] https://ÃÂ.fi/entity-
to-uni/
http://xn--4ca.fi/puny-to-puny/ --> [FAIL] http://www.Ã.fi/puny-
to-puny/
http://xn--4ca.fi/puny-to-entity/ --> [FAIL] http://www.Ã.fi/puny-
to-entity/
http://xn--4ca.fi/puny-to-uni/ --> [FAIL] http://www.Ã.fi/puny-
to-uni/
http://ÃÂ.fi/uni-to-uni/ --> [FAIL] http://www.ÃÂ.fi/uni-
to-uni/
http://ÃÂ.fi/uni-to-entity/ --> [FAIL] http://www.ÃÂ.fi/uni-
to-entity/
http://ÃÂ.fi/uni-to-puny/ --> [FAIL] http://www.ÃÂ.fi/uni-
to-puny/
}}}
***
a-latin.xml (file encoding: ISO-8859-15)
{{{
[[Typed URL]] [[Resulted URL]]
http://Ã.fi/latin-to-latin/ --> [OK] https://Ã.fi/latin-to-latin/
http://Ã.fi/latin-to-entity/ --> [OK] https://Ã.fi/latin-to-entity/
http://Ã.fi/latin-to-puny/ --> [OK] https://Ã.fi/latin-to-puny/
http://Ã.fi/entity-to-latin/ --> [OK] https://Ã.fi/entity-to-latin/
http://Ã.fi/puny-to-latin/ --> [FAIL] http://www.Ã.fi/puny-to-latin/
http://xn--4ca.fi/latin-to-latin/ --> [OK] https://Ã.fi/latin-to-
latin/
http://xn--4ca.fi/entity-to-latin/ --> [OK] https://Ã.fi/entity-to-
latin/
http://xn--4ca.fi/puny-to-latin/ --> [FAIL] http://www.Ã.fi/puny-
to-latin/
}}}
Conclusions:
- HTML entities always work
- Latin1 characters always work
- Unicode characters never work
- Puny-codes work in output ('to') fields but not in input ('from') fields
- Firefox converts punycodes before HTTPS Everywhere has the opportunity
to redirect them
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/8182#comment:1>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
_______________________________________________
tor-bugs mailing list
tor-bugs@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs