[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]
Re: [tor-bugs] #17786 [Metrics]: "France, Metropolitan" is unused?
#17786: "France, Metropolitan" is unused?
---------------------+--------------------------
Reporter: arma | Owner: karsten
Type: defect | Status: assigned
Priority: Low | Milestone:
Component: Metrics | Version:
Severity: Normal | Resolution:
Keywords: easy | Actual Points:
Parent ID: | Points:
Sponsor: |
---------------------+--------------------------
Changes (by karsten):
* owner: => karsten
* status: new => assigned
Comment:
''tl;wr: I'm going to remove country code FX (France, Metropolitan) and
add country codes BQ (Bonaire, Sint Eustatius and Saba), CW (CuraÃao),
SX(Sint Maarten), and XK (Kosovo) to Metrics by January 25, 2016.''
Great question. Sorry for taking so long to respond, but it turns out the
answer was even more difficult than I anticipated. I had to compare six
different sources of country codes to answer this question, in particular
the part about other country codes. Here are the six different country
code lists I looked at:
1. Metrics: The [https://gitweb.torproject.org/metrics-
web.git/tree/website/rserve/graphs.R R file used by Metrics] to include
country names in graphs;
2. clients: The [https://metrics.torproject.org/clients-data.html user
number estimates] file produced by Metrics from looking at extra-info
descriptors written by Tor relays;
3. geoip: The latest
[https://gitweb.torproject.org/tor.git/tree/src/config geoip and geoip6
files] shipped with little-t-tor;
4. MaxMind: MaxMind's list of
[http://dev.maxmind.com/geoip/legacy/codes/iso3166/ ISO 3166 Country
Codes] used in GeoIP legacy databases that we used in little-t-tor until
February 2014;
5. MaxMind2: MaxMind's country codes used in
[http://dev.maxmind.com/geoip/geoip2/whats-new-in-geoip2/ GeoIP2
databases] used by little-t-tor from February 2014 on; and
6. Wikipedia: Wikipedia's
[https://en.wikipedia.org/wiki/ISO_3166-1_alpha-2 ISO 3166-1 alpha-2]
page, in particular the decoding table.
In the following I'm going through all country codes that are missing in
at least one of these lists, skipping most of the user-assigned country
codes listed in Wikipedia. Numbers in brackets are lists containing
country codes. I'm starting with country codes contained in Metrics,
which includes France, Metropolitan mentioned above, and from those I'm
starting with the ones I think we can drop, which turns out to be just
one:
- FX (1, 6; France, Metropolitan): Wikipedia says this country code was
reserved on request of France, but it's neither used by MaxMind nor by
little-t-tor. We can safely drop this country code from Metrics.
Here are more country codes in Metrics that are not contained in all other
lists, but I think we should keep them all:
- AN (1, 2, 6; Netherlands Antilles): Wikipedia says this country code
was assigned until 2012, and there are still relays reporting users from
this country code. I'd say we should keep this, because we're not only
graphing current user numbers but also user numbers from a few years ago.
- BV (1, 2, 4, 6; Bouvet Island): This country code is still assigned,
there have been Tor users coming from it in the past, there are just no IP
address ranges in the current geoip file. We should keep this.
- EH (1, 2, 4, 6; Western Sahara): Same as BV, keep it.
- HM (1, 4, 6; Heard Island and McDonald Islands): There are currently no
IP address ranges using this country code, but there might be in the
future. We should keep this.
And here are country codes that are not contained in Metrics, starting
with the ones we can safely ignore in the future:
- ?? (2; Unknown): Tor uses this country code whenever it cannot resolve
an IP address. I think there's no need to draw a graph with users coming
from unknown countries, because there may be plenty of reasons for that,
and the graph won't reveal what they are.
- A1 (2, 4; Anonymous Proxy): MaxMind uses this country code for
anonymous proxies, which is not a specific country. We filter out these
IP address ranges before putting their database into little-t-tor, so
these reported users come from relays using their own database file. We
can safely ignore these, for the same reason as ignoring unknown countries
mentioned before.
- A2 (2, 4; Satellite Provider): Same as A1, keep ignoring.
- AA (2; User-assigned): Wikipedia says this country code is free for
assignment at the disposal of users, so we can safely ignore it.
- AP (2, 4, 6; Asia/Pacific Region): MaxMind uses this code for
Asia/Pacific Region when a specific country code has not been designated.
We're ignoring this country code in Metrics and little-t-tor, which I
think makes sense.
- CS (2, 6; Serbia and Montenegro): Wikipedia says this country code was
assigned to Serbia and Montenegro which are distinct countries since 2006.
This predates the user number estimates on Metrics, so I'd say we better
stay away from this political minefield by leaving out this country code
from Metrics.
- EU (2, 4, 6; Europe): See AP, ignore.
- O1 (4; Other Country): MaxMind used this country code in the past for
other countries, but there have not been any Tor users coming from that
country. We can ignore this.
- RI (2, 6; Indonesia): Wikipedia lists this country code under
indeterminate reservations, and it's neither used by Metrics nor by
little-t-tor. Let's ignore this.
And finally, here are country codes that are not yet in Metrics but which
should be there:
- BQ (2, 3, 4, 5, 6; Bonaire, Sint Eustatius and Saba): This country code
was assigned in 2010, and Metrics is missing it. We should add it.
- CW (2, 3, 4, 5, 6; CuraÃao): See BQ, add it.
- SX (2, 3, 4, 5, 6; Sint Maarten): See BQ, add it.
- XK (2, 3, 5; Kosovo): Wikipedia says that XF is a user-assigned code
that is being used by the European Commission, Switzerland, the Deutsche
Bundesbank, SWIFT, and other organizations as a temporary country code for
Kosovo. MaxMind adds this country code in its GeoIP2 format, and there
are actual Tor users from that country code. We should add it to Metrics,
too.
I'll leave this ticket in assigned state for a week and then make the
changes stated above.
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/17786#comment:1>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
_______________________________________________
tor-bugs mailing list
tor-bugs@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs