[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

[tor-bugs] #21255 [Metrics/Metrics website]: fraction value computation for clients.csv and its descriptions don't seem to match



#21255: fraction value computation for clients.csv and its descriptions don't seem
to match
-----------------------------------------+--------------------------
     Reporter:  iwakeh                   |      Owner:  metrics-team
         Type:  defect                   |     Status:  new
     Priority:  Medium                   |  Milestone:
    Component:  Metrics/Metrics website  |    Version:
     Severity:  Normal                   |   Keywords:
Actual Points:                           |  Parent ID:
       Points:                           |   Reviewer:
      Sponsor:                           |
-----------------------------------------+--------------------------
 Maybe, I'm missing something obvious, but the calculation for `frac` in
 `clients.csv` doesn't seem to calculate what is stated in the web-site
 description (nor in the sql comment),
 [https://metrics.torproject.org/stats.html#clients web-site]:
   "frac: Fraction of relays or bridges in percent that the estimate is
 based on."


 [https://gitweb.torproject.org/metrics-web.git/tree/modules/clients/init-
 userstats.sql?id=14840ed2db075bbc1d0991b974becc3826a50969#n553 SQL
 excerpt]:
 {{{
 ...
   -- Estimated fraction of nodes reporting directory requests, which is
   -- used to extrapolate observed requests to estimated total requests in
   -- the network.  The closer this fraction is to 1.0, the more precise
   -- the estimation.
   CAST(a.frac * 100 AS INTEGER) AS frac,

   -- Finally, the estimate number of users.
   CAST(a.rrx / (a.frac * 10) AS INTEGER) AS users

   -- Implement the estimation method in a subquery, so that the ugly
   -- formula only has to be written once.
   FROM (
     SELECT date, node, country, transport, version, rrx, nrx,
            (hrh * nh + hh * nrh) / (hh * nn) AS frac
 <--------------------<<<
     FROM aggregated WHERE hh * nn > 0.0) a

   -- Only include estimates with at least 10\% of nodes reporting
 directory
   -- request statistics.
   WHERE a.frac BETWEEN 0.1 AND 1.0
 ...
 }}}

 The arrow points at the **fraction of reported directory requests** (or
 responses for bridges) of the total (estimated) sum of directory requests
 (responses for bridges), but not the **fraction of nodes reporting
 directory requests** of the total number of nodes.

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/21255>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
_______________________________________________
tor-bugs mailing list
tor-bugs@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs