[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

[tor-commits] [metrics-web/release] Include client numbers with fractions up to 110%.



commit 3b5a2fb6c87a98508ab1633792f5c5176f6ffc31
Author: Karsten Loesing <karsten.loesing@xxxxxxx>
Date:   Fri Nov 30 10:22:35 2018 +0100

    Include client numbers with fractions up to 110%.
    
    Turns out that almost all relays report directory-request statistics
    these days, including a small number of relays that temporarily drop
    out of the consensus. We're now accepting up to 10% of those
    additional statistics.
    
    See #28305 for more details.
---
 src/main/sql/clients/init-userstats.sql | 9 +++++++--
 1 file changed, 7 insertions(+), 2 deletions(-)

diff --git a/src/main/sql/clients/init-userstats.sql b/src/main/sql/clients/init-userstats.sql
index 38521f2..cf2b620 100644
--- a/src/main/sql/clients/init-userstats.sql
+++ b/src/main/sql/clients/init-userstats.sql
@@ -678,8 +678,13 @@ CREATE OR REPLACE VIEW estimated AS SELECT
     FROM aggregated WHERE hh * nn > 0.0) a
 
   -- Only include estimates with at least 10% of nodes reporting directory
-  -- request statistics.
-  WHERE a.frac BETWEEN 0.1 AND 1.0
+  -- request statistics, and exclude estimates with fractions higher than 110%.
+  -- The upper bound is 110% and not 100%, because there can be relays reporting
+  -- statistics that temporarily didn't make it into the consensus, and we
+  -- accept up to 10% of those additional statistics. However, there needs to be
+  -- some upper bound to exclude obvious outliers with fractions of 120%, 150%,
+  -- or even 200%. See #28305 for more details.
+  WHERE a.frac BETWEEN 0.1 AND 1.1
 
   -- Skip estimates that are as recent as yesterday or newer.
   AND a.date < current_date - 1



_______________________________________________
tor-commits mailing list
tor-commits@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-commits