[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-bugs] #2680 [Metrics]: present bridge usage data so researchers can focus on the math



#2680: present bridge usage data so researchers can focus on the math
---------------------+------------------------------------------------------
 Reporter:  arma     |          Owner:  karsten 
     Type:  task     |         Status:  assigned
 Priority:  normal   |      Milestone:          
Component:  Metrics  |        Version:          
 Keywords:           |         Parent:          
   Points:           |   Actualpoints:          
---------------------+------------------------------------------------------

Comment(by arma):

 The "fingerprint" and "descriptor" in statuses.csv are always the same. I
 think you're printing "fingerprint" for both of them?

 I think the next step is to write a short overview of how to reconstruct
 these files to answer some research question. For example, say I want to
 get a list of all the countries that a given bridge has seen over time. I
 guess I want to iterate over all bridge fingerprints -- should I use the
 list of all fingerprints I find in statuses.csv or in descriptors.csv --
 should they be the same?

 So step zero, given a fingerprint, is to look it up in relays.csv and make
 sure it's not there. If it is, either ignore it or if we want to get
 fancier, ignore data from it close to the time it's in the relay list.

 Step one is to look it up in statuses.csv, get a set of descriptor hashes,
 discard all the ones whose third-to-last value is not TRUE, and skip
 duplicate hashes.

 Then step two is to take those remaining descriptor hashes and look them
 up in descriptors.csv, at which point I can learn which countries they saw
 unless the countries are all NA in which case we don't have data?

 And the optional step three is to take the timestamp from the status file
 and look up the fingerprint in assignments.csv to decide if it's http,
 email, or unassigned?

-- 
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/2680#comment:3>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
_______________________________________________
tor-bugs mailing list
tor-bugs@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs