[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]
Re: [or-cvs] [metrics-utils/master 2/4] Match full Torbutton user agents.
On 11/25/10 10:22 PM, Robert Ransom wrote:
> On Thu, 25 Nov 2010 18:47:07 +0000 (UTC)
> karsten@xxxxxxxxxxxxxx wrote:
>
>> Author: Karsten Loesing <karsten.loesing@xxxxxxx>
>> Date: Thu, 25 Nov 2010 19:42:49 +0100
>> Subject: Match full Torbutton user agents.
>> Commit: 157c0dfe0722113bef50cea73be74600bde9414e
>>
>> ---
>> visitor/visitor.py | 16 ++++++++--------
>> 1 files changed, 8 insertions(+), 8 deletions(-)
>>
>> diff --git a/visitor/visitor.py b/visitor/visitor.py
>> index 06288b1..466bd52 100644
>> --- a/visitor/visitor.py
>> +++ b/visitor/visitor.py
>> @@ -16,18 +16,18 @@ from cStringIO import StringIO
>> # regexes used in the script
>> IP_RE = re.compile(r'(\d+\.){3}\d+')
>> APACHE_DATETIME = re.compile(r'\[(\d{2}/\w{3}/\d{4}:\d{2}:\d{2}:\d{2}) ([+-]\d{4})\]')
>> -TOR_USERAGENTS = [('torbutton1_2_0', re.compile(r'Mozilla/5\.0 \(Windows; U; Windows NT 5\.1; '
>> +TOR_USERAGENTS = [('torbutton1_2_0', re.compile(r'^"Mozilla/5\.0 \(Windows; U; Windows NT 5\.1; '
>> r'[a-z]{2}-[A-Z]{2}; rv\:1\.8\.1\.16\) '
>> - r'Gecko/20080702 Firefox/2\.0\.0\.16')),
>> - ('torbutton1_2_0rc1', re.compile(r'Mozilla/5\.0 \(Windows; U; Windows NT 5\.1; '
>> + r'Gecko/20080702 Firefox/2\.0\.0\.16"$')),
>> + ('torbutton1_2_0rc1', re.compile(r'^"Mozilla/5\.0 \(Windows; U; Windows NT 5\.1; '
>> r'en-US; rv\:1\.8\.1\.14\) '
>> - r'Gecko/20080404 Firefox/2\.0\.0\.14')),
>> - ('torbutton1_2_1', re.compile(r'Mozilla/5\.0 \(Windows; U; Windows NT 5\.1; '
>> + r'Gecko/20080404 Firefox/2\.0\.0\.14"$')),
>> + ('torbutton1_2_1', re.compile(r'^"Mozilla/5\.0 \(Windows; U; Windows NT 5\.1; '
>> r'en-US; rv\:1\.9\.0\.7\) '
>> - r'Gecko/2009021910 Firefox/3\.0\.7')),
>> - ('torbutton1_2_5', re.compile(r'Mozilla/5\.0 \(Windows; U; Windows NT 6\.1; '
>> + r'Gecko/2009021910 Firefox/3\.0\.7"$')),
>> + ('torbutton1_2_5', re.compile(r'^"Mozilla/5\.0 \(Windows; U; Windows NT 6\.1; '
>> r'[a-z]{2}-[A-Z]{2}; rv:1\.9\.2\.3\) '
>> - r'Gecko/20100401 Firefox/3\.6\.3'))
>> + r'Gecko/20100401 Firefox/3\.6\.3"$'))
>> ]
>>
>>
>
> This list is not complete -- TAILS 0.5, and presumably other
> installations of Torbutton 1.2.5, produce the following User-Agent
> string:
>
> Mozilla/5.0 (Windows; U; Windows NT 6.1; chrome://global/locale/intl.properties; rv:1.9.2.3) Gecko/20100401 Firefox/3.6.3
>
> See <https://amnesia.boum.org/security/Iceweasel_exposes_a_rare_User-Agent/>.
>
> (Nobody should still be using TAILS 0.5, due to this and other security
> issues, but your script is intended to be useful for analyzing older
> logs as well as new ones.)
I just added this user-agent string:
https://gitweb.torproject.org/metrics-utils.git/commitdiff/e3e9ec8
Thanks!
Karsten