[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-dev] Guardiness: Yet another external dirauth script

To: tor-dev@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-dev] Guardiness: Yet another external dirauth script
From: George Kadianakis <desnacked@xxxxxxxxxx>
Date: Wed, 17 Sep 2014 16:54:38 +0300
Cc: Damian Johnson <atagar@xxxxxxxxxxxxxx>
Delivered-to: archiver@xxxxxxxx
Delivery-date: Wed, 17 Sep 2014 09:55:25 -0400
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/simple; d=riseup.net; s=squak; t=1410962107; bh=KlHCL52ZimA8p/J9HrphGjR+Dx0kDKEQNADqNebE6Ek=; h=From:To:Cc:Subject:In-Reply-To:References:Date:From; b=b8sRGuFwGVDtAHRsdm25V19g2aK3fmHChghQoAoBT4WM0wU95OGDaJhOsYHXidzZk IbPUL0aEONJnqvOQifjtj5HmZm4FRUAtNqoPKayajHe1+daITMl8zQeEqFKdJYjfZj TzNJPgk9iqQhXecblO5uN4szr304xV4gSUKL2B70=
In-reply-to: <878ulimqu5.fsf@xxxxxxxxxx> (George Kadianakis's message of "Wed, 17 Sep 2014 14:25:22 +0300")
List-archive: <http://lists.torproject.org/pipermail/tor-dev/>
List-help: <mailto:tor-dev-request@lists.torproject.org?subject=help>
List-id: discussion regarding Tor development <tor-dev.lists.torproject.org>
List-post: <mailto:tor-dev@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=unsubscribe>
References: <87d2avps68.fsf@xxxxxxxxxx> <CAJdkzEPONn5yYOu7tyWQax3NwKCzwOEqMvxfvep0w9obb5_c6w@xxxxxxxxxxxxxx> <878ulimqu5.fsf@xxxxxxxxxx>
Reply-to: tor-dev@xxxxxxxxxxxxxxxxxxxx
Sender: "tor-dev" <tor-dev-bounces@xxxxxxxxxxxxxxxxxxxx>
User-agent: Microsoft Outlook Express 6.00.2900.5843

George Kadianakis <desnacked@xxxxxxxxxx> writes:

> Damian Johnson <atagar@xxxxxxxxxxxxxx> writes:
>
>>> - Q: Why do you slow stem instead of parsing consensuses with Python on your own?
>>>
>>> This is another part where I might have taken the wrong design
>>> decision, but I decided to not get into the consensus parsing business
>>> and just rely on stem.
>>>
>>> This is also because I was hoping to use stem to verify consensus
>>> signatures. However, now that we might use Daniel's patch to populate
>>> our consensus database, maybe we don't need to treat consensuses as
>>> untrusted anymore.
>>>
>>> If you think that I should try to parse the consensuses on my own,
>>> please tell me and I will give it a try. Maybe it will be
>>> fast. Definitely not as fast as summary files, but maybe we can parse
>>> 3 months worth of consesuses in 15 to 40 seconds.
>>
>> I'm not sure why you think it was the wrong choice. If Stem isn't
>> providing you the performance you want then seems like speeding it up
>> is the right option rather than writing your own parser. That is, of
>> course, unless you're looking for something highly specialized in
>> which case have fun.
>>
>> Nick improved parsing performance by around 30% in response to this...
>>
>>   https://trac.torproject.org/projects/tor/ticket/12859
>>
>> Between that and turning off validation I'd be a little curious where
>> the time is going if it's still too slow for you.
>
> Indeed, our use case is quite specialized. The only thing the
> guardiness script cares about is whether relays have the guard
> flag. No other consensus parsing actually needs to happen.
>
> However, you have a point that stem performance could be improved and
> I will look a bit more into stem parsing and see what I can do.
>
> That said, currently stem parses (with validation enabled) 24
> consensuses in 25 seconds. That's one consensus per second.
> If we are aiming for 7000 consenuses in less than a minute, we need to
> parse 120~ consensuses a second. That will probably require quite some
> optimization in stem, I think.

FWIW, turning off validation helps a bit but not too much.  For
example, my laptop parsing 24 consensuses with validation takes 25
seconds, and if we disable validation it takes 22 seconds.

This means that to reach the rate of 120~ consensuses a second with
parse_file(), we need to make it 100 times faster or so. This sounds
much harder than 30% performance increase :/ 
_______________________________________________
tor-dev mailing list
tor-dev@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

Follow-Ups:
- Re: [tor-dev] Guardiness: Yet another external dirauth script
  - From: Damian Johnson

References:
- [tor-dev] Guardiness: Yet another external dirauth script
  - From: George Kadianakis
- Re: [tor-dev] Guardiness: Yet another external dirauth script
  - From: Damian Johnson
- Re: [tor-dev] Guardiness: Yet another external dirauth script
  - From: George Kadianakis

Prev by Author: Re: [tor-dev] [RFC] Proposal draft: The move to a single guard node
Next by Author: Re: [tor-dev] Guardiness: Yet another external dirauth script
Previous by thread: Re: [tor-dev] Guardiness: Yet another external dirauth script
Next by thread: Re: [tor-dev] Guardiness: Yet another external dirauth script
Index(es):
- Author
- Thread