[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-dev] Get Stem and zoossh to talk to each other

To: tor-dev@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-dev] Get Stem and zoossh to talk to each other
From: tordev123@xxxxxxxxxxxxx
Date: Mon, 17 Aug 2015 08:49:08 -0400
Delivered-to: archiver@xxxxxxxx
Delivery-date: Mon, 17 Aug 2015 08:49:23 -0400
Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=N1-0105; d=Safe-mail.net; b=hs3Uj6sxLsncNPIinEkLrewd5MBN2TFQt4bwM/XG/OfQHFcYomxy+lU3QiqTCBIS 0gi3g6+Js5Jl9+9xQE2dDRvzJ0g9+Xo67SebgpwRySbNVhotxG05gEp4TDFS9tyu Ou3R90AJJNGsio778GovTSfk0I5g21FT3XkBNO0Hxdc=;
List-archive: <http://lists.torproject.org/pipermail/tor-dev/>
List-help: <mailto:tor-dev-request@lists.torproject.org?subject=help>
List-id: discussion regarding Tor development <tor-dev.lists.torproject.org>
List-post: <mailto:tor-dev@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=unsubscribe>
Reply-to: tor-dev@xxxxxxxxxxxxxxxxxxxx
Sender: "tor-dev" <tor-dev-bounces@xxxxxxxxxxxxxxxxxxxx>

> > zoossh's test framework says that it takes 36364357 nanoseconds to
> > lazily parse a consensus that is cached in memory (to eliminate the I/O
> > bottleneck).  That amounts to approximately 27 consensuses a second.
> >
> > I used the following simple Python script to get a similar number for
> > [...]
> > This script manages to parse 24 consensus files in ~13 seconds, which
> > amounts to 1.8 consensuses a second.  Let me know if there's a more
> > efficient way to do this in Stem.
> 
> Interesting! First thought is 'wonder if zoossh is even reading the
> file content'. Couple quick things to try are...
> 
> with open(file_name) as consensus_file:
>   consensus_file.read()
> 
> ... to see how much time is disk IO verses parsing. Second is to try
> doing something practical (say, count the number of relays with the
> exit flag). Stem does some bytes => unicode normalization which might
> account for some difference but other than that I'm at a loss for what
> would be taking the time.

Why are you surprised? Python is a very, very slow language. My very simple unoptimized C++ parsing code is more than 10x faster than Stem (parsing from memory). Go has some nice string parsing features, so zoossh should have an easy time to get fast and simple parsing code.
_______________________________________________
tor-dev mailing list
tor-dev@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

Follow-Ups:
- Re: [tor-dev] Get Stem and zoossh to talk to each other
  - From: Damian Johnson

Prev by Author: [tor-dev] Alpha/Beta/Release cycle
Next by Author: [tor-dev] Number of directory connections
Previous by thread: Re: [tor-dev] Get Stem and zoossh to talk to each other
Next by thread: Re: [tor-dev] Get Stem and zoossh to talk to each other
Index(es):
- Author
- Thread