[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: latest svn rev 11720 become cpu hungry

To: or-talk@xxxxxxxxxxxxx
Subject: Re: latest svn rev 11720 become cpu hungry
From: Nick Mathewson <nickm@xxxxxxxxxxxxx>
Date: Tue, 2 Oct 2007 13:30:35 -0400
Delivered-to: archiver@xxxxxxxx
Delivered-to: or-talk-outgoing@xxxxxxxx
Delivered-to: or-talk@xxxxxxxx
Delivery-date: Tue, 02 Oct 2007 13:30:56 -0400
In-reply-to: <20071002173430.9217.LHZHOU@xxxxxxxx>
Mail-followup-to: Nick Mathewson <nickm@xxxxxxxxxxxxx>, or-talk@xxxxxxxxxxxxx
References: <20070930214000.5E28.LHZHOU@xxxxxxxx> <20071002012409.GA30402@xxxxxxxxxxxxxxxxxx> <20071002173430.9217.LHZHOU@xxxxxxxx>
Reply-to: or-talk@xxxxxxxxxxxxx
Sender: owner-or-talk@xxxxxxxxxxxxx
User-agent: Mutt/1.4.2.1i

On Tue, Oct 02, 2007 at 05:36:02PM +0800, Li-Hui Zhou wrote:
> I've noticed the bugfix and yeah, it's been a LOT better.
> 
> Old bug? Why it not been triggered until recent svn?

Short version: There was an optimization that worked around it, but
that optimization no longer applied.

Long version: The format of a "router list," as we parsed it used to
be:
     (ExtraInfo | RouterDesc)*
In other words, it had any number of extrainfo and routerdesc
documents, possibly scrambled up.

The old code did:
   - If we start with the word extra-info, it's an extra-info and
     we're done.
   - If we start with the word router, it's a routerdesc and we're
     done.
   - Otherwise, scan for the first instance of the word router and
     scan for the first instance of the word extra-info.  Whichever
     comes first is the next document.

This was fine until we added annotations around r11680.  The format
became:
    (Annotation* (ExtraInfo | RouterDesc))*
Now, the first two cases no longer applied when there were
annotations, since the point where we were looking never started with
the word router or the word extra-info, so we always did case 3.
But in a list like the cached-descriptors list that has no extra-info
documents, we wound up scanning the entire list looking for an
extra-info that never existed, and we did this scan for every router
in the cache.  That's O(n^2) in cache size, and that's no good.

For the fix, see the patch. :)

yrs,
-- 
Nick Mathewson

Attachment: pgpMIRmleQdHf.pgp
Description: PGP signature

References:
- Re: latest svn rev 11720 become cpu hungry
  - From: Nick Mathewson
- Re: latest svn rev 11720 become cpu hungry
  - From: Li-Hui Zhou

Prev by Author: Re: latest svn rev 11720 become cpu hungry
Next by Author: Re: Bug: main.c:344: connection_stop_writing
Previous by thread: Re: latest svn rev 11720 become cpu hungry
Next by thread: Torbutton 1.1.8-alpha (Usability improvements)
Index(es):
- Author
- Thread