[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-dev] prop224: HSDir caches question with OOM

To: tor-dev@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-dev] prop224: HSDir caches question with OOM
From: s7r <s7r@xxxxxxxxxx>
Date: Sat, 16 Apr 2016 20:30:29 +0300
Delivered-to: archiver@xxxxxxxx
Delivery-date: Sat, 16 Apr 2016 13:31:07 -0400
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sky-ip.org; s=20110108; t=1460827852; bh=6Z4YcmpVfbsUQQiy6FnntyPxZ7cDq2gP0trAB6OMPeM=; h=Reply-To:Subject:References:To:From:Date:In-Reply-To; b=fYOC/AsN8ygf226/QcPaBnGpnXyb4/X9jAZAxwdv7U34HD15ol+HHrt2fKD3JdpD6 J8P6n0wXipFvsDzCybhjYL5NS7g+F+tSM+H2ff+gP1KIXj6S0boib0a3GqK4p7zF9A 2/EMxAPt3J/gapTNCG7u7OVzl2TMQMxm+E45gh4c=
In-reply-to: <20160416131120.GA14646@raoul>
List-archive: <http://lists.torproject.org/pipermail/tor-dev/>
List-help: <mailto:tor-dev-request@lists.torproject.org?subject=help>
List-id: discussion regarding Tor development <tor-dev.lists.torproject.org>
List-post: <mailto:tor-dev@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=unsubscribe>
References: <20160415194742.GJ6139@raoul> <B6254C6B-5ABD-439C-9BAA-D59C1906FE1E@xxxxxxxxx> <20160416131120.GA14646@raoul>
Reply-to: tor-dev@xxxxxxxxxxxxxxxxxxxx
Sender: "tor-dev" <tor-dev-bounces@xxxxxxxxxxxxxxxxxxxx>
User-agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.7.2

Hello,

On 4/16/2016 4:11 PM, David Goulet wrote:
[snip]
>> 
>>
>> A third alternative is that we can iterate through each time period:
>> Set K to the oldest expected descriptor age in hours, minus 1 hour
>> Deallocate all entries from Cache A that are older than K hours
>> Deallocate all entries from Cache B that are older than K hours
>> Set K to K - 1 and repeat this process
>>
>> This algorithm is O(Kn), which is ok as long as K is small.
>> This carries a slight risk of over-deallocating cache entries. Which is OK at OOM time.
>> I like this one, because it's simple, performant, and doesn't need any extra memory allocations.
> 
> I do also like this one. It's pretty simple and efficient.
> 
> Now there is a fourth alternative that Yawning proposed in #tor-dev yesterday
> which is always prioritize our v2 cache in the OOM handling that is clean the
> v2 before than if we have to go to the v3 cache. It would be an incentive to
> "v3 is much more important than v2" kind of thing.
> 
> As he describe it, it's a bit like our tap vs ntor situation under pressure,
> we prioritize ntor and drop tap if needed.
> 
> I'm still quite _unsure_ about this. The v3 will bring more memory pressure
> with this second HSDir cache. And my intuition is that most users won't switch
> directly to v3 but will probably have a migration path from v2 to v3 like
> having the v2 onion on for X months before discontinuing it.
> 
> So losing reachability because we decide to drop v2 first could not be
> desirable. But then also how often does a HSDir OOM is triggered... ?
> 
> Anyway, right now I'm leaning towards your approach teor of just using the
> time-period.
> 
> More eyes on this would be great :).
> 
> Cheers!
> David
> 

I agree that teor's O(Kn) is the best approach from performance (no
additional memory allocations), simplicity and efficacy point of view.
O(Kn) algorithm will clear the entries only based on their expiration
time, it won't care to clean the v2 / v3 caches in equal measure which
is good, given that we do not know how long HS operators will take /
need to upgrade their services to prop 224.

The tap vs ntor situation was a good measure, but the threat model was
different (we were trying to ensure new clients using ntor get resources
from relays with priority as opposite to non-updated botnet zombies
using tap). In the current situation we care about v2 and v3 HS caches
exactly the same, for an unknown period of time which might not be
short, so we shouldn't penalize v2 in any way.

This needs to be covered regardless how often a HSDir has its OOM
triggered. I don't think we should assume it's hard to flood HSDirs with
descriptors until the memory is full.

Now that HSDirs will need to handle two caches, is 20% of the total
memory allocated for HS descriptors a good value? What harm would
increasing it to let's say 25% do?

Attachment: signature.asc
Description: OpenPGP digital signature

_______________________________________________
tor-dev mailing list
tor-dev@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

Follow-Ups:
- Re: [tor-dev] prop224: HSDir caches question with OOM
  - From: Yawning Angel

References:
- [tor-dev] prop224: HSDir caches question with OOM
  - From: David Goulet
- Re: [tor-dev] prop224: HSDir caches question with OOM
  - From: Tim Wilson-Brown - teor
- Re: [tor-dev] prop224: HSDir caches question with OOM
  - From: David Goulet

Prev by Author: Re: [tor-dev] Is it possible to leak huge load of data over onions?
Next by Author: [tor-dev] [GSoC16] Expand Nyx
Previous by thread: Re: [tor-dev] prop224: HSDir caches question with OOM
Next by thread: Re: [tor-dev] prop224: HSDir caches question with OOM
Index(es):
- Author
- Thread