[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-relays] max / burst speed

To: tor-relays@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-relays] max / burst speed
From: Rick Huebner <rhuebner@xxxxxxxxxx>
Date: Tue, 27 Sep 2011 20:43:22 -0700
Delivered-to: archiver@xxxxxxxx
Delivery-date: Tue, 27 Sep 2011 23:43:39 -0400
In-reply-to: <mailman.6346.1317155829.3589.tor-relays@xxxxxxxxxxxxxxxxxxxx>
List-archive: <http://lists.torproject.org/pipermail/tor-relays>
List-help: <mailto:tor-relays-request@lists.torproject.org?subject=help>
List-id: "This mailing list is for support and questions about running Tor relays \(exit, non-exit, bridge\)." <tor-relays.lists.torproject.org>
List-post: <mailto:tor-relays@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-relays>, <mailto:tor-relays-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-relays>, <mailto:tor-relays-request@lists.torproject.org?subject=unsubscribe>
References: <mailman.6346.1317155829.3589.tor-relays@xxxxxxxxxxxxxxxxxxxx>
Reply-to: tor-relays@xxxxxxxxxxxxxxxxxxxx
Sender: tor-relays-bounces@xxxxxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:6.0.2) Gecko/20110902 Thunderbird/6.0.2

On 9/27/2011 1:37 PM, "Steve Snyder" <swsnyder@xxxxxxxxxxxxx> wrote:

Either there is simply not enough traffic to saturate all available middle nodes or Tor's node selection algorithm is, um, sub-optimal.

I just started my relay a month ago, so I've done some research, and itseems to be pretty complicated. Please excuse what turned into anover-long post, but I think this is info that new relay operators don'tgenerally know, and should.

At its base, the traffic allocation scheme is elegantly simple: eachclient randomly selects circuit participants based on the sum of all therelays' bandwidth values from the most recent consensus (not countingcomplications like starting with a guard node, ending with an exit,etc.). So ideally, a relay with twice the bandwidth of another will (onaverage) be selected for circuits twice as often, and (again, onaverage) end up processing twice as much traffic. And also, if thetotal bandwidth demand from all clients is enough to consume, say, 60%of all Tor relays' available bandwidth, each relay will (on average) bekept operating at about 60% of capacity.

But... the bandwidth figure in the consensus that clients use to selectrelays is not a simple figure. Apparently, it used to be based on therates reported by each relay in their uploaded server descriptors, butpeople tried to game the system and fed it bogus data, so that wasreplaced by the authoritative servers going out and measuring the actualbandwidth of each relay by downloading a test file through it to see howfast it really went. So the values your relay uploads in the descriptorare ignored now, and your set Rate and Burst speeds aren't relevant fortraffic allocation except in as much as they affect the speed that theofficial bandwidth scanner can download the test file through you. Theonly bandwidth figure that matters for driving traffic to your relay isthe one reported in the consensus, which you can see athttps://metrics.torproject.org/networkstatus.html, or from your ownrelay's DirPort athttp://<relay>:<dirport>/tor/status-vote/current/consensus if you're adirectory mirror. Note that these are not the values from any of theTorStatus servers, they're displaying something else entirely (and notalways the same thing as each other, either). So anyway, don't expect achange in your torrc file to immediately bring you more traffic. Yourrelay has to get rescanned first, which probably only happens once a day.

OK, fair enough. But those bandwidth numbers aren't just the simplespeed which was seen when the scanner last downloaded a test filethrough your relay. To smooth out random speed changes as your relay isremeasured day to day, the bandwidth scanners use some kind of fairlyslow exponential moving average of your download speeds. So it takesconsiderable time for changes in your relay's speed to slowly seep intothe bandwidth number in the consensus. And for new relays, it seemsthat the initial value which the new measurements are slowly averagedinto starts out pretty low, so that your reported bandwidth also startsout pretty low, and then gradually rises over time to somewhere aroundyour actual Rate limit. And by slow, I mean weeks; my new relay hasbeen running for a month, and over that time, I've seen the reportedbandwidth slowly, with many fits and starts and temporary setbacks, gofrom about 20-30 to ~200 (about half my actual Rate limit), and it'sstill rising. Which kind of has the effect of putting new relays onprobation, and slowly feeding them more and more traffic over time tosee how they do, which is not a bad thing at all. But new relayoperators are usually excited and anxious to see stuff happen, and needto be aware of this slow starting ramp up period and not get toodiscouraged or give up because they're not seeing much traffic at first.

OK, so the bandwidth rating that matters is measured and slowlyaveraged... but there's yet another layer. To improve the overallperformance of the Tor network, and to help clients generally createfaster circuits, there's another bias factor thrown in. I don't knowthe details, but faster relays have their measured bandwidth figuresartificially boosted to drive even more traffic through them than theirhigh bandwidth would naturally attract, and/or slower relays have theirbandwidth figures artificially lowered to drive less traffic throughthem (not sure which or both, but the effect is the same regardless).So if the overall client bandwidth demand is, again, 60% of the totalTor network bandwidth available, instead of each relay being at ~60%capacity, the fastest relays will be more fully utilized, and,unavoidably, that means that the slower relays will be correspondinglyless utilized.

This might dismay slower relay operators who feel that they're beingprevented from contributing as much as they'd like, but objectively,it's generally better for a Tor client to have a 300 KB/s circuit hopthan a 30 KB/s one. The faster relays are just nicer for clients touse, and it's better overall for the Tor network to make sure they getused as much as possible. And if those fast relays are getting morethan their prorated "fair" share of usage based on their actual speeds,that unavoidably means that slower relays are getting less usage thantheir speed would normally merit. But that doesn't mean the slowerrelays are useless! Simply by existing, those extra relays greatlyincrease the difficulty of various attacks on Tor, just because they*might* have been used for any given circuit. Also, the whole guardnode system for making certain nasty attacks infeasible relies on havinglots of potential guard nodes to choose between, even relatively slowones. And of course, all exit nodes are especially precious, almostregardless of speed. And finally, even if they're not used all thatmuch while client demand on the Tor network is low to moderate, theyprovide an important spare reserve of bandwidth to make sure that somerelay somewhere will always be ready to handle a new circuit even if thenetwork becomes very busy and manages to max out the high speed"backbone" relays, or Tor is subjected to some kind of DOS attack.

So anyway, for all the relay operators asking "why isn't my relay beingused more?", there's your infodump. If it's a new relay, or yourecently upgraded/raised your speed limits, keep an eye on the officialbandwidth figure for your relay in the consensus, especially the nicegraph displayed in the Router Detail page that you reach by clickingyour router link on thehttps://metrics.torproject.org/networkstatus.html page. If that graphshows any upwards trend, the effects of your change are still slowlypercolating into your official bandwidth figure, and more traffic willappear as it rises. If it's plateaued out, you're getting your share ofthe overall Tor traffic based on your relay's overall performance andthe total client demand on the Tor network.

For slow nodes who've limited their overall Rate to avoid hittingbandwidth caps, you might consider using AccountingMax to cap the usageto a safe level, and increase your speeds; you may find it morerewarding to relay significant traffic for 6 hours per day and thenhibernate for 18 than to stay on "inactive reserve" status all thetime. From the overall viewpoint of the network, is it better to have1000 new relays at good speeds up 1/4 of the time (effectively adding250 new fast relays), or to have them at slow speeds all of the time,not being used much? I'm not really sure, but I've noticed that theAccountingMax hibernation feature is hardly used at all from what I seeon TorStatus, and I wonder why.

OK, enough already, this turned out way longer than I was expecting.Hope it helps.


_______________________________________________
tor-relays mailing list
tor-relays@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-relays

Follow-Ups:
- Re: [tor-relays] max / burst speed
  - From: markus reichelt
- Re: [tor-relays] max / burst speed
  - From: Sebastian Urbach
- Re: [tor-relays] max / burst speed
  - From: Sebastian Urbach
- Re: [tor-relays] max / burst speed
  - From: Steve Snyder

Prev by Author: Re: [tor-relays] max / burst speed
Next by Author: [tor-relays] max / burst speed
Previous by thread: Re: [tor-relays] max / burst speed
Next by thread: Re: [tor-relays] max / burst speed
Index(es):
- Author
- Thread