Re: [tor-talk] Norse Darklist, for blocking Tor

On 06/11/2014 12:10 PM, grarpamp wrote:
> On Wed, Jun 11, 2014 at 5:20 AM, Mirimir <mirimir@xxxxxxxxxx> wrote:
>>>> Well, we can create a blacklist of sites that block Tor ;)
>> No. I just mean a list of sites that block Tor in some way.
> There's a wiki page for all this, and both of these forms...
>>> You mean an exitpolicy blacklist of blocklists (list providers),
> https://trac.torproject.org/projects/tor/wiki/org/doc/ListOfServicesBlockingTor
> Under "List Of RBL Lists"...
>>> not of subscribing (or homegrown blocking) end sites.
> https://trac.torproject.org/projects/tor/wiki/org/doc/ListOfServicesBlockingTor
> Under "List Of Services Blocking Tor"...

OK, thanks. I'll submit results there.

>> I'm finding, by the way, that craigslist takes about six minutes to load
> 'slow' blocking rough maximum, avg exit is much less.
>> via Tor, from a VM where most of the other top 50 sites load in 5-20
> normal.
>> seconds. Another ten sites refuse to load. I haven't determined why yet.
> probably blocking.

Some detect that I'm doing console-level snapshot, and throw errors.
Some of the errors may be CAPTCHAs. Imgur invites me to apply for a job,
given that I use the console ;)

> If your scripts are anything like before, you're going to need to
> use the controller to manage exits, otherwise your data will
> be left to the randomness of tor. That's fine to find random things,
> but not very narrow to discover/test specific repeatable anomalies.
> Lock down to a single exit (possibly chosen at random) for each
> test run, or cycle a site through all exits (for which you will need
> at least 7 tors to finish one 'slow' site in 4 hours).

I prefer using random exits with random timing, because that looks more
like normal user behavior. What I need is a normal-seeming snapshot
method, and a way of explicitly flagging CAPTCHA challenges. Once I
finish working out the kinks, I'll post the scripts somewhere.

The server that I'm using can handle 10-20 VMs testing in parallel, and
they can run 24/7 as long as it takes. I could do it faster, but that
would stand out too much as testing.
