[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-talk] Hidden Services (about tor2web)

To: tor-talk@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-talk] Hidden Services (about tor2web)
From: "Fabio Pietrosanti (naif)" <lists@xxxxxxxxxxxxxxx>
Date: Wed, 19 Sep 2012 23:41:46 +0200
Delivered-to: archiver@xxxxxxxx
Delivery-date: Wed, 19 Sep 2012 17:41:59 -0400
In-reply-to: <E1TEFPt-0002EC-Th@xxxxxxxxxxxxxxxxxxx>
List-archive: <http://lists.torproject.org/pipermail/tor-talk>
List-help: <mailto:tor-talk-request@lists.torproject.org?subject=help>
List-id: "all discussion about theory, design, and development of Onion Routing" <tor-talk.lists.torproject.org>
List-post: <mailto:tor-talk@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-talk>, <mailto:tor-talk-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-talk>, <mailto:tor-talk-request@lists.torproject.org?subject=unsubscribe>
References: <CALyVa3W4T3YOKmAwRbAqBSJ3nN30rqCC=_=U=bjfDtwqmt01qw@xxxxxxxxxxxxxx> <C9D521E0-1C9D-44F5-AD88-14ACB3918316@xxxxxxxxxxxxxxxxx> <CALyVa3Wef+k6PJFbUqLDvD34TzmeaL3CqttMsPOJ1EJ+mayBQg@xxxxxxxxxxxxxx> <20120917112254.GA2603@xxxxxxxxxxxxxxxx> <50587578.7060106@xxxxxxxxxxxxxxx> <E1TDxzi-0005HK-9d@xxxxxxxxxxxxxxxxxxx> <CAD2Ti28C=m0hVKt0iWmRQGcDdLcdzLzyuCDCLG0KnoLwZqSDCw@xxxxxxxxxxxxxx> <E1TE2W4-0006iz-Gb@xxxxxxxxxxxxxxxxxxx> <CAD2Ti2_h1OwRMmVSkGXUnFyoStQmGGr=xwY+K=1zXj2PtCkZcw@xxxxxxxxxxxxxx> <E1TEFPt-0002EC-Th@xxxxxxxxxxxxxxxxxxx>
Reply-to: tor-talk@xxxxxxxxxxxxxxxxxxxx
Sender: tor-talk-bounces@xxxxxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:15.0) Gecko/20120907 Thunderbird/15.0.1

Apologise for subject/thread hijacking.

On 9/19/12 10:13 AM, tor@xxxxxxxxxxxxxxxxxx wrote:
> On 19/09/12 06:36, grarpamp wrote:
>
> >> People use robots.txt to indicate that they don't want their site
> >> to be added to indexes.
>
> > They use it to indicate that they don't want their site to be
> > crawled.
>
> In almost all cases (99% or higher), robots.txt is used to indicate
> that a site shouldn't be crawled, *because* they don't want it to be
> indexed. The intention is painfully clear...

The point has been integrated in the appropriate ticket there:
https://github.com/globaleaks/Tor2web-3.0/issues/19

Please integrate here any idea or suggestion about the topic.

However you should also know that already today is possible for a TorHs
to block access from Tor2web.

Tor2web send an X-Tor2web header to announce to the TorHS that
connection come from Tor2web.

We added up a wiki documentation section explaining how to do it:
https://github.com/globaleaks/Tor2web-3.0/wiki/Blocking-access-from-tor2web

Regarding the topic of "robots.txt", in the new tor2web 3.0 robots.txt
are "hijacked" in order to prevent Tor2web crawling by public search
engine. Also a list of user agent of internet spyder has been blocked by
default.
Both blocks settings can be disabled from config file:
https://github.com/globaleaks/Tor2web-3.0/wiki/Configuring-tor2web

Those blocks will be probably less annoying when the behavior regarding
spidering will be configurable directly from TorHs sites (for example by
providing specific tor2web related config strings in robots.txt).

Fabio

p.s. There's a new tor2web domain using Tor2web 3
http://eqt5g4fuenphqinx.tor2web.blutmagie.de :-)
_______________________________________________
tor-talk mailing list
tor-talk@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-talk

References:
- [tor-talk] Hidden Services
  - From: Scurvy Scott
- Re: [tor-talk] Hidden Services
  - From: Sebastian Hahn
- Re: [tor-talk] Hidden Services
  - From: Scurvy Scott
- Re: [tor-talk] Hidden Services
  - From: andrew
- Re: [tor-talk] Hidden Services
  - From: Fabio Pietrosanti (naif)
- Re: [tor-talk] Hidden Services
  - From: tor
- Re: [tor-talk] Hidden Services
  - From: grarpamp
- Re: [tor-talk] Hidden Services
  - From: tor
- Re: [tor-talk] Hidden Services
  - From: grarpamp
- Re: [tor-talk] Hidden Services
  - From: tor

Prev by Author: Re: [tor-talk] Italy - third highest number users
Next by Author: Re: [tor-talk] Does tor browser bundle can goes on Mac App Store?
Previous by thread: Re: [tor-talk] Hidden Services
Next by thread: Re: [tor-talk] Hidden Services
Index(es):
- Author
- Thread