[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Abuse statistics



Hi,

I extended my statistics a bit:

http://ob44yuhbyysk5xft.onion

Now you can also see, how many connections are used to search for profiles via an email-address. The detection is done by the following regular expression:
/GET .*se?a?rch.*=[^%& ]+%40[^%& ]+/
No data beside the host name is saved, especially not the email addresses.

It shows that at the moment between 4 and 20 % of the connections are used for these requests, mainly at flickr, what explains their high ranking in the general connections. It can be assumed that most of searches are done automatically by profilers, that scan "web 2.0" sites for existing profiles with their email-address-databases, in order to build up a relation database. (Examples: http://www.rapleaf.com/ , http://www.spock.com/, http://www.peekyou.com/) By using unencrypted connections over Tor they violate their privacy policies, I guess.

Also it shows how interesting it is for email-address harvesters to run an exit node on their own. Each 100'000 connections you will have collected at least 5'000 email addresses! So the profilers feed the spammers? :-)