[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-dev] Building an automatic censorship-detection system for Tor

To: tor-dev@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [tor-dev] Building an automatic censorship-detection system for Tor
From: Mansour Moufid <mansourmoufid@xxxxxxxxx>
Date: Sat, 17 Sep 2011 12:38:10 -0400
Delivered-to: archiver@xxxxxxxx
Delivery-date: Sat, 17 Sep 2011 12:38:48 -0400
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=OrLBAntl40z1nrzpo1dhahmR2/CfyUmQyP+t6oJgGZs=; b=jnclFcUtwPh0CWRiGnx6V7i0JuGAMHXOJcHCvjZ6VuW46ZJTIpdqtX6VcSWtGoDk8h xebzahTtdt0IHTVXgXhbCo7rRxfDrpK9LcYn2utpC7W2SDG1TQ6YCQzKX3tj2on/xQGe I7e2C/+llPooEFvSley3PmmaXCwZImzHiHfNg=
In-reply-to: <21E84BDA-C384-4884-B932-C9FAE269C5A6@xxxxxxxxxxxx>
List-archive: <http://lists.torproject.org/pipermail/tor-dev>
List-help: <mailto:tor-dev-request@lists.torproject.org?subject=help>
List-id: "This mailing list is for discussion by the developers of Tor." <tor-dev.lists.torproject.org>
List-post: <mailto:tor-dev@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=unsubscribe>
References: <4E7046E1.8040704@xxxxxxx> <21E84BDA-C384-4884-B932-C9FAE269C5A6@xxxxxxxxxxxx>
Reply-to: tor-dev@xxxxxxxxxxxxxxxxxxxx
Sender: tor-dev-bounces@xxxxxxxxxxxxxxxxxxxx

Apologies for breaking the thread, I didn't have the original message.

From the mentioned paper (which I've only skimmed):

``The deployed model considers a time interval of seven (7) days to
model connection rates (i.e. $t_i - t_{iâ1} = 7$ days).''

If I understand correctly, this means trends occurring on a
week-to-week basis (or larger periods) are considered and
higher-frequency trends are undesirable? In that case, perhaps
pre-processing the data by filtering would be useful.

Attached (1.png) is an example (in red) of filtering out all
frequencies higher than that corresponding to a one week period,
compared to the original data (green). This is the entire data for
Switzerland, abscissa in seconds.

The result is a little less noise, which might help with your algorithm.

The same filter applied to the Egypt and Iran data (2.png and 3.png
respectively) doesn't harm the signal for those two censorship events,
at least not by visual inspection. (You'd probably want to use a
Hanning window or something, to avoid those artifacts at the extreme
ends of the red graphs.)

But filtering like this would also mean that the signal of an event
which occurs and is over in less than a week, like this week's, is
also lost...

-- 
Mansour

Attachment: 1.png
Description: PNG image

Attachment: 2.png
Description: PNG image

Attachment: 3.png
Description: PNG image

_______________________________________________
tor-dev mailing list
tor-dev@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

References:
- [tor-dev] Building an automatic censorship-detection system for Tor
  - From: Karsten Loesing
- Re: [tor-dev] Building an automatic censorship-detection system for Tor
  - From: Steven Murdoch

Prev by Author: Re: [tor-dev] Survey on Tor Trac usage and how you manage your tasks
Next by Author: Re: [tor-dev] Sanitizing and publishing our web server logs
Previous by thread: Re: [tor-dev] Building an automatic censorship-detection system for Tor
Next by thread: [tor-dev] Tor2web 2.0 is live!
Index(es):
- Author
- Thread