[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-bugs] #6180 [Ooni]: Detecting censorship in HTTP pages



#6180: Detecting censorship in HTTP pages
----------------------------+-----------------------------------------------
 Reporter:  hellais         |          Owner:  hellais     
     Type:  task            |         Status:  needs_review
 Priority:  normal          |      Milestone:              
Component:  Ooni            |        Version:              
 Keywords:  SponsorH201206  |         Parent:              
   Points:                  |   Actualpoints:              
----------------------------+-----------------------------------------------

Comment(by hellais):

 We discussed a lot about this during the OONI hackfest with mct and this
 is what we talked about:

 We can divide the possible approaches for distinguishing a blocked page
 from the correct result page into two categories: Statistical and
 Heuristics.

 # Statistical approach

 - Bayesian

 - SVM

 - Fuzzy signature matching

 - Other classifiers

 What we want to do is run these classifiers on the corpus of block pages
 and real website.

 # Heuristics

 - List of regexp that apply to censorship pages


 We also talked about having clients tell the backend what it got as a
 response and having the backend figure out if such a page should be a
 block page or the correct result.

-- 
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/6180#comment:3>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
_______________________________________________
tor-bugs mailing list
tor-bugs@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs