[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-bugs] #15844 [Onionoo]: Develop database schema to support Onionoo's search parameter efficiently

Subject: Re: [tor-bugs] #15844 [Onionoo]: Develop database schema to support Onionoo's search parameter efficiently
From: "Tor Bug Tracker & Wiki" <blackhole@xxxxxxxxxxxxxx>
Date: Wed, 06 May 2015 19:03:36 -0000
Auto-submitted: auto-generated
Delivered-to: archiver@xxxxxxxx
Delivery-date: Wed, 06 May 2015 15:03:45 -0400
In-reply-to: <047.748fb994345a7c15a18a06680c27003a@xxxxxxxxxxxxxx>
List-archive: <http://lists.torproject.org/pipermail/tor-bugs/>
List-help: <mailto:tor-bugs-request@lists.torproject.org?subject=help>
List-id: "auto: Tor bug tracker status mails" <tor-bugs.lists.torproject.org>
List-post: <mailto:tor-bugs@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs>, <mailto:tor-bugs-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-bugs>, <mailto:tor-bugs-request@lists.torproject.org?subject=unsubscribe>
References: <047.748fb994345a7c15a18a06680c27003a@xxxxxxxxxxxxxx>
Reply-to: tor-assistants@xxxxxxxxxxxxxx
Sender: "tor-bugs" <tor-bugs-bounces@xxxxxxxxxxxxxxxxxxxx>

#15844: Develop database schema to support Onionoo's search parameter efficiently
-----------------------------+-----------------
     Reporter:  karsten      |      Owner:
         Type:  enhancement  |     Status:  new
     Priority:  normal       |  Milestone:
    Component:  Onionoo      |    Version:
   Resolution:               |   Keywords:
Actual Points:               |  Parent ID:
       Points:               |
-----------------------------+-----------------

Comment (by teor):

 How important is disk/memory space usage?
 Can you afford to denormalise and blow out the usage slightly, say one row
 per address, rather than one row per server?

 For example, you could have one table `server_address` with:
 * one row per relay or bridge address,
 * duplicated server information against each address, and
 * a column for the original fingerprint and another column for the hashed
 fingerprint

 Then you could search in the one table, but still have the advantages of
 searching separate columns by the start of a string (this works well with
 indexes).

 One assumption that I'm making here is that you're focusing on the speed
 of a single-string search. Doing an intersection is then O(n) by the
 number of terms, or perhaps better if you feed the results of the previous
 search into your next search.

 Oh, do you have a unique indexed / primary key / integer identity column
 on the table?
 That will help with creating unions and looking up rows, and other
 indexes.

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/15844#comment:4>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
_______________________________________________
tor-bugs mailing list
tor-bugs@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs

Prev by Author: Re: [tor-bugs] #15939 [Tor Browser]: Could not connect to Tor control port (28/4/2015 to present) (was: Could not connect to Tor control Port (28/4/2015 to present))
Next by Author: [tor-bugs] #15940 [Tor]: Make transition plans for current and future obsolete client versions
Previous by thread: Re: [tor-bugs] #15844 [Onionoo]: Develop database schema to support Onionoo's search parameter efficiently
Next by thread: Re: [tor-bugs] #15844 [Onionoo]: Develop database schema to support Onionoo's search parameter efficiently
Index(es):
- Author
- Thread