[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [tor-dev] Metrics Plans

To: Kostas Jakeliunas <kostas@xxxxxxxxxxxxxx>
Subject: Re: [tor-dev] Metrics Plans
From: Damian Johnson <atagar@xxxxxxxxxxxxxx>
Date: Tue, 11 Jun 2013 08:02:54 -0700
Cc: tor-dev@xxxxxxxxxxxxxxxxxxxx
Delivered-to: archiver@xxxxxxxx
Delivery-date: Tue, 11 Jun 2013 11:03:07 -0400
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=jzkpa3Ylf1Tsbg7uOl6w/cgyNy/nDTXW+Vm2AKACESY=; b=uiFzuigLl5DhE1Z7ZoB/jlRmQVeXFD49/dgf98CncMtTA+74+NhDm+hID14FRu/jkA 1UAOT7wcWC4MBqpt2xXNOsMRRhQM+ZvsMRGf4ge2A3uqgq1iA8Q1Qu+Y2gmWhffix6J0 rgSySgs/JwfV0VKJQm+gtrb2idQI2WwUpK2haY5WO/qbwbfLIZk9rbJXwpL0FXJBOgnt 0OqznoDt8ZiTONI5YV7roXle70X4oKDFkc1DDGLQegs30M0Oo0OcQ121q0ZvBsKow+Pa uvRxMlpO9fFQ/5S3F1KW3X3l+tovZiwBdjC3Ii+VRxAX7GsViX1R7rTrV9Enxm+bNrCl 6FaA==
In-reply-to: <CAN0Koyg45GoVwYZBCufz72uXGid=WGE6jnptMx3thv9kSC=OFg@xxxxxxxxxxxxxx>
List-archive: <http://lists.torproject.org/pipermail/tor-dev>
List-help: <mailto:tor-dev-request@lists.torproject.org?subject=help>
List-id: discussion regarding Tor development <tor-dev.lists.torproject.org>
List-post: <mailto:tor-dev@lists.torproject.org>
List-subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=subscribe>
List-unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-dev>, <mailto:tor-dev-request@lists.torproject.org?subject=unsubscribe>
References: <CAJdkzEON8c4aV-qqpw-Wf1Za8zgPYhoPquVOaJk1-m1bsNWFeA@xxxxxxxxxxxxxx> <519E61CE.6080900@xxxxxxxxxxxxxx> <CAJdkzEP7m+=+tKkK_d66YSjPf3ayA_=83f+iJwAOiJe32HQBpQ@xxxxxxxxxxxxxx> <CAN0Koyj4FfKx4UG17xJJMheuVita+8cvOr_-LeOA_Tj47c+A2g@xxxxxxxxxxxxxx> <CAJdkzEMGCuRfhXfwbsRPbXvDhwttH15vuHcKxHOYKqya5QhtXg@xxxxxxxxxxxxxx> <CAN0Koyg45GoVwYZBCufz72uXGid=WGE6jnptMx3thv9kSC=OFg@xxxxxxxxxxxxxx>
Reply-to: tor-dev@xxxxxxxxxxxxxxxxxxxx
Sender: tor-dev-bounces@xxxxxxxxxxxxxxxxxxxx

> I can try experimenting with this later on (when we have the full / needed
> importer working, e.g.), but it might be difficult to scale indeed (not
> sure, of course). Do you have any specific use cases in mind? (actually
> curious, could be interesting to hear.)

The advantages of being able to reconstruct Descriptor instances is
simpler usage (and hence more maintainable code). Ie, usage could be
as simple as...

========================================

from tor.metrics import descriptor_db

# Fetches all of the server descriptors for a given date. These are provided as
# instances of...
#
#   stem.descriptor.server_descriptor.RelayDescriptor

for desc in descriptor_db.get_server_descriptors(2013, 1, 1):
  # print the addresses of only the exits

  if desc.exit_policy.is_exiting_allowed():
    print desc.address

========================================

Obviously we'd still want to do raw SQL queries for high traffic
applications. However, for applications where maintainability trumps
speed this could be a nice feature to have.

>> * After making the schema update the importer could then run over this
>> raw data table, constructing Descriptor instances from it and
>> performing updates for any missing attributes.
>
> I can't say I can easily see the specifics of how all this would work, but
> if we had an always-up-to-date data model (mediated by Stem Relay Descriptor
> class, but not necessarily), this might work.. (The ORM <-> Stem Descriptor
> object mapping itself is trivial, so all is well in that regard.)

I'm not sure if I entirely follow. As I understand it the importer...

* Reads raw rsynced descriptor data.
* Uses it to construct stem Descriptor instances.
* Persists those to the database.

My suggestion is that for the first step it could read the rsynced
descriptors *or* the raw descriptor content from the database itself.
This means that the importer could be used to not only populate new
descriptors, but also back-fill after a schema update.

That is to say, adding a new column would simply be...

* Perform the schema update.
* Run the importer, which...
  * Reads raw descriptor data from the database.
  * Uses it to construct stem Descriptor instances.
  * Performs an UPDATE for anything that's out of sync or missing from
the database.

Cheers! -Damian
_______________________________________________
tor-dev mailing list
tor-dev@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev

References:
- Re: [tor-dev] Metrics Plans
  - From: Kostas Jakeliunas

Prev by Author: Re: [tor-dev] grabbing Tor circuit (node) data- Tor stem, torrc and Tor control port
Next by Author: Re: [tor-dev] Migration of Tor Weather to Stem
Previous by thread: Re: [tor-dev] Metrics Plans
Next by thread: Re: [tor-dev] Building better pluggable transports - GSoC 2013 project
Index(es):
- Author
- Thread