[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]
[tor-bugs] #2923 [Metrics]: Improve materialized views in the metrics database
#2923: Improve materialized views in the metrics database
-------------------------+--------------------------------------------------
Reporter: karsten | Owner: karsten
Type: enhancement | Status: new
Priority: normal | Milestone:
Component: Metrics | Version:
Keywords: | Parent:
Points: | Actualpoints:
-------------------------+--------------------------------------------------
The [https://gitweb.torproject.org/metrics-
web.git/blob/HEAD:/db/tordir.sql metrics database schema] uses
periodically updated tables similar to materialized views for aggregating
statistics. When inserting data into the database, we write the dates
that have changed to a separate updates table. Every three hours, we
delete the aggregates for these days and recompute them, which takes a few
minutes.
The recompute step that takes most of the time is `refresh_user_stats()`,
which is no surprise given the complexity of that function. We should try
to simplify this function, possibly by pre-computing partial results that
can be reused for other statistics. Ideally, recomputing aggregates
should run in under one minute, given that we want to add more
materialized views for more aggregate statistics in the future. In
particular, I'd like to know which particular SQL parts slow us down in
order to avoid them in the future.
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/2923>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
_______________________________________________
tor-bugs mailing list
tor-bugs@xxxxxxxxxxxxxxxxxxxx
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-bugs