Telemetry/Reboot: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 1: Line 1:
Goal: Fast, Robust server & frontend operational by '''Sep 31st''' able to accept and graph telemetry data. This means that as of Oct 1 http://metrics.mozilla.com/ will stop updating. Metrics should be able to retire telemetry portion of mango cluster in October.
Goal: Fast, Robust server & frontend operational by '''Sep 31st''' able to accept and graph telemetry data. This means that as of Oct 1 http://metrics.mozilla.com/ will stop updating. Metrics should be able to retire telemetry portion of mango cluster in October.


Server requirements:  
==Telemetry Backend==
Server(https://github.com/mreid-moz/telemetry-server/) requirements:  
* Ability to process 10x incoming packet rates of metrics telemetry infrastructure on a single AWS instance: 2400req/s with 30K HTTP POST packets. Fall
* Ability to process 10x incoming packet rates of metrics telemetry infrastructure on a single AWS instance: 2400req/s with 30K HTTP POST packets. Fall
* Server should be bandwidth-limited, not CPU.
* Server should be bandwidth-limited, not CPU.
Line 10: Line 11:
* '''Sep 16''': ^ should be feeding both servers (metrics and AWS) until cutover date.
* '''Sep 16''': ^ should be feeding both servers (metrics and AWS) until cutover date.


Dashboard Milestones
==Telemetry Dashboard==
Dashboard(https://github.com/mozilla/telemetry-dashboard/) Milestones
 
 
Notes
* Original etherpad https://etherpad.mozilla.org/telemetry-reboot
* Original etherpad https://etherpad.mozilla.org/telemetry-reboot

Revision as of 21:39, 20 August 2013

Goal: Fast, Robust server & frontend operational by Sep 31st able to accept and graph telemetry data. This means that as of Oct 1 http://metrics.mozilla.com/ will stop updating. Metrics should be able to retire telemetry portion of mango cluster in October.

Telemetry Backend

Server(https://github.com/mreid-moz/telemetry-server/) requirements:

  • Ability to process 10x incoming packet rates of metrics telemetry infrastructure on a single AWS instance: 2400req/s with 30K HTTP POST packets. Fall
  • Server should be bandwidth-limited, not CPU.
  • Server should make data available for map/reduce immediately. Fallback goal: 5min latency. In Q4 we'd like to use something like heka to make dashboards use live data(0min lag).

Server Milestones:

  • Sep 2: Ability temporarily(1 hour?) point telemetry dns at AWS, forwarding telemetry to metrics cluster. Ideally we'd be able to do this at bouncer level so no changes are needed to accomplish forwarding
  • Sep 16: ^ should be feeding both servers (metrics and AWS) until cutover date.

Telemetry Dashboard

Dashboard(https://github.com/mozilla/telemetry-dashboard/) Milestones


Notes