Community Ops/Monitoring

	THIS PAGE IS A WORKING DRAFT
	The page may be difficult to navigate, and some information on its subject might be incomplete and/or evolving rapidly. If you have any questions or ideas, please add them as a new topic on the discussion page.

Monitoring Setup

Monitoring is decentralized. Incident Response is distributed based on timezone, availability and project knowledge.

We use a number of services to maintain effective monitoring:

Pingdom - Checks that our servers are up. Screams if they aren't.
VictorOps - Incident Response Management. Dispatches alerts to sysadmins and compiles a nice timeline for us to manage incidents.

TBD

TBD

Tool	Usage	Primary Contact	Secondary Contacts
Pingdom	Uptime and latency monitoring	mrz
VictorOps	Incident Escalation and notifications	tanner	mrz, logan, yousef
Cloudwatch	Top Level Monitoring of AWS	Same as AWS
StatusHub	Dashboard	mrz