Devops/monitoring-alerting: Difference between revisions

Line 8: Line 8:
* For accounts, questions, or suggestions, email jp at mozillafoundation.org
* For accounts, questions, or suggestions, email jp at mozillafoundation.org


"MONITORING TOOLS, SYSTEMS, AND LINKS "
'''MONITORING TOOLS, SYSTEMS, AND LINKS '''
''Mozilla Foundation applications are monitored and measured in a number of systems:
''
* '''Opsview, a Nagios clone with a much friendlier interface.'''
* '''Opsview, a Nagios clone with a much friendlier interface.'''
:: * Monitors and alerts when servers in load balancers are unhealthy
:: * Monitors and alerts when servers in load balancers are unhealthy
:: * Monitors and alerts on uptime/downtime of overall endpoints, such as https://webmaker.org
:: * Monitors and alerts on uptime/downtime of overall endpoints, such as https://webmaker.org
:: * Monitors and alerts on database utilization and downtime.
:: * Monitors and alerts on database utilization and downtime.
 
::  '''Important Opsview Links'''
::  "Important Opsview Links'
:: [http://opsview.mofoprod.net:3000/viewport Public Status Page]
:: [http://opsview.mofoprod.net:3000/viewport Public Status Page]
:: [http://opsview.mofoprod.net:3000/status/service?filter=unhandled&order=state_desc&order=host&order=service&includeunhandledhosts=1 Current Unhandled Alerts (Login required)]
:: [http://opsview.mofoprod.net:3000/status/service?filter=unhandled&order=state_desc&order=host&order=service&includeunhandledhosts=1 Current Unhandled Alerts (Login required)]
Line 33: Line 30:
:: * Marks and compares new/old deployed versions of software
:: * Marks and compares new/old deployed versions of software
:: !!!TODO : Add the guide for notifications & contact settings
:: !!!TODO : Add the guide for notifications & contact settings
::  '''Important New Relic Links'''
::  '''Important New Relic Links'''
:: [https://rpm.newrelic.com/accounts/255689/custom_dashboards/1695/pages New Relic Dashboards ]
:: [https://rpm.newrelic.com/accounts/255689/custom_dashboards/1695/pages New Relic Dashboards ]
Line 43: Line 39:
* '''Log monitoring with [https://loggins.mofoprod.net Loggins (Kibana) (Login Required)]'''
* '''Log monitoring with [https://loggins.mofoprod.net Loggins (Kibana) (Login Required)]'''


* "AWS Infrastructure and Autoscaling Monitoring/Alerting"
* '''AWS Infrastructure and Autoscaling Monitoring/Alerting'''
:: * An email group exists to be notified of any autoscaling activities (up or down).  Contact jp at mozillafoundation.org to be added to this list.
:: * An email group exists to be notified of any autoscaling activities (up or down).  Contact jp at mozillafoundation.org to be added to this list.
:: * Cloudwatch in the AWS console is capable of monitoring many metrics and utilization metrics, including CPU usage or network usage for a group, database, server, or ELB.  Not many alarms are triggered from this outside of to trigger scaling.
:: * Cloudwatch in the AWS console is capable of monitoring many metrics and utilization metrics, including CPU usage or network usage for a group, database, server, or ELB.  Not many alarms are triggered from this outside of to trigger scaling.
:: Most AWS infrastructure is monitored via New Relic.  See the side menu options in New Relic for RDS, ELB, EC2, Elasticache, etc...
:: Most AWS infrastructure is monitored via New Relic.  See the side menu options in New Relic for RDS, ELB, EC2, Elasticache, etc...
Confirmed users
106

edits