TestEngineering/Services/TSVerifierSyncTestEnvironments: Difference between revisions
< TestEngineering | Services
Jump to navigation
Jump to search
| Line 253: | Line 253: | ||
** https://app.stackdriver.com/groups/4388/stage-services-tag-sync15 | ** https://app.stackdriver.com/groups/4388/stage-services-tag-sync15 | ||
** https://app.stackdriver.com/instances/<AWS instance> | ** https://app.stackdriver.com/instances/<AWS instance> | ||
** Also, make use of the Services, Groups, and Dashboards for more specific links | |||
*** Example: Groups > https://app.stackdriver.com/groups/4390/stage-services-tag-sync15/stage-services-sync15/tokenserver | |||
*** Example: Dashboards > https://app.stackdriver.com/monitoring/3827/tokenserver | |||
* Graphite: | * Graphite: | ||
Revision as of 17:35, 12 August 2014
Tokenserver, Verifier, and Sync Production Environments
- TokenServer: https://token.services.mozilla.com
- Verifier: https://verifier.accounts.firefox.com
- Sync 1.5 Nodes (for now in AWS):
- sync-1-us-east-2.sync.services.mozilla.com
- sync-2-us-east-2.sync.services.mozilla.com
- There are many more than this now...
- QA Access: not allowed
- Pointing Fx Nightly to Production FxA/Sync:
- Set/Verify the following Firefox configs:
- services.sync.clusterURL = (should get automatically set by the TokenServer)
- services.sync.tokenServerURI = https://token.services.mozilla.com/1.0/sync/1.5
- services.sync.fxaccounts.enabled=true (NEW FOR NIGHTLY FF 29 - SHOULD BE SET BY DEFAULT)
- services.sync.log.appender.file.logOnError = Yes
- services.sync.log.appender.file.logOnSuccess = Yes
- services.sync.log.appender.file.level = Trace
- identity.fxaccounts.auth.uri = https://api.accounts.firefox.com/v1
- identity.fxaccounts.remote.uri = https://accounts.firefox.com/?service=sync&context=...
- Set/Verify the following Firefox configs:
TokenServer Stage Environment
- URLs
- TokenServer: https://token.stage.mozaws.net
- Version:
TokenServer: rpm -qa | grep token Example: tokenserver-svcops <latest version>
- AWS
- Shared:
- shared-elasticsearch
- shared-elasticsearch
- shared-elasticsearch
- shared-elasticsearch
- shared-rabbitmq
- shared-rabbitmq
- shared-bastion
- shared-heka
- Shared:
- tokenserver app server (2 m3.medium instances behind a CF stack and ELB)
- tokenserver db (1 large DB instance behind RDS)
- Files
- /data/tokenserver/*
- Processes
- tokenserver app server:
- Search for token, circus, nginx, gunicorn
- tokenserver app server:
- Logs
- /media/ephemeral0/logs/
- /media/ephemeral0/logs/nginx/access.log
- /media/ephemeral0/logs/nginx/error.log
- /media/ephemeral0/logs/tokenserver/token.error.log
- /media/ephemeral0/logs/tokenserver/token.log.*
- /media/ephemeral0/logs/tokenserver/process_account_deletions.error.log
- /media/ephemeral0/logs/tokenserver/process_account_deletions.log
- /media/ephemeral0/squid/access.log
- /var/log/hekad/tokenserver.stdout.log
- /var/log/hekad/tokenserver.stderr.log
- Hekad
- /etc/puppet/modules/hekad
- QA Access via a Bastion Host
- SSH with AWS keys to the Stage bastion host in US East 1. From there SSH directly into any instance.
- Firefox Configs
- services.sync.clusterURL should get automatically set by the TokenServer
- services.sync.tokenServerURI = https://token.stage.mozaws.net/1.0/sync/1.5
Verifier Stage Environment
- URLs
- Version:
Verifier: rpm -qa | grep verifier Example: fxa-browserid-verifier-svcops <latest version>
- AWS
- Shared:
- shared-elasticsearch
- shared-elasticsearch
- shared-elasticsearch
- shared-elasticsearch
- shared-rabbitmq
- shared-rabbitmq
- shared-bastion
- shared-heka
- Shared:
- fxa-browserid_verifier (usually a single instance)
- Files
- /data/fxa-browserid-verifier
- /data/hekad
- /etc/puppet
- /etc/heka.d
- /media/ephemeral0/*
- /opt/aws
- /opt/ec2
- /opt/openresty
- /opt/rh
- /opt/stackdriver
- Processes
- Search for node, heka, nginx, circus
- Logs
- /media/ephemeral0/fxa-browserid-verifier/verifier_err.log
- /media/ephemeral0/fxa-browserid-verifier/verifier_out.log
- /media/ephemeral0/nginx/logs/fxa-browserid-verifier.access.log
- /media/ephemeral0/nginx/logs/fxa-browserid-verifier.access.log
- /media/ephemeral0/nginx/logs/default.access.log (not in use)
- /media/ephemeral0/nginx/logs/default.error.log (not in use)
- /media/ephemeral0/squid/access.log
- /var/log/circus.log
- /var/log/hekad/fxa-browserid_verifier.stderr.log
- /var/log/hekad/fxa-browserid_verifier.stdout.log
- Hekad
- /etc/puppet/modules/hekad
- /data/hekad
- QA Access via a Bastion Host
- SSH with AWS keys to the Stage bastion host in US East 1. From there SSH directly into any instance.
- Firefox Configs
- services.sync.clusterURL should get automatically set by the TokenServer
- services.sync.tokenServerURI = https://token.stage.mozaws.net/1.0/sync/1.5
- Quick verifications
Browser: https://verifier.stage.mozaws.net/ returns "Method Not Allowed" $ curl https://verifier.stage.mozaws.net returns "Method Not Allowed" $ curl -I https://verifier.stage.mozaws.net HTTP/1.1 405 Method Not Allowed Cache-Control: no-cache, max-age=0 Content-length: 18 Content-Type: text/plain Date: Wed, 23 Jul 2014 20:19:40 GMT Connection: keep-alive
Sync 1.5 Stage Environment
- This is a work in progress. Right now, there are only a small number of sync nodes (instances) in AWS.
- URLs
- Versions
Server-Syncstorage (sync node): rpm -qa | grep syncstorage Example: server-syncstorage <latest version>
- AWS
- Search for sync node instances in US East: "stage-sync-node-X"
- Each node is a specific large instance (mixed m3 and c3)
- Files
- /data/server-syncstorage/*
- /var/log/nginx
- Processes
- Search for sync, mysql, circusd, hekad, nginx, memcached
- Logs
- /media/ephemeral0/logs (most important)
- nginx/access.log
- (what about error.log?)
- sync/sync.err
- sync/sync.log
- /var/log/nginx
- /var/log/circus.log
- /media/ephemeral0/logs (most important)
- Hekad
- /etc/heka.d/sync_1_5.toml
- QA Access via a Bastion Host
- SSH with AWS keys to the Stage bastion host in US East 1. From there SSH directly into any instance.
- Firefox Configs
- services.sync.clusterURL should get automatically set by the TokenServer
- Note: There is no longer a Sync 1.1 Stage environment.
Loads Services Cluster Environment
- URLs
- Versions
Loads Cluster/Broker/Agents: $ cd /home/ubuntu/loads/bin $ ./loads-runner --version
- AWS in US West
- loads-master (broker and agent processes)
- loads-slave-1 (agent processes)
- loads-slave-2 (agent processes)
- NOTE: there is no stack or ELB for this cluster
- Files
- /home/ubuntu
- loads
- loads-aws
- loads-web
- /home/ubuntu
- Processes
- Search for processes owned by ubuntu, loads, nginx, circus
- Logs
- /var/log/redis
- /var/log/nginx
- QA access
- You need special access to be able to SSH into these devices
- You need to make some changes to your .ssh/config file
- Monitoring
- Web Interface: http://loads.services.mozilla.com/
- StackDriver: https://app.stackdriver.com/groups/6664/stage-loads-cluster
- Cluster status
- Check directly from the Loads Cluster dashboard:
Agents statuses Launch a health check on all agents
- Links
Monitoring the Stage Environment
- Kibana
- https://kibana.shared.us-east-1.stage.mozaws.net/
- https://kibana.shared.us-east-1.stage.mozaws.net/#/dashboard/file/weblogs.json
- https://kibana.shared.us-east-1.stage.mozaws.net/index.html#/dashboard/elasticsearch/Token%20App%20Logs%20POC
- https://kibana.shared.us-east-1.stage.mozaws.net/index.html#/dashboard/file/sync_http_status.json
- https://kibana.shared.us-east-1.stage.mozaws.net/index.html#/dashboard/elasticsearch/Sync%20Nginx%20Errors
- https://kibana.shared.us-east-1.stage.mozaws.net/index.html#/dashboard/elasticsearch/Sync%20App%20Logs
- https://kibana.shared.us-east-1.stage.mozaws.net/index.html#/dashboard/file/sync_mysql_slow_queries.json
- Heka
- https://heka.shared.us-east-1.stage.mozaws.net/
- https://heka.shared.us-east-1.stage.mozaws.net/#health
- https://heka.shared.us-east-1.stage.mozaws.net/#sandboxes
- https://heka.shared.us-east-1.stage.mozaws.net/#sandboxes/Sync-1_5-SlowQueries/outputs/Sync-1_5-SlowQueries.Statistics.cbuf
- https://heka.shared.us-east-1.stage.mozaws.net/#sandboxes/Sync-1_5-HTTPStatus/outputs/Sync-1_5-HTTPStatus.HTTPStatus.cbuf
- https://heka.shared.us-east-1.stage.mozaws.net/#sandboxes/Sync-1_5-ResponseTime/outputs/Sync-1_5-ResponseTime.storagehistory.cbuf
- https://heka.shared.us-east-1.stage.mozaws.net/#sandboxes/Sync-1_5-ResponseTime/outputs/Sync-1_5-ResponseTime.storagebookmarks.cbuf
- https://heka.shared.us-east-1.stage.mozaws.net/#sandboxes/Sync-1_5-ResponseTime/outputs/Sync-1_5-ResponseTime.storageforms.cbuf
- Stackdriver:
- Stage main: https://app.stackdriver.com
- https://app.stackdriver.com/groups/4388/stage-services-tag-sync15
- https://app.stackdriver.com/instances/<AWS instance>
- Also, make use of the Services, Groups, and Dashboards for more specific links
- Example: Groups > https://app.stackdriver.com/groups/4390/stage-services-tag-sync15/stage-services-sync15/tokenserver
- Example: Dashboards > https://app.stackdriver.com/monitoring/3827/tokenserver
Monitoring the Production Environment
- Heka shared:
- Main: https://heka.shared.us-west-2.prod.mozaws.net/
- https://heka.shared.us-west-2.prod.mozaws.net/#plugins/filters/Sync-1_5-FrequentUsersAggregator
- https://heka.shared.us-west-2.prod.mozaws.net/#plugins/filters/Sync-1_5-ResponseTime
- https://heka.shared.us-west-2.prod.mozaws.net/#plugins/filters/Sync-1_5-SlowQueries
- https://heka.shared.us-west-2.prod.mozaws.net/#plugins/filters/Sync-1_5-ActiveDailyUsers
- https://heka.shared.us-west-2.prod.mozaws.net/#plugins/filters/Sync-1_5-HTTPStatus
- https://heka.shared.us-west-2.prod.mozaws.net/#plugins/filters/TokenServerHTTPStatus
- StackDriver:
TokenServer and Sync 1.5 Dev Environments
- Versions
TokenServer: rpm -qa | grep token Example: tokenserver-svcops <latest version> Verifier: rpm -qa | grep verifier Example: fxa-browserid-verifier-svcops <latest version> Server-Syncstorage (sync node): rpm -qa | grep syncstorage Example: server-syncstorage <latest version>
- AWS
- This is part of the dev-lcip-org CloudFormation stack
- TokenServer: Search for the fxa-tokenserver instance
- SyncServer2: Search for the fxa-syncstorage instance
- SyncServer1: Search for the fxa-syncstorage instance
- LogServer: Search for the fxa-logbox instance
- This is part of the dev-lcip-org CloudFormation stack
- Files
- TBD
- Processes
- TBD
- Logs
- TBD
- QA Access
- SSH with AWS keys to the various instances
- Firefox configs:
- services.sync.clusterURL = https://sync1.dev.lcip.org/ (should get automatically set by the TokenServer)
- services.sync.tokenServerURI = https://token.dev.lcip.org/1.0/sync/1.5
OPs Mana and GitHub Pages
- NOTE: Talk to OPs for the links to Mana.
- Puppet Config: https://github.com/mozilla-services/puppet-config
- Cloud Formation: https://github.com/mozilla-services/svcops