Breakpad/Status Meetings/2016-08-17
< Breakpad | Status Meetings
Meeting Info
Breakpad status meetings occur on Wed at 10:00am Pacific Time.
Conference numbers:
Vidyo: Stability 650-903-0800 x92 conf 98200# 800-707-2533 (pin 369) conf 98200#
IRC backchannel: #breakpad
Mountain View: Dancing Baby (3rd floor)
Operations Updates
- S3 woes. We figured it out in the end. IAM is hard.
- Problem was that rhelmer used *his* key to configure access to a bucket
- We need a more formal cleanup of IAMs and policies.
- Lesson learned: When going to prod TEST the bucket access FIRST!!!
- Stage submitter PR (https://github.com/mozilla/socorro-infra/pull/249) is almost ready to go
- plan is to build it so it's a NON-admin node, that upgrades on deployments
- Ready to do a prod deploy now.
- Python Upgrade?
- tried over the weekend
- right python on built machines, but things didn't start
- peterbe is volunteering to debug python errors in stage with jp
- Pingdom
- peterbe, mbrandt, jp receving
- let's not worry about adding more admins. for now.
- adding more admins costs more money
- ElasticSearch monitoring
- jp talked with "our local ES expert" about things to alert on
- will do alerting with Datadog (this is what we already do with stage submitter)
- Status of NewRelic
- Owned by IT, still
- will soon be possibly owned by Travis's team
- Possibly looking at "Synthetics Transactions" as an alternative
- Still not working,
- will resume debugging after python 2.7.11 upgrade
- Owned by IT, still
Project Updates
- intel.com and adobe.com emails can now BOTH upload private symbols
- We're still waiting for word from MacAfee that they're ok with the *output* of symbolication is made public.
- We need to expose that question/awareness to people at Intel.
Deployment Triage
PR Triage
Major Projects
Migrating off of Persona
- :njn can sign in. But there might be bugs related to Nightly and Google Sign-In.
Sending public data spark/presto
Signature generation across crash reporters
- on hold
- crash ping will not need signatures iff we can tie to crashid. meeting with legal today
Splitting out collector
On hold. Will is implementing some metrics gathering code in the collector to track crash report sizes. This is needed for the collector architecture doc because the size of crash reports that we need to handle is one of the requirements for our collector.
Collecting client-side JavaScript errors
Handling more PII data in crashes
Sending stacks for all crashes from the client
Replacing FTPscraper
other business
- Changing the meeting time while Adrian is abroad?
- one that might work is 2pm PST | 5pm EST | 9am AST (Adrian Special Time)
- though we do not _have_ to do that
- https://www.timeanddate.com/worldclock/converted.html?iso=20160817T09&p1=22&p2=333&p3=224&p4=179
Travel, etc
- Adrian out next week
- then working the following week
- then out the week after that
- then working from far away for ~3 months