ReleaseEngineering/IT Resource Job Description: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 3: Line 3:
* remote power control/remote console solutions
* remote power control/remote console solutions
** eliminates *most* of the need for NOC monkeys if we can handle remote reboots ourselves
** eliminates *most* of the need for NOC monkeys if we can handle remote reboots ourselves
** reboot hung/stuck slaves for anything that is not configured for remote reboot (currently all minis, nokias, tegras)
* reimage corrupted slaves (IX, VMs, minis, nokias, tegras)
* nagios improvements
* nagios improvements
* munin improvements
* munin improvements
Line 10: Line 12:
* configuration sanity script development and maintenance
* configuration sanity script development and maintenance
** based on a knowledge of how the releng systems work and what they need to connect to, develop a suite of tools that can verify that IT changes won't affect builds *before* builds actually fail
** based on a knowledge of how the releng systems work and what they need to connect to, develop a suite of tools that can verify that IT changes won't affect builds *before* builds actually fail
 
* Asset tracking: low confidence that https://build.inventory.mozilla.org/build/ matches reality.


Comments from the IT side: (Zandr)
Comments from the IT side: (Zandr)
Confirmed users
2,679

edits