ReleaseEngineering/IT Resource Job Description

  • remote power control/remote console solutions
    • eliminates *most* of the need for NOC monkeys if we can handle remote reboots ourselves
  • nagios improvements
  • munin improvements
    • switch to ganglia?
  • puppet support/maintenance
  • need to run our configs (e.g. Build VPN) to know how/when things are broken
  • configuration sanity script development and maintenance
    • based on a knowledge of how the releng systems work and what they need to connect to, develop a suite of tools that can verify that IT changes won't affect builds *before* builds actually fail