Buildbot/OutageReports

< Buildbot
Revision as of 19:59, 30 August 2007 by ChrisCooper (talk | contribs)

We started collecting Outage Reports for Tinderbox last year as a means of determining what intermittent failures we were hitting on each platform. This allowed us to track failure patterns over time and helped us figure out where the highest value fixes were.

Many of the errors are difficult to fix or perhaps even unfixable (e.g. toolchain hangs on Windows), but having a history of outage reports with sufficient diagnostic information allows others (e.g. IT) to restart a hung system with outside intervention.