Revision as of 19:57, 14 June 2011

Goals

The problem is that right now our end to end time from checkin to all tests done is around 6-8 hours. That's wasting a lot of developer time and productivity. The Goal of this project is to increase developer productivity by both minimizing time spent waiting for results and by making the "waiting for results" less of a time consuming task in itself.

Most of the bugs are concerned with performance optimization. Productivity bugs that do not affect performance optimization are noted with (loosely related) in their descriptions.

The priority of the tasks below will be determined by the data we have on the gains associated with completing them. We'll evaluate all proposals in terms of that data. If you have a proposal, add it to the proposals section below (hopefully with links to data)

Specifically The Goals:

The overall target we're shooting for is 2 hours from checkin to all tests done.
Get time from build-available to all tests done to <= 30 minutes
Get per-checkin build time to under 2 hours for all platforms
There are other small advances in related bugs that can help free-up developer time.
Measure and track our progress

Non-goals:

Do not boil the ocean - we need to be selective
Ensure amount of work is worth the performance payoff
Buying more minis - we cannot buy more minis at this time (our rev is discontinued)

Current activities

Set up metrics to quantify our current end-to-end time, and be able to track it
Do non-PGO opt builds for linux and windows bug 658313
Investigate if we can do 64-bit only 10.6 builds per-checkin. Need to determine if there are many 10.5 only test failures.
Determine which suites to be merged with others - bug 659328
- Improve set-up time
Gather data for devs to tackle test jobs that take long
Speeding up the build system itself:
- (universal-singlepass) less painful universal builds on mac (build universal as a single pass) - bug 417044
- makefiles: split of config/rules.mk into a hierarchy of makefiles - bug 662833

Bug	Description	Owner	Status
??	Set up metrics to quantify our current end to end time for tracking	catlee	In Design
bug 658313	Make on-change builds into non PGO, only do PGO on an as available basis	UNOWNED	Needs owner
??	Investigate if we can do 64 bit only 10.6 builds per-checkin (how many 10.5 failures are there?)	UNOWNED	Needs Owner
bug 659328	Determine which suites can be merged to amortize setup cost	armenzg	Has Patches
n/a	Profile Test suites to find individual tests are higher than average time to complete (so devs can fix them)	jmaher	First analysis of mochitest-plain complete
bug 417044	Perform Universal binary builds as single-pass	:joey	In Progress
bug 662833	Split config/rules.mk into hierarchy of makefiles	:joey	Waiting on Review
bug 659222	Branch prioritization broken for test jobs	catlee	In Progress
bug 661656	Improving network between colos	zandr	In Progress
bug 586418	Selective unpacking of unittests	:salbiz	Selective unpacking landed, cannot finish until bug 594415 completed
bug 594415	Change TBPL to report on individual test suites not buildbot jobs	Swatinem	Patches complete, needs to be unstuck, ctalbert driving
bug 661649	Deploy talos pageset to all slaves to avoid download/unpack	UNOWNED	Needs Owner
bug 561235	Use server to deal with getting stacks for crashes (no download of symbols onto slave)	ted	Needs next steps defined
bug 561754	Create server for processing minidumps	ted	Complete?
bug 659118	Reuse win7 64bit machines for supporting other OS's	armenzg	In Progress
bug 660124	Turn off txul and ts in favor of their -paint varieties	jmaher	Blocked on reaching consensus, ctalbert helping drive
??	Use sdwilsh's notification bot to help devs more easily watch the tree	sdwilsh?	Needs next steps, ctalbert to follow up
bug 558180	Check mozconfigs into source to avoid buildconfig checkout	catlee	In Progress
??	Profile mochitest plain	UNOWNED	Needs Owner
n/a	Get data on test run time for each of the moth individual tests	ctalbert	In Progress
bug 528231	(loosely related) Get debug symbols for unit test reference platform into breakpad	nthomas	Blocked on nthomas's availability
bug 571886	(loosely related) Create calendar tool to aid Firefox sheriffs	harth and/or webdev	Blocked on harth's availability
??	Run debug builds on an "as available" basis once we hit capacity limits	catlee	Proposed/In design
??	Use a packaged, release build for httpd.js when running tests (improve httpd startup cost)	ctalbert	Proposed/Investigation
??	Purchase new windows and linux machines for test slaves that are not tied to aggressive obsolescence schedule	zandr	Underway, but issues getting machines from 3rd party provider

Long Term Ideas

bug 657738 - Automatically determine when oranges happen and auto-star them (no owner/decision on steps yet)
bug 630534 - (related) Host tree status outside of tinderbox
- This would help us move away from tinderbox and would free up many of the constraints we currently operate under
Analyze using VMs for "dial-up" capacity
- Would need engineering resources to debug oranges that occur only on vms
- Would need to run some low numbers of vms on an ongoing basis so that we continue to ensure they are providing results we can trust
- We may need to solve bug 617763 so that these machines can be kept up to date (whether they live in the build VPN or not, i.e. if we use EC2 or something like that...). Either way we must address the problem that they could be out of date between one dial-in and another dial-in.
Analyze methods of understanding and optimizing what we run on a per-checkin basis
- Test code related to the current patch?
- Run full tests only every x pushes or y time (whichever occurs first). Between those only run a set of tests related to the patch at hand. (Would need developer override and on-demand testing ability)

Data we need

Profiling on all major harnesses
How many 10.5-only test failures have there been? Do we need to run 10.5 tests on-checkin?
What is the test timing for the suites contained in the moth harness? (ctalbert owns)
Where are we spending our time during build slave setup/teardown (catlee owns)
Gather data on specific long-running tests on mochitest & mochitest other
- Jmaher reported to newsgroup some data on mochitest-plain. Moth is still TBD

Data we have

Test Setup versus Run Time

People

Chris AtLee (:catlee)
Armen Zambrano (:armenzg)
Joey Armstrong (:joey)
Clint Talbert (:ctalbert)
Some as-yet undefined set of the A-team (probably Ted, Jmaher and Harth)

Definitions

end-to-end time: The time between a change being pushed to HG and all builds and test results being available
per-checkin build: Opt and debug builds that are triggered by a new checkin
on-change build: same as per-checkin
as-available run: Run the given test/build task only as fast as you can complete the previous run. Run the previous run, complete, then start the next task with the latest checkin.

Proposals for consideration

Please add proposals here...

@@ Line 1: / Line 1: @@
 = Goals =
+The problem is that right now our end to end time from checkin to all tests done is around 6-8 hours.  That's wasting a lot of developer time and productivity.  The Goal of this project is to increase developer productivity by both minimizing time spent waiting for results and by making the "waiting for results" less of a time consuming task in itself.
+Most of the bugs are concerned with performance optimization.  Productivity bugs that do not affect performance optimization are noted with (loosely related) in their descriptions.
+The priority of the tasks below will be determined by the data we have on the gains associated with completing them.  We'll evaluate all proposals in terms of that data.  If you have a proposal, add it to the proposals section below (hopefully with links to data)
+Specifically The Goals:
+* The overall target we're shooting for is 2 hours from checkin to all tests done.
 * Get time from build-available to all tests done to <= 30 minutes
 * Get per-checkin build time to under 2 hours for all platforms
+* There are other small advances in related bugs that can help free-up developer time.
+* Measure and track our progress
+Non-goals:
+* Do not boil the ocean - we need to be selective
+* Ensure amount of work is worth the performance payoff
+* Buying more minis - we cannot buy more minis at this time (our rev is discontinued)
 = Current activities =
@@ Line 16: / Line 31: @@
 {| style="width: 100%" class="fullwidth-table sortable"
 |-
-| style="background:#DDD;" width="25%" | '''Bug'''
+| style="background:#DDD;" width=15%" | '''Bug'''
 | style="background:#DDD" | '''Description'''
 | style="background:#DDD" | '''Owner'''
@@ Line 70: / Line 85: @@
 | :salbiz
 | {{StatusBlocked|status=Selective unpacking landed, cannot finish until {{bug|594415}} completed}}
+|-
+| {{bug|594415}}
+| Change TBPL to report on individual test suites not buildbot jobs
+| Swatinem
+| {{StatusAtRisk|status=Patches complete, needs to be unstuck, ctalbert driving}}
+|-
+| {{bug|661649}}
+| Deploy talos pageset to all slaves to avoid download/unpack
+| '''UNOWNED'''
+| {{StatusAtRisk|status=Needs Owner}}
+|-
+| {{bug|561235}}
+| Use server to deal with getting stacks for crashes (no download of symbols onto slave)
+| ted
+| {{StatusHealthy|status=Needs next steps defined}}
+|-
+| {{bug|561754}}
+| Create server for processing minidumps
+| ted
+| {{StatusHealthy|status=Complete?}}
+|-
+| {{bug|659118}}
+| Reuse win7 64bit machines for supporting other OS's
+| armenzg
+| {{StatusHealthy|status=In Progress}}
+|-
+| {{bug|660124}}
+| Turn off txul and ts in favor of their -paint varieties
+| jmaher
+| {{StatusBlocked|status=Blocked on reaching consensus, ctalbert helping drive}}
+|-
+| ??
+| Use sdwilsh's notification bot to help devs more easily watch the tree
+| sdwilsh?
+| {{StatusHealthy|status=Needs next steps, ctalbert to follow up}}
+|-
+| {{bug|558180}}
+| Check mozconfigs into source to avoid buildconfig checkout
+| catlee
+| {{StatusHealthy|status=In Progress}}
+|-
+| ??
+| Profile mochitest plain
+| '''UNOWNED'''
+| {{StatusAtRisk|status=Needs Owner}}
+|-
+| n/a
+| Get data on test run time for each of the moth individual tests
+| ctalbert
+| {{StatusHealthy|status=In Progress}}
+|-
+| {{bug|528231}}
+| (loosely related) Get debug symbols for unit test reference platform into breakpad
+| nthomas
+| {{StatusBlocked|status=Blocked on nthomas's availability}}
+|-
+| {{bug|571886}}
+| (loosely related) Create calendar tool to aid Firefox sheriffs
+| harth and/or webdev
+| {{StatusBlocked|status=Blocked on harth's availability}}
+|-
+| ??
+| Run debug builds on an "as available" basis once we hit capacity limits
+| catlee
+| {{StatusHealthy|status=Proposed/In design}}
+|-
+| ??
+| Use a packaged, release build for httpd.js when running tests (improve httpd startup cost)
+| ctalbert
+| {{StatusHealthy|status=Proposed/Investigation}}
+|-
+| ??
+| Purchase new windows and linux machines for test slaves that are not tied to aggressive obsolescence schedule
+| zandr
+| {{StatusBlocked|status=Underway, but issues getting machines from 3rd party provider}}
 |}
+== Long Term Ideas ==
+* {{bug|657738}} - Automatically determine when oranges happen and auto-star them (no owner/decision on steps yet)
+* {{bug|630534}} - (related) Host tree status outside of tinderbox
+** This would help us move away from tinderbox and would free up many of the constraints we currently operate under
+* Analyze using VMs for "dial-up" capacity
+** Would need engineering resources to debug oranges that occur only on vms
+** Would need to run some low numbers of vms on an ongoing basis so that we continue to ensure they are providing results we can trust
+** We may need to solve {{bug|617763}} so that these machines can be kept up to date (whether they live in the build VPN or not, i.e. if we use EC2 or something like that...).  Either way we must address the problem that they could be out of date between one dial-in and another dial-in.
+* Analyze methods of understanding and optimizing what we run on a per-checkin basis
+** Test code related to the current patch?
+** Run full tests only every x pushes or y time (whichever occurs first).  Between those only run a set of tests related to the patch at hand. (Would need developer override and on-demand testing ability)
+= Data we need =
+* Profiling on all major harnesses
+* How many 10.5-only test failures have there been?  Do we need to run 10.5 tests on-checkin?
+* What is the test timing for the suites contained in the moth harness? (ctalbert owns)
+* Where are we spending our time during build slave setup/teardown (catlee owns)
+* Gather data on specific long-running tests on mochitest & mochitest other
+** Jmaher reported to newsgroup some data on mochitest-plain.  Moth is still TBD
+= Data we have =
+* [https://spreadsheets.google.com/spreadsheet/ccc?key=tNfPnuOZvLcbnItGP6gCYSQ&authkey=COzKr8wL#gid=0 Test Setup versus Run Time]
 = People =
@@ Line 76: / Line 189: @@
 * Armen Zambrano (:armenzg)
 * Joey Armstrong (:joey)
+* Clint Talbert (:ctalbert)
+* Some as-yet undefined set of the A-team (probably Ted, Jmaher and Harth)
 = Definitions =
 ; end-to-end time: The time between a change being pushed to HG and all builds and test results being available
 ; per-checkin build: Opt and debug builds that are triggered by a new checkin
+; on-change build: same as per-checkin
+; as-available run: Run the given test/build task only as fast as you can complete the previous run. Run the previous run, complete, then start the next task with the latest checkin.
+= Proposals for consideration =
+Please add proposals here...

ReleaseEngineering/BuildFaster: Difference between revisions

Revision as of 19:57, 14 June 2011

Contents

Goals

Current activities

Long Term Ideas

Data we need

Data we have

People

Definitions

Proposals for consideration

Navigation menu

ReleaseEngineering/BuildFaster: Difference between revisions

Revision as of 19:57, 14 June 2011

Goals

Current activities

Long Term Ideas

Data we need

Data we have

People

Definitions

Proposals for consideration

Navigation menu

Search