Revision as of 20:21, 11 July 2011

Goals

The problem is that right now our end to end time from checkin to all tests done is around 6-8 hours. That's wasting a lot of developer time and productivity. The Goal of this project is to increase developer productivity by both minimizing time spent waiting for results and by making the "waiting for results" less of a time consuming task in itself.

Most of the bugs are concerned with performance optimization. Productivity bugs that do not affect performance optimization are noted with (loosely related) in their descriptions.

The priority of the tasks below will be determined by the data we have on the gains associated with completing them. We'll evaluate all proposals in terms of that data. If you have a proposal, add it to the proposals section below (hopefully with links to data)

The Goals:

The overall target we're shooting for is 2 hours from checkin to all tests done.
Get time from build-available to all tests done to <= 30 minutes
Get per-checkin build time to under 1.5 hours for all platforms
There are other small advances in related bugs that can help free-up developer time.
Measure and track our progress

Non-goals:

Do not boil the ocean - we need to be selective
Ensure amount of work is worth the performance payoff
Buying more minis - we cannot buy more minis at this time (our rev is discontinued)

Current activities

Ensure any bugs that are filed for this effort have "buildfaster:pN" in the whiteboard. Where N is some number 1-5. P1 is highest priority, P5 is the lowest.

So, our current list of active bugs can be found here.

If you'd like to file a bug for something you'd like to see included in this effort, you can use "buildfaster:? in the whiteboard. These bugs are here.

The table below is for current tasks that are not tracked in bugzilla.

Bug	Description	Owner	Status
??	Use sdwilsh's notification bot to help devs more easily watch the tree	sdwilsh?	Needs next steps, ctalbert to follow up
??	Profile mochitest plain	jgriffin	In Progress
n/a	Get data on test run time for each of the moth individual tests	ctalbert	COMPLETE
??	Purchase new machines for win/linux builders that are not tied to aggressive obsolescence schedule	zandr	Underway, getting gear for lab testing

Long Term Ideas

bug 657738 - Automatically determine when oranges happen and auto-star them (no owner/decision on steps yet)
bug 630534 - (related) Host tree status outside of tinderbox
- This would help us move away from tinderbox and would free up many of the constraints we currently operate under
Analyze using VMs for "dial-up" capacity
- Would need engineering resources to debug oranges that occur only on vms
- Would need to run some low numbers of vms on an ongoing basis so that we continue to ensure they are providing results we can trust
- We may need to solve bug 617763 so that these machines can be kept up to date (whether they live in the build VPN or not, i.e. if we use EC2 or something like that...). Either way we must address the problem that they could be out of date between one dial-in and another dial-in.
Analyze methods of understanding and optimizing what we run on a per-checkin basis
- Test code related to the current patch?
- Run full tests only every x pushes or y time (whichever occurs first). Between those only run a set of tests related to the patch at hand. (Would need developer override and on-demand testing ability)
less frequent debug builds
use an optimized xpcshell + httpd.js to run debug tests (depends on bug 669953)

Data we need

Profiling on all major harnesses
How many 10.5-only test failures have there been? Do we need to run 10.5 tests on-checkin?
Where are we spending our time during build slave setup/teardown (catlee owns)
Gather data on specific long-running tests on mochitest & mochitest other
- Jmaher reported to newsgroup some data on mochitest-plain. Moth is still TBD

Data we have

Test Setup versus Run Time (thanks to armenzg)
Data from Orange factor detailing 31,000 Mochitest other test runs. (Thanks to jgriffin)
- Here is the easy-to-parse version, showing we have something seriously wrong with mochitest-browser-chrome debug. (opt is no light weight either). Also, the ipcplugin test varies wildly on mac and could possibly improved for some minor wins.

People

Chris AtLee (:catlee)
Armen Zambrano (:armenzg)
Joey Armstrong (:joey)
Clint Talbert (:ctalbert)
Some as-yet undefined set of the A-team (probably Ted, Jmaher and Harth, but I'll put out the call today)

Definitions

end-to-end time: The time between a change being pushed to HG and all builds and test results being available
per-checkin build: Opt and debug builds that are triggered by a new checkin
on-change build: same as per-checkin
as-available run: Run the given test/build task only as fast as you can complete the previous run. Run the previous run, complete, then start the next task with the latest checkin.

Proposals for consideration

Please add proposals here...

Meetings

2011-07-07

@@ Line 21: / Line 21: @@
 Ensure any bugs that are filed for this effort have "buildfaster:pN" in the whiteboard.  Where N is some number 1-5.  P1 is highest priority, P5 is the lowest.
-So, our current list of active bugs can be found here.
+So, our current list of active bugs can be found [https://bugzilla.mozilla.org/buglist.cgi?list_id=743687&resolution=---&status_whiteboard_type=allwordssubstr&query_format=advanced&status_whiteboard=buildfaster:p here].
-For items not tracked in bugs yet, or for items you wish to propose, please use the following table.
+If you'd like to file a bug for something you'd like to see included in this effort, you can use "buildfaster:? in the whiteboard.  These bugs are [https://bugzilla.mozilla.org/buglist.cgi?list_id=743751&resolution=---&resolution=DUPLICATE&status_whiteboard_type=allwordssubstr&query_format=advanced&status_whiteboard=buildfaster%3A%3F here].
+The table below is for current tasks that are not tracked in bugzilla.
 {| style="width: 100%" class="fullwidth-table sortable"

ReleaseEngineering/BuildFaster: Difference between revisions

Revision as of 20:21, 11 July 2011

Contents

Goals

Current activities

Long Term Ideas

Data we need

Data we have

People

Definitions

Proposals for consideration

Meetings

Navigation menu

ReleaseEngineering/BuildFaster: Difference between revisions

Revision as of 20:21, 11 July 2011

Goals

Current activities

Long Term Ideas

Data we need

Data we have

People

Definitions

Proposals for consideration

Meetings

Navigation menu

Search