Litmus:Requirements

From MozillaWiki
Jump to navigation Jump to search

Introduction

Purpose

The purpose of this document is to capture in one place all the various requirements for the Litmus quality assurance (henceforth, QA) tool. In the past, many of the Netscape/Mozilla webtools have grown organically without much supporting documentation. While this document does not necessarily preclude this from happening with Litmus, it will at least give us an initial point of reference from which we can start design/development.

Document conventions

MediaWiki markup conventions apply.

Intended audience

This document is intended for QA staff, developers, build/release personnel, and sysadmins from the Mozilla Foundation, as well as community members interested in helping to improve the QA process for Mozilla products.

Additional information

Contact Info

Chris Cooper

References

Existing Testrunner documentaion:

Existing Mozilla web tools:

Le Vie, Jr., Donn. "Writing Software Requirements Specifications" TECHWR-L 7 July 2002. <http://www.techwr-l.com/techwhirl/magazine/writing/softwarerequirementspecs.html>.

Overall Description

Perspective

Mozilla testing resources are spread pretty thin. Even with some community support, the turnaround time for smoke testing and basic functional testing (BFT) for release candidates can take several days (the smoketests and BFTs are not currently automated). If regressions or new bugs are found during the testing process, the cycle can be even longer.

An existing tool, Testrunner, helps with the administration of this process, but the tool is somewhat limited. Testrunner has the concept of a "test run" as a single instance of testing, but these test runs must be manually cloned for each new testing cycle on a per-platform basis, and tests cannot be re-ordered within test runs. Testrunner also does not let multiple users combine their efforts to work on a single test run; each user must have a separate test run, or have their results collated by a single "superuser."

Testrunner does not store executable test cases or scripts, but instead stores test cases as a set of manual instructions for executing a test. Test runs/lists are then made up of lists of test cases. If the tests invovled do require external scripts, these test lists must me be kept in sync with external test repositories manually. This has made it impossible for any kind of automation to be built into Testrunner.

There is also no way to do any meaningful querying or reporting on historical test results using Testrunner. On top of all this, Testrunner is tied intimately to specific versions of Bugzilla; small changes to Bugzilla can cause Testrunner to stop working.

Bob Clary has a XUL-based test harness, called Spider, which he has used to automate the testing of many Document Object Model (DOM) and Javascript (JS) engine tests, but there has never been a central repository for test results, so his results have been posted to his personal testing website.

Developers often would like to have testing done to verify a given change or patch. Historically, this has not often been possible due to the constant demands on the QA team.

Addressing these shortcomings in the current tools (or the lack of tools, in general) will do much to streamline the QA process for Mozilla. This should have the desirable side effect of freeing up QA staff to work on more interesting things, e.g. harder edge-case testing, incoming bug verification and triage, community interaction, etc.

Whatever system we design, it must also take better advantage of the large, varyingly-skilled tester base that is the Mozilla community. The new tool should be able to package many small, discreet lists of automated test cases, and farm those mini-lists out to as many people as possible. It must also be able to cope with an influx of results when those tests report back.

Functions

The new QA tool, Litmus, is meant to address these problems by:

  • serving as a repository for test cases, with all the inherent management abilities that implies;
  • serving as a repository for test results, carrying over the best features of Testrunner, e.g. test lists, division of labor, etc.;
  • providing a query interface for viewing, reporting on, and comparing test results;
  • providing a request interface whereby developers can queue testing requests for patches, fixes, and regressions;
  • managing the automation of testing requests — one-time, and recurring (e.g. from tinderbox) — on a new group of dedicated testing servers, managing request priorities appropriately;
  • exposing an API to allow developers to work with the various tools easily outside of a graphical environment;
  • making it easier for casual testers (the large, varyingly-skilled tester base mentioned before) to assist with testing Mozilla products.

User classes and characteristics

Litmus will attract the following types of users:

  • Sysadmins
    These power users will be responsible for the maintenance of the underlying machines, and will likely be doing so from the command line. They will be primarily interested in how easy Litmus is to setup and install, CPU/disk space/network usage/database usage by the Litmus daemon and web tool, and any security implications that Litmus exposes.
  • Litmus Maintainers
    This is a class of sysadmins who are solely responsible for the upkeep of the Litmus tool itself. They will likely have intimate knowledge of its inner working and will be responsible for fixing bugs in Litmus itself.
  • Build/Release Engineers
    Given their role, these users will be primarily interested in the status of automated testing for builds/release candidates, with the ability to compare test results between two different release candidates. They will also want the ability to pre-empt tests in progress if release testing is needed immediately. These users will have a history of using various existing web tools, e.g. tinderbox, bonsai, LXR, so they can be expected to adapt to a new web tool quickly.
  • QA Staff
    Existing QA staff will already be familiar with Testrunner, which should ease the transition to a new web tool. This user class will have experience running tests both by hand and using the automated Spider tool. Because of this, most of these users will have developed an intuitive feel for what constitutes a valid testing result. These users will expect to be able to do the same things that they can do currently with Testrunner.
  • Core Mozilla Developers
    Core developers will already be familiar with web tools such as Bugzilla and tinderbox. Due to their familiarity with Bugzilla, they will expect to see the same Product and Component categories in Litmus. This group might correspond to the set of developers with superreview and/or review status in Bugzilla. These users might expect to receive higher priority for testing requests that they submit.
  • Mozilla Developers (including add-ons and extensions), Localizers
    These developers will already be familiar with web tools such as Bugzilla and tinderbox. Due to their familiarity with Bugzilla, they will expect to see the same Product and Component categories in Litmus.
  • Testers
    This user class will be familiar with using a web browser, but may not necessarily be familiar with the suite of Mozilla web tools used by developers. With proper instruction, they can be expected to submit testing results automatically if the process is not too complicated. These users might be interested in seeing test results that they themselves have contributed, and comparisons of the test runs that those results belong to.
  • Community-at-large
    Anyone with a web browser could find Litmus on the web. Some of these people will want to see quality reports (partners, journalists, competitors), others may just want to poke around. Like Bugzilla, basic querying will be open to all, but users will need to register with the system in order to do much else.

Operating environment

The main Litmus daemon and web tool will reside on an as-of-yet unpurchased machine. This machine will likely be running Linux (RHEL3?) to facilitate remote administration. The daemon and web tool will need to be designed to use the existing Linux Virtual Server (LVS) cluster.

User environment

The primary human interface for the Litmus tools will be web-based: QA staff, developers, and testers will access the web interface to report manual test results, check automated test results, schedule testing requests, and report on past test runs.

There will also be a command-line interface to the daemon/tools. This interface will be used by the automation processes for submitting results remotely, but can also be used by testers to do the same.

Design/implementation constraints

The following constraints exist:

  • despite its limitations, Testrunner is being actively used by the Mozilla QA team on a day-to-day basis. Litmus must replicate the useful functionality of Testrunner, and make it easier to accomplish the same tasks the team is doing today. If it does not, then Litmus will have failed.
  • Mozilla web services current reside behind an LVS cluster. Litmus must be designed to work with and take advantage of this setup.
  • Litmus must be Bugzilla-aware, i.e. component/product lists must match, bug numbers should be marked up appropriately, etc.
  • documentation for Litmus must be written and maintained, in order to avoid the documentation void that exists for other Mozilla web tools.

Assumptions and dependencies

The following assumptions and dependencies are known to exist:

  • the Spider tool can be successfully changed to run smoketests and BFTs in an automated manner;
  • machines for the new test farm will be bought and installed in the colo, as has already been decided;
  • Mozilla sysadmins have enough time to setup and manage these new machines. Note: some of the management responsibility for these machines will be shared by the Litmus maintainers;

External Interface Requirements

User interfaces

The primary human interface for the Litmus tools will be web-based: QA staff, developers, and testers will access the web interface to report manual test results, check automated test results, schedule testing requests, and report on past test runs.

We want the Litmus web front-end to be easy to use and the user experience to be positive. This is a tool that we expect the Mozilla QA staff to be using every day. The QA staff has some experience with the limitations of Testrunner, and we will be mining than experience to avoid making the same mistakes again.

In general, we want to design the web tool so that:

  • the default display or report provides the most useful set of basic information for the user;
  • common tasks are easily access from the default display;
  • the path to more complicated tasks is easy discovered
  • some degree of customization is possible, so that users are able to streamline their own experience.

There will also be a command-line interface to the daemon/tools. This interface will be used by the automation processes for submitting results remotely, but can also be used by testers to do the same.

We will want the remote APIs for the command-line interface to be fully documented (with examples) so it can be easily used by developers and QA staff.

Hardware interfaces

At the recent Mozilla QA summit meeting (2005/06/21), it was decided to invest in a small cluster of machines that would serve as a test farm, similar in concept to the collection of machines that currently perform builds for Tinderbox.

The test farm will be made up of the following machines:

  • Head Node (likely running Linux);
    • Linux Boxes (#?);
    • Mac XServes (#?);
    • Windows Boxes (#?).

Software interfaces

Adding more machines won't do anything to aid the testing burden in and of itself. Indeed, in the short-term, it will simply add more system administration overhead.

This is where we hope to see the biggest payoff in terms of automation. A main Litmus daemon will live on the head node. This daemon will be responsible for coordinating automated testing on the test farm machines, and collating results as they come in.

Communication protocols and interfaces

Since Litmus is designed primarily as a web tool, the main protocol of record will be HTTP.

The command-line interface will need to accept remote procedure calls in order to manage automation. Both XML-RPC and SOAP have been proposed for this.

System Features

Replicate Testrunner functionality

Description

Testrunner is a test run management system that works as an add-on over Bugzilla. More information can be found at Testrunner web site.

Note: Testrunner's concept of test case management is somewhat limited, which is why I have referred to it instead as 'test run management' above. Litmus will have a somewhat some holistic concept of test case management. See below.

Priority

Testrunner is currently being used by Mozilla QA staff to track smoketest and BFT results. The QA team can continue to use Testrunner in this capacity until the replacement is ready. It should be possible to implement some of the test case management and automation pieces before it is necessary to build the Testrunner functionality.

Functional requirements

Testrunner currently performs the following functions:

  • displays lists of existing test runs;
  • for each test run, displays a list of the component test cases, sortable by group or status;
  • individual test cases in a test run can be marked as PASSED, FAILED, or NOT RUN. Test cases can also be marked with a bug number;
  • test cases can be added or removed from test runs;
  • testers and watchers can be associated with test runs;
  • test cases can be assigned to components;
  • test cases can be assigned to functional groups;
  • functional groups can be added, modified and deleted;
  • test cases can added, modified, deleted, and cloned;
  • test runs can added, modified, deleted, and cloned;
  • each test run can include a test plan document;
  • a rudimentary testing request interface is present.

As noted previously, Testrunner is not perfect. In order to address these shortcomings, the following functionality is also required:

  • maintain synchronized versions of products, components, and users with Bugzilla;
  • maintain a single copy of each test case, and create test runs as lists of test cases rather than strictly cloning cases for new runs. Note: test cases can still be "cloned" to create genuinely new test cases;
  • allow for review/certification flags for individual test results and runs, e.g. to aid in localization testing;
  • robust permissions system, with distinctions between who can:
    • view test cases/runs;
    • run test cases/runs;
    • create test cases/runs, groups, components;
    • edit test cases/runs, groups, components;
    • all of the above, but for security-related test cases;
  • allow for re-ordering of test cases within a test run;
  • ability to change status of completed test results and runs as new information becomes available;
  • allow comments to be track for test cases and runs;
  • track changes of test cases and runs;
  • integrate with Talkback: tracking build ids, test results, crash bugs, etc.
  • user documentation, including a tutorial.

Test Case Management

Description

Testrunner currently contains some metadata about test cases, and tracks results based on that metadata. However, Testrunner does not contain a copy (or even a link to) the test case itself. Updating test cases is a two-step procedure with no guarantee that both steps will be executed.

As much as possible, Litmus will act as a repository for test cases. This will allow for metadata to be associated directly with test cases. For external tests that cannot be brought into the repository, there will be sufficient information given to acquire the test case(s) from the remote source, e.g. download URL.

Priority

Since there does not currently exist a central repository for test cases, this feature has the highest priority. If we can get a test case management interface up quickly, we can ensure that all testers are running the exact same set of tests, and implement automation from there.

Functional requirements

Test case management requires the following:

  • test case storage:
    • for as many tests as possible, this will hopefully mean storing fully automated test cases in whatever syntax is appropriate for use with the Spider tool;
    • links to externals test cases when they cannot be stored by the system, including access information;
    • full instructions for running tests that cannot be automated. Note: this is similar to the existing test case functionality in Testrunner;
  • version control for test cases, with trackable history and commentary;
  • linking of test results to individual test cases with version information;
  • ability to check out/download groups of test cases, based on test runs, or functionality group, or platform;
  • full access control restrictions for the test case repository. Security-related test cases should only be visible/downloadable to those with sufficient privileges;
  • web-based administration (there will be some overlap with the Testrunner functionality outlined above):
    • add/modify/delete test cases;
    • add/modify/delete test runs;
    • add/modify/remove privileges for users;
    • ability to view recent test case activity (additions/updates/deletions);

Automated Testing

Description

Some automated testing is already occuring using Bob Clary's Spider tool. Our goal with automated testing is two-fold:

  1. automate the automation: get the automated testing running continuously in an environment where it can be monitored, queried, and updated;
  2. automate as much regular testing as possible: this includes both smoketests and BFTs. Tests that cannot be run automatically should require as little interaction as possible, and this interaction must be standardized.

Note: this document does not cover the necessary changes to Spider or the test cases themselves to allow for automation.

Priority

Once we have a central repository for test cases, we can begin designing automation tools to draw on that repository.

I understand that efforts to convert the existing smoketests and BFTs into a Spider-ready format are already under way.

Functional requirements

There are two facets here. The first are the test automation processes/daemons that will run on the individual testing machines in the test farm. The second is the test result collating process/daemon that will live on the main Litmus server.

The automation processes must be:

  • able to run all our tier 1 platforms: Windows, Mac, Linux.
  • written to be as platform-agnostic as possible to minimize maintenance;
  • able to respond to remote queries for:
    • current status;
    • start/stop/restart/pause;
    • self-update;
    • automatic installation of new product builds;
    • process a specific test request;
  • maintain current state locally to allow for stop/restart/pause without affecting and testing-in-progress. This also means maintaining a list of testing requests that have already been run on the local testing machine to avoid duplication;
  • able to fail gracefully, e.g. during network interruptions. (Perhaps we want some default local test run to proceed in the case?)
  • able to send back testing results to the main processing/database server;
  • able to query the main server to get the latest testing requests off the request queue;

The main test result collating process/daemon must be able to:

  • process incoming results (perhaps in parallel?);
  • weed out common errors at a pre-processing stage:
    • incomplete results;
    • invalid formatting of results (easy with a DTD);
  • automatically append information to test results that match certain criteria, e.g. known bugs;
  • send notifications of breakages (test, system, and network failures) as appropriate, and make this configurable.

Reporting (Result Querying)

Description

Testing automation will generate an ongoing stream of test results. These results will be useless unless the proper tools are in place to query and compare them. This will address a current void in Testrunner, wherein there is no way to perform a head-to-head comparison between the results from two separate test runs. This makes it harder to spot regressions.

We also have the opportunity as we move forward to begin collecting (and reporting on) performance and defect data. This will allow us to create meaningful trend data.

Priority

Only some of the required reports are known at the time of writing. The various reports share a core set of functionality which can be put in place initially, and new or more complicated reports can be added over time.

The test run comparison reports will likely be the first to be implemented.

Functional requirements

The reporting interface will require the following features:

  • proper limiting for the number of results returned on a single page. This should also be configurable with some appropriate upper bound. The user should be able to navigate through result sets that span more that a single page;
  • ability to limit results based on certain criteria;
  • ability to sort/reverse results based on certain criteria;

The following specific reports are needed:

  • single test case: results from a single test case are marked-up for viewing;
  • test run: test case results from a single test run are marked-up and presented in synopsis form;
  • test case comparison: head-to-head comparison between two test case results, with differences highlighted;
  • test run comparison: synopsis views for two test runs are compared head-to-head, with differences highlighted;

Testing Requests

Description

Build/release engineers need to be able to run (and re-run) specific lists of tests against certain builds/release candidates. Testrunner currently allows users to make testing requests for certain products/components, but this is a simple list of requests, i.e. the tests are not automated in any way.

Priority

Testing requests are not in the short-term critical path for Litmus. Once basic testing automation is running, the request interface can be developed and integrated with the rest of the tool.

Functional requirements

Testing requests need to have the following information associated with them:

  • product and version required, possibly specified via links for downloading;
  • lists of test cases sorted in the order in which they should be run;
  • submitter info (email, etc.);
  • submission time;
  • priority;
  • time after which the results are meaningless, i.e. if request has not been run by time X, don't bother running it: mark it as "sunset" or some such, and move on;

The testing request system needs the following general functionality:

  • restricted access to a small subset of maintainers, QA staff, build/release engineers, and developers;
  • priority system for submitted requests, i.e. requests from maintainers trump requests from QA staff trump build...;
  • ability for maintainers to re-prioritize requests that are already in the queue, to force requests to run immediately, or to cancel requests;
  • allow users to modify or delete requests that they have already submitted, provided they have not yet been run;

Other Nonfunctional Requirements

Performance requirements

There are several performance-related aspects to be considered.

The first aspect is the performance of the Litmus web front-end. This concern is partially addressed by the existing LVS cluster. If Litmus is designed with LVS in mind, one initial performance bottleneck will be pushed back. We also don't expect a very high degree of concurrent access for the system.

As we accumulate test results, we may reach a point in the future when the size of the results database becomes a limiting factor, and query speed becomes bogged down. To mitigate this, we should come up with a suitable data retention policy and consider backing up historical test data offline when it is not longer useful.

Another aspect to consider is the performance of the automation daemon with regards to turnaround time for test runs. Depending on the speed of the test machines, there will be a little bit of trial-and-error involved here in order to get test runs designed that can complete in a given amount under normal circumstances. We can tweak these test lists/runs based on the testing loads we end up seeing. It may also be necessary to tweak these lists on a per-platform basis.

Test cases should be run and monitored on the test machines with a suitable time limit. This time limit should be based on historical performance, and will serve to "time out" tests under abnormal circumstances. Of course, we won't actually have historical performance data to begin with, so again there will be an initial period of trial-and-error.

Safety requirements

Due to the sensitive nature of some of the security-related test cases, there may be liability issues surrounding access control. See Security requirements below.

Security requirements

Proper access control is essential, especially due to the presence of security-related test cases in the test case repository. The Bugzilla authentication model should be extensible for use with Litmus. Security-related testcases and results can be invisible (or stubbed) for users with inadequate permissions.

Software quality attributes

Just like the software it is testing, Litmus is itself a software tool, subject to the same flaws and limitations.

Bugs can be filed against Litmus in Bugzilla using the product Webtools and the component Litmus.

It would also be nice to track some basic Litmus usage statistics, e.g. type and frequency of queries, but this is not a high priority.

Project documentation

All Litmus project documentation will reside under the Litmus hierarchy on the Mozilla wiki: http://wiki.mozilla.org/Litmus

Note: this may be migrated to DevMo in the future.

User documentation

All Litmus user documentation will also reside under the Litmus hierarchy on the Mozilla wiki: http://wiki.mozilla.org/Litmus

Note: this may be migrated to DevMo in the future.


--coop 08:00, 14 Jul 2005 (PDT)