Closed Bug 1779468 Opened 2 years ago Closed 2 years ago

Move bytecode cache testing into its own test

Categories

(Testing :: Raptor, defect, P1)

defect

Tracking

(firefox104 fixed)

RESOLVED FIXED
104 Branch
Tracking Status
firefox104 --- fixed

People

(Reporter: sparky, Assigned: sparky)

References

Details

(Keywords: perf-alert)

Attachments

(1 file)

We recently enabled a pref that would prepopulate the bytecode cache before the warm pageload. This has caused an increase in variance for a few of our tests, and has increased task run-time by at most 8.3 minutes in all of our pageload tests. This bug is for moving the test outside of the main pageload tests and running them in their own browsertime-tp6-bytecode tests. We'll only run essential tests there at tier 2.

This patch adds a new set of tests (essential only) to run in CI at tier 2. A command line argument is added to turn bytecode cache testing on in warm pageloads. This also disables the testing in our primary pageload tests to fix the variance and increases in runtime.

Pushed by gmierz2@outlook.com: https://hg.mozilla.org/integration/autoland/rev/373afd9177bf Add `browsertime-tp6-bytecode` tests for bytecode cache. r=perftest-reviewers,AlexandruIonescu,aglavic
Status: NEW → RESOLVED
Closed: 2 years ago
Resolution: --- → FIXED
Target Milestone: --- → 104 Branch

== Change summary for alert #34833 (as of Fri, 15 Jul 2022 05:29:15 GMT) ==

Regressions:

Ratio Test Platform Options Absolute values (old vs new)
20% nytimes fcp linux1804-64-shippable-qr fission warm webrender 77.33 -> 93.04
4% nytimes loadtime linux1804-64-shippable-qr fission warm webrender 1,069.94 -> 1,113.17

Improvements:

Ratio Test Platform Options Absolute values (old vs new)
67% nytimes FirstVisualChange linux1804-64-shippable-qr fission warm webrender 140.00 -> 46.67
50% nytimes ContentfulSpeedIndex linux1804-64-shippable-qr fission warm webrender 353.12 -> 175.50
12% nytimes PerceptualSpeedIndex linux1804-64-shippable-qr fission warm webrender 414.08 -> 363.25

For up to date results, see: https://treeherder.mozilla.org/perfherder/alerts?id=34833

Regressions: 1780238

This change caused a ~30% regression, not just increased variance, on some Android warm page load tests on AWFY:

https://treeherder.mozilla.org/perfherder/graphs?series=autoland,4366742,1,13&timerange=2592000

Is that expected? Are the "regressed" results the new baselines? (I understand this is a test change, not a real regression.)

Flags: needinfo?(gmierz2)

Yes, this is expected so this is a new (or previous) baseline. It brought the metrics back to what they were before this change was made: https://phabricator.services.mozilla.com/D145803

For the A51, we don't have enough history to see what it was before the change.

Flags: needinfo?(gmierz2)
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: