Closed Bug 1727571 Opened 3 years ago Closed 3 years ago

Introduce a test to track memory while reloading page with devtools opened

Tracking

(firefox93 fixed)

Status:

RESOLVED FIXED

Milestone:

93 Branch

Tracking Flags:

Tracking

Status

firefox93

---

fixed

People

(Reporter: ochameau, Assigned: ochameau)

References

(Blocks 1 open bug)

Details

(Whiteboard: dt-perf-stability-mvp)

Attachments

(3 files)

Bug 1727571 - [devtools] Record allocations for toolbox opening+closing and page reloads. 3 years ago Alexandre Poirot [:ochameau] (deleted), text/x-phabricator-request		Details
Bug 1727571 - [devtools] Disable devtools testing flag in allocation tests. 3 years ago Alexandre Poirot [:ochameau] (deleted), text/x-phabricator-request		Details
Bug 1727571 - [devtools] Prevent the allocation tracker to record its own modules. 3 years ago Alexandre Poirot [:ochameau] (deleted), text/x-phabricator-request		Details

Alexandre Poirot [:ochameau]

Assignee

Description

•

3 years ago

We are having many report of leaks while reloading pages with devtools being opened.

We know that bug 1702715 had a significant positive impact of these type of leaks, while not trigerring any positive report from any of our DAMP tests.

bug 1682212 comment 44 reports that we use 10x less memory after 20 reload of CNN webpage.

We should have a test that would have notified us about this improvement so that:

we do not regress this back
we could possibly even improve our memory usage

After some investigation in DAMP, it makes sense that it doesn't highlight improvements on long sessions with DevTools being kept opened because with reload only once per toolbox.

https://searchfox.org/mozilla-central/rev/c8ea016b87997574e1ca9127a4c370032aa6ee79/testing/talos/talos/tests/devtools/addon/content/tests/head.js#164-166
Here, we only reload once.

  dump(`Reload page on '${name}'\n`);
  let test = runTest(`${name}.reload.DAMP`);
  await damp.reloadPage(onReload);
  test.done();

And in each panel test, we reload once per toolbox:
https://searchfox.org/mozilla-central/rev/c8ea016b87997574e1ca9127a4c370032aa6ee79/testing/talos/talos/tests/devtools/addon/content/tests/inspector/complicated.js#21-23

  let toolbox = await openToolboxAndLog("complicated.inspector", "inspector");
  await reloadInspectorAndLog("complicated", toolbox);
  await closeToolboxAndLog("complicated.inspector", toolbox);

We should be reloading multiple times for a given toolbox to see if memory is freed after each reload or not. If not, the following test would probably be slower and the many reload will probably be slower as well.

Unfortunately, I quickly tried doing that, and it did not highlight any significant improvement localy when testing against custom.inspector. On top of that, DAMP was crashing the inspector when trying to reload more than once against complicated.inspector...

Alexandre Poirot [:ochameau]

Assignee

Comment 1

•

3 years ago

I ended up reusing the logic behind this existing test:
https://searchfox.org/mozilla-central/source/devtools/client/framework/test/allocations/browser_allocations_target.js
This test is tracking the number of allocated JS objects before and after running a test script.
That test cover a very small subset of DevTools. It only track the memory usage of spawning a Commands object and fetching the first top level target.

In the upcoming patches, I'm adding two new similar test but covering:

opening and closing of a toolbox
reloading the webpage 10 times while having a toolbox opened

Then, I tried to compare the output of these tests without and with server targets (ie. bug 1702715):
https://treeherder.mozilla.org/perfherder/compare?originalProject=try&originalRevision=22bcbe80aeea10a9a98585dde40ad6b6f9254a0a&newProject=try&newRevision=2f1df5354adc5bb97d747fc6562c0cdd770dd291&framework=12&page=1
It provides a clear and better view on regressions and improvements provided by server targets. It especially highlight the significant fix in content processes when doing page reloads.

These tests highlighted:
-95% improvement in the content process when doing 10 reloads (this wasn't reported by DAMP), but a 62% regression in the parent process (this was reported by DAMP)
-26% improvement in the parent process when instantiating a commands with the top target (=existing test)
-2% improvement when opening and closing a toolbox (=new test, testing just this one usecase)

These tests are great because:

it is just plain mochitest, so it is much easier to run compared to DAMP
its output seem stable enough, so that it can be run on virtual machines
it can display the allocation sites of the objects still being allocated in order to debug and improve memory usage
the results goes into perfdata, like DAMP and can be tracked similarly

But they also have some limitations:

in many cases, allocation sites aren't available and so the promise of debugging isn't always true
it record the number of allocated javascript objects, so it doesn't scale with the actual amount of memory really used. If you are leaking one JS objects holding tons of data, it will regress undercovered.
perfdata tracking can be time consuming and requires an owner dedicated to to the triage

Alexandre Poirot [:ochameau]

Assignee

Comment 2

•

3 years ago

Attached file Bug 1727571 - [devtools] Record allocations for toolbox opening+closing and page reloads. (deleted) — Details

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1539518

Alexandre Poirot [:ochameau]

Assignee

Comment 3

•

3 years ago

Attached file Bug 1727571 - [devtools] Disable devtools testing flag in allocation tests. (deleted) — Details

This is important to disable flags.testing for many reasons:

it helps using the production code that is being used by end users. So we are closer to a real usage of our tools.
it prevents enabling debug code which are leaking or are explicitely doing stuff that hit performance and might allocate more objects.

We especially want to disable redux's store history feature which record all the actions,
and leads to leak tons of objects.

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1727838

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1727839

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Whiteboard: dt-perf-stability-triage

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1728069

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1728070

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1728071

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1728072

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1728075

Alexandre Poirot [:ochameau]

Assignee

Comment 4

•

3 years ago

Perfherder results for these two patches:
https://treeherder.mozilla.org/perfherder/compare?originalProject=try&originalRevision=2046c80777ca0c5f8fc4e0ee9f8dd20d6706b52a&newProject=try&newRevision=2540326933a9f34fac7bd67f059cf3ec71614a9c&framework=12&page=1
I imagine the devtools.testing is reducing the allocations.
We go from 3,804.5 down to 2,565.

Jan Honza Odvarko [:Honza] (always need-info? me)

Updated

•

3 years ago

Whiteboard: dt-perf-stability-triage → dt-perf-stability-mvp

Alexandre Poirot [:ochameau]

Assignee

Comment 5

•

3 years ago

To better understand the tests and what they record, here is a summary of the impact of each leak fix to the output of these tests:
https://docs.google.com/document/d/1Jl4Qwbwi_mdWBVmNeReX-_XX_xQvhQfUntqLsi8gflU/edit#heading=h.3s7pgrvsizca
Note that these fixes are focusing on leaks happening when running the "reload" test (=browser_allocation_reload.js)

tl;dr; The leak fixes only impact "objects with stacks" and recording the "resident unique" memory doesn't highlight any improvement by any of these patches.

The resident unique is the actual amount of memory allocated by the parent process.
I wasn't expecting each individual fix to have a significant/visible impact on this.
But this is still surprising that the whole patch queue isn't having a positive impact on this.
I do fix leaking target fronts, as well as MarkupView, on each page reload.
This should be a subtantial amount of objects and memory, which should have, after a couple of reload a visible impact.

Alexandre Poirot [:ochameau]

Assignee

Comment 6

•

3 years ago

Attached file Bug 1727571 - [devtools] Prevent the allocation tracker to record its own modules. (deleted) — Details

It was tracking the special sandbox we spawn for builtin-modules.js
as well as its internal sandbox used to fetch platform globals.

Pulsebot

Comment 7

•

3 years ago

Pushed by apoirot@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/b1098bda4a50 [devtools] Record allocations for toolbox opening+closing and page reloads. r=jdescottes https://hg.mozilla.org/integration/autoland/rev/c9b8b52fa85b [devtools] Prevent the allocation tracker to record its own modules. r=jdescottes https://hg.mozilla.org/integration/autoland/rev/2064d5f7e669 [devtools] Disable devtools testing flag in allocation tests. r=jdescottes

Alexandre Poirot [:ochameau]

Assignee

Comment 8

•

3 years ago

I tried some variations of GCs, but none were really great.
The to-be-landed version is:

  const numCycles = 3;
  for (let i = 0; i < numCycles; i++) {
    Cu.forceGC();
    Cu.forceCC();
    await new Promise(resolve => Cu.schedulePreciseShrinkingGC(resolve));

    // eslint-disable-next-line mozilla/no-arbitrary-setTimeout
    await new Promise(resolve => setTimeout(resolve, 1000));
  }

That's what we have been using in DAMP so far, and AFAIK, the best approach.

Here is a comparison with the following version:

  const memSrv = Cc["@mozilla.org/memory-reporter-manager;1"]
    .getService(Ci.nsIMemoryReporterManager);
 await new Promise(resolve => {
   memSrv.minimizeMemoryUsage(resolve);
  });
 
  // eslint-disable-next-line mozilla/no-arbitrary-setTimeout
  await new Promise(resolve => setTimeout(resolve, 1000));

https://treeherder.mozilla.org/perfherder/compare?originalProject=try&originalRevision=017219cde5221fd68acb5b30f52569c1e57705ef&newProject=try&newRevision=5c935d3812e5acb7e8422c5863eb59beea31459d&framework=12&page=1&filter=memory
We can see that the record are fine in the content process, where we have a low variance of only 7% in both revisions.
But the variance for parent process records are inacceptable in this version -17197% for the reload tests!

And another once with the following version:

      const memSrv = Cc["@mozilla.org/memory-reporter-manager;1"]
     .getService(Ci.nsIMemoryReporterManager);

       // In order to get stable results, we really have to do 3 GC attempts
       // *and* do wait for 1s between each GC.
       const numCycles = 3;
       for (let i = 0; i < numCycles; i++) {
        await new Promise(resolve => {
          memSrv.minimizeMemoryUsage(resolve);
        });
 
         // eslint-disable-next-line mozilla/no-arbitrary-setTimeout
         await new Promise(resolve => setTimeout(resolve, 1000));
       }

https://treeherder.mozilla.org/perfherder/compare?originalProject=try&originalRevision=017219cde5221fd68acb5b30f52569c1e57705ef&newProject=try&newRevision=1c878eb3bb6d1d8d97a54d548562542dab8e6359&framework=12&page=1&filter=memory
This is better than the previous version, but still much worse than the to-be-landed version.
422% and 724% variance on target and reload tests.

And a last flavor:

      const memSrv = Cc["@mozilla.org/memory-reporter-manager;1"]
     .getService(Ci.nsIMemoryReporterManager);

       // In order to get stable results, we really have to do 3 GC attempts
       // *and* do wait for 1s between each GC.
       const numCycles = 3;
       for (let i = 0; i < numCycles; i++) {
         Cu.forceGC();
         Cu.forceCC();
        await new Promise(resolve => {
          memSrv.minimizeMemoryUsage(resolve);
        });
 
         // eslint-disable-next-line mozilla/no-arbitrary-setTimeout
         await new Promise(resolve => setTimeout(resolve, 1000));
       }

https://treeherder.mozilla.org/perfherder/compare?originalProject=try&originalRevision=017219cde5221fd68acb5b30f52569c1e57705ef&newProject=try&newRevision=c9b0dc84421809337da8fa1bae3b22954c1328d5&framework=12&page=1&filter=memory
isn't much better with 248% and 262% on reload and target tests.

Iulian Moraru

Comment 9

•

3 years ago

Backed out for causing devtools failures on browser_allocation_tracker.js.

Push with failures

Failure log

Backout link

Flags: needinfo?(poirot.alex)

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1728737

Alexandre Poirot [:ochameau]

Assignee

Comment 10

•

3 years ago

I missed tweaking an existing to match the new API...

It should be fixed now:
https://treeherder.mozilla.org/jobs?repo=try&revision=b51e11ac94ed83d04ea3bbf2f3a3064bd5cd2d02

Flags: needinfo?(poirot.alex)

Pulsebot

Comment 11

•

3 years ago

Pushed by apoirot@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/2abfa9bef307 [devtools] Record allocations for toolbox opening+closing and page reloads. r=jdescottes https://hg.mozilla.org/integration/autoland/rev/b63d2379b135 [devtools] Prevent the allocation tracker to record its own modules. r=jdescottes https://hg.mozilla.org/integration/autoland/rev/9588cf22563a [devtools] Disable devtools testing flag in allocation tests. r=jdescottes

Alexandru Michis [:malexandru]

Comment 12

•

3 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/2abfa9bef307
https://hg.mozilla.org/mozilla-central/rev/b63d2379b135
https://hg.mozilla.org/mozilla-central/rev/9588cf22563a

Status: NEW → RESOLVED

Closed: 3 years ago

status-firefox93: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 93 Branch

Julian Descottes [:jdescottes]

Updated

•

3 years ago

Blocks: 1729671

Alexandre Poirot [:ochameau]

Assignee

Updated

•

3 years ago

Blocks: 1731630

You need to log in before you can comment on or make changes to this bug.