1787436 - Investigate performance issues when recording HAR with Firefox

Reporter

Description

•

2 years ago

As a follow up to the prototype built for HAR generation in WebDriver BiDi (Bug 1770496) we want to check how the new implementation behaves in terms of performance.

We regularly received the feedback that recording HAR with Firefox DevTools created incorrect timings, much slower than how it would happen without DevTools.

The goal here is to collect such scenarios, where DevTools HAR recording has a visible impact on the page load performance. Then run those scenarios against the BiDi prototype to see if we have the same issue. And if we do, attempt to identify which part of the network observer logic is actually responsible for the performance hit.

Based on this information we should be able to create a HAR generation solution which has less performance impact than the DevTools one.

Julian Descottes [:jdescottes]

Reporter

Comment 1

•

2 years ago

Hi Dave,

It was mentioned that the performance team might have some scenarios where using DevTools to record a HAR file would produce noticeably slower timings than a real navigation (ie without DevTools). Do you know of any scenario like that, could you share it here if yes?

Thanks!

Flags: needinfo?(dave.hunt)

Peter Hedenskog

Comment 2

•

2 years ago

Hi Julian,
sorry for jumping in, I've seen the performance difference, the more assets/requests a page do it's easier to see. If you use Browsertime to measure, we use https://github.com/firefox-devtools/har-export-trigger to get the devtools HAR. In Mozillas tests you use --skipHar that disables the har export trigger.

I just tried with https://en.wikipedia.org/wiki/COVID-19 and I could see the difference in the numbers, so running something like browsertime https://en.wikipedia.org/wiki/COVID-19 -b firefox --video --visualMetrics -c cable -n 5 --skipHar vs browsertime https://en.wikipedia.org/wiki/COVID-19 -b firefox --video --visualMetrics -c cable -n 5 I see a difference in 20-30% on firstPaint/first contentful paint/firstVisualChange, meaning it's 20-30% faster without devtools.

Maybe I can help making a branch of Browsertime where we hack triggering bidi to get the HAR so its easier to compare?

Dave Hunt [:davehunt] [he/him] ⌚BST

Comment 3

•

2 years ago

Thanks for jumping in Peter. Also adding a needinfo for :acreskey as I recall he may have some experience here too.

Flags: needinfo?(dave.hunt) → needinfo?(acreskey)

Julian Descottes [:jdescottes]

Reporter

Comment 4

•

2 years ago

Thanks a lot for the input Peter, this is super helpful!

Maybe I can help making a branch of Browsertime where we hack triggering bidi to get the HAR so its easier to compare?

I think that's a good next step!
To make that work I think we also need a binary where BiDi's HAR recording would start automatically based on a pref for instance.
Would it be OK if it doesn't actually save the HAR to a file? I haven't added that part yet, but what's important is the fact that it is recording anyway, right ?

I think I should be able to tweak browsertime to set a preference (eg remote.bidi.har-experiment=true) in the --skipHar code path, and that should already give me some idea about the impact?

I'll get started on a way to record HAR from BiDi based on a pref, in the meantime let me know if something sounds terribly wrong with my plan :)

Peter Hedenskog

Comment 5

•

2 years ago

Would it be OK if it doesn't actually save the HAR to a file? I haven't added that part yet, but what's important is the fact that it is recording anyway, right ?

Yes sounds good. Even though that the HAR is not saved, is there a way to still verify that the preferences is set correctly so I know that I triggered the right path?

Andrew Creskey [:acreskey]

Comment 6

•

2 years ago

This sounds great and will be very useful.
One can pass arbitrary preferences into browsertime, e.g. --firefox.preference gfx.webrender.enabled:true so the prototyping should work well.

Julian Descottes [:jdescottes]

Reporter

Comment 7

•

2 years ago

Thanks all for the input, I have a few early results, but I will still need more time to analyze the results and confirm them. For now what I can see is that BiDi HAR recording has an impact, but much smaller than the devtools HAR.

Here are my results for 5 runs on https://en.wikipedia.org/wiki/COVID-19

Metric	skipHAR	DevTools HAR	BiDi HAR
TTFB	64ms	65ms	64ms
firstPaint	524ms	786ms (-50%)	617ms (-18%)
firstVisualChange	933ms	933ms	933ms
FCP	517ms	784ms (-52%)	600ms (-16%)
DOMContentLoaded	482ms	803ms (-67%)	596ms (-24%)
Load	541ms	891ms (-65%)	674ms (-24%)
speedIndex	1.21s	1.32s	1.25s
visualComplete85	1.53s	1.80s	1.63s
lastVisualChange	9.87s	10.13s	9.97s

The BiDi implementation is a bit simplified compared to the devtools one (we don't monitor network information in content processes for instance) so those results are probably not very accurate. But simply running the scenario, I can see that devtools are started on the inspector which has a lot of impact on content processes, especially during the load. I will try to replace it with a blank tool, to see if that helps.

And from there I'll start bisecting a bit the BiDi and DevTools implementations to see what impacts performance.

Julian Descottes [:jdescottes]

Reporter

Comment 8

•

2 years ago

Attached file Bug 1787436 - [bidi] Record HAR automatically if remote.bidi.har-experiment is true (deleted) — Details

Additionally, if remote.bidi.har-experiment.path is set to a folder, the browser
will attempt to save the har recording to a file before quitting.

Phabricator Automation

Updated

•

2 years ago

Assignee: nobody → jdescottes

Status: NEW → ASSIGNED

Julian Descottes [:jdescottes]

Reporter

Comment 9

•

2 years ago

The patch at https://phabricator.services.mozilla.com/D155837 should allow anyone to test in an automation setting.
Just pass remote.bidi.har-experiment=true to start recording HAR on the first tab which will be loaded.
And you can pass remote.bidi.har-experiment.path="/some/valid/path/" if you want the HAR file to be recorded under /some/valid/path/bidi-har-${timestamp}.har on exit.

I will add a link to a try build with binaries in case anyone wants to test this. Otherwise I will do more tests later this week.

Henrik Skupin [:whimboo][⌚️UTC+2]

Comment 10

•

2 years ago

Julian, should this bug be in M4 or shall we put it as P2 into the backlog?

Flags: needinfo?(jdescottes)

Julian Descottes [:jdescottes]

Reporter

Comment 11

•

2 years ago

(In reply to Henrik Skupin (:whimboo) [⌚️UTC+2] from comment #10)

Julian, should this bug be in M4 or shall we put it as P2 into the backlog?

I would set it as P2. I was really looking forward to having a test case, so I wanted to do some investigation. But it's true that more than that will take some time now, and should be correctly planned.

I will unassign and move this to the backlog for now.

Flags: needinfo?(jdescottes)

Julian Descottes [:jdescottes]

Reporter

Updated

•

2 years ago

Assignee: jdescottes → nobody

Status: ASSIGNED → NEW

Priority: -- → P2

Whiteboard: [webdriver:backlog]

Julian Descottes [:jdescottes]

Reporter

Comment 12

•

2 years ago

I will add a link to a try build with binaries in case anyone wants to test this.

https://treeherder.mozilla.org/jobs?repo=try&revision=11a990a81cb90c1aa75721a459a3d3144a01e1c1

Julian Descottes [:jdescottes]

Reporter

Updated

•

2 years ago

Blocks: 1788874

Andrew Creskey [:acreskey]

Updated

•

2 years ago

Flags: needinfo?(acreskey)

Henrik Skupin [:whimboo][⌚️UTC+2]

Updated

•

2 years ago

Blocks: 1790362

Points: --- → 3

Priority: P2 → P1

Whiteboard: [webdriver:backlog] → [webdriver:m5]

Henrik Skupin [:whimboo][⌚️UTC+2]

Updated

•

2 years ago

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=1790380

Marco Mucci [:MarcoM]

Updated

•

2 years ago

Priority: P1 → P2

Whiteboard: [webdriver:m5] → [webdriver:backlog]

Henrik Skupin [:whimboo][⌚️UTC+2]

Comment 13

•

2 years ago

Via bug 1806833 we will create a demo script to record HAR files. I assume that we will be able to use that one to get started with performance checks and comparisons.

Depends on: 1806833

Henrik Skupin [:whimboo][⌚️UTC+2]

Updated

•

2 years ago

No longer blocks: 1790362

Julian Descottes [:jdescottes]

Reporter

Comment 14

•

2 years ago

Here's a performance profile while recording network events on a recent Nightly: https://share.firefox.dev/3zZ0iOU

One thing that stands out, the _log method from WebSocketConnection shows up relatively high, considering that trace logs are disabled. We should probably avoid building the strings when trace logs are disabled.

Edit: Second profile with many more requests, but with a fix for the _log from WebSocketConnection: https://share.firefox.dev/3MAiI09 . We see the security info parsing taking a long time here. Which is also interesting because this is one of the things which BiDi is not reading at the moment.