Closed
Bug 1249209
(e10s-crashes)
Opened 9 years ago
Closed 8 years ago
[meta] tracking: e10s top crashes
Categories
(Firefox :: General, defect)
Firefox
General
Tracking
()
RESOLVED
INCOMPLETE
People
(Reporter: poiru, Assigned: benjamin)
References
(Depends on 1 open bug)
Details
(Keywords: meta)
Tracking bugs for top crashes in e10s beta experiment 2, part 2 (i.e. e10s-beta45-withoutaddons@experiments.mozilla.org).
Reporter | ||
Updated•9 years ago
|
Reporter | ||
Updated•9 years ago
|
Comment 1•9 years ago
|
||
Bug 1247380 wasn't e10s specific but likely due to ease of restore crashed tab, gained twice the bug submissions than non-e10s. (Patch landing in b7, hopefully works as bug without steps.)
Reporter | ||
Comment 2•9 years ago
|
||
Top content process crash list available here: https://crash-stats.mozilla.org/search/?ActiveExperiment=%3De10s-beta45-withoutaddons%40experiments.mozilla.org&process_type=content&_facets=signature&_columns=date&_columns=signature&_columns=product&_columns=version&_columns=build_id&_columns=platform#facet-signature
Comment 3•9 years ago
|
||
FWIW, that query only looks at content crashes, and not at potential browser crashes that e10s may help make worse. I took a look at comparing the percentages of crash signatures in searches for all non-e10s and all e10s crashes, take a look here: https://crash-analysis.mozilla.com/rkaiser/datil/searchcompare/?common=product%3DFirefox%26version%3D45.0b6%26process_type%3Dbrowser%26process_type%3Dcontent&p1=dom_ipc_enabled%3D__null__&p2=dom_ipc_enabled%3D!__null__
Anything with red on the right-most column means those crashes have a higher percentage in e10s-enabled crashes, green means lower percentage (numbers are percent points difference). The top line about OOM|small is actually not reason to celebrate (yet), it's fallout of e10s bug 1236108
Comment 4•9 years ago
|
||
The loss of big OOM signiture isn't too bad as it just switches to counting under other signitures.
shutdownhang | WaitForSingleObjectEx | WaitForSingleObject | PR_WaitCondVar | nsThread::ProcessNextEvent | NS_ProcessNextEvent | mozilla::SharedThreadPool::SpinUntilEmpty
a/v failings. Being uncounted, bug 1241106.
Have to rely on telemetry of ignore shutdownhangs too.
Updated•9 years ago
|
Alias: e10s-crashes
Comment 5•9 years ago
|
||
Note that kairo's comparison does not take into account differing total crash rates. Including a rate adjustment gives a slight reordering of some signatures. (A full +/- column currently sums to 0 with rate adjusted it would sum to be the change e.g. 10% rise in crashes or whatever rate is used.)
Comment 6•9 years ago
|
||
It is a bit more than slight.
Open this (7 day)
https://crash-analysis.mozilla.com/rkaiser/datil/searchcompare/?common=process_type%3Dbrowser%26process_type%3Dcontent%26ActiveExperiment%3D%253De10s-beta45-withoutaddons%2540experiments.mozilla.org%26date%3D%3E%253D2016-02-14%26date%3D%3C2016-02-21&p1=ActiveExperimentBranch%3D%253Dcontrol-no-addons&p2=ActiveExperimentBranch%3D%253Dexperiment-no-addons
Add a breakpoint at scomp.js:99 gSigData[result2[i].term].pct2 = 100 * result2[i].count / total2;
Reload
Change total2 from 20913 to 15749
[20913/(20913/17324*1.1) accessibility adjustment just rough estimate.]
Clear break and continue.
Comment 7•9 years ago
|
||
(In reply to Jonathan Howard from comment #5)
> Note that kairo's comparison does not take into account differing total
> crash rates.
Yes, intentionally. We wouldn't even have the necessary data to respect rates in the place this is looking at.
Comment 8•9 years ago
|
||
(In reply to Robert Kaiser (:kairo@mozilla.com) from comment #7)
> (In reply to Jonathan Howard from comment #5)
> > Note that kairo's comparison does not take into account differing total
> > crash rates.
>
> Yes, intentionally. We wouldn't even have the necessary data to respect
> rates in the place this is looking at.
Automatically it is not possible but entering the right value manually can lead to better results. (Even potentially wrong value might highlight an overlooked change. Off bug topic: looking at 44b9 vs 45b9 I wonder what happens when I rate adjust to have "OOM | small" the same.)
Comment 9•9 years ago
|
||
The OOM|small drop with e10s (and probably some increases in other signature the other way) is bug 1236108, the mechanism that detects "OOM | ..." and puts that into signature shas not yet been implemented for content processes. That probably makes a number of signatures related to OOM differ needlessly.
Comment 10•9 years ago
|
||
morphing this tracker
new experiment, part 1:
e10s-beta46-noapz@experiments.mozilla.org
experiment-no-addons (test group)
control-no-addons (control group)
control:
https://crash-stats.mozilla.org/search/?ActiveExperiment=e10s-beta46-noapz%40experiments.mozilla.org&ActiveExperimentBranch=control-no-addons&_facets=signature&_columns=signature&_columns=product&_columns=build_id&_columns=platform#facet-signature
test:
https://crash-stats.mozilla.org/search/?ActiveExperiment=e10s-beta46-noapz%40experiments.mozilla.org&ActiveExperimentBranch=experiment-no-addons&_facets=signature&_columns=signature&_columns=product&_columns=build_id&_columns=platform#facet-signature
Summary: [meta] tracking: e10s beta experiment 2, part 2 (without addons) top crashes → [meta] tracking: e10s beta 46 experiment top crashes
Comment 11•9 years ago
|
||
http://www.unbiased.name/ff/e10scrashes/
Looks like ipc corruption fix has solved a couple top signatures since 45. b1 is en-US only, anyone's guess if other locales will have impact.
mozilla::ipc::TransferHandleToProcess probably bug 1140115
Assignee | ||
Updated•9 years ago
|
Assignee: nobody → benjamin
Status: NEW → ASSIGNED
Assignee | ||
Comment 13•9 years ago
|
||
The analysis pass (reviewed and final) on the experiment data shows things are worse than they were:
https://github.com/vitillo/e10s_analyses/blob/master/beta46-noapz/e10s-stability-analysis.ipynb
App stability regression is now 62% (17.1 to 27.8)
Plugin stability regression is 50%: (8.1 to 12.1)
Here are date-controlled links to the relevant topcrash queries for the experiment:
APP CRASH LINKS:
Experiment (e10s) group topcrashes, content process only: https://crash-stats.mozilla.org/search/?ActiveExperiment=e10s-beta46-noapz%40experiments.mozilla.org&ActiveExperimentBranch=experiment-no-addons&process_type=content&date=%3E2016-03-09&date=%3E%3C2016-03-22&_facets=signature&_columns=signature&_columns=product&_columns=build_id&_columns=platform#facet-signature
Control group topcrash, no plugins: https://crash-stats.mozilla.org/search/?ActiveExperiment=e10s-beta46-noapz%40experiments.mozilla.org&ActiveExperimentBranch=control-no-addons&date=%3E2016-03-09&date=%3C2016-03-22&process_type=!plugin&_facets=signature&_columns=signature&_columns=product&_columns=build_id&_columns=platform#facet-signature
I am going to focus only on content crashes because the main process is much better.
PLUGIN CRASH LINKS:
Experiment/e10s top plugin crashes: https://crash-stats.mozilla.org/search/?ActiveExperiment=e10s-beta46-noapz%40experiments.mozilla.org&ActiveExperimentBranch=experiment-no-addons&process_type=plugin&date=%3E2016-03-09&date=%3C2016-03-22&_facets=signature&_columns=signature&_columns=product&_columns=build_id&_columns=platform#facet-signature
Control/non-e10s top plugin crashes: https://crash-stats.mozilla.org/search/?ActiveExperiment=e10s-beta46-noapz%40experiments.mozilla.org&ActiveExperimentBranch=control-no-addons&process_type=plugin&date=%3E2016-03-09&date=%3C2016-03-22&_facets=signature&_columns=signature&_columns=product&_columns=build_id&_columns=platform#facet-signature
I'm writing a script to highlight the differences for prioritization now: some of the bugs currently blocking this one seem to be "generic topcrashers" and are not the most likely place to look.
Comment 14•9 years ago
|
||
:bsmedberg the 'Experiment (e10s) group topcrashes, content process only' App Crash link is not working for me.
Flags: needinfo?(benjamin)
Assignee | ||
Comment 15•9 years ago
|
||
Flags: needinfo?(benjamin)
Comment 16•9 years ago
|
||
beta 45:
non-e10s e10s
usage hours 5382 4742
chrome crashes 72659 33591
content crashes 11731 75757
plugin crashes 41897 55809
main crash rate 13.50 7.08
main+content crash rate 15.68 23.06
plugin crash rate 7.78 11.77
beta 46:
non-e10s e10s
usage hours 3312 2547
chrome crashes 50409 20203
content crashes 6320 50546
plugin crashes 26843 30741
main crash rate 15.22 7.93
main+content crash rate 17.13 27.77
plugin crash rate 8.10 12.07
Interesting that beta 46 regressed generally, and that e10s seems to be adversely affected by that.
Comment 17•9 years ago
|
||
Track some e10s crash bugs from Beta 46 experiment:
Bug 1235633
Bug 1258312
Bug 1258331
Bug 1145613
Bug 1259183
Bug 1258317
Bug 1259192
Comment 18•9 years ago
|
||
top crash list for the apz part of the 46 experiment -
https://crash-stats.mozilla.com/search/?product=Firefox&addons=~e10srollout%40mozilla.org&version=46.0b4&version=46.0b5&version=46.0b6&dom_ipc_enabled=!__null__&_facets=signature&_facets=ActiveExperiment&_columns=date&_columns=signature&_columns=product&_columns=version&_columns=build_id&_columns=platform#facet-signature
Updated•9 years ago
|
Depends on: e10s-plugincrashes
Updated•9 years ago
|
Depends on: shutdownkill
Updated•9 years ago
|
No longer depends on: shutdownkill
Comment 20•9 years ago
|
||
Jim,
you mentioned this bug as the follow up bug of bug 899758...
(https://bugzilla.mozilla.org/show_bug.cgi?id=899758#c2)
...and the alias of this bug is "e10s-crashes"...
...so I set bug 1219672 ("ShutDownKills-Win") as a block of this bug, like it was before with bug 899758 ("crash-e10s")...
...but you removed here the dependency again...
So is this bug now a the follow up bug, or not ???
Flags: needinfo?(jmathies)
Comment 21•9 years ago
|
||
(In reply to Tobias B. Besemer [:BesTo] (QA) from comment #20)
> Jim,
>
> you mentioned this bug as the follow up bug of bug 899758...
> (https://bugzilla.mozilla.org/show_bug.cgi?id=899758#c2)
>
> ...and the alias of this bug is "e10s-crashes"...
>
> ...so I set bug 1219672 ("ShutDownKills-Win") as a block of this bug, like
> it was before with bug 899758 ("crash-e10s")...
>
> ...but you removed here the dependency again...
>
> So is this bug now a the follow up bug, or not ???
We're tracking a specific set of actionable crash bugs related to beta experiments we have running here. The ShutDownKills-Win is too general to be hooked up here.
Flags: needinfo?(jmathies)
Comment 22•9 years ago
|
||
I want to suggest as dependencies bug 627706 and bug 1268559.
Comment 23•9 years ago
|
||
Also the sig [@ ntdll.dll@0x4be7a ]. (No bug, yet.)
Example:
https://crash-stats.mozilla.com/report/index/7f2b48af-a03c-47fe-8ed0-4497a2160503
Stats:
https://crash-stats.mozilla.com/signature/?product=Firefox&signature=ntdll.dll%400x4be7a
Comment 24•9 years ago
|
||
Can someone plz have a look on bug 1265812?
Updated•8 years ago
|
Keywords: meta
Summary: [meta] tracking: e10s beta 46 experiment top crashes → [meta] tracking: e10s top crashes
Assignee | ||
Comment 25•8 years ago
|
||
No longer tracking this separately from normal stability work.
Status: ASSIGNED → RESOLVED
Closed: 8 years ago
Resolution: --- → INCOMPLETE
You need to log in
before you can comment on or make changes to this bug.
Description
•