small number of libxul / omni.ja build ID mismatches
Categories
(Firefox :: General, defect, P3)
Tracking
()
People
(Reporter: heycam, Assigned: rhelmer)
References
Details
The telemetry in bug 1608308 has been in place for a few days now. I'm seeing a very small number of libxul / omni.ja build ID mismatches (4 out of 215K):
Not really sure where to go from here.
Assignee | ||
Comment 1•5 years ago
|
||
Huh, interesting. It's quite a bit lower than "omnijar corrupted" (which is a signature check of both omni JARs):
I'm curious to see the relationship between the two.
I don't see the "omnijar mismatch" scalar in BigQuery yet, following up on that with data folks. Once that's available, we can see if there are any other interesting correlations (presence/type of anti-virus, OS, number of crashes, etc.)
Assignee | ||
Comment 2•5 years ago
|
||
I'm also interested to see if/how this changes as they ride the trains, the release population is pretty different from nightly.
Comment hidden (obsolete) |
Assignee | ||
Comment 4•5 years ago
|
||
OK so it's payload.processes.parent.scalars.corroborate_omnijar_mismatch
in telemetry.main
(I was looking in telemetry.main_summary
), I'll see what I can dig up today.
Updated•5 years ago
|
Assignee | ||
Comment 5•5 years ago
|
||
I took a quick stab at using BigQuery's correlation function (with some help from tdsmith) to make it easier to try to correlate problems like broken omni JAR vs. wrong build ID in omni JAR. If this is correct then looks like an unsigned omni JAR is negatively correlated with an incorrect build ID, so they are likely different phenomena: https://sql.telemetry.mozilla.org/queries/68218/source
I went through the hassle of doing it this way so we can make it easier to now plug in different sorts of correlates:
(In reply to Robert Helmer [:rhelmer] from comment #1)
I don't see the "omnijar mismatch" scalar in BigQuery yet, following up on that with data folks. Once that's available, we can see if there are any other interesting correlations (presence/type of anti-virus, OS, number of crashes, etc.)
The theories I've been able to dream up so far around the mismatched build ID (setting aside the signature problem for now):
- our updater has a bug (or something is broken on the users machine) and sometimes the omni JAR doesn't get updated
- users (or malware) are copying an old omni JAR into place on purpose
However before trying to really figure out the cause it probably makes sense to measure the impact of this to see how much it's worth digging into. I think this is what we know so far:
- This is related to at least one crash: https://bugzilla.mozilla.org/show_bug.cgi?id=1607716
- We know there are a small number of Nightly users reporting this, but we won't really know the full impact until this collection rolls out to release
Reporter | ||
Comment 6•5 years ago
|
||
Given the small number of crashes, and small numbers being reported by the telemetry on Nightly, I think it would be fine to wait until the telemetry hits Release before sinking a lot of time into investigating this.
Comment 7•4 years ago
|
||
Bugbug thinks this bug should belong to this component, but please revert this change in case of error.
Updated•4 years ago
|
Updated•2 years ago
|
Description
•