Closed Bug 1159297 Opened 9 years ago Closed 9 years ago

Some sessions have multiple fragments with the same subsessionCounter and/or missing fragments

Categories

(Toolkit :: Telemetry, defect)

defect
Not set
normal

Tracking

()

RESOLVED FIXED

People

(Reporter: rvitillo, Unassigned)

References

Details

(Whiteboard: [unifiedTelemetry] [data-validation])

I spotted some sessions that seem to have missing and/or duplicated fragment numberings, e.g.:

list of subsessionCounter for fragments of session X of client Y: [1, 1, 1, 1]
list of subsessionCounter for fragments of session U of client K: [1, 2, 4, 5]

From all the sessions that have more than one fragment, about 15% have an incorrect ordering.

See http://nbviewer.ipython.org/gist/vitillo/9645b5f6849bd2051ea8.
Blocks: 1147395
Depends on: 1159312
George suggested to limit the data to build ids after the 5th of April to avoid some known and fixed bugs.
No longer depends on: 1159312
Depends on: 1159312
Alessio is currently investigating this.
Assignee: nobody → alessio.placitelli
I have updated the notebook. The issue persists with submission from newer build-ids. 

As you can see in the example at the bottom, it seems that some fragments with the same subsessionCounter are not perfect dupes but just the same fragment taken at different times; once with reason="shutdown" and the other with reason="aborted-session".
... i meant to assign bug 1157359.
Assignee: alessio.placitelli → nobody
(In reply to Roberto Agostino Vitillo (:rvitillo) from comment #3)
> I have updated the notebook. The issue persists with submission from newer
> build-ids. 
> 
> As you can see in the example at the bottom, it seems that some fragments
> with the same subsessionCounter are not perfect dupes but just the same
> fragment taken at different times; once with reason="shutdown" and the other
> with reason="aborted-session".

Thanks, that is really helpful.
It looks like we are (always or sometimes) failing to remove the "aborted-session" ping after saving the "shutdown" ping.
Blocks: 1120356
In trying to understand if and how many fragments we receive out of order, I noticed that some fragments are sent multiple times, e.g. one particular fragment with reason "shutdown" has been submitted once on the 23rd and twice on the 26th for a session that ended on the 23rd.
Those are identical pings being resent repeatedly?
Exactly, their payloads match and they have identical docids.
Nightly 2015-05-06 has bug 1157359 which affects this, we should recheck from that build on.
From bug 1157359 etc. we seem to be much better now - Roberto, can we close this bug over the other ones open or do you want to recheck here?
Flags: needinfo?(rvitillo)
Let's close this.
Status: NEW → RESOLVED
Closed: 9 years ago
Flags: needinfo?(rvitillo)
Resolution: --- → FIXED
Whiteboard: [unifiedTelemetry] [data-validation]
You need to log in before you can comment on or make changes to this bug.