Closed Bug 1731667 Opened 3 years ago Closed 2 years ago

Document sources of non-deterministic data collection in FOG

Categories

(Toolkit :: Telemetry, task, P3)

task

Tracking

()

RESOLVED FIXED

People

(Reporter: chutten, Assigned: chutten)

References

(Blocks 1 open bug)

Details

Attachments

(1 file)

Due to its design, FOG has some non-deterministic behaviour that may influence how much and what kinds of data will be reported.

  • IPC flushes on idle, meaning it might miss data from sessions that have fewer idle periods.
    • IPC should also be triggered when a process is being taken down in an orderly fashion. File a bug for this, or fix it here.
  • Ping-lifetime data is held in-memory (since bug 1729723) and is persisted to the db on idle and on shutdown, meaning a crash in a session with fewer idle periods would result in lost data.

To our best ability we should instrument and document these and other sources of data unreliability to build confidence in our data collection system in Firefox Desktop.

Assignee: nobody → chutten
Status: NEW → ASSIGNED
Pushed by chutten@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/1890b6a3b3d2 Flush FOG IPC from content children on process shutdown r=janerik

Marking leave-open as the landing patch takes care of

IPC should also be triggered when a process is being taken down in an orderly fashion. File a bug for this, or fix it here.

But doesn't instrument or document non-determinism otherwise.

Keywords: leave-open
Flags: needinfo?(chutten)

Dang. Looks like my "while we're here" addendum to the patch to ensure delayed ping lifetime io is persisted after the IPC data has flushed in has found another way to hit Bug 1731595 (still pending a release and a vendor before it'll be fixed in m-c).

I can take it out and file a follow-up for putting it back in. Though if it's truly being caused by persisting after shutdown, that means some of the IPC data we're flushing on idle is going to miss the bus. Which means we may wish to rethink how we schedule these at-shutdown flushes.

Flags: needinfo?(chutten)
Blocks: 1732989
Pushed by chutten@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/14987b584ee7 Flush FOG IPC from content children on process shutdown r=janerik

Florian learned that some instrumentation on content children comes in too late to be recorded. We should document exactly how late is too late for our subprocess support and what we do when it's too late (which is, for the most part, nothing).

The leave-open keyword is there and there is no activity for 6 months.
:chutten, maybe it's time to close this bug?
For more information, please visit auto_nag documentation.

Flags: needinfo?(chutten)

Alrighty, auto-nag, you win this one. I do still hope to document this in the dev docs, but the non-deterministicity (not a word) of data collection is set to be improved by bug 1641989 in the not-too-distant future and we haven't bemoaned the lack of documentation yet... so maybe this is work that isn't needed after all.

Status: ASSIGNED → RESOLVED
Closed: 2 years ago
Flags: needinfo?(chutten)
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: