968923 - Implement some equivalent of Chrome's use counters (on top of telemetry?)

Reporter

Description

•

11 years ago

If I understand their use counter setup correctly, it allows them to answer the following question: What fraction of pages that our users loaded encountered condition X? This fundamentally requires keeping track of two values: total number of pages loaded by all users and number of pages that encountered condition X. The question is how to implement that on our end. On the DOM side it's pretty simple to just have an enum and be able to call into a document with a value of that enum, have the first call do something and the rest no-op. We have a similar setup for WarnOnceAbout already. What should the "something" be? The goal here is that a developer should be able to add a value to the enum, wait for it to ride the trains, then load some web page where he's told what fraction of documents had that enum value triggered... Does telemetry let us do that in a sane way?

Boris Zbarsky [:bzbarsky]

Reporter

Comment 1

•

11 years ago

One other note. A common request for this sort of data is to have some idea of how many distinct _sites_ are involved, or which sites they are. The former would be a bit harder than the above proposal while the latter is a privacy no-go.

(dormant account)

Comment 2

•

11 years ago

(In reply to Boris Zbarsky [:bz] from comment #1) > One other note. A common request for this sort of data is to have some idea > of how many distinct _sites_ are involved, or which sites they are. The > former would be a bit harder than the above proposal while the latter is a > privacy no-go. I'm still not clear as to how to measures 'number of pages loaded' and how to tell if a feature is used by multiple times per 'page' or not. Nobody I spoke to has a good understanding as to how to track 'pages'. Telemetry is just a vehicle to carry payloads. If you can capture a value locally in a way that makes sense to you, we can send it out. So my request is: please implement dom/network needed to measure 'x per page' so we know what we are asked to hook up to telemetry.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 3

•

11 years ago

> I'm still not clear as to how to measures 'number of pages loaded' We can measure the number of non-chrome inner window objects created, say. This part is simple. The hard part is figuring out when to reset the counter and when to report it and such. > and how to tell if a feature is used by multiple times per 'page' See comment 0. It covers this part. > Telemetry is just a vehicle to carry payloads. Sort of. It's also a UI for viewing them, etc. So what sorts of payloads can telemetry send and allow me to view? Can I do something where I have a bunch of counters, every time we go to send data we send the current values and zero them out, and on the server end we just total up the values of all the counters and have an easy way to see those totals for a particular release?

Cameron McCormack (:heycam)

Comment 4

•

11 years ago

In bug 968572 I have a patch to do some measurements of certain kinds of CSS property usage where we store that information on the nsPresContext, only report it at nsPresContext destruction time, and only if the nsPresContext is for a document that is non-chrome and not about:* or chrome:*. (Perhaps that last check is redundant with "the document is non-chrome".) That counts things like background-image documents, iframes, etc. It wouldn't be hard to modify to store this information only for the top level document. I agree with Boris that it would be great if we could get this information per page rather than per loaded document, but that doesn't seem feasible at the moment.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 5

•

11 years ago

I'm fine with per loaded document. I'd just like us to have something developers can use turnkey without the pain I went through in bug 909656 trying to get _some_ sort of useful data....

Cameron McCormack (:heycam)

Updated

•

11 years ago

Assignee: nobody → cam

Status: NEW → ASSIGNED

(dormant account)

Comment 6

•

11 years ago

(In reply to Boris Zbarsky [:bz] from comment #3) > > So what sorts of payloads can telemetry send and allow me to view? Can I do > something where I have a bunch of counters, every time we go to send data we > send the current values and zero them out, and on the server end we just > total up the values of all the counters and have an easy way to see those > totals for a particular release? Yes. just have an api for getAndResetCounter() that you call to assign to a simpleMeasure in TelemetryPing.js.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 7

•

11 years ago

Hmm. What does the current simpleMeasure setup do? I would have thought it resets counters already, if it does counters. Are there existing simpleMeasure providers in C++ we could crib? The stuff in Telemetry.h does histograms, and we don't really need or want a histogram here.... And how do we get the server to just add up the numbers instead of producing histograms?

Cameron McCormack (:heycam)

Comment 8

•

11 years ago

In my WIP patch I'm defining a histogram per use counter of kind "boolean", where I call Telemetry::Accumulate each time a Document is destroyed with true if script ever called a particular DOM method in that document, or false if it didn't. Am I right in thinking that each time I call Telemetry::Accumulate(), one of two values (for the "true" nad "false" cases) will get incremented locally on the histogram object, and then when the daily telemetry ping is performed, those two numbers will be sent to the server, and then those two values are reset back to zero? And then when you view the data on telemetry.mozilla.org, this shows the total number of false/true values that were sent to the server, and not anything like "number of false/true values per user"? I'm assuming this is how it worked with the XMLHTTPREQUEST_ASYNC_OR_SYNC histogram.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 9

•

11 years ago

FWIW, I have had little luck getting actual _numbers_ out of a two-value histogram on the telemetry server. Approximate percentages, yes, but approximate to like the nearest %. Better server UI is part of what we need for this bug. :(

Cameron McCormack (:heycam)

Comment 10

•

11 years ago

If I go to http://telemetry.mozilla.org/#nightly/29/XMLHTTPREQUEST_ASYNC_OR_SYNC and click "download as csv" I can get exact numbers, but it is difficult to use that date range selector on the graph to a get a particular range of interest.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 11

•

11 years ago

For this, I don't think we need graph ranges. Just having the raw counts per-release is great. The ideal UI for these use counters, I suspect, is a single page that just lists them all and lets you see the percentages of pages using them, on a per-release basis.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 12

•

11 years ago

Oh, and the point is we can grab the CSV, via XHR or server-side, and munge it to produce that UI, I would think.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 13

•

11 years ago

Also, if we're going to make use of the "didn't use" number, then we need to restrict this to exclude various XBL bits, XHR result documents, etc, etc... Doable; just have to be a bit careful.

Cameron McCormack (:heycam)

Comment 14

•

11 years ago

Attached patch WIP (v0.1) (obsolete) (deleted) — Details — Splinter Review

This is what I've got so far. You can only put [UseCounter] on a regular operation (static or not) for the moment. The generated histogram JSON file is objdir/dom/bindings/UseCounterHistograms.json and is created from Codegen.py. I'm not really sure about the dom/bindings/Makefile.in change -- is there a different way I should write "dom/bindings/UseCounterHistograms.json is generated by Codegen.py running"? The telemetry calls in ~nsDocument are #if-ed out at the moment; instead it prints the use counter booleans to the console.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 15

•

11 years ago

You'll probably want gps to look over the build bits.

Gregory Szorc [:gps]

Comment 16

•

11 years ago

I'm glad you CC'd a build peer early in the design of this feature! Telemetry and their histograms are one part of the build system I wish were designed better. I believe there are bugs on this already. But in summary: 1) Generating a massive .h/.cpp with all the histograms/probes means that any time you change a histogram, you've invalidated every object file including said .h file. In other words, you are effectively introducing a clobber build. This is similar to how anytime we add a new AC_SUBST/AC_DEFINE to configure, mozilla-config.h changes and forces a rebuild of everything. I believe a solution with multiple, domain-specific .h/.cpp files for logically grouped entries would be a better design since it would cull dependency hell. If you are familiar with Protocol Buffers, they employ a similar solution with each .proto file deriving to a .h/.cpp pair. 2) Having all the histograms/probes defined in a single file is so pre-moz.build. We now have the ability to aggregate whole world build config info as part of config.status. We should consider separate JSON files for defining counters/histograms, each living in the source tree next to the component it measures. We could even have this data in moz.build files themselves if we didn't want to use JSON. Not recommending it - just throwing it out there. 3) We should consider processing the JSON files / probe data as part of config.status. We've historically had dependency issues with changes to Telemetry that require clobbers. I'm not sure what the underlying problems are. Moving more things to config.status can form a nice band aid. But it can be annoying to wait for config.status to run. So, we need to be cognizant of the effect on developer workflows.

Cameron McCormack (:heycam)

Comment 17

•

11 years ago

That all sounds fine, although not something I myself would want to have to try doing. :-) I guess I just want to make sure that the changes I'm making in toolkit/components/telemetry/Makefile.in , dom/bindings/Makefile.in and dom/bindings/mozwebidlcodegen/__init__.py are correct and aren't making builds any worse -- for example if touching an arbitrary .webidl file but not adding/removing [UseCounter] caused the whole world to rebuild that would be bad.

Cameron McCormack (:heycam)

Updated

•

11 years ago

Attachment #8372751 - Flags: feedback?(gps)

(dormant account)

Comment 18

•

11 years ago

(In reply to Boris Zbarsky [:bz] from comment #12) > Oh, and the point is we can grab the CSV, via XHR or server-side, and munge > it to produce that UI, I would think. Yes, see telemetry.js bits, http://jonasfj.dk/blog/2014/01/custom-telemetry-dashboards/ it's quite easy to get raw data Patches welcome re better UI :) > Am I right in thinking that each time I call Telemetry::Accumulate(), one of two values (for the "true" nad "false" cases) will get incremented locally on the histogram object, and then when the daily telemetry ping is performed, those two numbers will be sent to the server, and then those two values are reset back to zero? Sort of. Telemetry for the same session is designed to overwrite the previous submission. Each browser session starts with counters reset(eg exit to reset counters).

Cameron McCormack (:heycam)

Comment 19

•

11 years ago

(In reply to Taras Glek (:taras) from comment #18) > Sort of. Telemetry for the same session is designed to overwrite the > previous submission. Each browser session starts with counters reset(eg exit > to reset counters). OK, so same net effect. Great. I think using Histograms here rather than writing up something custom with the simple measurements is going to be simpler.

Cameron McCormack (:heycam)

Comment 20

•

11 years ago

So one question here is whether we are happy with measuring "per loaded content document (excluding things like XHR result documents)", or if we should aggregate all of the content documents loaded within a single window. If I have a couple of <iframe>s in a page that use (or not) a counted feature, should we report true/false for each of those <iframe>s, plus the top level document, or just one true/false for the top level document that is a logical or of itself and the <iframe>s?

Boris Zbarsky [:bzbarsky]

Reporter

Comment 21

•

11 years ago

Can we get both? ;)

Cameron McCormack (:heycam)

Comment 22

•

11 years ago

Er yeah, that should be easy enough. :-)

Cameron McCormack (:heycam)

Comment 23

•

11 years ago

Apart from documents which are in the docshell tree, and SVG documents loaded as images, are there any other documents we'd want to track?

Boris Zbarsky [:bzbarsky]

Reporter

Comment 24

•

11 years ago

Hrm. It sort of depends on the feature. For DOM APIs, we might care about XHR result documents...

Cameron McCormack (:heycam)

Comment 25

•

11 years ago

Maybe we can account feature usage in XHR result documents and documents created with DOMImplementation.createDocument() to the document in the window that we created the XHR/Document from.

Cameron McCormack (:heycam)

Comment 26

•

11 years ago

(In reply to Cameron McCormack (:heycam) from comment #25) > Maybe we can account feature usage in XHR result documents and documents > created with DOMImplementation.createDocument() to the document in the > window that we created the XHR/Document from. Turns out this will happen automatically, since the code in the bindings will grab the document from the global rather than the |this| value (which also is a document).

Cameron McCormack (:heycam)

Comment 27

•

11 years ago

Attached patch WIP (v0.2) (obsolete) (deleted) — Details — Splinter Review

This patch now supports use counters for DOM methods, attributes and for CSS properties. That's probably enough types of use counters to begin with. We record both per-document usage and per-page document. Per-page use counters represent whether a feature was used in the document itself, any of its children (like via <iframe>s) or resource documents (SVG-referenced things) or image documents that are SVG documents. Generating the use counter histograms directly from Codegen.py turned out to be too much of a hassle in terms of getting the dependencies in the build system right. Also there was no corresponding place from which to generate them for CSS properties. So now there is a global use counter configuration file, dom/base/UseCounters.conf. You still also need to add [UseCounter] to the corresponding interface member in the Web IDL file. For properties, you make the CSS_PROP declaration take the CSS_PROPERTY_HAS_USE_COUNTER flag. I considered using JSON as the format for UseCounters.conf, but I realised that JSON doesn't allow comments and I wanted to put some notes in that file to describe the format and remind people to add [UseCounter] in the right place. So I've gone with a simple custom format. gps, do you want to take a quick look at my Makefile.in changes to make sure they're sane? bz, so far I've tested: * SVG images in <img>, background-image, list-style-image, 'content' * SVG resource documents, like |fill: url(otherdoc#something)| * hierarchies of <iframe>s * calling things on XHR/DOMParser-created documents to make sure the right documents get the per-document/per-page use counters set, and that the Telemetry calls happen. Any other interesting multiple document cases we need to make work?

Attachment #8372751 - Attachment is obsolete: true

Attachment #8372751 - Flags: feedback?(gps)

Attachment #8376953 - Flags: feedback?(gps)

Attachment #8376953 - Flags: feedback?(bzbarsky)

Cameron McCormack (:heycam)

Comment 28

•

11 years ago

https://tbpl.mozilla.org/?tree=Try&rev=a000c0fd26c3

Gregory Szorc [:gps]

Comment 29

•

11 years ago

You can abuse Python's eval() or execsource() to convert a JSON file to a Python data structure. Then, add comments and trailing commas. It still quacks like JSON but it's really Python. I'll try to do a full review shortly.

Cameron McCormack (:heycam)

Comment 30

•

11 years ago

Yeah I suppose using a Python object for UseCounters.conf is another option. I could change it if you really think it's preferable, but I think what I have now is fine for the moment.

Gregory Szorc [:gps]

Comment 31

•

11 years ago

Comment on attachment 8376953 [details] [diff] [review] WIP (v0.2) Review of attachment 8376953 [details] [diff] [review]: ----------------------------------------------------------------- I'm not a huge fan of: a) monolithic and centralized config files like UseCounters.conf b) monolithic and centralized header files for derived data On the surface "a" is a style nit. I just think things should be defined closer to their origin and sufficiently decoupled from the rest of the tree. i.e. if we remove feature X from toolkit/components/X I shouldn't need to update files in dom/base. "b" is a performance concern. Because TelemetryHistogramEnums.h is included by a lot of translation units, a small change to that file results in massive recompilation. On my OS X build when that file changes, we recompile ~100 .o files (in unified mode) and that adds ~70s wall and ~7:50 CPU to builds. "b" is the larger concern. More and more features and translation units will use Telemetry. This patch is proof :) Over time, the percentage of translation units including TelemetryHistogramEnums.h will approach 100%. Over time, the development cost to add/remove a histogram will approach the time for a clobber build. I don't believe this is something developers want to live with. I argue it's easier to transition into multiple header files if the histograms are defined in separate config files (one .h per histogram file is a natural mapping). Therefore, I think pursuing separate config files today is worth doing, especially since it's marginal extra effort. I would either define use counters inline in moz.build files: USE_COUNTERS['css_properties'] += ['fill'] Or have moz.build aggregate the set of files with use counters: USE_COUNTER_FILES += ['usecounters.conf'] We can then have the build backend emit a JSON (or other machine-readable) file describing what it sees and that file is consumed by gen-usecounters.py. I'm not going to block r+ if you use a central config file. I do want people aware of the performance implications and the technical debt a unified config file / header entails. It'll be making incremental builds slower for everyone. I strongly encourage us to get off that path at our earliest convenience. Separate config files with the intention that they eventually result in multiple .h files is a good first step. ::: dom/base/Makefile.in @@ +15,5 @@ > endif > + > +usecounter_files = UseCounterList.h PropertyUseCounterMap.inc UseCounterHistograms.json > +usecounter_conf_src = $(srcdir)/UseCounters.conf > +usecounter_conf_dest = $(CURDIR)/UseCounters.conf You'll want to add these generated files to GENERATED_FILES in moz.build. You shouldn't need $(CURDIR) in there. All paths are relative from the objdir. @@ +17,5 @@ > +usecounter_files = UseCounterList.h PropertyUseCounterMap.inc UseCounterHistograms.json > +usecounter_conf_src = $(srcdir)/UseCounters.conf > +usecounter_conf_dest = $(CURDIR)/UseCounters.conf > + > +$(usecounter_conf_dest) : $(usecounter_conf_src) Nit: Cuddle colon to the left. @@ +20,5 @@ > + > +$(usecounter_conf_dest) : $(usecounter_conf_src) > + $(call py_action,preprocessor,$(DEFINES) $(ACDEFINES) $^ -o $@) > + > +$(usecounter_files) : $(srcdir)/gen-usecounters.py $(usecounter_conf_dest) make will race here due to multiple outputs for one rule. See https://www.gnu.org/software/automake/manual/html_node/Multiple-Outputs.html (Yes, make is silly.) ::: dom/base/UseCounter.h @@ +1,2 @@ > +#ifndef UseCounter_h_ > +#define UseCounter_h_ Need moar MPL. ::: dom/base/UseCounters.conf @@ +16,5 @@ > +// > +// (d) one of three possible use counter declarations: > +// > +// method <IDL interface name>.<IDL operation name> > +// attribute <IDL interface name>.<IDL attribute name> It seems silly to me that we have to both put an attribute in a .webidl *and* update this file. Why can't we have the .webidl parser emit data of all discovered counters and have that output fed into gen-usecounters.py? ::: dom/base/gen-usecounters.py @@ +1,2 @@ > +#!/usr/bin/env python > + Need moar MPL. @@ +4,5 @@ > +import sys > + > +AUTOGENERATED_WARNING_COMMENT = "/* THIS FILE IS AUTOGENERATED BY gen-usercounters.py - DO NOT EDIT */" > + > +def read_conf(conf_filename): Just pass in a file object. It's more versatile. @@ +11,5 @@ > + with open(conf_filename, 'r') as f: > + line_num = 0 > + for line in f: > + line = line.rstrip('\n') > + line_num = line_num + 1 for line_num, line in enumerate(f): @@ +12,5 @@ > + line_num = 0 > + for line in f: > + line = line.rstrip('\n') > + line_num = line_num + 1 > + if line == '' or line.startswith('//'): if not line or line.startswith('//') @@ +17,5 @@ > + # empty line or comment > + continue > + m = re.match(r'method ([A-Za-z0-9]+)\.([A-Za-z0-9]+)$', line) > + if m: > + (interface_name, method_name) = m.groups() Nit: you don't need the parens. @@ +20,5 @@ > + if m: > + (interface_name, method_name) = m.groups() > + counters.append({ 'type': 'method', > + 'interface_name': interface_name, > + 'method_name': method_name }) Nit: cuddle braces @@ +42,5 @@ > + > +def generate_list(filename, counters): > + with open(filename, 'w') as f: > + def print_optional_macro_declare(name): > + print >>f, ''' Please use print() in all new Python so it is Python 3 compatible. (The print statement was removed in Python 3.) That being said, people typically use f.write() or f.writeline(). @@ +63,5 @@ > + print_optional_macro_declare('USE_COUNTER_DOM_METHOD') > + print_optional_macro_declare('USE_COUNTER_DOM_ATTRIBUTE') > + print_optional_macro_declare('USE_COUNTER_CSS_PROPERTY') > + > + for counter in counters: We strive for consistent and stable output from the build system. Any chance we could sort the counters? Although, I suppose if the order comes from the conf file, then that's as stable as we need. @@ +102,5 @@ > + items.append(''' "%s": { > + "expires_in_version": "never", > + "kind": "boolean", > + "description": "%s" > + }''' % (name, desc)) Python has a json module that will convert Python data structures to json and will handle Unicode properly. Please use it. ::: toolkit/components/telemetry/Makefile.in @@ +22,5 @@ > histoenums_TARGET := export > > include $(topsrcdir)/config/rules.mk > > +histogram_files := $(srcdir)/Histograms.json $(DEPTH)/dom/base/UseCounterHistograms.json This introduces a race condition in the build system between the processing of toolkit/components/telemetry/Makefile and dom/base/Makefile. You'll need to add a dependency to the bottom of /Makefile.in to prevent it. @@ +39,5 @@ > > +$(histogram_enum_file): $(histogram_files) $(enum_python_deps) > + $(PYTHON) $(srcdir)/gen-histogram-enum.py $(histogram_files) > $@ > +$(histogram_data_file): $(histogram_files) $(data_python_deps) > + $(PYTHON) $(srcdir)/gen-histogram-data.py $(histogram_files) > $@ It's a best practice to have all our Python scripts that perform code generation to emit a .pp make file that includes extra dependencies. Since these .py files are self-contained and don't import other/shared modules, we should be fine listing dependencies in the Makefile.in. Be forewarned. Also, I'm not sure why we have multiple scripts/actions for the same set of inputs. We should consolidate these scripts. Followup.

Attachment #8376953 - Flags: feedback?(gps) → feedback+

Boris Zbarsky [:bzbarsky]

Reporter

Updated

•

11 years ago

Blocks: 914360

Cameron McCormack (:heycam)

Comment 32

•

11 years ago

(In reply to Gregory Szorc [:gps] from comment #31) > a) monolithic and centralized config files like UseCounters.conf > b) monolithic and centralized header files for derived data > > On the surface "a" is a style nit. I just think things should be defined > closer to their origin and sufficiently decoupled from the rest of the tree. > i.e. if we remove feature X from toolkit/components/X I shouldn't need to > update files in dom/base. OK. We could easily split the configuration file out into say dom/base/UseCounters.conf for all IDL attribute/operation use counters, layout/style/UseCounters.conf for CSS properties, and other files for new types of use counters that we come up with. I feel like it is simpler for them to be defined in one place, though. > "b" is a performance concern. Because TelemetryHistogramEnums.h is included > by a lot of translation units, a small change to that file results in > massive recompilation. On my OS X build when that file changes, we recompile > ~100 .o files (in unified mode) and that adds ~70s wall and ~7:50 CPU to > builds. > > "b" is the larger concern. More and more features and translation units will > use Telemetry. This patch is proof :) Over time, the percentage of > translation units including TelemetryHistogramEnums.h will approach 100%. > Over time, the development cost to add/remove a histogram will approach the > time for a clobber build. I don't believe this is something developers want > to live with. I think adding and removing use counters will be infrequent enough that this doesn't really matter. But: if we did want to solve this, we could make the enum a sized enum and then forward declare it. In nearly all of the headers where it is included, we don't need to refer to specific values of the enum. > I argue it's easier to transition into multiple header files if the > histograms are defined in separate config files (one .h per histogram file > is a natural mapping). Therefore, I think pursuing separate config files > today is worth doing, especially since it's marginal extra effort. How do we ensure that each histogram-enum-value-defining .h file uses unique values, unless we just have a master TelemetryHistogramEnum.h that #includes all the individually generated ones? We're back to where we start then, though, where a change to any of the separate .h files will cause lots of things to rebuild. > I would either define use counters inline in moz.build files: > > USE_COUNTERS['css_properties'] += ['fill'] > > Or have moz.build aggregate the set of files with use counters: > > USE_COUNTER_FILES += ['usecounters.conf'] > > We can then have the build backend emit a JSON (or other machine-readable) > file describing what it sees and that file is consumed by gen-usecounters.py. > > I'm not going to block r+ if you use a central config file. I do want people > aware of the performance implications and the technical debt a unified > config file / header entails. It'll be making incremental builds slower for > everyone. I strongly encourage us to get off that path at our earliest > convenience. Separate config files with the intention that they eventually > result in multiple .h files is a good first step. If we go with the forward-declared enum, there won't be any performance problems with a single config file / header. Would that turn your "reluctant r+" into a "happy r+"? > ::: dom/base/Makefile.in > It seems silly to me that we have to both put an attribute in a .webidl > *and* update this file. Why can't we have the .webidl parser emit data of > all discovered counters and have that output fed into gen-usecounters.py? That is what I had initially (well, generating a histogram json file for the use counters directly), but I found it difficult to get the build system rules right that would cause the Web IDL python generation scripts to generate the histogram json file which the histogram .h file generation rule could then depend on. (Although below you talk about a race condition between toolkit/components/telemetry/Makefile and dom/base/Makefile which might have been the issue.) It also resulted in a bunch of code added to the Web IDL python scripts, and the current UseCounter.conf-reading script is shorter. I also realised that for other types of use counters, like properties, there wasn't a good place to declare them in existing files, and I wanted to have the declaration of use counters to be done in the same way regardless of what type they were. > ::: dom/base/gen-usecounters.py > @@ +63,5 @@ > > + print_optional_macro_declare('USE_COUNTER_DOM_METHOD') > > + print_optional_macro_declare('USE_COUNTER_DOM_ATTRIBUTE') > > + print_optional_macro_declare('USE_COUNTER_CSS_PROPERTY') > > + > > + for counter in counters: > > We strive for consistent and stable output from the build system. Any chance > we could sort the counters? Although, I suppose if the order comes from the > conf file, then that's as stable as we need. It does come from the conf file. I don't think sorting gains us any stability. > @@ +102,5 @@ > > + items.append(''' "%s": { > > + "expires_in_version": "never", > > + "kind": "boolean", > > + "description": "%s" > > + }''' % (name, desc)) > > Python has a json module that will convert Python data structures to json > and will handle Unicode properly. Please use it. > > ::: toolkit/components/telemetry/Makefile.in > @@ +22,5 @@ > > histoenums_TARGET := export > > > > include $(topsrcdir)/config/rules.mk > > > > +histogram_files := $(srcdir)/Histograms.json $(DEPTH)/dom/base/UseCounterHistograms.json > > This introduces a race condition in the build system between the processing > of toolkit/components/telemetry/Makefile and dom/base/Makefile. You'll need > to add a dependency to the bottom of /Makefile.in to prevent it. Could you explain how to do this please?

Flags: needinfo?(gps)

Gregory Szorc [:gps]

Comment 33

•

11 years ago

(In reply to Cameron McCormack (:heycam) from comment #32) > (In reply to Gregory Szorc [:gps] from comment #31) > > "b" is a performance concern. Because TelemetryHistogramEnums.h is included > > by a lot of translation units, a small change to that file results in > > massive recompilation. On my OS X build when that file changes, we recompile > > ~100 .o files (in unified mode) and that adds ~70s wall and ~7:50 CPU to > > builds. > > > > "b" is the larger concern. More and more features and translation units will > > use Telemetry. This patch is proof :) Over time, the percentage of > > translation units including TelemetryHistogramEnums.h will approach 100%. > > Over time, the development cost to add/remove a histogram will approach the > > time for a clobber build. I don't believe this is something developers want > > to live with. > > I think adding and removing use counters will be infrequent enough that this > doesn't really matter. $ hg log toolkit/components/telemetry/Histograms.json --template '{date(firstpushdate, "%Y-%m")}\n' | sort | uniq -c 5 2012-08 12 2012-09 17 2012-10 20 2012-11 10 2012-12 16 2013-01 11 2013-02 24 2013-03 15 2013-04 5 2013-05 11 2013-06 24 2013-07 13 2013-08 15 2013-09 16 2013-10 43 2013-11 15 2013-12 28 2014-01 16 2014-02 0.5 to ~1.5 average changes per day is not infrequent. This is probably masked by other changes to widely-used headers. But that shouldn't be an excuse to contribute to the problem. > But: if we did want to solve this, we could make the enum a sized enum and > then forward declare it. In nearly all of the headers where it is included, > we don't need to refer to specific values of the enum. Interesting idea. > > I argue it's easier to transition into multiple header files if the > > histograms are defined in separate config files (one .h per histogram file > > is a natural mapping). Therefore, I think pursuing separate config files > > today is worth doing, especially since it's marginal extra effort. > > How do we ensure that each histogram-enum-value-defining .h file uses unique > values, unless we just have a master TelemetryHistogramEnum.h that #includes > all the individually generated ones? We're back to where we start then, > though, where a change to any of the separate .h files will cause lots of > things to rebuild. I'm not sure how histogram enums are submitted as part of Telemetry. Would it be possible to define an individual histogram not as a single integer but as a tuple of (namespace, enum)? How does Telemetry know that HistoX from Nightly is the same as HistoX from Aurora if HistoX is represented by "42?" > > I'm not going to block r+ if you use a central config file. I do want people > > aware of the performance implications and the technical debt a unified > > config file / header entails. It'll be making incremental builds slower for > > everyone. I strongly encourage us to get off that path at our earliest > > convenience. Separate config files with the intention that they eventually > > result in multiple .h files is a good first step. > > If we go with the forward-declared enum, there won't be any performance > problems with a single config file / header. Would that turn your > "reluctant r+" into a "happy r+"? Yes. I'd still like multiple files because I think it is architecturally better (and results in fewer merge conflicts). But performance is my major concern. > > ::: dom/base/Makefile.in > > It seems silly to me that we have to both put an attribute in a .webidl > > *and* update this file. Why can't we have the .webidl parser emit data of > > all discovered counters and have that output fed into gen-usecounters.py? > > That is what I had initially (well, generating a histogram json file for the > use counters directly), but I found it difficult to get the build system > rules right that would cause the Web IDL python generation scripts to > generate the histogram json file which the histogram .h file generation rule > could then depend on. (Although below you talk about a race condition > between toolkit/components/telemetry/Makefile and dom/base/Makefile which > might have been the issue.) It also resulted in a bunch of code added to > the Web IDL python scripts, and the current UseCounter.conf-reading script > is shorter. I also realised that for other types of use counters, like > properties, there wasn't a good place to declare them in existing files, and > I wanted to have the declaration of use counters to be done in the same way > regardless of what type they were. As long as we're using recursive make, toolkit/components/telemetry and dom/base will not directly contain cross-directory dependencies because that's how recursive make works. The way you encode dependencies is by directory traversal order in the root Makefile.in. See reply below. You already have a separate mechanism for use counters from Telemetry Histograms. I don't think it's a huge deal to have a separate mechanism for WebIDL and CSS use counters. Think of it as putting in a little extra effort now to make the lives of WebIDL developers easier going forward. I can't say with certainty if that trade-off is worth it! and will handle Unicode properly. Please use it. > > > > ::: toolkit/components/telemetry/Makefile.in > > @@ +22,5 @@ > > > histoenums_TARGET := export > > > > > > include $(topsrcdir)/config/rules.mk > > > > > > +histogram_files := $(srcdir)/Histograms.json $(DEPTH)/dom/base/UseCounterHistograms.json > > > > This introduces a race condition in the build system between the processing > > of toolkit/components/telemetry/Makefile and dom/base/Makefile. You'll need > > to add a dependency to the bottom of /Makefile.in to prevent it. > > Could you explain how to do this please? The following will ensure that |make -C toolkit/components/telemetry export| occurs after |make -C dom/base export| and |make -C dom/bindings/export|: ifdef MOZ_PSEUDO_DERECURSE toolkit/components/telemetry/export: dom/base/export dom/bindings/export endif

Flags: needinfo?(gps)

Cameron McCormack (:heycam)

Comment 34

•

11 years ago

(In reply to Gregory Szorc [:gps] from comment #33) > 0.5 to ~1.5 average changes per day is not infrequent. This is probably > masked by other changes to widely-used headers. But that shouldn't be an > excuse to contribute to the problem. I was thinking of use counters rather than telemetry histograms more generally. I don't think people will need to be adding/removing use counters to measure the usage of Web platform features as frequently as these changes to TelemetryHistograms.json. > I'm not sure how histogram enums are submitted as part of Telemetry. Would > it be possible to define an individual histogram not as a single integer but > as a tuple of (namespace, enum)? How does Telemetry know that HistoX from > Nightly is the same as HistoX from Aurora if HistoX is represented by "42?" I have no idea really. > > If we go with the forward-declared enum, there won't be any performance > > problems with a single config file / header. Would that turn your > > "reluctant r+" into a "happy r+"? > > Yes. OK. I've filed bug 977430 for that. I'll get your feedback on that once the patch is up. > You already have a separate mechanism for use counters from Telemetry > Histograms. I don't think it's a huge deal to have a separate mechanism for > WebIDL and CSS use counters. Think of it as putting in a little extra effort > now to make the lives of WebIDL developers easier going forward. I can't say > with certainty if that trade-off is worth it! OK. I'll go back to using the [UseCounter] extended attribute in .webidl files as the key to generating DOM use counters, and having a separate .conf file for CSS properties. > > Could you explain how to do this please? > > The following will ensure that |make -C toolkit/components/telemetry export| > occurs after |make -C dom/base export| and |make -C dom/bindings/export|: > > ifdef MOZ_PSEUDO_DERECURSE > toolkit/components/telemetry/export: dom/base/export dom/bindings/export > endif Great, thanks!

Cameron McCormack (:heycam)

Comment 35

•

11 years ago

I'm thinking some more. If we generate the use counter enum and the telemetry histogram json files based on [UseCounter] in .webidl files, and given that that would happens as part of Codegen.py, I think any change to any .webidl file would cause the enum/json files to be regenerated. Unless we add the ability to invoke Codegen.py more than once, so not just from mozwebidlcodegen/__init__.py's generate_build_files. That would mean we have to parse all the .webidl files a second time though. Regarding splitting the .conf files into separate directories, and whether that can help avoid an enum header that causes various files to be rebuilt whenever a use counter is added/removed, I don't think we can. nsIDocument needs to know at compile time the total number of use counters N, and we need each use counter to have a unique value less than N. So using an enum is really the only thing that makes sense. Unless we want to use strings or something else to represent use counters IDs, and have a lookup table in nsIDocument. But I don't think we want to pay that performance cost, since we want to keep the function that records a use counter as fast as possible. So although bug 977430 will reduce the number of files that get recompiled when use counters change and hence when telemetry IDs change, by not including the enum file in headers, every cpp file that does telemetry calls will get rebuilt when a use counter gets added/removed. If as you say, eventually very many files will have telemetry calls, then we'll be rebuilding a lot. I don't see a good way around this, though. Happy to hear your further thoughts.

Flags: needinfo?(gps)

Boris Zbarsky [:bzbarsky]

Reporter

Comment 36

•

11 years ago

I'm sorry for the lag here.... Still reading through the patch, but why are we adding the new document observer notification? Do we have some code that does something useful with it? If not, what are the planned uses?

Flags: needinfo?(cam)

Cameron McCormack (:heycam)

Comment 37

•

11 years ago

(In reply to Boris Zbarsky [:bz] (reviews will be slow; ask someone else) from comment #36) > Still reading through the patch, but why are we adding the new document > observer notification? Do we have some code that does something useful with > it? If not, what are the planned uses? Yes, I'm using it in image/src/VectorImage.cpp to allow outer documents to listen for use counters that are incremented by SVG documents loaded as images. nsDocument::NotifyUseCounterSet is where I'm notifying the observers from.

Flags: needinfo?(cam)

Benjamin Smedberg

Comment 38

•

11 years ago

>Would it be possible to define an individual histogram not as a single integer but as a tuple > of (namespace, enum)? How does Telemetry know that HistoX from Nightly is the same as HistoX > from Aurora if HistoX is represented by "42?" Jumping in late here: in the telemetry payload must not use opaque integer enum values but should use string keys. Currently there is no way to represent a single histogram with an extra "key" value, but that doesn't mean we couldn't build something like that if it were important. Note that we'd still want to declare probe expiration (and in the future, probe sampling rate) separately for each measurement. I'm not sure I understand how the .webidl annotations fit into this bug. Not all of these measurements are generated through webIDL, right? Can we separate out the core "measure usage by document" and "automatically measure usage in webidl" bits to make this easier to understand? In particular (for example) it would have been nice to use this mechanism for bug 851917 instead of the hand-rolled system.

Boris Zbarsky [:bzbarsky]

Reporter

Comment 39

•

11 years ago

Comment on attachment 8376953 [details] [diff] [review] WIP (v0.2) > to allow outer documents to listen for use counters that are incremented by SVG documents > loaded as images. Ah, I see. It seems like a lot of machinery to get this data, but I'm not seeing a better way of doing it so far. :( It might be good to break out this boilerplate into a separate patch. Maybe two patches: one to add the document observer, and one to add all the imagelib bits. I'd like to understand the difference between the SetDocumentUseCounter and SetPageUseCounter APIs. Also, whether we need to keep the separate bitsets for "page counters" and "document counters". SetPageUseCounter shouldn't go through a presshell. You can get documents from a docshell directly via do_GetInterface. MappedAttrParser probably doesn't need an mNodePrincipal member anymore; it can just get it off the element. For codegen, I'm not sure CGPerSignatureCall is the right place to put this code. It's convenient because it covers all the callers, but it also means a copy of the code for every overload... and you're having to change all the callers anyway to pass in the useCounterName. So it might be better to do it in things like CGAbstractStaticBindingMethod and CGSpecializedGetter/Setter/Method. >+ # XXX Why do I get two "URL" interface descriptors? One's a worker descriptor. Since we don't particularly support use counters in workers so far with this setup, you could just filter those out in your config.getDescriptors() call. That said, getMembersWithUseCounters seems to be unused. No need to add "UseCounter" to the trailing bit in IDLMethod, since you're already checking for it earlier. Though I'm not sure what I think of "isSpecial" as a name... The global enum thing is a bit annoying, but I have no better ideas. :(

Attachment #8376953 - Flags: feedback?(bzbarsky) → feedback+

Boris Zbarsky [:bzbarsky]

Reporter

Comment 40

•

11 years ago

Oh, and the actual question you asked, about interesting document cases. XBL documents are worth testing (e.g. for the CSS use counters, in binding sheets). So are documents created via DOMImplementation, though I expect these to act like DOMParser. Another thing: chrome vs non-chrome. The patch here only sends telemetry for the latter, but it might be good to do it for both, with different names so we can catch extension usage too?

Gregory Szorc [:gps]

Comment 41

•

11 years ago

(In reply to Cameron McCormack (:heycam) from comment #35) > I'm thinking some more. If we generate the use counter enum and the > telemetry histogram json files based on [UseCounter] in .webidl files, and > given that that would happens as part of Codegen.py, I think any change to > any .webidl file would cause the enum/json files to be regenerated. Unless > we add the ability to invoke Codegen.py more than once, so not just from > mozwebidlcodegen/__init__.py's generate_build_files. That would mean we > have to parse all the .webidl files a second time though. We definitely do not want to parse the .webidl files multiple times in a build. We invested a lot of effort into making WebIDLs build the way they do now because of performance and build dependency reasons. > Regarding splitting the .conf files into separate directories, and whether > that can help avoid an enum header that causes various files to be rebuilt > whenever a use counter is added/removed, I don't think we can. nsIDocument > needs to know at compile time the total number of use counters N, and we > need each use counter to have a unique value less than N. So using an enum > is really the only thing that makes sense. Unless we want to use strings or > something else to represent use counters IDs, and have a lookup table in > nsIDocument. But I don't think we want to pay that performance cost, since > we want to keep the function that records a use counter as fast as possible. > > So although bug 977430 will reduce the number of files that get recompiled > when use counters change and hence when telemetry IDs change, by not > including the enum file in headers, every cpp file that does telemetry calls > will get rebuilt when a use counter gets added/removed. If as you say, > eventually very many files will have telemetry calls, then we'll be > rebuilding a lot. I don't see a good way around this, though. > > Happy to hear your further thoughts. Is there any way we can have multiple "namespaces" for use counters and histograms? By having a single shared namespace/array/list of integer enumerations, we've painted ourselves into this corner of any-changes-to-the-global-list-require-recompilations-to-the-world. What I'd like to see is separate enumerations for each domain/namespace. Each component will only know about the enumerations for its subset of the world. There would be a single component (a registrar if you will) that knows how to map each tuple of (namespace, local enumeration) to the master integer list (if such a unified list even exists). For example, instead of: void Accumulate(ID id, uint32_t sample); we have void Accumulate(ID ns, ID, id, uint32_t sample); Each histogram is annotated with its "namespace." When enumerations are generated, we group all histograms by their namespace. Say we have the "dom" and "http" namespaces. We'd write out multiple .h files: TelemetryEnumsDom.h and TelemetryEnumsHttp.h. .cpp files would only include the .h containing the enumerations they needed. When a histogram is recorded, we'd do something like: Accumulate(TELEMETRY_NS_HTTP, TELEMETRY_HTTP_PAGE_CACHE_READ_TIME, value); When telemetry values are aggregated, the internal data structure is a double hashmap instead of a single (namespace is outer, enum is inner). Full knowledge of all enumerations for all namespaces would only need to live inside the Telemetry aggregator. I imagine converting all existing Telemetry to use namespaces would be a big effort. I think we could implement use counters initially as a one-off and over time convert existing Telemetry code to use the new, namespace-aware APIs, slowly improving include hell in the process. Does this idea have legs?

Cameron McCormack (:heycam)

Comment 42

•

11 years ago

(In reply to Gregory Szorc [:gps] from comment #41) > For example, instead of: > > void Accumulate(ID id, uint32_t sample); > > we have > > void Accumulate(ID ns, ID, id, uint32_t sample); Doing a lookup based on integer (namespace, id) pairs is probably acceptable in terms of performance, compared to using strings. The signature would need to be something like: void Accumulate(Telemetry::Namespace ns, uint32_t id, uint32_t sample); where Telemetry::Namespace is as enum representing the domains/namespaces. The id argument can't be an enum any more, assuming they have overlapping values across different namespaces. So we lose some type safety there. Maybe we could have separate Accumulate functions per namespace. void AccumulateHTTP(Telemetry::HTTPID id, uint32_t sample); void AccumulateDOM(Telemtry::DOMID id, uint32_t sample); That'd let us keep the type safety. So the net result would be that if we add/remove a namespace, everything that uses telemetry gets rebuilt. If you add/remove an individual ID within a namespace, everything within that namespace that uses telemetry. Still, all of this improvement seems orthogonal to the use counter stuff I really want to implement in this bug. Anyway, I am not a Telemetry peer. (Oh, looks like there's no Telemetry module.) Vladan do you have a view on this?

Flags: needinfo?(vdjeric)

Cameron McCormack (:heycam)

Comment 43

•

11 years ago

(Or Taras?)

Gregory Szorc [:gps]

Comment 44

•

11 years ago

(In reply to Cameron McCormack (:heycam) from comment #42) > Still, all of this improvement seems orthogonal to the use counter stuff I > really want to implement in this bug. Sorry about that. This trap has been in place for months. You're just the unlucky person to step into it first. If use counters is super high importance, feel free to call stop energy on me. I'm just hoping we can get a scalable solution for new implementations without too much additional effort.

Flags: needinfo?(gps)

Florian Bender

Updated

•

11 years ago

No longer blocks: 914360

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=914360

Kohei Yoshino

Updated

•

11 years ago

Keywords: dev-doc-needed

OS: Mac OS X → All

Hardware: x86 → All

Vladan Djeric (:vladan)

Comment 45

•

10 years ago

Clearing old needinfo here. The Telemetry build improvements are unlikely to happen this quarter unless someone outside my team steps up. Converting all the existing Telemetry users is going to be a special pain. Use-counters seem like a decent idea although I think they can be implemented using existing Telemetry semantics, e.g. to track number of pages containing a Flash object you can use a boolean histogram and accumulate a "true" value every time you load a page with Flash and "false" every time you load a page without Flash. For "number of times a condition X is met on a page", a linear or exponential histogram would suffice. I don't think histograms need to be "reset" as long as you accumulate a zero value when a page does not meet condition X. The current Telemetry dashboard (and other Telemetry tools) would show the aggregate results without needing any changes. Anyway, we can revive this discussion if there's still interest in adding new semantics to Telemetry

Flags: needinfo?(vdjeric)

WIP (v0.1) 11 years ago Cameron McCormack (:heycam) (deleted), patch		Details \| Diff \| Splinter Review
WIP (v0.2) 11 years ago Cameron McCormack (:heycam) (deleted), patch	gps : feedback+ bzbarsky : feedback+	Details \| Diff \| Splinter Review
implement Blink-style use counters for CSS properties and WebIDL attributes and operations - WIP (v0.3) 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 1 - add infrastructure for defining use counters from UseCounters.conf 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 2 - change MappedAttrParser to store a nsSVGElement directly, instead of its nsIPrincipal 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 3 - add infrastructure for reporting use counters through documents 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 4 - hook up use counters to WebIDL bindings 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 1 - add infrastructure for defining use counters from UseCounters.conf 10 years ago Nathan Froyd [:froydnj] (deleted), patch	gps : feedback+	Details \| Diff \| Splinter Review
part 2 - change MappedAttrParser to store a nsSVGElement directly, instead of its nsIPrincipal 10 years ago Nathan Froyd [:froydnj] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review
part 3 - add infrastructure for reporting use counters through documents 10 years ago Nathan Froyd [:froydnj] (deleted), patch	seth : feedback-	Details \| Diff \| Splinter Review
part 4 - hook up use counters to WebIDL bindings 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 5a - add nsIDOMWindowUtils::forceUseCounterFlush 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 5 - add tests for use counters 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 5a - add nsIDOMWindowUtils::forceUseCounterFlush 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 5 - add tests for use counters 10 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 1 - add infrastructure for defining use counters from UseCounters.conf 10 years ago Nathan Froyd [:froydnj] (deleted), patch	gfritzsche : review+ mshal : review+ heycam : review+	Details \| Diff \| Splinter Review
part 3a - add core DOM use counter functionality 9 years ago Nathan Froyd [:froydnj] (deleted), patch	smaug : review-	Details \| Diff \| Splinter Review
part 3b - propagating use counters from SVG images into owning/parent documents 9 years ago Nathan Froyd [:froydnj] (deleted), patch	seth : feedback+	Details \| Diff \| Splinter Review
part 3c - miscellaneous telemetry changes for use counters 9 years ago Nathan Froyd [:froydnj] (deleted), patch	gfritzsche : review+	Details \| Diff \| Splinter Review
part 3d - record use counter information from the CSS parser 9 years ago Nathan Froyd [:froydnj] (deleted), patch	dbaron : review+	Details \| Diff \| Splinter Review
part 4 - hook up use counters to WebIDL bindings 9 years ago Nathan Froyd [:froydnj] (deleted), patch	bzbarsky : review+	Details \| Diff \| Splinter Review
part 5a - add use counters for SVGSVGElement and CSS properties for testing purposes 9 years ago Nathan Froyd [:froydnj] (deleted), patch		Details \| Diff \| Splinter Review
part 5b - add nsIDOMWindowUtils::forceUseCounterFlush 9 years ago Nathan Froyd [:froydnj] (deleted), patch	bzbarsky : review+	Details \| Diff \| Splinter Review
part 5c - add tests for use counters 9 years ago Nathan Froyd [:froydnj] (deleted), patch	bzbarsky : review+	Details \| Diff \| Splinter Review
part 6 - add use counters for deprecated operations 9 years ago Nathan Froyd [:froydnj] (deleted), patch	bzbarsky : review+	Details \| Diff \| Splinter Review
part 3d - record use counter information from the CSS parser 9 years ago Nathan Froyd [:froydnj] (deleted), patch	froydnj : review+	Details \| Diff \| Splinter Review
part 5a - add use counters for SVGSVGElement and CSS properties for testing purposes 9 years ago Nathan Froyd [:froydnj] (deleted), patch	bzbarsky : review+	Details \| Diff \| Splinter Review
part 3a - add core DOM use counter functionality 9 years ago Nathan Froyd [:froydnj] (deleted), patch	smaug : review-	Details \| Diff \| Splinter Review
part 3a - add core DOM use counter functionality 9 years ago Nathan Froyd [:froydnj] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review
fix bogus assert in nsSVGElement.cpp 9 years ago Nathan Froyd [:froydnj] (deleted), patch	heycam : review-	Details \| Diff \| Splinter Review
fix bogus assert in nsSVGElement.cpp 9 years ago Nathan Froyd [:froydnj] (deleted), patch	heycam : review+	Details \| Diff \| Splinter Review