28221 - [tracking bug] profile string usage; deploy new implementations where appropriate

Reporter

Description

•

25 years ago

I still believe that an immutable nsIString interface coupled with appropriate implementations could be a huge win for us in terms of both space and time. There would need to be at least 4 implementations to make this work: - nsUnicharString for double-byte encoding, - nsCString for single-byte encoding, - nsSubString would manage a lengh and offset into another nsIString to avoid copying, - nsConcatenatedString would manage a sequence of nsIStrings, treating them as a single concatenated string. To determine whether my hypothesis is correct, I think we can instrument nsString and nsCString to gather statistics that indicate how many copies of strings we make in the process of running our app. Specifically: - Count the number of times each nsString constructs a char/PRUnichar array. This is often done when passing them to IDL-generated interfaces. (This number could be completely eliminated with nsIString.) [ToNewString, ToNewCString, ToNewUnicode, ToCString] - Count the number of times we construct nsStrings from char/PRUnichar arrays. This is often done when we want to manipulate strings that come in from IDL-generated interfaces. (Some number of these could be eliminated with nsIString.) (How can we break down first-time constructions from copies - histogram?) - Count the number of times we assign the character sequence in a string. Also count the percentage of strings which are actually assigned. (This number would indicate the number of additional nsIStrings which would need to be created due to immutability.) [SetString, Assign, operator=] - Count the number of substring operations done on nsStrings. (This number could be replaced by an allocation of an nsSubString object.) [SetLength, Truncate, Trim, Left, Mid, Right, Cut] - Count the number of concatenation operations done on nsStrings. Also count the percentage of strings which are concatenated. (This number could be replaced by an allocation of an nsConcatenatedString object, saving space.) [operator+, operator+=, Append, Insert] - Count the number of mutation operations done on nsStrings. Also count the percentage of strings which are mutated. (This number would indicate how often an actual string buffer (e.g. the existing nsString implementation) would continue to be needed.) [SetCharAt, ToLowerCase, ToUpperCase, StripChars, StripWhitespace, ReplaceChar, ReplaceSubstring, CompressSet, CompressWhitespace] Once we have these counts, we can see can do some analysis to determine what sort of ramifications nsIString might have: Will we allocate far fewer strings because they're shared more? Will we need to allocate far more substring objects because they're mutated too often? What sort of space might we expect to save due to more sharing. What sort of space might we expect to loose due to more copies made as a result of mutation. Right now we're in the dark.

Warren Harris

Reporter

Updated

•

25 years ago

Keywords: perf

Summary: investigate nsIString → investigate nsIString

Suresh Duddi (gone)

Comment 1

•

25 years ago

Just counting constructors wont be good I think. We should factor in how many are destroyed to get a figure of how many will exist. That will influence the space issue.

Chris Waterson

Comment 2

•

25 years ago

n.b. that vidur & troy are exploring some kind of BSTR-like stuff to reduce copies in layout. Not sure if their stuff would cross interface boundaries...

nsStringStats.h 25 years ago Chris Waterson (deleted), text/plain		Details
diffs to xpcom/ds to implement string stats 25 years ago Chris Waterson (deleted), patch		Details \| Diff \| Splinter Review
better diffs. 25 years ago Chris Waterson (deleted), patch		Details \| Diff \| Splinter Review
Here is a patch containing the changes I made to measure COW efficacy 25 years ago Scott Collins (deleted), patch		Details \| Diff \| Splinter Review
diffs for mutated/unmutated accounting (in addition to Chris' diffs) 25 years ago Warren Harris (deleted), patch		Details \| Diff \| Splinter Review
attaching data <dougt@netscape.com> generated... 24 years ago Scott Collins (deleted), text/plain		Details
more <dougt@netscape.com> data... 24 years ago Scott Collins (deleted), text/plain		Details
Here's the patch Doug used to generate this data... 24 years ago Scott Collins (deleted), patch		Details \| Diff \| Splinter Review