Open Bug 582858 Opened 14 years ago Updated 1 year ago

Provide an nsTextFragment API which makes it possible for callers to skip ASCII scanning code

Tracking

()

Status:

NEW

People

(Reporter: ehsan.akhgari, Unassigned)

References

Details

(Keywords: perf, Whiteboard: [post-2.0])

Attachments

(4 files, 4 obsolete files)

Part 1: Remove the 1b text fragment APIs 14 years ago (no longer active) (deleted), patch		Details \| Diff \| Splinter Review
Test case (bzipped) 14 years ago (no longer active) (deleted), application/x-bzip2		Details
Part 1: Remove the 1b text fragment APIs 14 years ago (no longer active) (deleted), patch		Details \| Diff \| Splinter Review
Experiment that might be a good start... 14 years ago Johnny Stenback (:jst) (deleted), patch		Details \| Diff \| Splinter Review
Testcase 14 years ago Justin Lebar (not reading bugmail) (deleted), text/html		Details
Testcase v1.1 14 years ago Justin Lebar (not reading bugmail) (deleted), text/html		Details
Patch v1 14 years ago Justin Lebar (not reading bugmail) (deleted), patch		Details \| Diff \| Splinter Review
patch used for the tests 13 years ago arno renevier (deleted), patch		Details \| Diff \| Splinter Review

(no longer active)

Reporter

Description

•

14 years ago

We had a discussion with roc and bz over on irc about how much perf hit/win we'll get from storing all of our text node content as UTF-16 (aka m2b) in text fragments. I took it upon myself to write the patch and experiment with it. If all goes well, we can probably land that patch on trunk and see what our talos numbers would tell us. Here is roc's idea on how to test such a patch before landing on trunk: roc: for a worstcase test you probably want a very large text file, many <p> elements with say 1000 characters per element, loaded locally froma file roc: compare to trunk with text-rendering:optimizeLegibitility added, and without that added I'll write that patch, and post the results here.

(no longer active)

Reporter

Updated

•

14 years ago

OS: Mac OS X → All

Hardware: x86 → All

Version: 1.9.1 Branch → Trunk

(no longer active)

Reporter

Updated

•

14 years ago

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=582852

(no longer active)

Reporter

Comment 1

•

14 years ago

So, I have a patch (which I'll attach shortly), and a test case which is basically the text from 9 Project Gutenberg top books concatenated, with a paragraph break inserted every 1000 bytes. Here is the results: http://mzl.la/9nIoSI I'm not quite sure how to interpret these results...

(no longer active)

Reporter

Comment 2

•

14 years ago

Attached patch Part 1: Remove the 1b text fragment APIs (obsolete) (deleted) — Details — Splinter Review

This patch works (in the sense that Firefox starts and things seem to be fine judging by some smoke tests), but before I spend any more time on this, I want to make sure that the time I spend is worth it.

(no longer active)

Reporter

Comment 3

•

14 years ago

(Specifically, I have not yet removed any 8bit fast path code in graphics, although they're not kicking in because I'm not setting TEXT_IS_8BIT any more.)

(no longer active)

Reporter

Comment 4

•

14 years ago

Attached file Test case (bzipped) (deleted) — Details

(no longer active)

Reporter

Comment 5

•

14 years ago

Attached patch Part 1: Remove the 1b text fragment APIs (deleted) — Details — Splinter Review

Fix some memory allocation issues in nsTextFragment.

Attachment #461619 - Attachment is obsolete: true

(no longer active)

Reporter

Comment 6

•

14 years ago

Does anybody have any ideas? Do people think it's worth for me to write the second part of the patch (to remove the TEXT_IS_8BIT stuff) and push them to see if catlee's scripts pick up any regression/improvement?

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 7

•

14 years ago

I'm not sure why you want to do this. Are you expecting some kind of performance win from moving all DOM nodes to UTF16?

Boris Zbarsky [:bzbarsky]

Comment 8

•

14 years ago

There is one, actually: textnodes would no longer have to walk over their text trying to decide how to store it, so textnode creation and modification could be faster.

(no longer active)

Reporter

Comment 9

•

14 years ago

(In reply to comment #8) > There is one, actually: textnodes would no longer have to walk over their text > trying to decide how to store it, so textnode creation and modification could > be faster. Yes, exactly. FWIW, my tests show that this *actually* makes us a lot better at handling things such as editing huge textareas (after bug 240933).

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 10

•

14 years ago

What are the cases that get a lot faster? It seems to me that all we have to do is, when text is inserted/appended, scan the new text for characters >= 256.

Boris Zbarsky [:bzbarsky]

Comment 11

•

14 years ago

Yes, indeed. For editor, the problem is not having a working insert method on textfragments, which I think we should fix. For dramaeo's stupid createTextNode(huge-string) test, the scan is just ... expensive. Totally dwarfs all the other costs involved.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 12

•

14 years ago

OK. I think it's totally OK for nsDocument::CreateTextNode (and any other DOM text methods that are a performance issue) to call a SetText method that doesn't scan, just assumes UTF16. We could even do that for editor too, if we wanted. I'm guessing that almost all the benefits of 8-bit text can be captured as long as we can create 8-bit text from the parser.

Boris Zbarsky [:bzbarsky]

Comment 13

•

14 years ago

I'd be on board with that. Between than and a better insert API, do we really need this?

Boris Zbarsky [:bzbarsky]

Comment 14

•

14 years ago

Well, and I guess maybe converting editor textnodes to UTF16

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 15

•

14 years ago

One thing we should consider is using vectorized instructions for the scan. That could be significantly faster. Justin: Is that something you'd be interested to look into?