264412 - (innertext) Add support for element.innerText

Reporter

Description

•

20 years ago

User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; rv:1.7.3) Gecko/20040913 Firefox/0.10 Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; rv:1.7.3) Gecko/20040913 Firefox/0.10 It seems that for some reason, the innerText property is not updatable with the script in the above URL. Reproducible: Always Steps to Reproduce: 1. Go to the URL listed in this bug report 2. 3. Actual Results: The innerText properties of the p, td, div and span elements are undefined when the javascript starts, and they never get updated by the script. The input element does actually get updated if the script is downloaded and modified to take out the null checking. Expected Results: The innerText property of the p, div, span, and td elements should have been updated via the javascript. If the innerText property in the 'clock.js' script is replaced by innerHTML, then everything works OK

Phil Ringnalda (:philor)

Comment 1

•

20 years ago

Gecko (and thus the Mozilla suite, Firefox, Netscape, Camino, Galeon, ...) never has supported innerText - http://www.mozilla.org/docs/web-developer/upgrade_2.html#dom_unsupp If you actually meant to file an enhancement bug (on Browser - DOM Level 0) for innerText support, well, in an era when we support document.all as long as you don't ask first, who knows?

Status: UNCONFIRMED → RESOLVED

QA Contact: general → general

Aryeh Gregor (:ayg) (no longer with Mozilla)

Comment 10

•

14 years ago

Is Gecko interested in implementing this if a detailed spec is written? It's complicated, but considerably more useful than textContent for getting a plaintext version of an HTML element, and every other browser has it. WebKit seems not to be interested in dropping it, and it would be nice if we could get interop here.

Status: VERIFIED → REOPENED

Ever confirmed: true

Resolution: INVALID → ---

Mounir Lamouri (:mounir)

Updated

•

14 years ago

Component: DOM → DOM: Core & HTML

OS: Windows XP → All

Hardware: x86 → All

Version: unspecified → Trunk

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 11

•

14 years ago

Why innerText is more useful than textContent? Based on http://msdn.microsoft.com/en-us/library/ms533899%28v=vs.85%29.aspx it is quite bizarre API.

Aryeh Gregor (:ayg) (no longer with Mozilla)

Comment 12

•

14 years ago

innerText returns the element's HTML contents converted to plaintext. Conceptually, it works the same way as Selection stringification -- in fact, WebKit uses the same algorithm for both, and I'm going to specify both as the same algorithm. textContent just concatenates the text node descendants in tree order, so it preserves indentation and so forth. innerText collapses runs of whitespace to a single space, inserts newlines after block elements, etc. I don't have specific use-cases offhand for why anyone would want to use innerText instead of textContent. The large majority of authors seem to only set it, in which case aliasing it to textContent would work, and in fact Opera treats it similarly to textContent (although they apparently have some compat bugs as a result). But given that everyone needs to implement the plaintext conversion algorithm for Selection stringification anyway, I don't see any reason not to provide innerText too. The additional implementation burden should be low, and it makes things easier for authors if all browsers implement it instead of all but Firefox.

Aryeh Gregor (:ayg) (no longer with Mozilla)

•

9 years ago

Apart from the web compatibility gains, Juriy has written a great summary of innerText and a possible way forward in terms of a (pseudo) spec. If Chrome and IE are willing to converge, great. If not, we should pick one of the flavors and implement it. http://perfectionkills.com/the-poor-misunderstood-innerText/#naive-spec http://kangax.github.io/jstests/innerText/

Summary: innerText property on various elements not updatable with javascript → Add support for element.innerText

Mike Taylor [:miketaylr]

Updated

•

9 years ago

Blocks: 1170774

:shell escalante

Updated

•

Assignee

Comment 34

•

9 years ago

Breaks clicking on erroneous words in xkcd SimpleWriter. http://blog.xkcd.com/2015/09/22/a-thing-explainer-word-checker/

Carlos Alén Silva

Comment 35

•

9 years ago

For what is worth, please note that the original URL ( http://www.sharepointcustomization.com/resources/codesamples/TimeZoneClocks/clock_example.htm ) is gone.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 36

•

9 years ago

Looking at nsPlainTextSerializer, it's pretty complex. Some changes we'd want to make would be quite disruptive, e.g. right now many HTML elements (such as <span>) ignore IsElementBlock() in nsPlainTextSerializer::DoOpenContainer. I'd feel a lot more comfortable about having a separate implementation for innerText that we can evolve independently. I also think it would be good to deploy an implementation of innerText whose behavior is as simple as possible, so that the spec can be as simple as possible while still be being Web-compatible. Maybe we should try picking the simpler behavior everywhere IE and Chrome differ.

Karl Dubost💡 :karlcow

•

9 years ago

Attached file MozReview Request: Bug 264412. Refactor nsIFrame::GetRenderedText API to be more sane. r=mats,marcoz (deleted) — Details

Bug 264412. Refactor nsIFrame::GetRenderedText API to be more sane. r=mats

Attachment #8675486 - Flags: review?(mats)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 55

•

9 years ago

Attached file MozReview Request: Bug 264412. Implement HTMLElement.innerText. r=smaug,mats (deleted) — Details

Bug 264412. Implement HTMLElement.innerText. r=smaug,mats

Attachment #8675487 - Flags: review?(mats)

Attachment #8675487 - Flags: review?(bugs)

Chris Peterson [:cpeterson]

•

9 years ago

FYI, nsDocumentEncoder has some special code for handling ShadowRoot and IsHTMLElement(nsGkAtoms::rp) here: http://mxr.mozilla.org/mozilla-central/source/dom/base/nsDocumentEncoder.cpp#107 did you consider those?

Mats Palmgren (inactive)

Comment 59

•

9 years ago

It would be good to have test that checks <table><tfoot>x</tfoot><tbody>y</tbody></table> since those are reordered in rendering. And some tests using shadow dom.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 60

•

9 years ago

https://reviewboard.mozilla.org/r/22369/#review20043 > It's a bit hard to understand what the zero means at the call site: > GetRenderedText(0, aContentOffset, aContentOffset + 1); > I think you should add a CONTENT_TEXT_OFFSETS to make it clear that the flag passed matches the offsets: > GetRenderedText(CONTENT_TEXT_OFFSETS, aContentOffset, aContentOffset + 1); > > Do you think it's likely that we'll add additional flags here in the future? If not, then I suggest you make it an enum class to avoid typos like: > GetRenderedText(aContentOffset, aContentOffset + 1); > Actually, I think we should do that now anyway. Then if we need more flags in the future we add "operator|" etc. > > Alternatively, could we bundle these three params into an object, DOMOffsets/RenderedOffsets subclassed of SomeOffsets, or something like that? e.g. > GetRenderedText(DOMOffsets(aContentOffset, aContentOffset + 1)); > GetRenderedText(RenderedOffsets(aRenderedOffset, aRenderedOffset + 1)); I thought I would have to add more flags but in the end I didn't. So making this an enum class parameter, and moving it to the end, seems like the way to go. > |setOffsets| is a confusing name. It's used like so: > if (!setOffsets) { > // set the offsets > setOffsets = true; > } > Which appears to be the opposite of what the name suggests. > Perhaps |haveOffsets| is better? Renamed to haveOffsets.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 61

•

9 years ago

https://reviewboard.mozilla.org/r/22371/#review20065 ::: dom/base/nsRange.cpp:3201 (Diff revision 1) > + int mRequiredLineBreakCount; It can only be 0, 1 or 2. I'll make it int8_t. ::: dom/base/nsRange.cpp:3206 (Diff revision 1) > + while (mRequiredLineBreakCount > 0) { Because we need to add as many newlines as mRequiredLineBreakCount says. In particular sometimes (when a <p> is present) we need to add two newlines. ::: dom/base/nsRange.cpp:3279 (Diff revision 1) > + if (aFrame->GetType() != nsGkAtoms::tableCellFrame) { Yes, good catch. ::: dom/base/nsRange.cpp:3325 (Diff revision 1) > + nsIContent* currentNode = static_cast<nsIContent*>(mStartParent.get()); Sure. ::: dom/base/nsRange.cpp:3328 (Diff revision 1) > + nsGenericDOMDataNode* t = static_cast<nsGenericDOMDataNode*>(mStartParent.get()); OK ::: dom/base/nsRange.cpp:3358 (Diff revision 1) > + nsIFrame::RenderedText text = f->GetRenderedText(0, 0, UINT32_MAX); Sure. ::: dom/base/nsRange.cpp:3371 (Diff revision 1) > + result.Append(NS_LITERAL_STRING("\n")); OK, though I have had to add an Append(char) overload for that. ::: dom/base/nsRange.cpp:3373 (Diff revision 1) > + uint8_t display = f->StyleDisplay()->mDisplay; OK

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 62

•

9 years ago

(In reply to Mats Palmgren (:mats) from comment #58) > FYI, nsDocumentEncoder has some special code for handling ShadowRoot I don't understand what that code is for. It was added in bug 806506 without explanation. I think we should ignore that for now, but let's ask William. Unless I hear otherwise I'd like to completely ignore shadow DOM, like we're ignoring CSS anonymous content. I'll write some tests to check that it's ignored. Chrome excludes shadow DOM content from "innerText". > and IsHTMLElement(nsGkAtoms::rp) here: > http://mxr.mozilla.org/mozilla-central/source/dom/base/nsDocumentEncoder. > cpp#107 Yes ... clearly <rp> contents should be included in innerText, though Chrome doesn't do it. I'll fix that in the spec and here. > It would be good to have test that checks <table><tfoot>x</tfoot><tbody>y</tbody></table> > since those are reordered in rendering. And some tests using shadow dom. Sure.

Flags: needinfo?(wchen)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Updated

•

9 years ago

Attachment #8675486 - Flags: review?(mats)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 63

•

•

9 years ago

Hmm, this landed already? Is my review needed here? (Sorry, I've had some other stuff to review too)

Guilherme Lima

Comment 83

•

9 years ago

Roc landed by mistake, it seems: b70e89c03c56 Robert O'Callahan — Revert incorrectly committed changes ab657569f554 and a396f4262479

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 84

•

9 years ago

I landed it by mistake and backed it out immediately. So it still needs your review.

Flags: needinfo?(bugs)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 85

•

9 years ago

https://treeherder.mozilla.org/#/jobs?repo=try&revision=326d0927d8a0

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 86

•

9 years ago

https://treeherder.mozilla.org/#/jobs?repo=try&revision=bfdc26cd9ea4

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 87

•

9 years ago

https://treeherder.mozilla.org/#/jobs?repo=try&revision=4e7a7c3d1a11

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 88

•

9 years ago

•

9 years ago

Attachment #8675486 - Flags: review+ → review?(mats)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

•

9 years ago

(In reply to Marco Zehe (:MarcoZ) from comment #98) > ::: accessible/generic/HyperTextAccessible.cpp:1989 > (Diff revision 4) > > - // Only get info up to original offset, we know that will be larger than skipped offset > > + *aRenderedOffset = text.mOffsetWithinNodeRenderedText;> Can we keep the comment, please? I don't think we should keep it, because the fact "we know that will be larger than skipped offset" is no longer relevant. The new code doesn't depend on that assumption. > ::: accessible/generic/HyperTextAccessible.cpp:2013 > (Diff revision 4) > > - // We only need info up to skipped offset -- that is what we're converting to original offset > > + *aContentOffset = text.mOffsetWithinNodeText; > > Same here. I don't think this comment really makes sense anymore either.

Flags: needinfo?(mzehe)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 113

•

9 years ago

Comment on attachment 8675487 [details] MozReview Request: Bug 264412. Implement HTMLElement.innerText. r=smaug,mats Bug 264412. Implement HTMLElement.innerText. r=smaug,mats

Attachment #8675487 - Flags: review+ → review?(bugs)

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

Comment 114

•

9 years ago

•

9 years ago

Attachment #8681097 - Flags: review?(mats) → review+

Mats Palmgren (inactive)

Comment 121

•

9 years ago

Comment on attachment 8681097 [details] MozReview Request: Bug 264412. Optimize GetRenderedText. r=mats https://reviewboard.mozilla.org/r/23769/#review21259 r=mats if my question below isn't an issue ::: dom/base/nsRange.cpp:3403 (Diff revision 2) > - } else { > + if (currentNode == endNode && currentState == endState) { > + break; Hmm, doesn't this skip the AFTER_NODE action for the endNode? ::: layout/generic/nsTextFrame.cpp:9050 (Diff revision 2) > + runLength = std::min(runLength, > + trimmedOffsets.GetEnd() - iter.GetOriginalOffset()); I'd prefer if the second arg to std::min lined up with the first.

Robert O'Callahan (:roc) (email my personal email if necessary)

Assignee

•

9 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/95346f49d048f5abb3d60df4beec4fe4ef412017 Bug 264412. Refactor nsIFrame::GetRenderedText API to be more sane. r=mats,marcoz https://hg.mozilla.org/integration/mozilla-inbound/rev/5ebc59281c25fbb8ea288f24797b9ece1fdb21a5 Bug 264412. Implement HTMLElement.innerText. r=smaug,mats https://hg.mozilla.org/integration/mozilla-inbound/rev/9160f08660b8290559e427fd80d617edd86fe2a6 Bug 264412. Optimize GetRenderedText. r=mats

Carsten Book [:Tomcat]

Comment 126

•

9 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/95346f49d048 https://hg.mozilla.org/mozilla-central/rev/5ebc59281c25 https://hg.mozilla.org/mozilla-central/rev/9160f08660b8

Status: NEW → RESOLVED

Closed: 20 years ago → 9 years ago

•

•

9 years ago

innerText is not supposed to return something that was set before. It returns element's textual representation, which always exists and should always be a string.

Flags: needinfo?(kangax)

Loic

Updated

•

9 years ago

Depends on: 1260025

Kohei Yoshino

Updated

•

9 years ago

Depends on: 1268833

Olli Pettay [:smaug][bugs@pettay.fi]

Updated

•

8 years ago

Depends on: 1288975

Loic

Updated

•

8 years ago

Depends on: 1290937

Karl Dubost💡 :karlcow

Updated

•

4 years ago

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=1709790

MozReview Request: Bug 264412. Refactor nsIFrame::GetRenderedText API to be more sane. r=mats,marcoz 9 years ago Robert O'Callahan (:roc) (email my personal email if necessary) (deleted), text/x-review-board-request	MatsPalmgren_bugz : review+ MarcoZ : review+	Details
MozReview Request: Bug 264412. Implement HTMLElement.innerText. r=smaug,mats 9 years ago Robert O'Callahan (:roc) (email my personal email if necessary) (deleted), text/x-review-board-request	MatsPalmgren_bugz : review+ smaug : review+	Details
MozReview Request: Bug 264412. Optimize GetRenderedText. r=mats 9 years ago Robert O'Callahan (:roc) (email my personal email if necessary) (deleted), text/x-review-board-request	MatsPalmgren_bugz : review+	Details