Closed Bug 1247935 Opened 9 years ago Closed 9 years ago

Tearing during composition in latest Nightly

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla47

Tracking Flags:

Tracking

Status

firefox47

---

fixed

People

(Reporter: marco, Assigned: lsalzman)

References

Details

(Keywords: perf, regression, Whiteboard: gfx-noted)

Attachments

(5 files, 1 obsolete file)

VID_20160219_120323.mp4 9 years ago Marco Castelluccio [:marco] (deleted), video/mp4		Details
Screenshot from video 9 years ago Kartikaya Gupta (email:kats@mozilla.staktrace.com) (deleted), image/png		Details
Screenshot from screencast 9 years ago Kartikaya Gupta (email:kats@mozilla.staktrace.com) (deleted), image/png		Details
part 1 - double-buffer nsShmImage 9 years ago Lee Salzman [:lsalzman] (deleted), patch	jrmuizel : review+	Details \| Diff \| Splinter Review
part 2 - use shared pixmaps with XShm for nsShmImage 9 years ago Lee Salzman [:lsalzman] (deleted), patch	jrmuizel : review+	Details \| Diff \| Splinter Review
part 2 - use shared pixmaps with XShm for nsShmImage 9 years ago Lee Salzman [:lsalzman] (deleted), patch	lsalzman : review+	Details \| Diff \| Splinter Review

Marco Castelluccio [:marco]

Reporter

Description

•

9 years ago

The scrolling experience is extremely painful. It's very noticeable, on any page.

Marco Castelluccio [:marco]

Reporter

Comment 1

•

9 years ago

mozregression points me to bug 1241832.

Blocks: 1241832

Marco Castelluccio [:marco]

Reporter

Comment 2

•

9 years ago

Indeed, setting `gfx.xrender.enabled` to true fixes this bug.

Marco Castelluccio [:marco]

Reporter

Comment 3

•

9 years ago

I think bug 753228 is related.

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=753228

Marco Castelluccio [:marco]

Reporter

Comment 4

•

9 years ago

https://cleopatra.io/#report=750d140ddf6cd395fa72c6bf615428697fc1dfd4

Mason Chang [Inactive] [:mchang]

Comment 5

•

9 years ago

Fallout from disabling xrender.

Flags: needinfo?(lsalzman)

Keywords: perf, regression

Whiteboard: gfx-noted

Lee Salzman [:lsalzman]

Assignee

Updated

•

9 years ago

Flags: needinfo?(lsalzman)

Lee Salzman [:lsalzman]

Assignee

Comment 6

•

9 years ago

This bug report seems somewhat subjective. I don't see anything that I wouldn't expect to see in profile. Could you explain more about what the problem is and how to reproduce it?

Flags: needinfo?(mcastelluccio)

Marco Castelluccio [:marco]

Reporter

Comment 7

•

9 years ago

Lee, I guess what I'm seeing is similar to what BenWa was describing here: http://www.hackermusings.com/2012/05/firefoxs-graphics-performance-on-x11/#comment-30. I don't know how to describe it. Firefox fails to keep up with the scrolling, it draws with a small delay, and this causes very noticeable glitches. It happens on almost any page when I scroll. Here's the Graphics section from my about:support page: Adapter Description: Intel Open Source Technology Center -- Mesa DRI Intel(R) Haswell Mobile Asynchronous Pan/Zoom: wheel input enabled Device ID: Mesa DRI Intel(R) Haswell Mobile Driver Version: 3.0 Mesa 11.0.2 GPU Accelerated Windows: 0/2 Basic (OMTC) Supports Hardware H264 Decoding: No Vendor ID: Intel Open Source Technology Center WebGL Renderer: Intel Open Source Technology Center -- Mesa DRI Intel(R) Haswell Mobile windowLayerManagerRemote: true AzureCanvasBackend: cairo AzureContentBackend: cairo AzureFallbackCanvasBackend: none AzureSkiaAccelerated: 0 CairoUseXRender: 0 I have taken a couple of screencasts, but I don't know if it's easy to see the problem from them. I'll attach them to the bug.

Flags: needinfo?(mcastelluccio)

Lee Salzman [:lsalzman]

Assignee

Comment 8

•

9 years ago

The only real thing I noticed in the profile on a second look is that about 13% of the compositor time was spent doing redundant copies that XShm should have been taking care of already. I just wrote up a fix for that in bug 1019856. Though shaving 13% off our compositing time for almost free is nothing to be ashamed of. :)

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=1019856

Marco Castelluccio [:marco]

Reporter

Comment 9

•

9 years ago

https://dl.dropboxusercontent.com/u/34104919/xrender%20disabled.mkv https://dl.dropboxusercontent.com/u/34104919/xrender%20enabled.mkv Great, I'll test that build shortly.

Marco Castelluccio [:marco]

Reporter

Comment 10

•

9 years ago

Attached video VID_20160219_120323.mp4 (deleted) — Details

Still seeing the problem with the build from the other bug. This video is clearer, can you see the blur when I scroll up on second 2? My machine should be pretty common, if you are in a Mozilla office. It's a ThinkPad W541.

Karl Tomlinson (:karlt)

Comment 11

•

9 years ago

About 25% of non-idle compositor time is spent waiting in XSync() from nsShmImage::Put(). That offers rooms for improvement, but I don't know whether it is the cause of the issue here.

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 12

•

9 years ago

This doesn't appear to be related to checkerboarding or APZ. From the videos it looks like a tearing issue during composition.

Summary: Checkerboarding regression in latest Nightly → Tearing during composition in latest Nightly

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 13

•

9 years ago

Attached image Screenshot from video (deleted) — Details

Here is a screenshot from the video that shows how part of the screen is updated and other part hasn't yet updated.

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 14

•

9 years ago

Attached image Screenshot from screencast (deleted) — Details

Here's a better one taken from the screencast, so hopefully no camera artifacts. There are two instances of tearing - one in the comment with the needinfo near the top of the screen, one just above the "Vendor ID" text in the bottom field.

Lee Salzman [:lsalzman]

Assignee

Comment 15

•

9 years ago

Talking with #dri-devel people, it seems that XShmPutImage is not exactly atomic with respect to window scan-out. So scan-out can actually happen during this operation, and thus lead to the tearing that is evident in the screenshot above. This will take some reworking of how we're doing that to fix.

Lee Salzman [:lsalzman]

Assignee

Comment 16

•

9 years ago

Attached patch part 1 - double-buffer nsShmImage (deleted) — Details — Splinter Review

This double buffers our usage of nsShmImage to get rid of the XSync. The clip rectangle usage and LastKnownRequestProcessed handling was based on some suggestions from #dri-devel and looking into how recent versions of Cairo deal with XShm synchronization. Put now just does an XFlush instead of an XSync, and we do the actual waiting in CreateDrawTarget.

Assignee: nobody → lsalzman

Status: NEW → ASSIGNED

Attachment #8724156 - Flags: review?(jmuizelaar)

Lee Salzman [:lsalzman]

Assignee

Comment 17

•

9 years ago

Attached patch part 2 - use shared pixmaps with XShm for nsShmImage (obsolete) (deleted) — Details — Splinter Review

This makes nsShmImage use shared pixmaps when possible so that eventually we can work towards using the Present extension, which requires pixmaps. There is some new ickyness to deal with initialized the extension and detecting pixmap supports. Also we have to manually inject a ClientMessage into the event queue to make sure our polling terminates in a timely fashion for the XCopyArea path that pixmaps use. Some other cleanups suggestions from #dri-devel too, mainly to get rid of our Xlibint.h usage, so overall we should be handling this a bit more cleanly than Cairo does.

Attachment #8724157 - Flags: review?(jmuizelaar)

Lee Salzman [:lsalzman]

Assignee

Comment 18

•

9 years ago

Also of note, with this double-buffering in place, it eliminates the sync overhead from the profile. So we spend more of our time in the compositor profile just compositing now, with the actual pixmap -> window copy being entirely in parallel.

Jeff Muizelaar [:jrmuizel]

Comment 19

•

9 years ago

Comment on attachment 8724156 [details] [diff] [review] part 1 - double-buffer nsShmImage Review of attachment 8724156 [details] [diff] [review]: ----------------------------------------------------------------- This looks really good except for my worry about hanging while waiting for our request. ::: widget/nsShmImage.cpp @@ +212,5 @@ > + if (!mRequest) { > + return; > + } > + > + while (long(LastKnownRequestProcessed(mDisplay) - mRequest) < 0) { It seems like we could be in a situation where we wrap before waiting for a request(). e.g. - request = Present() (= 5) - the browser is idle for a long time - we call WaitForRequest() and LastKnownRequestProcessed() returns 1<<31 - we end up hanging for another 1<<31 events. If you call LastKnownRequestProcessed before calling XNextRequest you can then have a waiting condition like: while (long(LastKnownRequestProcessed(mDisplay) - mRequest) < 0 && long(LastKnownRequestProcessed(mDisplay) - mOriginalLastKnown) > 0)

Attachment #8724156 - Flags: review?(jmuizelaar) → review+

Jeff Muizelaar [:jrmuizel]

Updated

•

9 years ago

Attachment #8724157 - Attachment is patch: true

Jeff Muizelaar [:jrmuizel]

Comment 20

•

9 years ago

Comment on attachment 8724157 [details] [diff] [review] part 2 - use shared pixmaps with XShm for nsShmImage Review of attachment 8724157 [details] [diff] [review]: ----------------------------------------------------------------- It would be good to add some comments about the differences between using pixmaps and images for XShm ::: widget/nsShmImage.cpp @@ +304,5 @@ > } > > + if (long(LastKnownRequestProcessed(mDisplay) - mRequest) < 0) { > + XEvent event; > + XPeekIfEvent(mDisplay, &event, FindEvent, (XPointer)this); What's the advantage of this XPeekIfEvent vs. the previous idiom?

Attachment #8724157 - Flags: review?(jmuizelaar) → review+

Lee Salzman [:lsalzman]

Assignee

Comment 21

•

9 years ago

Marco, can you try running this try build here and see if the situation is improved at all? http://archive.mozilla.org/pub/firefox/try-builds/lsalzman@mozilla.com-1d6f9675984270b56c6af4f3afa63eff69204ffd/try-linux64/ It has my shared pixmaps changes in it, and I'm wondering if they alone are sufficient to fix your apparent regression.

Lee Salzman [:lsalzman]

Assignee

Updated

•

9 years ago

Flags: needinfo?(mcastelluccio)

Marco Castelluccio [:marco]

Reporter

Comment 22

•

9 years ago

Yes, as far as I can tell it fixes the problem for me (I don't have very good eyes, so it might still be happening from time to time in a much less noticeable way :D).

Flags: needinfo?(mcastelluccio)

Lee Salzman [:lsalzman]

Assignee

Comment 23

•

9 years ago

Attached patch part 2 - use shared pixmaps with XShm for nsShmImage (deleted) — Details — Splinter Review

Addressed the issue with LastKnownRequestProcessed possibly wrapping around. Since the reporter's issue appears to be resolved with this usage of pixmaps and XCopyArea here, I am going to skip incorporation of the Present extension since that is a more onerous change to make and probably shoots past what this bug really called for. Try run: https://treeherder.mozilla.org/#/jobs?repo=try&revision=9538b4b0f33e

Attachment #8724157 - Attachment is obsolete: true

Attachment #8724509 - Flags: review+

Pulsebot

Comment 24

•

9 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/f1ada5d7a092 https://hg.mozilla.org/integration/mozilla-inbound/rev/f37626eba5c6

Carsten Book [:Tomcat]

Comment 25

•

9 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/f1ada5d7a092 https://hg.mozilla.org/mozilla-central/rev/f37626eba5c6

Status: ASSIGNED → RESOLVED

Closed: 9 years ago

status-firefox47: affected → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla47

Joel Maher ( :jmaher ) (UTC -8)

Comment 26

•

9 years ago

here are the talos differences from this change: https://treeherder.allizom.org/perf.html#/alerts?id=310 we have good wins on tresize/tscroll/tp5o_scroll, but a small loss on tp5 xres. :lsalzman, any concerns with the xres increase? I am happy with the wins we have, just want to make sure we didn't do anything silly to increase xres during tp5o.

Flags: needinfo?(lsalzman)

Lee Salzman [:lsalzman]

Assignee

Comment 27

•

9 years ago

(In reply to Joel Maher (:jmaher) from comment #26) > here are the talos differences from this change: > https://treeherder.allizom.org/perf.html#/alerts?id=310 > > we have good wins on tresize/tscroll/tp5o_scroll, but a small loss on tp5 > xres. > > :lsalzman, any concerns with the xres increase? I am happy with the wins we > have, just want to make sure we didn't do anything silly to increase xres > during tp5o. Not concerned. Since this patch both fixes a visual regression and reduces jank in many cases.

Flags: needinfo?(lsalzman)

Marco Castelluccio [:marco]

Reporter

Comment 28

•

9 years ago

I'm still seeing tearing from time to time, but much less noticeable than before.

Lee Salzman [:lsalzman]

Assignee

Comment 29

•

9 years ago

(In reply to Marco Castelluccio [:marco] from comment #28) > I'm still seeing tearing from time to time, but much less noticeable than > before. I talked with the various people in #dri-devel and there will always be some unavoidable tearing. The only way around this is basically to use accelerated layers right now, as that's the only place we implement proper vsync. None of the APIs we were using before for basic X11 can really take care of it, and until DRI3 becomes a thing, the Present API is still wait-and-see.

Marco Castelluccio [:marco]

Reporter

Comment 30

•

9 years ago

It's happening really often now, could it be a regression from some other change? Or maybe it depends on the contents of the page? Scrolling on http://www.ilfattoquotidiano.it/2016/03/06/primarie-usa-cruz-e-lanti-trump-la-louisiana-a-clinton-per-sanders-kansas-e-nebraska/2522644/ is particularly painful.

Milan Sreckovic [:milan] (needinfo for best results)

Updated

•

9 years ago

Blocks: 1263222

Karl Tomlinson (:karlt)

Updated

•

8 years ago

No longer blocks: 1263222

(no longer active)

Updated

•

8 years ago

Depends on: 1321129

You need to log in before you can comment on or make changes to this bug.