Closed Bug 985193 Opened 11 years ago Closed 10 years ago

High heap unclassified/display driver crash during slideshow

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla38

Tracking Flags:

Tracking

Status

firefox37

fixed

firefox38

---

fixed

People

(Reporter: ferongr, Assigned: seth)

References

(Blocks 1 open bug,
URL
)

Details

(Whiteboard: [MemShrink:P2])

Attachments

(6 files, 4 obsolete files)

about:memory snapshot without OMTC 11 years ago ferongr (deleted), application/gzip		Details
about:memory snapshot with OMTC 11 years ago ferongr (deleted), application/gzip		Details
Graphics section of about:support 11 years ago ferongr (deleted), text/plain		Details
about:memory snapshot with OMTC, fresh profile and session 11 years ago ferongr (deleted), application/gzip		Details
gpu-memory-usage.svg 10 years ago Jeff Muizelaar [:jrmuizel] (deleted), image/svg+xml		Details
image-deallocation-stacks.txt 10 years ago Jeff Muizelaar [:jrmuizel] (deleted), text/plain		Details
Stop holding a strong reference to RasterImage's ImageContainer 10 years ago Seth Fowler [:seth] [:s2h] (deleted), patch	mattwoodrow : review+	Details \| Diff \| Splinter Review
Stop holding a strong reference to RasterImage's ImageContainer 10 years ago Seth Fowler [:seth] [:s2h] (deleted), patch		Details \| Diff \| Splinter Review
Stop holding a strong reference to RasterImage's ImageContainer 10 years ago Seth Fowler [:seth] [:s2h] (deleted), patch		Details \| Diff \| Splinter Review
Stop holding a strong reference to RasterImage's ImageContainer 10 years ago Seth Fowler [:seth] [:s2h] (deleted), patch	lmandel : approval-mozilla-aurora+	Details \| Diff \| Splinter Review

ferongr

Reporter

Description

•

11 years ago

Attached file about:memory snapshot without OMTC (deleted) — Details

Mozilla/5.0 (Windows NT 6.1; WOW64; rv:31.0) Gecko/20100101 Firefox/31.0 ID:20140318030202 CSet: 082761b7bc54 2014-03-18 Nightly build. STR 1. Follow URL 2. Enable auto-next at the bottom left panel and set it to a low value (for faster reproduction, the crash eventually happens regardless of speed). 3. Wait for a minute or so 4. Observe about:memory 5. Wait for 5 minutes or so 6a. With layers.offmainthreadcomposition.enabled;false: Display driver/DWM crash (resolution switches to 640x480, theme switches to Aero Basic, Windows recovers after a few seconds). Nightly becomes inoperable (window shrinks completely showing only a small titlebar), requiring to restart it. 6b. With layers.offmainthreadcomposition.enabled;true: The browser crashes. Example crash ID: 203e780f-5466-400a-9132-e02802140318 Without OMTC, the heap-unclassified leaf node keeps growing until the driver crashes. With OMTC enabled it's the GFX -> heap-textures leaf node that keeps growing. Forcing a GC/CC without OMTC does lower the memory used (heap-unclassified goes to normal levels). With OMTC though the memory stays around until the browser is restarted. GPU: AMD HD6850 with 13.12 Catalyst.

ferongr

Reporter

Comment 1

•

11 years ago

Attached file about:memory snapshot with OMTC (obsolete) (deleted) — Details

ferongr

Reporter

Comment 2

•

11 years ago

Attached file Graphics section of about:support (deleted) — Details

ferongr

Reporter

Updated

•

11 years ago

Blocks: DarkMatter

Andrew McCreight [:mccr8]

Updated

•

11 years ago

Component: General → Graphics

Whiteboard: [MemShrink]

Nicholas Nethercote [inactive]

Comment 3

•

11 years ago

Bad behaviour with and without OMTC, yikes.

Whiteboard: [MemShrink] → [MemShrink:P2]

Timothy Nikkel (:tnikkel)

Comment 4

•

11 years ago

In my testing images uncompressed heap never went up with OMTC on or off, so I'm guessing the presence of significant images uncompressed heap in the OMTC off report attachment is due to other activity in the session? There was lots of memory usage in heap-textures and/or heap-unclassified though.

Status: UNCONFIRMED → NEW

Ever confirmed: true

Timothy Nikkel (:tnikkel)

Comment 5

•

11 years ago

On mac (with omtc on, the default) heap-textures grows to 1 GB in about a minute. So this seems bad.

ferongr

Reporter

Comment 6

•

11 years ago

No, I don't think so. The snapshot with OMTC off is with a brand-new profile and a session a few seconds old. The OMTC snapshot is from my normal profile but I will upload a new snapshot with a fresh profile and OMTC enable in a bit just to be sure.

ferongr

Reporter

Comment 7

•

11 years ago

Attached file about:memory snapshot with OMTC, fresh profile and session (deleted) — Details

Attachment #8393206 - Attachment is obsolete: true

Timothy Nikkel (:tnikkel)

Comment 8

•

11 years ago

(In reply to ferongr from comment #6) > No, I don't think so. The snapshot with OMTC off is with a brand-new profile > and a session a few seconds old. The OMTC snapshot is from my normal profile > but I will upload a new snapshot with a fresh profile and OMTC enable in a > bit just to be sure. Interesting. I tested on a Windows machine (previous results were for mac) and I still never get any values on images uncompressed over 1 MB. I wish I could reproduce what you are seeing. Either way those numbers are but a sideshow to heap-textures and heap-unclassified.

Timothy Nikkel (:tnikkel)

Comment 9

•

11 years ago

Ah, so I can get large images uncompressed-heap numbers in the release channel. I'm guessing that bug 962670 moved that to heap-textures or unclassified? The site users background-image to show the images, for which we don't have any smarts about keeping around decoded data like we do for img elements.

ferongr

Reporter

Comment 10

•

11 years ago

In my case, it's the layers.offmainthreadcomposition.enabled (that's false by default on Windows Nightly) pref that makes the high number go from heap-unclassified to heap-textures.

Timothy Nikkel (:tnikkel)

Comment 11

•

11 years ago

We appear to be discarding images fine. The difference between the number of RasterImage::DecodingComplete calls and RasterImage::Discard calls (on non-chrome images) is never more than 15 even after my laptop start going into an unusable state due to memory use. And I traced one Discard call down to the vm_deallocate call, so that appears to be working fine. So the problem must be somewhere else.

Timothy Nikkel (:tnikkel)

Comment 12

•

11 years ago

The images are getting layerized and the image container that RasterImage uses seems to be the source of the problem. We don't touch it in ::Discard, so if proper disposal of it is happening must be done by the layers subsystem.

Timothy Nikkel (:tnikkel)

Updated

•

11 years ago

Blocks: 1006295

(Away)

Comment 13

•

10 years ago

Looks like this is still an issue. With a late Nightly 37 on Windows, after about an hour I see: 463.56 MB (100.0%) -- explicit ├──273.78 MB (59.06%) ── heap-unclassified ├──117.12 MB (25.27%) -- images 5,468.42 MB ── gpu-committed 367.62 MB ── gpu-dedicated 122.00 MB ── gpu-shared 422.81 MB ── heap-allocated 473 ── heap-chunks 1.00 MB ── heap-chunksize 426.46 MB ── heap-committed 473.00 MB ── heap-mapped 0.86% ── heap-overhead-ratio 2.42 MB ── imagelib-surface-cache-estimated-locked 2.42 MB ── imagelib-surface-cache-estimated-total 0.34 MB ── js-main-runtime-temporary-peak 0 ── low-commit-space-events 734.10 MB ── private 139.82 MB ── resident 3,906.76 MB ── vsize 17.00 MB ── vsize-max-contiguous VMMap says I have about 2700MB of "Shareable" write-combine regions. (Is the gpu-committed number double-counting?) A sample WinDbg !address dump on one of those regions looks like: Usage: MappedFile Base Address: dab00000 End Address: db600000 Region Size: 00b00000 State: 00001000 MEM_COMMIT Protect: 00000404 PAGE_READWRITE|PAGE_WRITECOMBINE Type: 00040000 MEM_MAPPED Allocation Base: dab00000 Allocation Protect: 00000404 PAGE_READWRITE|PAGE_WRITECOMBINE Mapped file name: PageFile I can't tell where those regions are coming from. AFAICT it's not through the VirtualAlloc or MapViewOfFile families. On the bright side, those regions do get cleaned up if I navigate away and Minimize Memory.

(Away)

Updated

•

10 years ago

No longer blocks: 1006295

(Away)

Updated

•

10 years ago

Blocks: 1062065

Jeff Muizelaar [:jrmuizel]

Comment 14

•

10 years ago

Attached image gpu-memory-usage.svg (deleted) — Details

So I had a look at gpuview trace of this situation. We definitely seem to be accumulating gpu allocations. I've attached a graph of gpu allocations from David's gpuview trace.

Jeff Muizelaar [:jrmuizel]

Comment 15

•

10 years ago

Looking more closely, it appears as though we are accumulating textures. I do not yet know why or how.

Jeff Muizelaar [:jrmuizel]

Comment 16

•

10 years ago

I can reproduce this locally and it looks like the gpuview logs contain allocation stacks so I should be able to figure out what's going on tomorrow.

Jeff Muizelaar [:jrmuizel]

Comment 17

•

10 years ago

Attached file image-deallocation-stacks.txt (deleted) — Details

Here's a sample deallocation from when everything gets freed.

Jeff Muizelaar [:jrmuizel]

Comment 18

•

10 years ago

It looks like imagelib is keeping all of images on the page in the image cache. This is keeping a bunch of the gpu stuff alive. How are images supposed to be removed from the cache? What's keeping them there on this page?

Jeff Muizelaar [:jrmuizel]

Comment 19

•

10 years ago

A few more details. mCache is growing in size. The cacheQueue does not have very many items in it. It seems like the imageRequestProxy destructor is not being called much.

Flags: needinfo?(seth)

Jeff Muizelaar [:jrmuizel]

Updated

•

10 years ago

Flags: needinfo?(tnikkel)

Timothy Nikkel (:tnikkel)

Comment 20

•

10 years ago

I think the problem is that there is nothing to release mImageContainer on RasterImage. So as long as the image is kept alive (the imgCache is only for keeping the source data around) the image container sticks around. Instead it should be treated the same as the rest of the decoded data for images. I think all we should need is nulling out mImageContainer in RasterImage::Discard.

Flags: needinfo?(tnikkel)

Jeff Muizelaar [:jrmuizel]

Comment 21

•

10 years ago

(In reply to Timothy Nikkel (:tn) from comment #20) > I think the problem is that there is nothing to release mImageContainer on > RasterImage. So as long as the image is kept alive (the imgCache is only for > keeping the source data around) the image container sticks around. Instead > it should be treated the same as the rest of the decoded data for images. > > I think all we should need is nulling out mImageContainer in > RasterImage::Discard. This seems to have worked.

Assignee: nobody → tnikkel