Closed Bug 1202006 Opened 9 years ago Closed 8 years ago

Blob XHRs kept in memory

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla52

Tracking Flags:

Tracking

Status

firefox52

---

fixed

People

(Reporter: azakai, Assigned: baku)

References

(Blocks 1 open bug)

Details

(Whiteboard: [MemShrink:P2])

Attachments

(7 files, 5 obsolete files)

emscripten_temp.tar.bz2 9 years ago Alon Zakai (:azakai) (deleted), application/x-bzip		Details
emscripten_temp.tar.bz2 9 years ago Alon Zakai (:azakai) (deleted), application/octet-stream		Details
part 1 - BlobSet - indentation, moving code, etc 8 years ago Andrea Marchesini [:baku] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review
part 2 - Memory to Temporary BlobImpl 8 years ago Andrea Marchesini [:baku] (deleted), patch		Details \| Diff \| Splinter Review
part 2 - Memory to Temporary BlobImpl 8 years ago Andrea Marchesini [:baku] (deleted), patch	smaug : review-	Details \| Diff \| Splinter Review
part 1 - MutableBlobStorage for XHR 8 years ago Andrea Marchesini [:baku] (deleted), patch		Details \| Diff \| Splinter Review
part 2 - BlobSet just for MultipartBlobImpl 8 years ago Andrea Marchesini [:baku] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review
part 1 - MutableBlobStorage for XHR 8 years ago Andrea Marchesini [:baku] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review
part 3 - BlobSet and MutableBlobStorage for XHR 8 years ago Andrea Marchesini [:baku] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review
part 4 - temp files 8 years ago Andrea Marchesini [:baku] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review
part 5 - tests 8 years ago Andrea Marchesini [:baku] (deleted), patch		Details \| Diff \| Splinter Review
part 5 - tests 8 years ago Andrea Marchesini [:baku] (deleted), patch	smaug : review+	Details \| Diff \| Splinter Review

Alon Zakai (:azakai)

Reporter

Description

•

9 years ago

Attached file emscripten_temp.tar.bz2 (deleted) — Details

Comparing memory usage between Firefox and Chrome, Firefox uses a lot more memory when receiving large Blob XHRs. They appear to be kept in memory, while in Chrome, they seem to be kept on disk. The attachment has a simple testcase that runs in a worker, and fetches 300MB of binary data in a Blob XHR. It then uses FileReaderSync to read from various locations in the file. In Chrome, memory usage hardly rises when loading the testcase; in Firefox, there is over 300MB of memory allocated for the Blob, showing up as "memory-file-data/large". This is also noticeable when viewing the OS processes. Chrome's behavior where Blob XHRs are kept on disk would be helpful for applications with massive datasets, that can't even fit in memory. And FileReaderSync allows us to read from Blobs in a convenient manner in workers, which together would let us use massive datasets easily.

Alon Zakai (:azakai)

Reporter

Comment 1

•

9 years ago

There are downsides to the Chrome approach as well. Firefox takes less than a second to load and read from those data files, while Chrome takes 4.5 seconds. Of which 3 seconds is the actual reads which for Firefox is just 0.023 (i.e. 23 ms). In other words, Chrome does a lot more disk IO here. Times are similar on two machines, on with an SSD and one without.

Kyle Huey (Exited; not receiving bugmail, old account, do not use)

Comment 2

•

9 years ago

We've talked about paging out large blobs to disk before, but it's never something we've felt necessary to implement.

Alon Zakai (:azakai)

Reporter

Comment 3

•

9 years ago

I understand. It's hard for me to guess at what the best thing here is, but I am looking into ways to reduce memory usage for large applications (games, image editors, etc.), that have lots of data and currently are struggling to run on the web. Luke suggested Blob+FileReaderSync as one promising approach. In some more testing now, I tried to load a Blob, then store it to IDB, then read it from there and use FileReaderSync. This still takes the full amount of memory at first, but once in IDB, it looks like we do actually keep the Blob on disk efficiently. However, the initial process of retrieving the file and storing it in IDB is very memory-intensive, perhaps there are even two copies in memory at some point? In Chrome on the other hand, memory usage is almost flat during the whole process. It receives the XHR, stores it to IDB, loads it from there, and uses it, all without it ever being actually in memory. This does seem to support their approach here, I think. Benchmarking the times also supports them: Unlike before, both browsers now do save the file on disk. Now the times are quite similar, suggesting that Chrome stores the XHR Blob and then repurposes it for IDB. So both browsers end up spending the time to write and then read, and Chrome avoids a double write. I'll attach a testcase showing this.

Blocks: gecko-games

Whiteboard: [MemShrink]

Alon Zakai (:azakai)

Reporter

Comment 4

•

9 years ago

Attached file emscripten_temp.tar.bz2 (deleted) — Details

Testcase of XHR => IDB => use Blob in FileReaderSync.

Luke Wagner [:luke]

Comment 5

•

9 years ago

khuey: I don't think we need anything fancy like paging out Blobs that were initially created from an in-memory source (like Blob(ArrayBuffer)), just for Blobs retrieved from the network.

Alon Zakai (:azakai)

Reporter

Comment 6

•

9 years ago

Yes, I think Luke is right, after thinking about this some more I believe the issue here is binary XHRs where the user requested 'arraybuffer' vs 'blob' as the xhr.responseType. It looks like right now in Firefox we handle memory the same in both cases, so the two are in effect almost the same, but in Chrome, when the user asked for a Blob, it is actually a normal on-disk Blob and memory is not used for it. That seems like very useful behavior.

Nicholas Nethercote [inactive]

Updated

•

9 years ago

Whiteboard: [MemShrink] → [MemShrink:P2]

Masatoshi Kimura [:emk]

Comment 7

•

9 years ago

We used to store XHR-blobs on the disk (bug 649133), but the ability was removed (bug 725993 and 758296), How does Chrome prevent XHR-blobs from running-out the disk space?

Luke Wagner [:luke]

Comment 8

•

9 years ago

Ah, good to know the backstory. So, scanning bug 725993, it looks like there was every intention to use files to avoid the OOM issues we're seeing here and it was only removed because the HTTP cache was the wrong place for it (and it sounds like there was no quota checking in place at the time). It also appears there was the intention to implement a new file-backed mechanism (shared by both WebSockets and XHR) but we just never got around to doing it b/c XHR+Blob were still new and so it was a lower priority. I'm don't know what Chrome does, but I have to assume there is a quota mechanism to prevent disk exhaustion.

Alon Zakai (:azakai)

Reporter

Comment 9

•

9 years ago

Perhaps we could use the same quota mechanism we already have for IndexedDB, precompiled asm.js binaries, etc.? (and if a blob xhr fails the quota, we'd store it in memory; and if it is on disk, we delete it when the page is closed)

Marco Castelluccio [:marco]

Comment 10

•

9 years ago

This would have been useful for PluotSorbet (j2me.js) as well, where we implemented a FileSystem on top of IndexedDB.