1398322 - stylo: PropertyDeclarationBlock storage is inefficient

Reporter

Description

•

7 years ago

On gmail, bz measures 5.5MBs of PDBs, which is a lot. I think we can shrink it. The definition is at [1]. The main inefficiency is that we store an Importance alongside every PropertyDeclaration, costing a word where we only need a bit. We should instead store this bit out-of-band, and use minimal storage for doing so. My idea is as follows. Adjustments / counter-proposals welcome: * We introduce a type called SmallBitVec, that just wraps a usize. * If the rightmost bit of the usize is unset, then we treat it as inline storage. * In the inline storage case, the rightmost bit is a sentinel from which len() can be derived. * If the rightmost bit is set, then the usize is treated as heap pointer (modulo the rightmost bit, which needs to be masked off) to a buffer that is capacity + len + buffer. The type would expose indexed access to the bits, as well as methods for computing whether all of the bits are set or none of the bits are set. This would allow us to eliminate important_count, since that's all the consumer really wants to know. This means that the inline size of PDB will not increase, which is a nice bonus. [1] http://searchfox.org/mozilla-central/rev/44c693914255638d74bcf1ec3b0bcd448dfab2fd/servo/components/style/properties/declaration_block.rs#72

Bobby Holley (:bholley)

Reporter

Comment 1

•

7 years ago

(In reply to Bobby Holley (:bholley) (busy with Stylo) from comment #0) > * In the inline storage case, the rightmost bit is a sentinel from which > len() can be derived. > * If the rightmost bit is set, then the usize is treated as heap pointer > (modulo the rightmost bit, which needs to be masked off) to a buffer that is > capacity + len + buffer. I realize that I'm overloading "rightmost". In the first case (the sentinel), it refers to "the lowest order bit that is set". In the second case (the discriminant), it refers to "the bit of order zero".

Bobby Holley (:bholley)

Reporter

Comment 2

•

7 years ago

Matt is going to take this. Thanks Matt!

Flags: needinfo?(mbrubeck)

Bobby Holley (:bholley)

Reporter

Updated

•

7 years ago

Priority: -- → P3

Matt Brubeck (:mbrubeck)

Assignee

Updated

•

7 years ago

Flags: needinfo?(mbrubeck)

Xidorn Quan [:xidorn] UTC+11

Comment 3

•

7 years ago

I have slightly different idea that we introduce a new type called VecWithBit which contains a Vec and a union of usize and pointer, which serves as bitvec with length identical to the capacity of the Vec, so that we don't need any extra bits for checking whether the bitvec is inlined as well as the length of bitvec, because we can simply derive them from Vec's length.

Bobby Holley (:bholley)

Reporter

Comment 4

•

7 years ago

(In reply to Xidorn Quan [:xidorn] UTC+10 from comment #3) > I have slightly different idea that we introduce a new type called > VecWithBit which contains a Vec and a union of usize and pointer, which > serves as bitvec with length identical to the capacity of the Vec, so that > we don't need any extra bits for checking whether the bitvec is inlined as > well as the length of bitvec, because we can simply derive them from Vec's > length. That works too, at the cost of being slightly less reusable. If we made it reusable, we could replace the BitVec in [1] and save some heap-allocations, which would be nice. [1] http://searchfox.org/mozilla-central/rev/44c693914255638d74bcf1ec3b0bcd448dfab2fd/servo/components/style/sharing/mod.rs#126

Bobby Holley (:bholley)

Reporter

Comment 5

•

7 years ago

(I think eliminating the BitVec for revalidation_match_results would be a nice improvement, so I'm inclined to prefer the approach in comment 0 unless we discover other downsides)

Matt Brubeck (:mbrubeck)

Assignee

Comment 6

•

7 years ago

I have started implementing the SmallBitVec type at https://github.com/mbrubeck/smallbitvec

Matt Brubeck (:mbrubeck)

Assignee

Comment 7

•

7 years ago

https://github.com/servo/servo/pull/18431

Bobby Holley (:bholley)

Reporter

Comment 8

•

7 years ago

(In reply to Matt Brubeck (:mbrubeck) from comment #6) > I have started implementing the SmallBitVec type at > https://github.com/mbrubeck/smallbitvec Nice! Looks like it's missing a Drop impl though?

Bobby Holley (:bholley)

Reporter

Comment 9

•

7 years ago

Also might be worth adding some cargo bench microbenchmarks to compare perf against BitVec.

Bobby Holley (:bholley)

Reporter

Comment 10

•

7 years ago

Also: probably worth using an unchecked get from the iterator, so that we don't have to do the somewhat-complicated len() computation on each call to next()?

Bobby Holley (:bholley)

Reporter

Comment 11

•

7 years ago

(In reply to Matt Brubeck (:mbrubeck) from comment #7) > https://github.com/servo/servo/pull/18431 This looks great on a quick skim. May be worth pushing through gecko try before landing.

Matt Brubeck (:mbrubeck)

Assignee

Comment 12

•

7 years ago

Added the Drop impl, get_unchecked method, and some microbenchmarks. Also made some tweaks to improve performance on some of the benchmarks. Currently, SmallBitVec is about 20% slower than BitVec in benchmarks that test the `set` method, and about 200% slower in benchmarks that iterate over the vector.

Matt Brubeck (:mbrubeck)

Assignee

Comment 13

•

7 years ago

https://treeherder.mozilla.org/#/jobs?repo=try&revision=9d02bbe8a27a4f741ed13ef4ccb7c51c6a143933

Bobby Holley (:bholley)

Reporter

Comment 14

•

7 years ago

(In reply to Matt Brubeck (:mbrubeck) from comment #12) > Added the Drop impl, get_unchecked method, and some microbenchmarks. Also > made some tweaks to improve performance on some of the benchmarks. > Currently, SmallBitVec is about 20% slower than BitVec in benchmarks that > test the `set` method, and about 200% slower in benchmarks that iterate over > the vector. I noticed you added a commit to iterator performance by fiddling with inlining. How much did it help? In practice, we probably only really care about the non-spilled case, so we could make that logic inline and make the spill-handling #[inline(never)] / #[cold]. Also, what about eq performance? I noticed that SmallBitVec compares via iterators, whereas BitVec compares the words. That potentially makes BitVec 64x faster on eq, which probably matters when comparing the bitvecs for revalidation selectors. Should be straightforward to add a similar eq impl?

Matt Brubeck (:mbrubeck)

Assignee

Comment 15

•

7 years ago

The inlining changes improved iteration speed about 30% in both spilled and non-spilled cases. I'll push a fix for eq performance shortly.

Matt Brubeck (:mbrubeck)

Assignee

Comment 16

•

7 years ago

https://hg.mozilla.org/integration/autoland/rev/002ebb2eb2b7

Bobby Holley (:bholley)

Reporter

Comment 17

•

7 years ago

Thanks Matt!

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → FIXED

Ryan VanderMeulen [:RyanVM]

Updated

•

7 years ago

status-firefox57: --- → fixed

Bugzilla

stylo: PropertyDeclarationBlock storage is inefficient

Categories

(Core :: CSS Parsing and Computation, enhancement, P3)

Tracking

()

People

(Reporter: bholley, Assigned: mbrubeck)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Updated

Updated

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Comment 16

Comment 17

Updated