Open Bug 1795511 Opened 2 years ago Updated 1 year ago

Switch motionmark to use 'ramp' mode and report complexity score

Tracking

(Not tracked)

Status:

ASSIGNED

People

(Reporter: jrmuizel, Assigned: aglavic, NeedInfo)

References

(Blocks 1 open bug)

Details

(Whiteboard: [fxp])

Attachments

(1 file, 2 obsolete files)

Bug 1795511 - Switch motionmark to use 'ramp' mode and report complexity score. r=#perftest 2 years ago Andrej Glavic (:aglavic) (deleted), text/x-phabricator-request		Details
Bug 1795511 - Switch motionmark to use 'ramp' mode and report complexity score. r=#perftest 2 years ago Andrej Glavic (:aglavic) (deleted), text/x-phabricator-request		Details
default-options.png 1 year ago Brian Grinstead [:bgrins] (deleted), image/png		Details

Jeff Muizelaar [:jrmuizel]

Reporter

Description

•

2 years ago

To get stable numbers to compare Firefox with WebRender vs without we chose the current configuration. (See bug 1423267 comment 3).

However, I don't think this configuration is working well:

The units reported here are (ms) but maybe they're fps? https://treeherder.mozilla.org/jobs?repo=mozilla-central&revision=c14f7934269f333be9e65958c7a012899b3123bd&group_state=expanded&selectedTaskRun=bYAN6l1qTH63dIoAgKRD0w.0
The values seem to cap at around 60 (which suggests that they are fps)
This configuration is not representative of the way that people actually run MotionMark
Chrome appears to do worse than Firefox in CI but that doesn't match the results when running it manually.

In bug 1778575 we're looking to fix frame scheduling which should make the measurements we get in ramp mode much more stable. I don't think we need to wait for that to land before changing the mode though. For now, I'd rather have numbers that are closer to what MotionMark reports than stability.

Jeff Muizelaar [:jrmuizel]

Reporter

Updated

•

2 years ago

Summary: Switch motionmark to use ramp and report complexity → Switch motionmark to use 'ramp' mode and report complexity

Jeff Muizelaar [:jrmuizel]

Reporter

Comment 1

•

2 years ago

Joel, does this seem reasonable?

Flags: needinfo?(jmaher)

Joel Maher ( :jmaher ) (UTC -8)

Comment 2

•

2 years ago

in general this seems reasonable- if the numbers most people get are not represented with our CI/tests, then we should change our CI. Keep in mind we can also change the labels we use (default is 'ms', can add 'fps') and make sure things are lower_is_better || higher_is_better.

Keep in mind we also have older hardware that runs these tests- maybe it is representative. There are plans in place to upgrade the CPU (and keep intel GPU) with this month ordering some prototypes.

I would leave this up to the perf tooling team to prioritize/change/review as needed. :kimberlythegeek, can you chime in here if there are other things to consider.

Flags: needinfo?(jmaher) → needinfo?(ksereduck)

Kimberly Sereduck :kimberlythegeek

Comment 3

•

2 years ago

:jrmuziel Could you provide more information on how the configuration is not representative, and using ramp mode?

Flags: needinfo?(ksereduck) → needinfo?(jmuizelaar)

Priority: -- → P3

Kimberly Sereduck :kimberlythegeek

Updated

•

2 years ago

Severity: -- → S3

Kimberly Sereduck :kimberlythegeek

Updated

•

2 years ago

Whiteboard: [perftest:triage]

Jeff Muizelaar [:jrmuizel]

Reporter

Comment 4

•

2 years ago

When you run https://browserbench.org/MotionMark/ in its default configuration it uses ramp mode. The constant complexity mode that we run it in is only accessible through https://browserbench.org/MotionMark/developer.html.

Flags: needinfo?(jmuizelaar) → needinfo?(ksereduck)

Greg Mierzwinski [:sparky]

Comment 5

•

2 years ago

:jrmuizel, regarding point (4), have you seen this on multiple machines and platforms, or only you're own so far?

Also, can you elaborate on why you want the complexity to be reported? We could add this to our extra-options, but it's unclear if we'll ever have more than 1 complexity variation of motionmark running at once.

Type: enhancement → task

Priority: P3 → P2

Greg Mierzwinski [:sparky]

Updated

•

2 years ago

Flags: needinfo?(ksereduck) → needinfo?(jmuizelaar)

Greg Mierzwinski [:sparky]

Updated

•

2 years ago

URL: https://mozilla-hub.atlassian.net/bro...

Jeff Muizelaar [:jrmuizel]

Reporter

Comment 6

•

2 years ago

(In reply to Greg Mierzwinski [:sparky] from comment #5)

:jrmuizel, regarding point (4), have you seen this on multiple machines and platforms, or only you're own so far?

I've run it on a couple of other machines now and the results are mixed.

Also, can you elaborate on why you want the complexity to be reported? We could add this to our extra-options, but it's unclear if we'll ever have more than 1 complexity variation of motionmark running at once.

Complexity is the score reported by MotionMark when you run it in it's default configuration. I just want that. That will prevent tests from getting capped at 60fps like they currently do.

Flags: needinfo?(jmuizelaar)

Greg Mierzwinski [:sparky]

Comment 7

•

2 years ago

Ah ok, perfect, thanks for the additional info!

Summary: Switch motionmark to use 'ramp' mode and report complexity → Switch motionmark to use 'ramp' mode and report complexity score

Andrej Glavic (:aglavic)

Assignee

Updated

•

2 years ago

Whiteboard: [perftest:triage]

Jeff Muizelaar [:jrmuizel]

Reporter

Comment 8

•

2 years ago

Who should do this work?

Greg Mierzwinski [:sparky]

Comment 9

•

2 years ago

The jira task wasn't setup properly so it evaded our grooming filter sorry about that. We'll find someone to look into this at the next grooming session (on Monday Dec 19).

Kash Shampur [:kshampur] ⌚EST

Updated

•

2 years ago

Assignee: nobody → aglavic

Status: NEW → ASSIGNED

Andrej Glavic (:aglavic)

Assignee

Comment 10

•

2 years ago

:jrmuizel a few questions about the switch:

Would you prefer mean or median for the complexity scores?
What are the units for complexity score? Should we use a unit of 'score'?
Do you want this to be changed for both motionmark-html and motionmark-animometer?

Flags: needinfo?(jmuizelaar)

Andrej Glavic (:aglavic)

Assignee

Comment 11

•

2 years ago

As well if we are tracking score, is lower still better?

(In reply to Andrej Glavic (:andrej) from comment #10)

:jrmuizel a few questions about the switch:

Would you prefer mean or median for the complexity scores?

What are the units for complexity score? Should we use a unit of 'score'?

Do you want this to be changed for both motionmark-html and motionmark-animometer?

Andrej Glavic (:aglavic)

Assignee

Comment 12

•

2 years ago

Attached file Bug 1795511 - Switch motionmark to use 'ramp' mode and report complexity score. r=#perftest (obsolete) (deleted) — Details

Phabricator Automation

Updated

•

2 years ago

Attachment #9310719 - Attachment is obsolete: true

Andrej Glavic (:aglavic)

Assignee

Comment 13

•

2 years ago

Attached file Bug 1795511 - Switch motionmark to use 'ramp' mode and report complexity score. r=#perftest (obsolete) (deleted) — Details

Andrej Glavic (:aglavic)

Assignee

Updated

•

2 years ago

Priority: P2 → P1

Andrej Glavic (:aglavic)

Assignee

Updated

•

2 years ago

Blocks: andrej-2023H1

Jeff Muizelaar [:jrmuizel]

Reporter

Comment 14

•

2 years ago

(In reply to Andrej Glavic (:andrej) from comment #10)

:jrmuizel a few questions about the switch:

Would you prefer mean or median for the complexity scores?

probably the median

What are the units for complexity score? Should we use a unit of 'score'?

yep, score seems best

Do you want this to be changed for both motionmark-html and motionmark-animometer?

Yes

As well if we are tracking score, is lower still better?

No, higher is better

Flags: needinfo?(jmuizelaar)

Andrej Glavic (:aglavic)

Assignee

Comment 15

•

2 years ago

Since we are already changing the parameters for the controller, would you like to keep all other existing preferences listed below?

test-interval=15
display=minimal
tiles=big
frame-rate=30
kalman-process-error=1
kalman-measurement-error=4
time-measurement=performance

Andrej Glavic (:aglavic)

Assignee

Updated

•

2 years ago

Flags: needinfo?(jmuizelaar)

Jeff Muizelaar [:jrmuizel]

Reporter

Comment 16

•

2 years ago

I think defaults look more like:

frame-rate=50
test-interval=30

I think everything else can stay the same.

Flags: needinfo?(jmuizelaar)

Dave Hunt [:davehunt] [he/him] ⌚BST

Updated

•

2 years ago

URL: https://mozilla-hub.atlassian.net/bro...

See Also: → https://mozilla-hub.atlassian.net/browse/FXP-2450

Whiteboard: [fxp]

Phabricator Automation

Updated

•

1 year ago

Attachment #9311131 - Attachment is obsolete: true

Andrej Glavic (:aglavic)

Assignee

Updated

•

1 year ago

Priority: P1 → P2

Brian Grinstead [:bgrins]

Comment 17

•

1 year ago

Attached image default-options.png (deleted) — Details

FWIW, this is the set of default options I get on the developer menu on https://browserbench.org/MotionMark1.2/developer.html. E.g. https://browserbench.org/MotionMark1.2/developer.html?warmup-length=2000&warmup-frame-count=30&first-frame-minimum-length=0&test-interval=30&display=minimal&tiles=big&controller=ramp&frame-rate=50&time-measurement=performance&suite-name=MotionMark

Andrej Glavic (:aglavic)

Assignee

Comment 18

•

1 year ago

We are working on changing motionmark to use ramp mode, but for chrome and chromium when we alter to ramp mode on macs we find that we get a return value of one for all tests and subtests:
https://treeherder.mozilla.org/jobs?repo=try&revision=344b651c2a66fb39b8e4b65fe033d0a7117fc8ed
This is for the 1300 M2s but it was a similar thing for the 1015

Brian Grinstead [:bgrins]

Comment 19

•

1 year ago

We've been seeing a number of scoring issues with MotionMark in general - including a reported 0 score on Chrome on the Multiply test on very fast devices. But not all tests, so I suspect something else is going wrong here. Hoping to fix some of the structural scoring problems MotionMark 2. In the meantime, how difficult would it be to make a brand new taskcluster job so we can at least track ramp results for Firefox?

Flags: needinfo?(aglavic)

Andrej Glavic (:aglavic)

Assignee

Comment 20

•

1 year ago

We can definitely do that :) I can look into that and get it sometime soon after all-hands!

Flags: needinfo?(aglavic)

Andrej Glavic (:aglavic)

Assignee

Comment 21

•

1 year ago

Leaving on need info

Flags: needinfo?(aglavic)

Brian Grinstead [:bgrins]

Updated

•

1 year ago

Blocks: motionmark

You need to log in before you can comment on or make changes to this bug.