Closed Bug 526234 Opened 15 years ago Closed 13 years ago

[Perf suite] Ensure that self validating testcases will fail

Categories

(Tamarin Graveyard :: Tools, defect, P3)

defect

Tracking

(Not tracked)

VERIFIED FIXED
Q3 11 - Serrano

People

(Reporter: brbaker, Assigned: brbaker)

References

Details

Attachments

(1 file, 10 obsolete files)

1) There are some performance tests that have self validation that does not have a correct "reference" value 2) If validation currently fails not all tests will be tracked as a failure when running via runtests.py
Flags: in-testsuite+
Flags: flashplayer-triage+
Flags: flashplayer-qrb?
Attached patch scimark/FFT and LU self validation (obsolete) (deleted) — Splinter Review
This patch was previously approved by dschaffe on bug 513229
Attachment #409940 - Flags: review+
Attached patch jsbench untyped (obsolete) (deleted) — Splinter Review
untyped jsbench testcases will now properly self validate and if they fail validation they will not output the "metric" causing them to properly fail when run via performance/runtests.py
Attachment #410222 - Flags: review?(dschaffe)
Attached patch jsbench untyped validation (obsolete) (deleted) — Splinter Review
- important change here is the moldyn now actually works, previously was generating bogus calculations (NaN), now works properly
Attachment #410222 - Attachment is obsolete: true
Attachment #410261 - Flags: review?(dschaffe)
Attachment #410222 - Flags: review?(dschaffe)
Attached patch jsbench typed validation (obsolete) (deleted) — Splinter Review
- important change here is the moldyn now actually works, previously was generating bogus calculations (NaN), now works properly
Attachment #410263 - Flags: review?(dschaffe)
I am getting RayTracer failure on my desktop mac for typed and untyped. Have you seen it? Validation failed Pixel checksum = 302690 Reference value = 302532 Validation failed Pixel checksum = 302690 Reference value = 302532 test failed validation! jsbench/RayTracer.as no test result - test output: ['Validation failed\n', 'Pixel checksum = 302690\n', 'Reference value = 302532\n', 'Validation failed\n', 'Pixel checksum = 302690\n', 'Reference value = 302532\n', 'test failed validation!\n']
Flags: flashplayer-qrb? → flashplayer-qrb+
Priority: -- → P3
Target Milestone: --- → flash10.1
Attachment #410261 - Flags: review?(dschaffe) → review-
Attachment #410263 - Flags: review?(dschaffe) → review-
Target Milestone: flash10.1 → flash10.2
The following performance tests are failing verification: jsbench/Euler.as jsbench/FFT.as jsbench/RayTracer.as jsbench/typed/Euler.as jsbench/typed/FFT.as jsbench/typed/RayTracer.as sunspider/as3vector/s3d-cube.as
Simple patch that checks for validation failure in performance tests and then does not record metric when validation fails. Tests need to be fixed before submitting this patch as it will fail in the performance phase.
Attachment #431365 - Flags: review?(brbaker)
Attachment #431365 - Flags: review?(brbaker) → review+
Comment on attachment 431365 [details] [diff] [review] Explicitly look for validation failures in tests, do not record results Should this code just be removed instead of just commented out? for f in out1: for line in f: print(line.strip()) Other patches will need to be scrubbed and rebased before this can land
Are we certain that all failures result in the string "Validation failed" ? For example, jsbench/LUFact.as prints a computed and a reference value which differ in my test run, but no explicit error message is displayed.
Depends on: 594836
Attached patch v2 jsbench self validating (obsolete) (deleted) — Splinter Review
Rebased patch. RayTracer now has 2 'valid' results since Math.tan() produces different results on mac-intel which changes the checksum of the image (bug #594836) - important change here is the moldyn now actually works, previously was generating bogus calculations (NaN), now works properly
Attachment #410261 - Attachment is obsolete: true
Attachment #410263 - Attachment is obsolete: true
Attached patch v3. jsbench self validating (obsolete) (deleted) — Splinter Review
Attachment #474021 - Attachment is obsolete: true
Attached patch v2. scimark/FFT and LU self validation (obsolete) (deleted) — Splinter Review
Attachment #409940 - Attachment is obsolete: true
Attachment #431365 - Attachment is obsolete: true
With the current set of patches self validating tests are working correctly and if self validation fails the runtests will not record a metric value (both because no metric is generated but also because "validation failed" is found in the output).
Blocks: 607714
Depends on: 645018
status?
Attached patch Combined patch (deleted) — Splinter Review
This is a rebased and combined patch of previous patches. I have run this patch through the following configurations: mac 32/64, win32/64, linux 32/64 and android-arm
Attachment #474038 - Attachment is obsolete: true
Attachment #474039 - Attachment is obsolete: true
Attachment #474040 - Attachment is obsolete: true
Attachment #474041 - Attachment is obsolete: true
Attachment #529659 - Flags: review?(dschaffe)
Flags: flashplayer-bug-
Attachment #529659 - Flags: review?(dschaffe) → review+
changeset: 6255:c69cfc4d7c51 user: Brent Baker <brbaker@adobe.com> summary: Bug 526234: enable self validation to fail a performance testcase and not record a metric (r=dschaffe) http://hg.mozilla.org/tamarin-redux/rev/c69cfc4d7c51
jsbench validation for LUFact, Moldyn and Raytrace are failing on linux32, this is not an injection and has been found to happen as far back as coral. This is being tracked via bug# 654498. Closing this bug since all available work/patches have been committed
Status: ASSIGNED → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Status: RESOLVED → VERIFIED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: