1449518 - Win64 CFI unwind tests failing due to stack overflow on clang builds

Assignee

Description

•

7 years ago

The following tests fail due to a stack overflow as discovered by dmajor in Bug 1420947 comment 9: test_crash_win64cfi_alloc_small.js test_crash_win64cfi_infinite_entry_chain.js test_crash_win64cfi_infinite_code_chain.js test_crash_win64cfi_invalid_exception_rva.js test_crash_win64cfi_not_a_pe.js These tests all begin by crashing firefox via CRASH_X64CFI_ALLOC_SMALL. That crash attempts to trash RBP to obfuscate the stack, which works fine on MSVC builds, but under clang causes this stack overflow. https://searchfox.org/mozilla-central/rev/028cd8342899357d80fafba2d528a0fd5819e316/toolkit/crashreporter/test/win64UnwindInfoTests.asm#179 It's not really worth digging into why this difference occurs; this stack obfuscation is not essential to the test anyway.

Comment hidden (mozreview-request)

(Away)

Comment 2

•

7 years ago

Great find yet again! I see there are still some fail-if ccov lines left in the file. Were they fixed by your previous bug? If so then I'd recommend pushing a followup fix on that bug. Or, are they still an outstanding issue?

(Away)

Comment 3

•

7 years ago

mozreview-review

Comment on attachment 8963040 [details] Bug 1449518: More accurate testing for invalid CFI in Win64 unwind info CFI tests; https://reviewboard.mozilla.org/r/231878/#review237518 I don't really grok the details of why the bp-trashing is or isn't needed, but as long as this passes on both MSVC and clang-cl, and it still tests the thing you're wanting to test, then go for it.

Attachment #8963040 - Flags: review?(dmajor) → review+

Carl Corcoran [:ccorcoran]

Assignee

Comment 4

•

7 years ago

Yep I submitted a patch to pass those other fail-if ccov tests in bug 1449392

Carl Corcoran [:ccorcoran]

Assignee

Updated

•

7 years ago

Keywords: checkin-needed

Pulsebot

Comment 5

•

7 years ago

Pushed by cbrindusan@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/21db86504db5 Remove unnecessary RBP trashing in x64CrashCFITest_ALLOC_SMALL; r=dmajor

Keywords: checkin-needed

Cosmin Sabou [:CosminS]

Comment 6

•

7 years ago

Backed out for xpcshell failures on toolkit/crashreporter/test/unit/test_crash_win64cfi_invalid_exception_rva.js Push with failures: https://treeherder.mozilla.org/#/jobs?repo=autoland&filter-resultStatus=testfailed&filter-resultStatus=busted&filter-resultStatus=exception&fromchange=21db86504db5454c5c3f7052c651ca3c565076c2&tochange=10f662b4d0b30f53d12618f4c82a07127ec83e89&filter-searchStr=xpcshell&selectedJob=170820751 Failure log: https://treeherder.mozilla.org/logviewer.html#?job_id=170820751&repo=autoland&lineNumber=10222 Backout link: https://hg.mozilla.org/integration/autoland/rev/10f662b4d0b30f53d12618f4c82a07127ec83e89

Flags: needinfo?(ccorcoran)

Comment hidden (mozreview-request)

Code Review Bot [:reviewbot]

Comment 8

•

7 years ago

mozreview-review

Comment on attachment 8963040 [details] Bug 1449518: More accurate testing for invalid CFI in Win64 unwind info CFI tests; https://reviewboard.mozilla.org/r/231878/#review237846 Code analysis found 1 defect in this patch: - 1 defect found by mozlint You can run this analysis locally with: - `./mach lint path/to/file` (JS/Python) If you see a problem in this automated review, please report it here: http://bit.ly/2y9N9Vx ::: toolkit/crashreporter/test/unit/head_win64cfi.js:128 (Diff revision 2) > + dumpThisFrame(); > + info("Expected frame trust matched when it should not have."); > + ok(false); > + } > + } else { > - if (frame.trust !== expectedFrame.trust) { > + if (frame.trust !== expectedFrame.trust) { Error: Unexpected if as the only statement in an else block. [eslint: no-lonely-if]

Comment hidden (mozreview-request)

Carl Corcoran [:ccorcoran]

Assignee

Comment 10

•

7 years ago

Some background on these tests and the fix... These 4 tests... - test_crash_win64cfi_infinite_entry_chain.js - test_crash_win64cfi_infinite_code_chain.js - test_crash_win64cfi_invalid_exception_rva.js - test_crash_win64cfi_not_a_pe.js ...all crash firefox.exe, but instead of reading stack unwind info from firefox.exe itself, they get it from a test EXE file that's been doctored with certain corruptions. The point of testing these is to ensure that even under the worst conditions, the stack walker will at least continue to work as well as it used to before we started supporting Win64 unwind info. For example in "test_crash_win64cfi_infinite_code_chain.js", the EXE providing unwind info has an infinite loop in its entry chains. Here we want to make sure the unwind info parser 1) doesn't freeze in an infinite loop, 2) bails out and continues to walk the stack as it used to. Trashing RBP was an attempt to force the behavior of checking for the "as it used to" behavior. Without CFI, breakpad's stack scanner will try to deduce RBP. If it can, it will reconstruct the stack frame with "frame_pointer" trust. If not, it will scan the stack ("scan" trust). The problem with our method is that trashing RBP is unpredictable and caused the stack overflow as seen above. And since trashing RBP is not essential to this test, I stopped trashing RBP, but this means we cannot 100% predict when breakpad will choose to stack scan or use the frame pointer, as seen in the latest failure (comment 6). A more accurate way to test for fallback behavior is to just ensure it's NOT using CFI, so it doesn't matter whether breakpad chooses to stack scan or frame_pointer. This change is reflected in the latest patch.

Flags: needinfo?(ccorcoran)

(Away)

Updated

•

7 years ago

Attachment #8963040 - Flags: review?(gsvelto)

Gabriele Svelto [:gsvelto]

Comment 11

•

7 years ago

mozreview-review

Comment on attachment 8963040 [details] Bug 1449518: More accurate testing for invalid CFI in Win64 unwind info CFI tests; https://reviewboard.mozilla.org/r/231878/#review238160 This looks robust, great!

Attachment #8963040 - Flags: review?(gsvelto) → review+

Carl Corcoran [:ccorcoran]

Assignee

Updated

•

7 years ago

Keywords: checkin-needed

Pulsebot

Comment 12

•

7 years ago

Pushed by csabou@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/48dab6fbabf7 More accurate testing for invalid CFI in Win64 unwind info CFI tests; r=dmajor,gsvelto

Keywords: checkin-needed

Bogdan Tara[:bogdan_tara | bogdant]

Comment 13

•

7 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/48dab6fbabf7

Status: ASSIGNED → RESOLVED

Closed: 7 years ago

status-firefox61: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla61

Bugzilla

Win64 CFI unwind tests failing due to stack overflow on clang builds

Categories

(Toolkit :: Crash Reporting, defect, P1)

Tracking

()

People

(Reporter: ccorcoran, Assigned: ccorcoran)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Updated

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Updated

Comment 11

Updated

Comment 12

Comment 13

Attachment

General

Description

File Name

Content Type