1282638 - Intermittent gfx/layers/apz/test/mochitest/test_group_touchevents.html | Test timed out.

Try push with logging: https://treeherder.mozilla.org/#/jobs?repo=try&revision=a2f0e74c5929&group_state=expanded At least some of the failures seem to be because helper_touch_action ends up triggering a fling which is unexpected. Not only does it throw off the scroll position, future touch events get ignored because of the fast-motion. There are other failure too, so I'll spin off another bug to fix this issue.

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Updated

•

8 years ago

Depends on: 1297408

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 14

•

8 years ago

I put a patch on bug 1297408. Next try run with more logging at https://treeherder.mozilla.org/#/jobs?repo=try&revision=58443a139413&selectedJob=26227850 seems to indicate that during helper_long_tap, the APZ that gets long-tapped upon is destroyed during the long-tap. Not sure why yet.

Comment hidden (Intermittent Failures Robot)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 16

•

8 years ago

Seems like maybe waitUntilApzStable returns too soon, before the previous page is unloaded, and so the tap events end up going to the wrong page. That's a little concerning.

Comment hidden (Intermittent Failures Robot)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 18

•

8 years ago

Based on additional logging [1], I think the problem is that waitUntilApzStable does a waitForAllPaints instead of a waitForAllPaintsFlushed. For some reason on OS X we can run through that waitForAllPaints call without having done a paint and without a paint being pending, and so the test goes on to run on the previous page's layer tree. This naturally doesn't work and usually results in the timeouts. I have a new try push [2] which changes that call to flush the paints to see if it fixes the issue. [1] https://treeherder.mozilla.org/#/jobs?repo=try&revision=b2f9d5411f9e&selectedJob=26520124 [2] https://treeherder.mozilla.org/#/jobs?repo=try&revision=5fed88798269

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 19

•

8 years ago

So that seemed to help fix the helper_tap timeouts, but I see the exact same issue on helper_long_tap (which I was seeing before as well). So I suspect that really the patch did nothing. I pored over the logs a bit more and as far as I can tell, the compositor is in fact getting the layers update (and the screenshot indicates that the correct page is visible) but maybe the APZ tree rebuild is getting skipped? I don't see any APZC NLU calls with aIsFirstPaint=1 for the subtest that fails. Continuing investigation along those lines...

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 20

•

8 years ago

What I'm finding is that normally when subtests inside test_group_touchevents load, the window they spawn in first loads about:blank and then loads the subtest file. This is normal and expected, and it means that the compositor gets two aIsFirstPaint transactions (one for about:blank and the other for the page). In the bad cases it seems like about:blank doesn't load, it goes straight to the subtest file, and that may or may not be related to the problem. The latest log I have is from this job: https://treeherder.mozilla.org/#/jobs?repo=try&revision=4caf56adfe12&selectedJob=26682790

Priority: P5 → P3

Comment hidden (Intermittent Failures Robot)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 22

•

8 years ago

I'm not really sure where to go from here, and there are higher-volume intermittent failures that I can look at, so I'm unassigning this one.

Assignee: bugmail → nobody

Comment hidden (Intermittent Failures Robot)

Ryan VanderMeulen [:RyanVM]

Comment 27

•

8 years ago

Looks like this is basically permafail on Win8 e10s at the moment. I'll try to bisect when that happened. https://treeherder.mozilla.org/logviewer.html#?job_id=29204519&repo=try

Ryan VanderMeulen [:RyanVM]

Comment 28

•

8 years ago

Oh, that answer is obvious I guess. It was only re-enabled on Windows a week ago in bug 1291381. Kats, can you please take a look? Win8 M-e10s(4) is available on Try.

Flags: needinfo?(bugmail)

Comment hidden (Intermittent Failures Robot)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 30

•

8 years ago

First try push with logging seems to show [1] that it's hanging during loading of helper_bug1162771.html. Not really surprising since that's what the screenshot was showing as well. I'll do more try pushes with more logging to figure out what's going on. [1] https://treeherder.mozilla.org/#/jobs?repo=try&revision=c7fdd4cb15c554e976116f1fbf9a1f37f58453a1&selectedJob=29211204

Flags: needinfo?(bugmail)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Updated

•

8 years ago

Assignee: nobody → bugmail

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 31

•

8 years ago

After many try pushes with successively more logging, it looks like we do call the injectTouchEvent windows API with the touchstart event on helper_bug1162771.html, but we never get the WM_TOUCH. Similar previous calls work fine. The only notable thing about this touchstart event that I can see is that it has a lower x-coordinate (x=16) than the previous ones. So my guess is that windows 8 has some sort of edge swipe gesture detector that's holding on to the touchstart to see if it's an edge gesture. I'm running a try push with a higher x-coordinate to see if it helps.

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 32

•

8 years ago

That seems to have worked: https://treeherder.mozilla.org/#/jobs?repo=try&revision=aa2d935b69374f1135532da15f7a6f04645aba47&selectedJob=29455183 I'll spin out a new bug with the fix, since it's probably a separate issue from the intermittent this bug was originally tracking.

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Updated

•

8 years ago

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=1311406

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 33

•

8 years ago

Split the fix into bug 1311406, throwing this back into the unassigned pool.

Assignee: bugmail → nobody

Comment hidden (Intermittent Failures Robot)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 43

•

7 years ago

The failures reported in the previous comment were introduced with bug 1376519 and stopped when it was backed out.

Geoff Brown [:gbrown]

Updated

•

7 years ago

Whiteboard: [gfx-noted] → [gfx-noted][stockwell fixed:backout]

Comment hidden (Intermittent Failures Robot)

Firefox Bug Husbandry Bot

Comment 46

•

7 years ago

Bulk priority update of open intermittent test failure bugs. P3 => P5 https://bugzilla.mozilla.org/show_bug.cgi?id=1381960

Priority: P3 → P5

Comment hidden (Intermittent Failures Robot)

Sebastian Hengst [:aryx] (needinfo me if it's about an intermittent or backout)

Comment 51

•

7 years ago

This started permafailing today and doing retriggers on past green jobs also permafail. First there was bug 1411144, not it's this one. Any ideas what could be causing this? Thank you in advance. https://treeherder.mozilla.org/#/jobs?repo=mozilla-inbound&revision=405cc8ca7f764e985c6ab1e6d4365681b6ff2e10&filter-resultStatus=testfailed&filter-resultStatus=busted&filter-resultStatus=exception&filter-resultStatus=usercancel&filter-resultStatus=runnable&filter-resultStatus=retry

Flags: needinfo?(rthijssen)

Flags: needinfo?(bugmail)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 52

•

7 years ago

From a quick look at the log it looks like touch event injection is failing. All the failures are on Windows so probably the releng or taskcluster folks updated something in the OS that caused this to start failing. If they can't roll it back I can probably update the tests to use our "synthetic touch events" rather than OS-injected touch events but the test will be less representative of real-world scenarios so I would prefer to avoid that.

Comment hidden (Intermittent Failures Robot)

Rob Thijssen [:grenade (EET/UTC+0300)]

Comment 54

•

7 years ago

some months ago i added a registry key to disable touch events (bug 1382988, comment 4). that patch never actually worked due to a syntax error that quietly logged a failure. recently, whilst cleaning up these syntax errors to reduce noise in the logs, the registry hack was corrected: https://github.com/mozilla-releng/OpenCloudConfig/commit/801ef77f468b7e6bc5778a7e231f196af17fee65 https://github.com/mozilla-releng/OpenCloudConfig/commit/51ecff2f17159a1ec9d13242438b7402a0d908b1 i assume that when the registry hack to disable touch events started working, it broke these tests. i've removed the registry hack today (https://github.com/mozilla-releng/OpenCloudConfig/commit/614792e280811e80422a5732c6304d348757e9e3) and will retrigger tests when the rebuilt amis have propagated to see if they go green. the ami rebuild is here: https://tools.taskcluster.net/groups/NY_vYQDuQZGZji0QY49cvQ

Flags: needinfo?(rthijssen)

Rob Thijssen [:grenade (EET/UTC+0300)]

Comment 55

•

7 years ago

retrigger is green: https://treeherder.mozilla.org/#/jobs?repo=mozilla-inbound&revision=405cc8ca7f764e985c6ab1e6d4365681b6ff2e10&filter-searchStr=windows10&selectedJob=142219153&group_state=expanded

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → FIXED

Comment hidden (Intermittent Failures Robot)

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 57

•

7 years ago

Thanks Rob! Since this intermittent failure was happening even before the permafail I'm not sure it's correct to actually mark this bug FIXED but we can leave it for now and reopen it if we continue to see intermittent instances of this failure.

Flags: needinfo?(bugmail)

Phil Ringnalda (:philor)

Updated

•

7 years ago

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Comment hidden (Intermittent Failures Robot)

Firefox Bug Husbandry Bot

Comment 131

•

5 years ago

https://wiki.mozilla.org/Bug_Triage#Intermittent_Test_Failure_Cleanup

Status: REOPENED → RESOLVED

Closed: 7 years ago → 5 years ago

Resolution: --- → INCOMPLETE

Narcis Beleuzu [:NarcisB]

Updated

•

5 years ago

Status: RESOLVED → REOPENED

Resolution: INCOMPLETE → ---

Comment hidden (Intermittent Failures Robot)

BugBot [:suhaib / :marco/ :calixte]

Comment 166

•

4 years ago

https://wiki.mozilla.org/Bug_Triage#Intermittent_Test_Failure_Cleanup
For more information, please visit auto_nag documentation.

Status: REOPENED → RESOLVED

Closed: 5 years ago → 4 years ago

Resolution: --- → INCOMPLETE

Atila Butkovits

Comment 167

•

4 years ago

Recent failure: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=308906330&repo=mozilla-beta&lineNumber=26602

Atila Butkovits

Updated

•

4 years ago

Status: RESOLVED → REOPENED

Resolution: INCOMPLETE → ---

Comment hidden (Intermittent Failures Robot)

BugBot [:suhaib / :marco/ :calixte]

Comment 171

•

4 years ago

https://wiki.mozilla.org/Bug_Triage#Intermittent_Test_Failure_Cleanup
For more information, please visit auto_nag documentation.

Status: REOPENED → RESOLVED

Closed: 4 years ago → 4 years ago

Resolution: --- → INCOMPLETE

Bogdan Tara[:bogdan_tara | bogdant]

Comment 172

•

4 years ago

This is still happening.

Recent failure: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=315927457&repo=autoland&lineNumber=25961

Status: RESOLVED → REOPENED

Resolution: INCOMPLETE → ---

Comment hidden (Intermittent Failures Robot)

BugBot [:suhaib / :marco/ :calixte]

Comment 174

•

4 years ago

https://wiki.mozilla.org/Bug_Triage#Intermittent_Test_Failure_Cleanup
For more information, please visit auto_nag documentation.

Status: REOPENED → RESOLVED

Closed: 4 years ago → 4 years ago

Resolution: --- → INCOMPLETE

Razvan Maries

Comment 175

•

4 years ago

New occurrence: https://treeherder.mozilla.org/logviewer.html#/jobs?job_id=318660377&repo=autoland&lineNumber=26236

Status: RESOLVED → REOPENED

Resolution: INCOMPLETE → ---

Comment hidden (Intermittent Failures Robot)

BugBot [:suhaib / :marco/ :calixte]

Comment 177

•

4 years ago

https://wiki.mozilla.org/Bug_Triage#Intermittent_Test_Failure_Cleanup
For more information, please visit auto_nag documentation.

Status: REOPENED → RESOLVED

Closed: 4 years ago → 4 years ago

Resolution: --- → INCOMPLETE

Atila Butkovits

Comment 178

•

4 years ago

New failure log: https://treeherder.mozilla.org/logviewer?job_id=321397734&repo=autoland&lineNumber=30772

Status: RESOLVED → VERIFIED

Comment hidden (Intermittent Failures Robot)