990580 - Tests are failing; app freezes after being launched

Reporter

Description

•

11 years ago

Tests on Jenkins are failing because of a timeout in the app launch method. The timeout occurs when trying to switch to the frame of the newly opened app. This issue is not reproduced locally with manual or automated testing. Stacktrace: 06:42:52 ERROR: None 06:42:52 ---------------------------------------------------------------------- 06:42:52 Traceback (most recent call last): 06:42:52 File "/var/jenkins/workspace/b2g.hamachi.mozilla-central.ui.2/.env/local/lib/python2.7/site-packages/marionette_client-0.7.5-py2.7.egg/marionette/marionette_test.py", line 163, in run 06:42:52 testMethod() 06:42:52 File "/var/jenkins/workspace/b2g.hamachi.mozilla-central.ui.2/tests/python/gaia-ui-tests/gaiatest/tests/functional/keyboard/test_keyboard.py", line 19, in test_keyboard_basic 06:42:52 contacts_app.launch() 06:42:52 File "/var/jenkins/workspace/b2g.hamachi.mozilla-central.ui.2/tests/python/gaia-ui-tests/gaiatest/apps/contacts/app.py", line 32, in launch 06:42:52 Base.launch(self) 06:42:52 File "/var/jenkins/workspace/b2g.hamachi.mozilla-central.ui.2/tests/python/gaia-ui-tests/gaiatest/apps/base.py", line 25, in launch 06:42:52 self.app = self.apps.launch(self.name, launch_timeout=launch_timeout) 06:42:52 File "/var/jenkins/workspace/b2g.hamachi.mozilla-central.ui.2/tests/python/gaia-ui-tests/gaiatest/gaia_test.py", line 60, in launch 06:42:52 self.marionette.switch_to_frame(app.frame_id) 06:42:52 File "/var/jenkins/workspace/b2g.hamachi.mozilla-central.ui.2/.env/local/lib/python2.7/site-packages/marionette_client-0.7.5-py2.7.egg/marionette/marionette.py", line 895, in switch_to_frame 06:42:52 response = self._send_message('switchToFrame', 'ok', element=frame.id, focus=focus) 06:42:52 File "/var/jenkins/workspace/b2g.hamachi.mozilla-central.ui.2/.env/local/lib/python2.7/site-packages/marionette_client-0.7.5-py2.7.egg/marionette/marionette.py", line 604, in _send_message 06:42:52 "Connection timed out", status=ErrorCodes.TIMEOUT) 06:42:52 TimeoutException: TimeoutException: Connection timed out 06:42:52 06:42:52 06:42:52 Most recent errors/exceptions are: 06:42:52 06:42:52 04-01 06:36:45.129 E/GeckoConsole( 6688): Content JS ERROR at app://homescreen.gaiamobile.org/gaia_build_defer_index.js:397 in loadSVConfFileError: Failed parsing singleVariant configuration file [js/singlevariantconf.json]: [Exception... "File error: Not found" nsresult: "0x80520012 (NS_ERROR_FILE_NOT_FOUND)" location: "JS frame :: app://homescreen.gaiamobile.org/gaia_build_defer_index.js :: loadFile :: line 389" data: no] 06:42:52 04-01 06:36:47.209 E/GeckoConsole( 6688): Content JS ERROR at app://homescreen.gaiamobile.org/gaia_build_defer_index.js:60 in retrieve: Got an exception when trying to load icon "app://music2.gaiamobile.org/style/icons/60/Music.png +" falling back to cached icon. Exception is: 06:42:52 TEST-UNEXPECTED-FAIL | test_keyboard.py test_keyboard.TestKeyboard.test_keyboard_basic | The B2G process has restarted after crashing during the tests so Marionette can't respond due to either a Gecko, Gaia or Marionette error. Above, the 5 most recent errors are listed. Check logcat for all errors if these errors are not the cause of the failure. Latest build on which it is reproduced: Gaia 874fe42b82e8d819d592690e74db91c07179e68c Gecko https://hg.mozilla.org/mozilla-central/rev/1417d180a1d8 BuildID 20140401040202 Version 31.0a1 ro.build.version.incremental=eng.tclxa.20131223.163538 ro.build.date=Mon Dec 23 16:36:04 CST 2013 Latest test run where the issue occured: http://selenium.qa.mtv2.mozilla.com:8080/view/B2G%20Hamachi/job/b2g.hamachi.mozilla-central.ui.2/255/console

Robert Chira [:RobertC]

Reporter

Comment 1

•

11 years ago

This issue first appeared in yesterdays build (31 March). Showing version details application_buildid: 20140331040200 application_changeset: d8e8f13bd4ae application_name: B2G application_repository: https://hg.mozilla.org/mozilla-central application_version: 31.0a1 build_changeset: 1ad48c4be51b279f7f63c1a13025b52fe087d231 device_firmware_date: 1387433095 device_firmware_version_incremental: 324 device_firmware_version_release: 4.0.4 device_id: msm7627a gaia_changeset: 26839cb46f856d610b192f5655a8c38a6bfe0829 gaia_date: 1396251714 gecko_changeset: 34c6e4261eb036cb6050d5b1a73cd9bc4f5f6251 platform_buildid: 20140331040200 platform_changeset: d8e8f13bd4ae platform_repository: https://hg.mozilla.org/mozilla-central

Robert Chira [:RobertC]

Reporter

Updated

•

11 years ago

Summary: Tests are failing on Jenkins with timeout when trying to launch apps → Tests are failing on Jenkins with timeout when trying to switch to the app frame

Robert Chira [:RobertC]

Reporter

Comment 2

•

11 years ago

It seems the error is not limited to the launch method. It appears when calling self.marionette.switch_to_frame(app.frame_id) regardless of the method it's called from.

Florin Strugariu [:Bebe]

Comment 3

•

11 years ago

Adding qawanted as we can not reproduce this locally with automation and manual testing.

Keywords: qawanted

Jason Smith [:jsmith]

Comment 4

•

11 years ago

(In reply to Florin Strugariu [:Bebe] from comment #3) > Adding qawanted as we can not reproduce this locally with automation and > manual testing. We really only need to do qawanted in cases where there hasn't been a manual investigation already. From what I can tell from the comments above, that already happened here.

Keywords: qawanted

Stephen Donner [:stephend] Not actively reading bugmail

Comment 5

•

11 years ago

Andrei, team: when you see timeouts like this, can you start attaching debugging logs, if we suspect a crash? See https://wiki.mozilla.org/B2G/QA/Tips_And_Tricks#Debugging_OOMs for info; thanks!

Malini Das [:mdas] - Away, not checking bugmail

Comment 6

•

11 years ago

From the error message: (In reply to Robert Chira [:RobertC] from comment #0) > test_keyboard.TestKeyboard.test_keyboard_basic | The B2G process has > restarted after crashing during the tests so Marionette can't respond due to > either a Gecko, Gaia or Marionette error. Above, the 5 most recent errors > are listed. Check logcat for all errors if these errors are not the cause of > the failure. This means that during this test, b2g crashed and restarted, so Marionette can't respond since the server was restarted. The crash might be due to the errors that was printed out: "04-01 06:36:45.129 E/GeckoConsole( 6688): Content JS ERROR at app://homescreen.gaiamobile.org/gaia_build_defer_index.js:397 in loadSVConfFileError: Failed parsing singleVariant configuration file [js/singlevariantconf.json]: [Exception... "File error: Not found" nsresult: "0x80520012 (NS_ERROR_FILE_NOT_FOUND)" location: "JS frame :: app://homescreen.gaiamobile.org/gaia_build_defer_index.js :: loadFile :: line 389" data: no] 06:42:52 04-01 06:36:47.209 E/GeckoConsole( 6688): Content JS ERROR at app://homescreen.gaiamobile.org/gaia_build_defer_index.js:60 in retrieve: Got an exception when trying to load icon "app://music2.gaiamobile.org/style/icons/60/Music.png +" falling back to cached icon. Exception is: " But more information about the crash would be in the logcat, or in the minidumps if you have that enabled.

Florin Strugariu [:Bebe]

Comment 7

•

11 years ago

I found: Bug 918982 - FILE_NOT_FOUND error fired in logcat looking for singleVariant configuration file (singlevariantconf.json) Bug 980814 - Fix test_system_message.py, JavaScript Error: "NS_ERROR_ILLEGAL_VALUE: Component returned failure code: 0x80070057 (NS_ERROR_ILLEGAL_VALUE) That look to contain the same error: E/GeckoConsole( 2241): Content JS ERROR at app://homescreen.gaiamobile.org/gaia_build_defer_index.js:397 in loadSVConfFileError: Failed parsing singleVariant configuration file [js/singlevariantconf.json]: [Exception... "File error: Not found" nsresult: "0x80520012 (NS_ERROR_FILE_NOT_FOUND)" location: "JS frame :: app://homescreen.gaiamobile.org/gaia_build_defer_index.js :: loadFile :: line 389" data: no] Also when we reboot the phone we see the same issue: E/GeckoConsole( 371): Content JS ERROR at app://homescreen.gaiamobile.org/gaia_build_defer_index.js:397 in loadSVConfFileError: Failed parsing singleVariant configuration file [js/singlevariantconf.json]: [Exception... "File error: Not found" nsresult: "0x80520012 (NS_ERROR_FILE_NOT_FOUND)" location: "JS frame :: app://homescreen.gaiamobile.org/gaia_build_defer_index.js :: loadFile :: line 389" data: no] Manual steps: 1. flash latest build 2. open adb log cat and watch for the error BuildID: Gaia 0e974ff33ba47f3d1e59df1e0ad534f1bbe3ef8a Gecko https://hg.mozilla.org/mozilla-central/rev/91be2828f17e BuildID 20140403040201 Version 31.0a1 ro.build.version.incremental=324 ro.build.date=Thu Dec 19 14:04:55 CST 2013

Florin Strugariu [:Bebe]

Comment 8

•

11 years ago

Attached file Logcat when rebooting the phone (deleted) — Details

at line 231 you can find the error

Florin Strugariu [:Bebe]

Comment 9

•

11 years ago

This issue is reproducible in today’s Jenkins build. Gaia 0e974ff33ba47f3d1e59df1e0ad534f1bbe3ef8a Gecko https://hg.mozilla.org/mozilla-central/rev/91be2828f17e BuildID 20140403040201 Version 31.0a1 ro.build.version.incremental=324 ro.build.date=Thu Dec 19 14:04:55 CST 2013 We can't reproduce the test failure locally.

[:AndreiH]

Comment 10

•

11 years ago

I actually managed to replicate this when running test_url_keyboard.TestUrlKeyboard locally. When I saw it failed, the UI Tests app failed to launch. After being tapped the app opening animation did not finish. Hamachi build: Gaia d9a574284d672f532f7c562a091bb01f531202b1 Gecko https://hg.mozilla.org/mozilla-central/rev/6c924a018540 BuildID 20140404040204 Version 31.0a1

Stephen Donner [:stephend] Not actively reading bugmail

Comment 11

•

11 years ago

(In reply to [:AndreiH] from comment #10) > I actually managed to replicate this when running > test_url_keyboard.TestUrlKeyboard locally. > When I saw it failed, the UI Tests app failed to launch. After being tapped > the app opening animation did not finish. > > Hamachi build: > Gaia d9a574284d672f532f7c562a091bb01f531202b1 > Gecko https://hg.mozilla.org/mozilla-central/rev/6c924a018540 > BuildID 20140404040204 > Version 31.0a1 Thanks for isolating that; in the future, as requested in comment 5, can you get a minidump/crash log attached, here? See https://wiki.mozilla.org/B2G/QA/Tips_And_Tricks#Debugging_OOMs

Jason Smith [:jsmith]

Comment 12

•

11 years ago

Have we determined here that this is a gecko bug or a bug in the test?

Flags: needinfo?(stephen.donner)

Stephen Donner [:stephend] Not actively reading bugmail

Comment 13

•

11 years ago

(In reply to Jason Smith [:jsmith] from comment #12) > Have we determined here that this is a gecko bug or a bug in the test? Looks like a core B2G process/Gecko crasher, to me. Bebe, please work with the team to get this further debugged (look at http://selenium.qa.mtv2.mozilla.com:8080/job/b2g.hamachi.mozilla-central.tinderbox.ui.receive_call/42/ and http://selenium.qa.mtv2.mozilla.com:8080/job/b2g.hamachi.mozilla-central.unittests/370/) for smaller tests to help reproduce. Malini/Gregor, can you take a look with the information we'll provide? Thx.

Flags: needinfo?(stephen.donner) → needinfo?(florin.strugariu)

Florin Strugariu [:Bebe]

Comment 14

•

11 years ago

Attached file Full logcat of failing test (deleted) — Details

Full logcat from http://selenium.qa.mtv2.mozilla.com:8080/job/b2g.hamachi.mozilla-central.tinderbox.ui.receive_call/42

Flags: needinfo?(florin.strugariu)

Logcat when rebooting the phone 11 years ago Florin Strugariu [:Bebe] (deleted), text/plain		Details
Full logcat of failing test 11 years ago Florin Strugariu [:Bebe] (deleted), text/plain		Details
test_camera_multiple_shots.py Debugging OOM 11 years ago [:AndreiH] (deleted), application/x-gzip		Details
locally run logcat.txt 11 years ago [:AndreiH] (deleted), text/plain		Details
test_gallery_view logcat 11 years ago Florin Strugariu [:Bebe] (deleted), text/plain		Details
Full logcat of the latest failure in test_email_keyboard 11 years ago [:AndreiH] (deleted), text/plain		Details
github pr 11 years ago Zac C (:zac) (deleted), text/x-github-pull-request		Details
bug-990580_gaia-cold-launch_qemu-ics_adb-logcat.zip 11 years ago Vicamo Yang [:vicamo][:vyang] (deleted), application/zip		Details
bug-990580_gaia-cold-launch_qemu-ics_console.txt 11 years ago Vicamo Yang [:vicamo][:vyang] (deleted), text/plain		Details
bug-990580_gaia-cold-launch_nexus-5_adb-logcat.zip 11 years ago Vicamo Yang [:vicamo][:vyang] (deleted), application/zip		Details
Backout the workaround from gaiatest 11 years ago Zac C (:zac) (deleted), text/x-github-pull-request	Bebe : review+	Details