Closed
Bug 906793
Opened 11 years ago
Closed 10 years ago
B2G mochitests intermittently just stop after "IMediaResourceManagerDeathNotifier: media server died" and/or "ServiceManager: service 'media.resource_manager' died"
Categories
(Firefox OS Graveyard :: General, defect)
Tracking
(Not tracked)
RESOLVED
WORKSFORME
People
(Reporter: RyanVM, Unassigned)
References
Details
(Keywords: intermittent-failure)
Attachments
(1 obsolete file)
Worst of all, the error only shows up in the dump, so TBPL can't suggest anything for it.
https://tbpl.mozilla.org/php/getParsedLog.php?id=26722508&tree=B2g-Inbound
b2g_emulator_vm b2g-inbound opt test mochitest-3 on 2013-08-19 08:37:33 PDT for push 24fbe2665d27
slave: tst-linux64-ec2-108
09:15:48 INFO - 08-19 16:08:39.423 699 699 I GeckoDump: 15720 INFO TEST-PASS | /tests/content/smil/test/test_smilMappedAttrFromTo.xhtml | image-rendering: checking that 'from' value is set at start of animation
09:15:48 INFO - 08-19 16:08:39.433 699 699 I GeckoDump: 15721 INFO TEST-PASS | /tests/content/smil/test/test_smilMappedAttrFromTo.xhtml | image-rendering: checking value halfway through animation
09:15:48 INFO - 08-19 16:14:09.692 699 1015 W IMediaResourceManagerDeathNotifier: media server died
09:15:48 INFO - 08-19 16:14:09.692 33 33 I ServiceManager: service 'media.resource_manager' died
Reporter | ||
Comment 1•11 years ago
|
||
Summary: B2G mochitests intermittently just stop (after "IMediaResourceManagerDeathNotifier: media server died") → B2G mochitests intermittently just stop after "IMediaResourceManagerDeathNotifier: media server died" and/or "ServiceManager: service 'media.resource_manager' died"
Reporter | ||
Comment 2•11 years ago
|
||
Comment 3•11 years ago
|
||
(In reply to Ryan VanderMeulen [:RyanVM UTC-4] from comment #0)
> 09:15:48 INFO - 08-19 16:14:09.692 699 1015 W
> IMediaResourceManagerDeathNotifier: media server died
> 09:15:48 INFO - 08-19 16:14:09.692 33 33 I ServiceManager:
> service 'media.resource_manager' died
The above says that b2g process is died by some reason during when video codec is instantiating. On emulator, sw video codec is created.
Comment 4•11 years ago
|
||
Reporter | ||
Comment 5•11 years ago
|
||
Reporter | ||
Comment 6•11 years ago
|
||
Reporter | ||
Comment 7•11 years ago
|
||
Comment 8•11 years ago
|
||
Comment 9•11 years ago
|
||
Reporter | ||
Comment 10•11 years ago
|
||
Comment 11•11 years ago
|
||
Reporter | ||
Comment 12•11 years ago
|
||
Reporter | ||
Comment 13•11 years ago
|
||
Reporter | ||
Comment 14•11 years ago
|
||
Reporter | ||
Comment 15•11 years ago
|
||
Sotaro, can you please suggest an assignee for this bug?
https://tbpl.mozilla.org/php/getParsedLog.php?id=26959050&full=1&branch=b2g-inbound
Flags: needinfo?(sotaro.ikeda.g)
Reporter | ||
Comment 16•11 years ago
|
||
Reporter | ||
Comment 17•11 years ago
|
||
Reporter | ||
Comment 18•11 years ago
|
||
Comment 19•11 years ago
|
||
(In reply to Ryan VanderMeulen [:RyanVM UTC-4] from comment #15)
> Sotaro, can you please suggest an assignee for this bug?
>
> https://tbpl.mozilla.org/php/getParsedLog.php?id=26959050&full=1&branch=b2g-
> inbound
I take this bug. Before investigating the problem, Bug 907745 needs to be fixed. The bug seems to be fixed in near future.
Assignee: nobody → sotaro.ikeda.g
Reporter | ||
Comment 20•11 years ago
|
||
Reporter | ||
Comment 21•11 years ago
|
||
Reporter | ||
Comment 22•11 years ago
|
||
Reporter | ||
Comment 23•11 years ago
|
||
Reporter | ||
Comment 24•11 years ago
|
||
Reporter | ||
Comment 25•11 years ago
|
||
Reporter | ||
Comment 26•11 years ago
|
||
Reporter | ||
Comment 27•11 years ago
|
||
Reporter | ||
Comment 28•11 years ago
|
||
Reporter | ||
Comment 29•11 years ago
|
||
Reporter | ||
Comment 30•11 years ago
|
||
Comment 31•11 years ago
|
||
Reporter | ||
Comment 32•11 years ago
|
||
Reporter | ||
Comment 33•11 years ago
|
||
Updated•11 years ago
|
Blocks: b2g-mochitests
Reporter | ||
Comment 34•11 years ago
|
||
Reporter | ||
Comment 35•11 years ago
|
||
Reporter | ||
Comment 36•11 years ago
|
||
Reporter | ||
Comment 37•11 years ago
|
||
Reporter | ||
Comment 38•11 years ago
|
||
Comment 39•11 years ago
|
||
Reporter | ||
Comment 40•11 years ago
|
||
Comment 41•11 years ago
|
||
Comment 42•11 years ago
|
||
Reporter | ||
Comment 43•11 years ago
|
||
Reporter | ||
Comment 44•11 years ago
|
||
Reporter | ||
Comment 45•11 years ago
|
||
Reporter | ||
Comment 46•11 years ago
|
||
Reporter | ||
Comment 47•11 years ago
|
||
Reporter | ||
Comment 48•11 years ago
|
||
Comment 49•11 years ago
|
||
Comment 50•11 years ago
|
||
Reporter | ||
Comment 51•11 years ago
|
||
Reporter | ||
Comment 52•11 years ago
|
||
Reporter | ||
Comment 53•11 years ago
|
||
Reporter | ||
Comment 54•11 years ago
|
||
Reporter | ||
Comment 55•11 years ago
|
||
Comment 56•11 years ago
|
||
I still haven't been able to figure out the interleaving, but figured I might as well upload the fix to the timeout error message while I was thinking about it.
Attachment #809412 -
Flags: review?(jhammel)
Comment 57•11 years ago
|
||
Comment on attachment 809412 [details] [diff] [review]
Patch 1.0 - fix format errors in the timeout error message
Beh, sorry. I meant to upload this to bug 917252
Attachment #809412 -
Attachment is obsolete: true
Attachment #809412 -
Flags: review?(jhammel)
Reporter | ||
Comment 58•11 years ago
|
||
Reporter | ||
Comment 59•11 years ago
|
||
Reporter | ||
Comment 60•11 years ago
|
||
Reporter | ||
Comment 61•11 years ago
|
||
Reporter | ||
Comment 62•11 years ago
|
||
Reporter | ||
Comment 63•11 years ago
|
||
Reporter | ||
Comment 64•11 years ago
|
||
Reporter | ||
Comment 65•11 years ago
|
||
Reporter | ||
Comment 66•11 years ago
|
||
Reporter | ||
Comment 67•11 years ago
|
||
Reporter | ||
Comment 68•11 years ago
|
||
Reporter | ||
Comment 69•11 years ago
|
||
Reporter | ||
Comment 70•11 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=30163533&tree=Mozilla-B2g26-v1.2
Andrew, this is probably the most frequent B2G emulator mochitest failure we hit and there hasn't been any action on it in months. Can we please get someone to look at this?
Flags: needinfo?(overholt)
Reporter | ||
Comment 71•11 years ago
|
||
Comment 72•11 years ago
|
||
It looks like mochitests may be randomly running out of memory. In each of the failing lots, I see lots of lines like:
log : OOM Message Logger Started
Comment 73•11 years ago
|
||
Bug 929005 might be related to this bug. but it is not clear.
Comment 74•11 years ago
|
||
Sotaro's the best person to investigate here. Let's hope the work in bug 929005 helps.
Flags: needinfo?(overholt)
Reporter | ||
Comment 75•11 years ago
|
||
Reporter | ||
Comment 77•11 years ago
|
||
Reporter | ||
Comment 78•11 years ago
|
||
Reporter | ||
Comment 79•11 years ago
|
||
Reporter | ||
Comment 80•11 years ago
|
||
Comment 81•11 years ago
|
||
The following log just say b2g process was died by some reason. It has basically no relation ship to media.
> ServiceManager: service 'media.resource_manager' died
Comment 82•11 years ago
|
||
If there is no crash log and b2g process was killed by some reason, a possible cause is 'killed by low memory killer' or 'killed by out of memory killer'.
Reporter | ||
Comment 83•11 years ago
|
||
Comment 84•11 years ago
|
||
Many of the logs being pasted here are now no longer related the the issue in the summary.
I've filed bug 937684 for ensuring the "TEST-UNEXPECTED-FAIL | /foo/bar/baz.html | application timed out after 330.0 seconds with no output" lines don't get interleaved, and thus we have more to star against, and this hopefully avoid false positives here.
Reporter | ||
Comment 85•11 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=30464686&full=1&branch=b2g-inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=30473561&tree=B2g-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=30476516&tree=B2g-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=30469719&full=1&branch=mozilla-inbound
Reporter | ||
Comment 86•11 years ago
|
||
Reporter | ||
Comment 87•11 years ago
|
||
Reporter | ||
Comment 88•11 years ago
|
||
Reporter | ||
Comment 89•11 years ago
|
||
Reporter | ||
Comment 90•11 years ago
|
||
Reporter | ||
Comment 91•11 years ago
|
||
Reporter | ||
Comment 92•11 years ago
|
||
Reporter | ||
Comment 93•11 years ago
|
||
Reporter | ||
Comment 94•11 years ago
|
||
Reporter | ||
Comment 95•11 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=30843693&tree=Mozilla-Inbound&full=1
Sotaro, it seems unclear to me where we go next. Any ideas?
Flags: needinfo?(sotaro.ikeda.g)
Reporter | ||
Comment 96•11 years ago
|
||
Reporter | ||
Comment 97•11 years ago
|
||
Reporter | ||
Comment 98•11 years ago
|
||
Reporter | ||
Comment 99•11 years ago
|
||
Reporter | ||
Comment 100•11 years ago
|
||
Comment 101•11 years ago
|
||
(In reply to Ryan VanderMeulen [:RyanVM UTC-5] from comment #95)
> https://tbpl.mozilla.org/php/getParsedLog.php?id=30843693&tree=Mozilla-
> Inbound&full=1
>
> Sotaro, it seems unclear to me where we go next. Any ideas?
Need to make clear if Comment 82 happens.
Flags: needinfo?(sotaro.ikeda.g)
Comment 102•11 years ago
|
||
dhylands, is it possible to output "adb shell dmesg" log?
Flags: needinfo?(dhylands)
Comment 103•11 years ago
|
||
So we recently added oom-msg-logger for on-device which will look for OOM related messages. These will then get captured and placed into the logcat log.
Currently, this only works on phones which our init.b2g.rc gets included with the kernel and we flash the kernel.
It looks like the OOM Message Logger is, in fact, trying to run. I see these near the end of the log:
10:55:35 INFO - 11-08 18:54:24.896 1288 1288 I log : OOM Message Logger Started
We discovered that it really needs to run as root, and this was fixed just recently in Bug 939370
Looking at the more recent logs (after Nov 15) I see that the OOM Message Logger is running, but I'm not seeing any output from it, so it would appear that the media server isn't dying due to the low memory OOM killer.
Looking at the logs:
02:06:10 INFO - 11-25 10:04:23.585 702 702 I GeckoDump: 18058 INFO TEST-START | Shutdown
02:06:10 INFO - 11-25 10:04:23.595 702 702 I GeckoDump: 18059 INFO Passed: 31704
02:06:10 INFO - 11-25 10:04:23.595 702 702 I GeckoDump: 18060 INFO Failed: 0
02:06:10 INFO - 11-25 10:04:23.610 702 702 I GeckoDump: 18061 INFO Todo: 578
02:06:10 INFO - 11-25 10:04:23.625 702 702 I GeckoDump: 18062 INFO Slowest: 164922ms - /tests/dom/tests/mochitest/ajax/jquery/test_jQuery.html
02:06:10 INFO - 11-25 10:04:23.655 702 702 I GeckoDump: 18063 INFO SimpleTest FINISHED
02:06:10 INFO - 11-25 10:04:23.675 702 702 I GeckoDump: 18064 INFO TEST-INFO | Ran 1 Loops
02:06:10 INFO - 11-25 10:04:23.675 702 702 I GeckoDump: 18065 INFO SimpleTest FINISHED
02:06:10 INFO - 11-25 10:04:26.505 662 662 I Gecko:ProcessPriorityManager: [Mochitest, child-id=1, pid=702] Scheduling reset timer to fire in 1000ms.
02:06:10 INFO - 11-25 10:04:26.535 662 683 E Profiler: BPUnw: [1 total] thread_unregister_for_profiling(me=0x22b10) (NOT REGISTERED)
02:06:10 INFO - 11-25 10:04:26.625 662 827 E Profiler: BPUnw: [1 total] thread_unregister_for_profiling(me=0x22fe0) (NOT REGISTERED)
02:06:10 INFO - 11-25 10:04:26.745 662 662 I Gecko:ProcessPriorityManager: [Mochitest, child-id=1, pid=702] ScheduleResetPriority bailing; the timer is already running.
02:06:10 INFO - 11-25 10:04:26.755 662 662 I Gecko:ProcessPriorityManager: Making background LRU pool with size(31)
02:06:10 INFO - 11-25 10:04:26.755 662 662 I Gecko:ProcessPriorityManager: [child-id=1, pid=-1] Destroying ParticularProcessPriorityManager.
02:06:10 INFO - 11-25 10:04:28.626 33 33 I ServiceManager: service 'media.resource_manager' died
It looks like the tests all completed successfully, and that mochitest was in the the process of shutting down when the mediaserver died.
So maybe something was opened but not closed and that then caused the media server to die?
Flags: needinfo?(dhylands)
Comment 104•11 years ago
|
||
Thanks for the comment! 'media.resource_manager' are not media server. It is instantiated in b2g process. Do you know if b2g process is closed at these timing? I recognized that these services Shutdown do not implemented yet. If b2g process is shutdown, but service manager is still in active, the service manager detects 'media.resource_manager' died.
Comment 105•11 years ago
|
||
Created Bug 943039.
Reporter | ||
Comment 106•11 years ago
|
||
Reporter | ||
Comment 107•11 years ago
|
||
Reporter | ||
Comment 108•11 years ago
|
||
Reporter | ||
Comment 109•11 years ago
|
||
Reporter | ||
Comment 110•11 years ago
|
||
Reporter | ||
Comment 111•11 years ago
|
||
Reporter | ||
Comment 112•11 years ago
|
||
Reporter | ||
Comment 113•11 years ago
|
||
Reporter | ||
Comment 114•11 years ago
|
||
Reporter | ||
Comment 115•11 years ago
|
||
Reporter | ||
Comment 116•11 years ago
|
||
Comment 117•11 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=32683057&tree=Mozilla-Aurora
https://tbpl.mozilla.org/php/getParsedLog.php?id=32683062&tree=Mozilla-Aurora
https://tbpl.mozilla.org/php/getParsedLog.php?id=32683632&tree=Mozilla-Aurora
https://tbpl.mozilla.org/php/getParsedLog.php?id=32672430&tree=Mozilla-Aurora
https://tbpl.mozilla.org/php/getParsedLog.php?id=32683225&tree=Mozilla-Aurora
https://tbpl.mozilla.org/php/getParsedLog.php?id=32684647&tree=Mozilla-Aurora
https://tbpl.mozilla.org/php/getParsedLog.php?id=32684722&tree=Mozilla-Aurora
Comment 118•11 years ago
|
||
Comment 119•11 years ago
|
||
Reporter | ||
Comment 120•11 years ago
|
||
Comment 121•11 years ago
|
||
In the most recent instances of all these failures, the tests are passing and completing successfully, but for some reason the mozharness script isn't reporting this correctly. This looks like a stdout buffering/threading problem, e.g.,
13:55:41 INFO - 39652 INFO TEST-START | Shutdown
13:55:41 INFO - 39653 INFO Passed: 37433
13:56:52 INFO - 39654 Mochitest INFO | runtestsb2g.py | Running tests: end.
13:57:53 INFO - INFO Failed: 0
13:57:53 INFO - 39655 INFO Todo: 157
13:57:53 INFO - 39656 INFO Slowest: 57602ms - /tests/content/html/content/test/forms/test_input_typing_sanitization.html
13:57:53 INFO - 39657 INFO SimpleTest FINISHED
13:57:53 INFO - 39658 INFO TEST-INFO | Ran 1 Loops
13:57:53 INFO - 39659 INFO SimpleTest FINISHED
Comment 122•11 years ago
|
||
Comment 123•11 years ago
|
||
Comment 124•11 years ago
|
||
Reporter | ||
Comment 125•11 years ago
|
||
Reporter | ||
Comment 126•11 years ago
|
||
Comment 127•11 years ago
|
||
Comment 128•11 years ago
|
||
Comment 129•11 years ago
|
||
Comment 130•11 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=33787854&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=33788903&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=33789145&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=33788821&tree=B2g-Inbound
Comment 131•11 years ago
|
||
Comment 132•11 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=33799235&tree=Fx-Team&full=1
https://tbpl.mozilla.org/php/getParsedLog.php?id=33795194&tree=B2g-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=33800224&tree=B2g-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=33798481&tree=B2g-Inbound
Comment 133•11 years ago
|
||
Comment 134•11 years ago
|
||
Comment 135•11 years ago
|
||
Comment 136•11 years ago
|
||
Comment 137•11 years ago
|
||
Comment 139•11 years ago
|
||
Comment 140•11 years ago
|
||
Comment 141•11 years ago
|
||
Reporter | ||
Comment 142•11 years ago
|
||
Updated•11 years ago
|
Assignee: sotaro.ikeda.g → nobody
Comment 143•10 years ago
|
||
We haven't seen this in a while. I think this message indicated a crash, which is now being reported correctly, and so we can close this bug.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → WORKSFORME
You need to log in
before you can comment on or make changes to this bug.
Description
•