<a class="header-button" href="https://bugzilla-dev.allizom.org/home" title="Go to home page"> Bugzilla

Reporter

Comment 2

•

11 years ago

There are recent crashes.

Status: RESOLVED → REOPENED

Resolution: DUPLICATE → ---

Ben Turner (not reading bugmail, use the needinfo flag!)

Reporter

Comment 3

•

11 years ago

It's #7 crasher in B2G 18.0.

Comment 4

•

11 years ago

These crashes indicate memory corruption. We'll need some STR and valgrind probably :(

Ben Turner (not reading bugmail, use the needinfo flag!)

Comment 5

•

11 years ago

Basically we're crashing on a null deref of 'actor' here: PIndexedDBRequestChild::OnMessageReceived(const Message& __msg) { ... PIndexedDBRequestChild* actor; if ((!(Read((&(actor)), (&(__msg)), (&(__iter)), false)))) { FatalError("Error deserializing 'PIndexedDBRequestChild'"); return MsgValueError; } ... (actor)->DestroySubtree(Deletion); ... } But: PIndexedDBRequestChild::Read( PIndexedDBRequestChild** __v, const Message* __msg, void** __iter, bool __nullable) { int32_t id; if ((!(Read((&(id)), __msg, __iter)))) { FatalError("Error deserializing 'id' for 'PIndexedDBRequestChild'"); return false; } if (((1) == (id)) || (((0) == (id)) && ((!(__nullable))))) { mozilla::ipc::ProtocolErrorBreakpoint("bad ID for PIndexedDBRequest"); return false; } if ((0) == (id)) { (*(__v)) = 0; return true; } ... } That can only return a null actor if '__nullable' is true, which it can't be in this case. So somewhere between 'Read(&actor)' and 'actor->DestroySubtree()' our actor pointer is being overwritten.

Reporter

Comment 6

•

11 years ago

It's now #4 top crasher in B2G 18.0.

blocking-b2g: --- → leo?

Keywords: topcrash

Wayne Chang [:wchang]

Comment 7

•

11 years ago

Reporter: Can you describe the user impact when this crash occurs? ahuang: Can you analyze what we have already first and provide some insights so we can see the severity?

Flags: needinfo?(ahuang)

Naoki Hirata :nhirata (please use needinfo instead of cc)

Comment 8

•

11 years ago

(In reply to Wayne Chang [:wchang] from comment #7) > Reporter: Can you describe the user impact when this crash occurs? > > ahuang: Can you analyze what we have already first and provide some insights > so we can see the severity? We don't see this at least after 5/15 build, right? I believe the severity is low. According to Ben in comment 4 and comment 5, I think coredump may provide us little help here. Minidump from partner is not enough to solve this bug for sure, but I think it's barely possible to let partners run Valgrind in stress tests as well. Maybe bug 847268, enabling coredump is much more reasonable for partners and us to dig into this bug.

Flags: needinfo?(ahuang)

Alex Keybl [:akeybl]

Comment 9

•

11 years ago

leo+ (at least temporarily) given comment 6, but comment 8 may lead to a resolved/worksforme.

blocking-b2g: leo? → leo+

Comment 10

•

11 years ago

https://crash-stats.mozilla.com/report/index/b6f3c329-07fc-4c0a-8f94-39ef52130618 6/13 build.

Naoki Hirata :nhirata (please use needinfo instead of cc)

Comment 11

•

11 years ago

To note : these are all keon or peak crashes.

Comment 12

•

11 years ago

(In reply to ben turner [:bent] from comment #5) > That can only return a null actor if '__nullable' is true, which it can't be > in this case. So somewhere between 'Read(&actor)' and > 'actor->DestroySubtree()' our actor pointer is being overwritten. Let's try wether we can reproduce this on emulator-x86 or not. We can enable hardware watchpoint with gdb 7.4 (or later) (bug 865582) on emulator-x86. Valgrind seems to be a good choice, too.

Comment 13

•

11 years ago

(In reply to Scoobidiver from comment #3) > It's #7 crasher in B2G 18.0. Hi, I want to check this bug, using HW watchpoint on emulator-x86. Can you provide 100% reproduciable steps? Thanks.

Reporter

Comment 14

•

11 years ago

(In reply to Wayne Chang [:wchang] from comment #7) > Reporter: Can you describe the user impact when this crash occurs? (In reply to Alan Huang [:ahuang] from comment #13) > Can you provide 100% reproduciable steps? I don't have. This bug was filed against crash stats. In addition, users can't add a comment when crashing so no clue except maybe from URLs if available.

Updated

•

11 years ago

Assignee: nobody → ahuang

Wayne Chang [:wchang]

Comment 15

•

11 years ago

Are we still seeing this on more recent builds?

Flags: needinfo?(scoobidiver)

Reporter

Comment 16

•

11 years ago

(In reply to Wayne Chang [:wchang] from comment #15) > Are we still seeing this on more recent builds? It happens on Peak and Keon up to B2G 18.0/20130613 which seems to be the latest FxOS-1.0.1 build.

Flags: needinfo?(scoobidiver)

Robert Kaiser

Comment 17

•

11 years ago

(In reply to Scoobidiver from comment #16) > (In reply to Wayne Chang [:wchang] from comment #15) > > Are we still seeing this on more recent builds? > It happens on Peak and Keon up to B2G 18.0/20130613 which seems to be the > latest FxOS-1.0.1 build. Have we seen it on 1.1 or trunk/1.2 builds recently as well?

Reporter

Comment 18

•

11 years ago

(In reply to Robert Kaiser (:kairo@mozilla.com) from comment #17) > Have we seen it on 1.1 or trunk/1.2 builds recently as well? ZTE phones don't have symbols so I can't say for 1.1. In trunk/1.2, there are only 3 crashes over the last week, none for this bug, so it's not statistically representative.

Robert Kaiser

Comment 19

•

11 years ago

(In reply to Scoobidiver from comment #18) > ZTE phones don't have symbols so I can't say for 1.1. The shipped ZTE phones are running 1.0.1 - both 1.1 and 1.2 are only in use in internal testing builds/devices (unagi etc.), or for 1.2, on Geeksphones devices with very daring users.

Reporter

Comment 20

•

11 years ago

It's #21 crasher in B2G for all versions.

blocking-b2g: leo+ → leo?

Keywords: topcrash

Comment 21

•

11 years ago

Hello Al, As we talked before, we may need QA help us to find STR for this. Can Taiwan QA provide some help here? Thanks!

Keywords: qawanted

QA Contact: atsai

Wayne Chang [:wchang]

Comment 22

•

11 years ago

Triage- Leo-ing until we can find an STR or the occurrence rate rises.

blocking-b2g: leo? → ---

William Hsu [:whsu]

Comment 23

•

11 years ago

Hi, Alan, Sorry to jump in. I have no idea regarding provided logs. All that we can do is run the scenarios that Bug 863500 comment 24 mentioned. Do you think this makes sense? If you know that there have any specific methods to trigger this crash, please feel free to contact us. I will also go to your cubicle to discuss this problem with you after I did the test. Thanks!

William Hsu [:whsu]

Comment 24

•

11 years ago

Hi, Alan and all, I automated the test steps that Bug 863500 comment 24 mentioned recently and run it on the following V1-TRAIN build with unagi device. * 2013-07-03-07-02-10 * 2013-07-18-23-02-25 I still cannot reproduce it. This bug was reported 2 months ago. I cannot sure if we had any patch impact the bug and became a potential issue. By the way, I also doubt that if the crash reports were caused by QA since we ran the Leo test during the period. But I don't have any finding. I will continue to monitor this issue form automation server but not spend too much time. If you have further suggestions, comments, or findings, please feel free to contact. Thanks!

Ben Turner (not reading bugmail, use the needinfo flag!)

Reporter

Updated

•

11 years ago

Crash Signature: [@ @0x0 | mozilla::dom::indexedDB::PIndexedDBRequestChild::OnMessageReceived] → [@ @0x0 | mozilla::dom::indexedDB::PIndexedDBRequestChild::OnMessageReceived] [@ @0x0 | mozilla::dom::indexedDB::PIndexedDBRequestChild::OnMessageReceived(IPC::Message const&)]

Jason Smith [:jsmith]

Comment 25

•

11 years ago

Based on the comment above I think QA has done what we can to reproduce this. If we get more information later during daily testing, we'll try to action it from there. For now, there's not much we can do here.

Keywords: qawanted

Comment 26

•

11 years ago

(In reply to William Hsu [:whsu] from comment #24) > I automated the test steps that Bug 863500 comment 24 mentioned recently and > run it on the following V1-TRAIN build with unagi device. It might help to run this series of steps under valgrind and see if it reports anything unusual. Please ping qDot for help on setting it up.

Kyle Machulis [:qdot] [:kmachulis] (INACTIVE)

Comment 27

•

11 years ago

Valgrind unfortunately only runs on >= v1.2 on the nexus 4.

Ben Turner (not reading bugmail, use the needinfo flag!)

Comment 28

•

11 years ago

(In reply to Kyle Machulis [:kmachulis] [:qdot] from comment #27) > Valgrind unfortunately only runs on >= v1.2 on the nexus 4. Eh? I was able to run it on v1.0.1 unagi before.

Kyle Machulis [:qdot] [:kmachulis] (INACTIVE)

Comment 29

•

11 years ago

bent's original instructions for getting valgrind up and running on v1.0/1.1 are at https://bug854517.bugzilla.mozilla.org/attachment.cgi?id=729283 See if you can work through these. I'm hoping my valgrind patches for v1.2 will land soon, and will try to backport them to 1.0/1.1 when that happens.

Naoki Hirata :nhirata (please use needinfo instead of cc)

Updated

•

11 years ago

Keywords: topcrash-b2g

Jason Smith [:jsmith]

Updated

•

11 years ago

Component: General → DOM: IndexedDB

Product: Firefox OS → Core

Comment 30

•

11 years ago

Alan, this has been a top-crasher for a while with no action on it, can you please help here ?

Flags: needinfo?(ahuang)

Naoki Hirata :nhirata (please use needinfo instead of cc)

Comment 31

•

11 years ago

I have no idea of this for a while, and I am currently occupied by tarako. Un-take this first.

Assignee: ahuang → nobody

Flags: needinfo?(ahuang)

Comment 32

•

11 years ago

I don't see any recent crash in anything higher than 18. I am not sure if this bug will appear in new Gecko levels. Should we keep this open?

Flags: needinfo?(bbajaj)

Comment 33

•

11 years ago

(In reply to Naoki Hirata :nhirata (please use needinfo instead of cc) from comment #32) > I don't see any recent crash in anything higher than 18. I am not sure if > this bug will appear in new Gecko levels. Should we keep this open? lets close it for now and we can reopen if need be.

Flags: needinfo?(bbajaj)