Closed Bug 1421963 Opened 7 years ago Closed 7 years ago

Intermittent GECKO(3202) | ==3255==ERROR: AddressSanitizer: heap-use-after-free on address 0x61d000a582b4 at pc 0x7f1472493c22 bp 0x7f14688f2d30 sp 0x7f14688f2d28

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla59

Tracking Flags:

Tracking

Status

firefox-esr52

wontfix

firefox57

---

wontfix

firefox58

---

wontfix

firefox59

fixed

People

(Reporter: intermittent-bug-filer, Assigned: jesup)

References

(Blocks 1 open bug)

Details

(Keywords: csectype-uaf, intermittent-failure, sec-high, Whiteboard: [post-critsmash-triage][adv-main59+])

Attachments

(1 file)

lock around SCTP input processing, not just the receive callback 7 years ago Randell Jesup [:jesup] (needinfo me) (deleted), patch	drno : review+ abillings : sec-approval+	Details \| Diff \| Splinter Review

Treeherder Bug Filer

Reporter

Description

•

7 years ago

treeherder

Filed by: nerli [at] mozilla.com https://treeherder.mozilla.org/logviewer.html#?job_id=148701035&repo=mozilla-inbound https://queue.taskcluster.net/v1/task/CzwqKpXAS7e7OXLDUwx4aw/runs/0/artifacts/public/logs/live_backing.log

Ryan VanderMeulen [:RyanVM]

Comment 1

•

7 years ago

Nico, is this a dupe of the other one you're looking at?

Group: media-core-security

Component: Build Config → WebRTC: Audio/Video

Flags: needinfo?(na-g)

Nico Grunbaum [:ng, @chew:mozilla.org]

Comment 2

•

7 years ago

Ryan, this doesn't look related to me, this appears to be DataChannels.

Flags: needinfo?(na-g)

Sebastian Hengst [:aryx] (needinfo me if it's about an intermittent or backout)

Comment 3

•

7 years ago

Should be fixed by backout of bug 1297418. Please reopen if you think otherwise.

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → DUPLICATE

Randell Jesup [:jesup] (needinfo me)

Assignee

Comment 4

•

7 years ago

Reopening to track UAF issues in the new library, which may also be relevant to the current impl and cause the existing security crashes. (copied from an email): I *finally* reproed it with ASAN + rr, so I could look into why it happens. The memory was freed due to setsockopt() called from SendOutgoingStreamReset() in DataChannels.cpp. This is freed on "MainThread" aka thread 0. The crash happens when it's accessed on thread 6 ("Socket Thread") in response to a packet input. We appear to be getting a stream reset while we're in the middle of closing it from our end. The ordering is apparently (per rr): t0: DataChannel::Close() { MutexAutoLock lock(mLock) .... setsockopt() ... t6: <packet input> usrsctp_conninput() { ... sctp_reset_out_streams() { ... sctp_ulp_notify(SCTP_NOTIFY_STR_RESET_SEND,...) { ... sctp_notify_stream_reset() { ... sctp_add_to_readq() { ... sctp_invoke_recv_callback() { ... inp->recv_callback() { ... mozilla::receive_cb() { ... DataChannelConnection::ReceiveCallback() { MutexAutoLock lock(mLock); <wait on lock> t0: <finish freeing buffers> <release lock in Close()> .... t6: <wake up with lock> HandleStreamResetEvent() finish sctp_reset_out_streams(); call sctp_reset_clear_pending() { crash Now to figure out what's the right thing to do here, and if this is in some what a hole in the library or in how Close() is implemented... Note that the sctp library doesn't lock access while it's invoking the recv callback or running reset_out_streams(); the setsockopt() does lock in sctp. The lock in the ReceiveCallback just makes this more likely/repeatable; if you wrote code that called setsockopt() on one thread, and it interleaved with conninput() and you simply task-switched at the right points, you'd have the same problem. Grabbing the global lock before calling conninput instead of in ReceiveCallback() would be another option, though I worry about possible deadlocks. Another solution would be to force Close() to proxy the setsockopt() to the Socket thread, though that begs the question of what else needs to be proxied. Is it supposed to be allowed to call conninput() on a different thread than other calls like setsockopt()? (or is it allowed with external locking?) In general I thought the API was supposed to be threadsafe. Phew... that wasn't easy (and solutions aren't obvious either) Further notes: it appears moving the lock to SctpDtlsInput() (which calls usrsctp_conninput()) works (one minor mod to assert lock held in the threshold code instead of grabbing it). This might be worth doing in beta 58 if we believe that's safe deadlock-wise. it's tough to believe it wouldn't be safe in general; there are only a couple of callbacks into our code, and this acts as a giant lock around the library input code.

Status: RESOLVED → REOPENED

Flags: needinfo?(tuexen)

Flags: needinfo?(lennart.grahl)

Resolution: DUPLICATE → ---

Randell Jesup [:jesup] (needinfo me)

Assignee

Comment 5

•

7 years ago

sec-high since this may affect trunk

status-firefox57: --- → ?

status-firefox58: --- → ?

status-firefox59: --- → disabled

status-firefox-esr52: --- → ?

Keywords: csectype-uaf, sec-high

Randell Jesup [:jesup] (needinfo me)

Assignee