Closed Bug 241440 Opened 21 years ago Closed 20 years ago

memory overflow in UTF8ToNewUnicode

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla1.8beta1

People

(Reporter: wind.li, Assigned: caillon)

References

Details

(Keywords: fixed-aviary1.0.1, fixed1.4.5, fixed1.7.6, Whiteboard: [sg:fix])

Attachments

(3 files)

remove mLength = 0; 21 years ago wind li (deleted), patch	dbaron : review+ dveditz : superreview+ dveditz : approval-aviary1.0.1+ dveditz : approval1.7.6+ dveditz : approval1.8b+	Details \| Diff \| Splinter Review
Fix attempt 20 years ago Christopher Aillon (sabbatical, not receiving bugmail) (deleted), patch	dbaron : review-	Details \| Diff \| Splinter Review
Patch for 1.4 branch 20 years ago Leon Sha (deleted), patch	dveditz : superreview+	Details \| Diff \| Splinter Review

wind li

Reporter

Description

•

21 years ago

User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.6) Gecko/20040113 Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.6) Gecko/20040113 in UTF8ToNewUnicode >copy_string(aSource.BeginReading(start), aSource.EndReading(end), > calculator); if aSource include none-UTF8 chachacter such as 0xFC 0xDF calculator.Length() == 0 >PRUnichar *result = NS_STATIC_CAST(PRUnichar*, > nsMemory::Alloc(sizeof(PRUnichar) * (calculator.Length() + 1))); length of result = 2 > > ConvertUTF8toUTF16 converter(result); > copy_string(aSource.BeginReading(start), aSource.EndReading(end), converter).write_terminator(); if aSource is like "something(0xFC 0xDF)" result will be "something". Danger. A walk through is to remove the mLength = 0; in xpcom/string/public/nsUTF8Utils.h:256 Reproducible: Always Steps to Reproduce: 1. 2. 3.

timeless

Comment 1

•

21 years ago

why would non utf8 characters be in a class clearly labeled utf8?

Component: XPCOM → String

Simon Montagu :smontagu

Comment 2

•

21 years ago

(In reply to comment #1) > why would non utf8 characters be in a class clearly labeled utf8? It happens ;-) See bug 236941 for a recent example.

Christian :Biesinger (don't email me, ping me on IRC)

Comment 3

•

21 years ago

that sounds more like a bug in the caller

wind li

Reporter

Comment 4

•

21 years ago

Attached patch remove mLength = 0; (deleted) — Details — Splinter Review

wind li

Reporter

Updated

•

21 years ago

Attachment #147104 - Flags: review?(scc)

Daniel Veditz [:dveditz]

Comment 5

•

21 years ago

Confirming bug. Exploitability would depend on the ability of an attacker to get bad UTF8 into this section of code, but there are enough places where we parse UTF8 off the web that there's probably some easy ways. I'm not sure I like the proposed fix. Instead of returning a known error value (0) it would return a partial length. Then we have to hope ConvertUTF8toUTF16 fails no later than CalculateUTF8Length. It seems better for UTF8toNewUnicode to check for a zero length before allocating and deal with the error at that level.

Status: UNCONFIRMED → NEW

Ever confirmed: true

Whiteboard: [sg:fix] heap overrun

Christian :Biesinger (don't email me, ping me on IRC)

Comment 6

•

21 years ago

I'd expect UTF-8 off the web to go to a different place, namely intl's nsUTF8ToUnicode...

Daniel Veditz [:dveditz]

Updated

•

20 years ago

Blocks: sg-ff101, sg-moz176

Flags: blocking1.8b?

Flags: blocking-aviary1.1?

David Baron :dbaron:

Updated

•

20 years ago

Assignee: dougt → string

Daniel Veditz [:dveditz]

Updated

•

20 years ago

Blocks: sg-tb101

Flags: blocking1.7.6?

Asa Dotzler [:asa]

Updated

•

20 years ago

Flags: blocking1.8b?

Flags: blocking1.8b+

Flags: blocking-aviary1.1?

Flags: blocking-aviary1.1+

Daniel Veditz [:dveditz]

Updated

•

20 years ago

Flags: blocking-aviary1.0.1?

chris hofmann

Comment 7

•

20 years ago

callion to try and work on this a bit. can use some help

Daniel Veditz [:dveditz]

Updated

•

20 years ago

Flags: blocking1.7.6?

Flags: blocking1.7.6+

Flags: blocking-aviary1.0.1?

Flags: blocking-aviary1.0.1+

chris hofmann

Updated

•

20 years ago

Assignee: string → caillon

chris hofmann

Updated

•

20 years ago

Whiteboard: [sg:fix] heap overrun → [sg:fix] heap overrun - eta 2/14

Christopher Aillon (sabbatical, not receiving bugmail)

Assignee

Comment 8

•

20 years ago

Attached patch Fix attempt (deleted) — Details — Splinter Review

Another fix for the issue, handling the zero-length given by the calculator. The question to ask is do we protect against this when we clearly state that UTF8ToNewUnicode takes a UTF-8 string? Sure, callers should get fixed, but we probably shouldn't stomp memory if they hand us garbage... I'm also wondering if the previously submitted patch (removing the mLength = 0 in the calculatr) is correct, since the converter will in fact write up to the invalid character and hand back a string of that length. It makes sense to have the converter and calculator match, I think, with their output length. I also took a look at ToNewUTF8String which I thought might have had the same issue, though I don't believe it does after reading through its respective Convert and Calculate classes.

Attachment #147104 - Attachment is obsolete: true

Attachment #174183 - Flags: review?(dbaron)

Daniel Veditz [:dveditz]

Updated

•

20 years ago

Whiteboard: [sg:fix] heap overrun - eta 2/14 → [sg:fix] heap overrun - eta 2/14 [need review dbaron]

David Baron :dbaron:

Comment 9

•

20 years ago

Comment on attachment 147104 [details] [diff] [review] remove mLength = 0; The converter and the calculator should definitely match. It looks to me like this makes them do so, so r=dbaron on this patch. It would probably be good to add comments to the header of all 4 classes (the two converters and the two calculators) about that (i.e., that ConvertUTF8toUTF16 and CalculateUTF8Length should match and that ConvertUTF16toUTF8 and CalculateUTF8Size should match).

Attachment #147104 - Attachment is obsolete: false

Attachment #147104 - Flags: superreview?(darin)

Attachment #147104 - Flags: review?(scc)

Attachment #147104 - Flags: review+

David Baron :dbaron:

Comment 10

•

20 years ago

Comment on attachment 174183 [details] [diff] [review] Fix attempt I prefer the other approach.

Attachment #174183 - Flags: review?(dbaron) → review-

Daniel Veditz [:dveditz]

Comment 11

•

20 years ago

Comment on attachment 147104 [details] [diff] [review] remove mLength = 0; sr=dveditz

Attachment #147104 - Flags: superreview?(darin) → superreview+

Daniel Veditz [:dveditz]

Comment 12

•

20 years ago

Comment on attachment 147104 [details] [diff] [review] remove mLength = 0; a=dveditz for landing everywhere

Attachment #147104 - Flags: approval1.8b+

Attachment #147104 - Flags: approval1.7.6+

Attachment #147104 - Flags: approval-aviary1.0.1+

Daniel Veditz [:dveditz]

Updated

•

20 years ago

Whiteboard: [sg:fix] heap overrun - eta 2/14 [need review dbaron] → [sg:fix] need checkin

Simon Montagu :smontagu

Comment 13

•

20 years ago

If I'm not mistaken, the convertor and calculator will still not match in the case of an incomplete multi-byte sequence (e.g. 110xxxxx not followed by 10xxxxxx, or 1110xxxx not followed by two of 10xxxxxx, etc.). If by "match" you just mean the convertor should never write more than the length returned by the calculator, that's fine.

Simon Montagu :smontagu

Comment 14

•

20 years ago

(In reply to comment #9) > (From update of attachment 147104 [details] [diff] [review] [edit]) > The converter and the calculator should definitely match. It looks to me like > this makes them do so, so r=dbaron on this patch. It would probably be good to > add comments to the header of all 4 classes (the two converters and the two > calculators) about that (i.e., that ConvertUTF8toUTF16 and CalculateUTF8Length > should match and that ConvertUTF16toUTF8 and CalculateUTF8Size should match). It would also be good to have comments explaining which errors the converter considers fatal and which it considers recoverable by emitting a REPLACEMENT CHARACTER, and why. I assume that the distinction is between byte sequences that look like they are in another encoding (fatal) and byte sequences that look like UTF-8 with errors (recoverable), but that's only a guess.

David Baron :dbaron:

Comment 15

•

20 years ago

Actually, though, there's another case where the computed length could be less -- which is if we actually get a 5-byte or 6-byte UTF-8 sequence. It looks like we'll attempt to fit it into a surrogate pair even though it doesn't fit.

David Baron :dbaron:

Comment 16

•

20 years ago

Er, actually, never mind. We do check for that case.

David Baron :dbaron:

Comment 17

•

20 years ago

Fix checked in to trunk, 2005-02-17 12:17 -0800. Fix checked in to MOZILLA_1_7_BRANCH, 2005-02-17 12:24 -0800. Fix checked in to AVIARY_1_0_1_20050124_BRANCH, 2005-02-17 12:29 -0800. Thanks for the patch.

Status: NEW → RESOLVED

Closed: 20 years ago

Keywords: fixed-aviary1.0.1, fixed1.7.6

Resolution: --- → FIXED

Whiteboard: [sg:fix] need checkin → [sg:fix]

Target Milestone: --- → mozilla1.8beta1

Jay Patel [:jay]

Comment 18

•

20 years ago

Does anyone have a testcase we can use to verify this fix?

Daniel Veditz [:dveditz]

Updated

•

20 years ago

Group: security

Leon Sha

Comment 19

•

20 years ago

Attached patch Patch for 1.4 branch (deleted) — Details — Splinter Review

Attachment #187760 - Flags: superreview?(dveditz)

Daniel Veditz [:dveditz]

Comment 20

•

20 years ago

Comment on attachment 187760 [details] [diff] [review] Patch for 1.4 branch sr=dveditz

Attachment #187760 - Flags: superreview?(dveditz) → superreview+

Leon Sha

Comment 21

•

20 years ago

Checking in nsUTF8Utils.h; /cvsroot/mozilla/xpcom/string/public/nsUTF8Utils.h,v <-- nsUTF8Utils.h new revision: 1.3.2.1; previous revision: 1.3 done

Ginn Chen

Updated

•

20 years ago

Keywords: fixed1.4.5

Nobody; OK to take it and work on it

Updated

•

4 years ago

Component: String → XPCOM

You need to log in before you can comment on or make changes to this bug.