Closed Bug 5723 Opened 26 years ago Closed 26 years ago

parser strip out Unicode U+xx00 from html attribute

Tracking

()

Status:

VERIFIED FIXED

Milestone:

People

(Reporter: ftang, Assigned: rickg)

References

(
URL
)

Details

(Whiteboard: DEPEND - Intl)

Frank Tang

Reporter

Description

•

26 years ago

I find this problem when I try to fix the form submission for non ISO-8859-1 character set. 1. Select "View:Default Character Set" to "Shift_JIS" 2. Set a break point in SetAttribute function (static) in nsHTMLContentSink.cpp 3. Go to the above url. 4. You will find out all those ALTTEXT which should have 4 characters only have two characters. All the characters in U+xx00 (for example U+6700 ) are strip off by Tokenizer. Note: you don't need to use Japanese system or even install Japanese font to debug this. Just look at your debugger I have one time trace back to parser code, and I am sure the problem is in the parser. Maybe tokenizer. all the U+xx00 characters have problem. Not sure about other characters.

Frank Tang

Reporter

Updated

•

26 years ago

Priority: P3 → P2

Frank Tang

Reporter

Comment 1

•

26 years ago

change priority to p2.

rickg

Assignee

Updated

•

26 years ago

Status: NEW → ASSIGNED

Priority: P2 → P3

rickg

Assignee

Comment 2

•

26 years ago

This is a legitimate bug, and is fixed with nsString2. As soon as that becomes the defacto string, this will go away.

Jan Carpenter

Updated

•

26 years ago

QA Contact: 3847 → 4141

rickg

Assignee

Updated

•

26 years ago

Target Milestone: M6

rickg

Assignee

Updated

•

26 years ago

Assignee: rickg → ftang

Status: ASSIGNED → NEW

rickg

Assignee

Comment 3

•

26 years ago

Handing this back to you to keep track of. See my earlier comments.

bobj

Updated

•

26 years ago

Target Milestone: M6 → M7

bobj

Comment 4

•

26 years ago

Rick said he will land nsString2 shortly after M6, so moving this to M7. Rick, if you provide QA with an nsString2 enabled binary, maybe they can see if this really fixes the problem.

Frank Tang

Reporter

Updated

•

26 years ago

Status: NEW → ASSIGNED

chris hofmann

Updated

•

26 years ago

Whiteboard: DEPEND - Intl

chris hofmann

Updated

•

26 years ago

Blocks: 7228

Frank Tang

Reporter

Updated

•

26 years ago

Assignee: ftang → rickg

Status: ASSIGNED → NEW

Frank Tang

Reporter

Comment 5

•

26 years ago

It is fixed in this case. Reassign it back to rickg but mark it fix

Frank Tang

Reporter

Updated

•

26 years ago

Status: NEW → RESOLVED

Closed: 26 years ago

Resolution: --- → FIXED

gem

Updated

•

25 years ago

Status: RESOLVED → VERIFIED

gem

Comment 6

•

25 years ago

verified

You need to log in before you can comment on or make changes to this bug.

Bugzilla

parser strip out Unicode U+xx00 from html attribute

Categories

(Core :: DOM: HTML Parser, defect, P3)

Tracking

()

People

(Reporter: ftang, Assigned: rickg)

References

(
URL
)

Details

(Whiteboard: DEPEND - Intl)

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Updated

Comment 2

Updated

Updated

Updated

Comment 3

Updated

Comment 4

Updated

Updated

Updated

Updated

Comment 5

Updated

Updated

Comment 6