Closed Bug 106803 Opened 23 years ago Closed 19 years ago

problem in rendering 0x80~0x9f in win1252 encoding

Categories

(Core :: Internationalization, defect)

x86
Windows NT
defect
Not set
normal

Tracking

()

RESOLVED DUPLICATE of bug 134733

People

(Reporter: shanjian, Assigned: jshin1987)

Details

(Keywords: intl)

This problem was spinned from 97343. It is a different problem from 0xb7 and it is more complicated to fix. Related comment below: >>shanjian With my test case, I tried all raster font on winNT and win98, here is a summary of my findings: . Winnt: "MS Sans Serif" and "MS Serif" has problem with 0x82 to 0x8c, 0x93 to 0x9c "Modern", "Courier", "Small Fonts", "Roman" and all truetype font are OK. . On Win98 , non-A-version "MS Sans Serif" and "MS Serif" has problem with 0x82 to 0x8c, 0x93 to 0x9c, 0xb7 "Modern", "Courier", "Small Fonts", "Roman" and all truetype font are OK. . On Win98, A-version All looks fine. Since 0x80 to 0x9f are left for C1 in 8859-x, the problem is not serious. 0xb7 is something we have to deal with. In Erik's existing code, charmap for certain charset is generated using system's conversion API. Only a character in certain charset can be converted to wchar, do we declare the support for such char in charmap. >> ftang >You might also have noticed that 0x82 to 0x8c, 0x93 to 0x9c are untouched. >IMO that is not worth fixing. Why ? these seems important characters for some language too. Probably we need a different fix for it; I think we should not generate the multibyte by for loop from 0 to 255 (0xff). The origional definitation of ANSI charcter set from my Win3.1 and Win95 programming book show it does not contains the 0x80-0x9F part. see the following Page 130 (Figure 3-8. The ANSI character set arranged by character code.) of Programming Windows 3.1 3rd Edition, Charles Petzold, Microsoft Press, ISBN 1-55615-395-3 also see Page 281 (Figure 5-8. The ANSI character set arranged by character code.) of Programming Windows 95, Charles Petzold, Microsoft Press, ISBN 1-55615-676-6 Page 162 (Figure 3-8. The ANSI character set) of Programming Windows 95 with MFC, Jeff Prosise, Microsoft Press, ISBN 1-55615-902-1 I think your approach for B7 is needed. But I think we should also fix other characters. They might be used somewhere else. The unicode character of these char are: 0x80 0x20AC #EURO SIGN 0x82 0x201A #SINGLE LOW-9 QUOTATION MARK 0x83 0x0192 #LATIN SMALL LETTER F WITH HOOK 0x84 0x201E #DOUBLE LOW-9 QUOTATION MARK 0x85 0x2026 #HORIZONTAL ELLIPSIS 0x86 0x2020 #DAGGER 0x87 0x2021 #DOUBLE DAGGER 0x88 0x02C6 #MODIFIER LETTER CIRCUMFLEX ACCENT 0x89 0x2030 #PER MILLE SIGN 0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON 0x8B 0x2039 #SINGLE LEFT-POINTING ANGLE QUOTATION MARK 0x8C 0x0152 #LATIN CAPITAL LIGATURE OE 0x8E 0x017D #LATIN CAPITAL LETTER Z WITH CARON 0x91 0x2018 #LEFT SINGLE QUOTATION MARK 0x92 0x2019 #RIGHT SINGLE QUOTATION MARK 0x93 0x201C #LEFT DOUBLE QUOTATION MARK 0x94 0x201D #RIGHT DOUBLE QUOTATION MARK 0x95 0x2022 #BULLET 0x96 0x2013 #EN DASH 0x97 0x2014 #EM DASH 0x98 0x02DC #SMALL TILDE 0x99 0x2122 #TRADE MARK SIGN 0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON 0x9B 0x203A #SINGLE RIGHT-POINTING ANGLE QUOTATION MARK 0x9C 0x0153 #LATIN SMALL LIGATURE OE 0x9E 0x017E #LATIN SMALL LETTER Z WITH CARON 0x9F 0x0178 #LATIN CAPITAL LETTER Y WITH DIAERESIS you can see the following characters are wrong on my NT4J 0x82 0x201A #SINGLE LOW-9 QUOTATION MARK 0x83 0x0192 #LATIN SMALL LETTER F WITH HOOK 0x84 0x201E #DOUBLE LOW-9 QUOTATION MARK 0x85 0x2026 #HORIZONTAL ELLIPSIS 0x86 0x2020 #DAGGER 0x87 0x2021 #DOUBLE DAGGER 0x88 0x02C6 #MODIFIER LETTER CIRCUMFLEX ACCENT 0x89 0x2030 #PER MILLE SIGN 0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON 0x8B 0x2039 #SINGLE LEFT-POINTING ANGLE QUOTATION MARK 0x8C 0x0152 #LATIN CAPITAL LIGATURE OE 0x8E 0x017D #LATIN CAPITAL LETTER Z WITH CARON 0x93 0x201C #LEFT DOUBLE QUOTATION MARK 0x94 0x201D #RIGHT DOUBLE QUOTATION MARK 0x95 0x2022 #BULLET 0x96 0x2013 #EN DASH 0x97 0x2014 #EM DASH 0x98 0x02DC #SMALL TILDE 0x99 0x2122 #TRADE MARK SIGN 0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON 0x9B 0x203A #SINGLE RIGHT-POINTING ANGLE QUOTATION MARK 0x9C 0x0153 #LATIN SMALL LIGATURE OE 0x9E 0x017E #LATIN SMALL LETTER Z WITH CARON 0x9F 0x0178 #LATIN CAPITAL LETTER Y WITH DIAERESIS MS Sans Serif can display 0x80 0x20AC #EURO SIGN but not "MS Serif" B7, 2018 and 2019 have no problem on both "MS Sans Serif" and "MS Serif"
Keywords: intl
I need to testing this in various windows versions and localization builds. We might need to escalated this bug if user begin to complain. Set it to 1.01 now.
Status: NEW → ASSIGNED
Target Milestone: --- → mozilla1.0.1
shanjian is no longer working on mozilla for 2 years and these bugs are still here. Mark them won't fix. If you want to reopen it, find a good owner first.
Status: ASSIGNED → RESOLVED
Closed: 20 years ago
Resolution: --- → WONTFIX
Mass Reassign Please excuse the spam
Assignee: shanjian → nobody
Mass Re-opening Bugs Frank Tang Closed on Wensday March 02 for no reason, all the spam is his fault feel free to tar and feather him
Status: RESOLVED → REOPENED
Resolution: WONTFIX → ---
Reassigning Franks old bugs to Jungshik Shin for triage - Sorry for spam
Assignee: nobody → jshin1987
Status: REOPENED → NEW
Can someone reproduce this bug on current Trunk? Mr. Kimura said, this bug may be dup of bug 130617.
Target Milestone: mozilla1.0.1 → ---
Very sorry. I told a wrong bug number. This bug may be dup of bug 134733.
*** This bug has been marked as a duplicate of 134733 ***
Status: NEW → RESOLVED
Closed: 20 years ago19 years ago
Resolution: --- → DUPLICATE
You need to log in before you can comment on or make changes to this bug.