Closed Bug 72525 Opened 24 years ago Closed 23 years ago

RFE: GB18030 support for P.R. China market

Categories

(Core :: Internationalization, defect, P1)

Sun
Solaris
defect

Tracking

()

VERIFIED FIXED
mozilla0.9.2

People

(Reporter: byuan, Assigned: ftang)

References

Details

(Keywords: intl, meta)

Attachments

(3 files)

GB18030 is a mandatory Standard in China, it has obsoleted the previouse GBK1.0.
Changing QA Contact to ylong@netscape.com.
QA Contact: andreasb → ylong
Status: UNCONFIRMED → NEW
Ever confirmed: true
Reassign to ftang, cc to bstell.
Assignee: nhotta → ftang
Please provide more information about GB18030 here. How Solaris is going to support GB18030 ? Which fonts will be used? in what encoding? How MacOS and Window are going to support GB18030 ? How to convert GB18030 from/to Unicode ?
Target Milestone: --- → Future
Where can I find an official copy of GB18030 ?
Attached file GB18030 to Unicode mapping table (deleted) —
There is no online copy of the standard, but we do have an e-mail from the Standard Committee about the mapping table I attached to this bug. In Solaris next, we are going to enhance the existing GBK locale (zh_CN.GBK/zh.GBK) to support GB18030, also we may provide symbolic locales zh_CN.GB18030/zh.GB18030 based on the requirement from the Chinese Government. Again, we'd like to suggest NSCP to use the following fonts fallback for all of the CJK characters in Simplified Chinese locales: Use GB2312 (gb2312.1980-0) as the default for performance purpose ---> If it's not enough, try GBK (gbk-0) fonts first ---> for all other CJK characters, use GB18030 (gb18030-0) fonts always. BTW, MSFT, IBM, ORCALE, ADOBE and HP, etc are also working on this standard. Thanks. Brian@Sun
P1 bug for Sun. Setting Priority = P1; adding intl, nsbeta1 keywords
Keywords: intl, nsbeta1
Priority: -- → P1
Blocks: 60916
remove Future from target milestone. Get more information from Sun.
Status: NEW → ASSIGNED
Target Milestone: Future → ---
Some information: Mar 2001 09:38:07 -0800 (PST) From: Brian Yuan <Brian.Yuan@Sun.COM> As for the PostScript printing, please refer to the following documents from Adobe: http://partners.adobe.com/asn/developer/pdfs/tn/5079.Adobe-GB1-4.pdf ftp://ftp.ora.com/published/oreilly/nutshell/cjkv/adobe/ag14.tar.Z Date: Mon, 2 Apr 2001 10:53:00 -0700 (PDT) From: Brian Yuan <Brian.Yuan@Sun.COM> Frank, We are implementing the zh_CN.GB18030 locale support in Solaris, the following information are for your information so that you can start the GB18030 support in Mozilla/NSCP: 1. We will use the existing Roman fonts used in Simplified Chinese locales for the 128 1-byte characters: -sun-song-*-r-normal--*-*-*-*-c-*-gb1988.1989-0 or -sun-song-*-r-normal--*-*-*-*-c-*-iso8859-1 2. To keep consistent with the existing GBK fonts, we will use gb18030.2000-0 (not -1) for the two-byte characters, there are 23940 codepoints (21003 CJK + 884 Symbols + 1894 User Define Area + 159 Blanks) 3. We will use gb18030.2000-1 as the font name for the current four-byte characters, there are 39420 codepoints but only 6530 (6582 - 52) Unicode 3.0 "Extension A" CJK characters. Date: Mon, 2 Apr 2001 12:24:11 -0700 (PDT) From: Brian Yuan <Brian.Yuan@sun.com> >For your GB180303 supports, you are mapping characters from 32 bits >space to a 16 bits font encoding space (which is limited by X windows >protocol). We need to know the details how you map it. Since there is no simple mechanism to map the four-byte characters into two-byte using turning of the MSB or ignoring some of the bytes, we want to use the very basic algorithm to convert the 4-byte to two-byte: The 0x81308130 will be mapped to 0x0000: GB18030 Code Index Unicode 0x81308130 0x0000 0x0080 0x81308131 0x0001 0x0081 0x81308132 0x0002 0x0082 0x81308133 0x0003 0x0083 0x81308134 0x0004 0x0084 0x81308135 0x0005 0x0085 0x81308136 0x0006 0x0086 0x81308137 0x0007 0x0087 0x81308138 0x0008 0x0088 0x81308139 0x0009 0x0089 0x81308230 0x000a 0x008a 0x81308231 0x000b 0x008b BTW, the index can be calculated using 12600*(byte1 - 0x81) + 1260*(byte2 - 0x30) + 10*(byte3 - 0x81) + (byte4 - 0x30). Date: Mon, 2 Apr 2001 14:41:22 -0700 (PDT) From: Brian Yuan <Brian.Yuan@sun.com> Frank, Just want to let you know that the way we handle the GB18030 fonts at this time hasn't been discussed among all of the Unix Vendors, so you can try to implement the GB18030 support based on those information, but please be prepared there may have some changes regarding to the grouping, and XLFD CHARSET-ENCODING name, etc. I will let you know if we have any updates about the fonts.
Depends on: 75928
Depends on: 75929
Depends on: 75930
mark it as moz0.9.1
Target Milestone: --- → mozilla0.9.1
Whiteboard: depend on 80772 expect date 5/17
remove moz0.9.1 here and mark it as moz0.9.2 the converter have been check in the X font support is also checked in The reason I mark this as moz0.9.2 is because this bug is now a meta bug. We really depend on 75928, 75929 and 75930 to be complete to support this one.
Keywords: meta
Target Milestone: mozilla0.9.1 → mozilla0.9.2
Depends on: 81200
Whiteboard: depend on 80772 expect date 5/17
changing to nsbeta1-. this one does not meet beta stopper guidelines.
Keywords: nsbeta1nsbeta1-
Verified in nightly 2001051622 on Solaris 9, All the GB18030 characters can display OK, but all the English characters are disappeared. a test text and snapshot image attatched at bug 81200
This is needed for RTM for Sun to distribute.
Keywords: nsCatFood, rtm
the feature is in. open other bugs for details issues. I will mark this bug as fixed for now.
Status: ASSIGNED → RESOLVED
Closed: 23 years ago
Resolution: --- → FIXED
Changed QA contact to teruko@netscape.com.
QA Contact: ylong → teruko
Per Ervin yan's comment, mark this as verified.
Status: RESOLVED → VERIFIED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: