Closed
Bug 359179
Opened 18 years ago
Closed 1 year ago
Unicode: bad Old Persian Line Breaking
Categories
(Core :: Internationalization, defect)
Tracking
()
RESOLVED
DUPLICATE
of bug 1719535
People
(Reporter: behaafarid_hezaarmard, Unassigned)
References
(Depends on 1 open bug, )
Details
(Keywords: intl)
User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1) Gecko/20061010 Firefox/2.0
According to Unicode Standard Annex #14 (Line Breaking Properties: http://unicode.org/reports/tr14/) the character 0x103D0 (in Old Persian) is a visible word divider. However in firefox this character is treated like a regular letter and firefox treats the whole text as one giant word. So far this problem has been observed in all tested systems namely: Firefox 2.0 under window$ and Firefox 1.5 under Linux amd window$.
Reproducible: Always
Updated•18 years ago
|
Status: UNCONFIRMED → NEW
Ever confirmed: true
Updated•18 years ago
|
Keywords: intl
OS: Windows XP → All
Summary: Unicode: Old Persian Line Breaking → Unicode: bad Old Persian Line Breaking
Version: unspecified → 2.0 Branch
Updated•18 years ago
|
Assignee: nobody → smontagu
Component: General → Internationalization
Product: Firefox → Core
QA Contact: general → i18n
Version: 2.0 Branch → 1.8 Branch
Comment 2•17 years ago
|
||
Linux builds with Pango enabled shouldn't have this problem BTW.
Updated•4 years ago
|
Updated•2 years ago
|
Severity: normal → S3
Comment 3•1 year ago
|
||
We've integrated ICU4X line segmenter in bug 1719535, which is UAX 14 compatible.
The following testcase is grabbed from the URL.
𐎲𐎥𐏐𐎺𐏀𐎼𐎣𐏐𐎠𐎢𐎼𐎶𐏀𐎭𐎠𐏐𐏃𐎹𐏐𐎡𐎶𐎠𐎶𐏐𐎲𐎢𐎷𐎡𐎶𐏐𐎠𐎭𐎠𐏐𐏃𐎹𐏐𐎠𐎺𐎶𐏐𐎠𐎿𐎶𐎠𐎴𐎶𐏐𐎠𐎭𐎠𐏐𐏃𐎹𐏐𐎶𐎼𐎫𐎡𐎹𐎶𐏐𐎠𐎭𐎠𐏐𐏃𐎹𐏐𐏁𐎡𐎹𐎠𐎫𐎡𐎶𐏐𐎠𐎭𐎠𐏐𐎶𐎼𐎫𐎡𐎹𐏃𐎹𐎠𐏐
Tested on Firefox Nightly 2023-08-07. When double-clicking on the forth character 𐎺, Firefox selects "𐎺𐏀𐎼𐎣", with each end stops at '𐏐' (U+103D0 Old Persian Word Divider).
You need to log in
before you can comment on or make changes to this bug.
Description
•