View Issue Details
ID | Project | Category | View Status | Date Submitted | Last Update |
---|---|---|---|---|---|
0002071 | Composr | core | public | 2015-11-11 04:53 | 2015-11-11 12:46 |
Reporter | Patrick Schmalstig | Assigned To | Chris Graham | ||
Severity | Minor-bug | ||||
Status | closed | Resolution | fixed | ||
Product Version | |||||
Fixed in Version | |||||
Summary | 0002071: Some characters, such as quotations, don't get rendered properly via. lang | ||||
Description | Sometimes if there is a special character such as a quotation mark in a language string, it will render as bizzare characters. Example: “:activityâ€. | ||||
Steps To Reproduce | 1. Find a language string with quotation marks in it. 2. Using PHP code, pass the language string through an cms template to be rendered as a page. 3. Access that page. | ||||
Tags | No tags attached. | ||||
Time estimation (hours) | |||||
Sponsorship open | |||||
|
Misunderstanding of character sets. Unicode characters cannot display if not unicode. |
|
Checking the LANG files shows they are not unicode; there are valid quotations in the LANG file. However, when passed through the templates etc, they show up with these random characters. I'm wondering if maybe the LANG file itself needs converted to UTF8 even if it looks to already be in that format. |
|
A file isn't necessarily unicode or not, it is a stream of bytes. There is a system called "byte order mark" which some editors use to say if a file is unicode or not, but you can have unicode characters (multi-byte) characters in files without that mark. The characters here are not regular quote marks, they are the left/right leaning quotes, which are not in the Western European character set. We use some of these by reference to their HTML entities in the default language files, but we never use the characters directly because the default pack is not unicode. |