Unicode and Microsoft Internet Explorer
August 20, 2007
A scientific publishing client writes:
“We are making great progress converting all our documents to HTML (from SGML). One challenge we are facing is how to convert Unicode character entities into characters displayable in Internet Explorer. It appears that Netscape and Firefox work much better than IE in displaying Unicode. One option is to create glyphs for all of the non displayable characters; but, there are hundreds of them and that is not realistic for us.
Do you know how other publishers are handling the display of these special characters? If the characters appear in display equations, we are creating gifs. Our challenge is for those characters that appear in text, which are now displaying as boxes in IE. For example, the entity bsime is used for similar or equal to. Unicode represents this as ⋍ and it should display as ⋍ (Editor’s note: you are seeing this if you viewing this in Firefox or Netscape!)
Are there plug ins or sites that have all the glyphs or does Microsoft have special setups, etc? We have the same question out to a few of our vendors to see if they can help as well. This has become the critical path for us.”
Thoughts?
UPDATE:
I forgot to post this awhile ago. My colleague Marc Dashevsky worked with the client and they came up with the following:
In short, the problem is solely with Internet Explorer V6.x. The Mozilla-based browsers and Internet Explorer V7.x all display the same subset of Unicode. Following is a description of the testing.
He set up his system as follows:
* The font Arial Unicode MS was already installed on his system.
* He explicitly set, in Internet Explorer, Arial Unicode MS to be the Web Page Font (Tools->Internet Options->Fonts).
* He ensured that Internet Explorer, Firefox and Netscape were all using UTF-8 encoding.
He then visited a web page that lists many characters in ISO 8859-1.
Just as the client had experienced with with uncommon characters displayed in its HTML pages, on this page the Mozilla-based browsers and Internet Explorer V7 displayed many characters not displayed by Internet Explorer V6. All browsers successfully displayed all characters listed in the Latin Extended-A block. However when he got to characters in the Minimum European Subset (a.k.a. the Multilingual European Subset No. 2), Internet Explorer V6 displayed open rectangles while Mozilla browsers displayed appropriate glyphs. (An open rectangle means that Internet Explorer knows what character it has encountered, but it cannot find a glyph to display it.)
There clearly is some problem with Internet Explorer V6, and it is not likely that there is a work-around for it. Microsoft fixed the problem in V7 and he is certain they have no interest in retrofitting it to V6.
Marc’s solution is to have everyone switch to Firefox.
Makes sense to me.
Posted by Bill Trippe at August 20, 2007 5:32 PM








