Re: International Document Server Support

Alexsander Totic (atotic@ncsa.uiuc.edu)
Tue, 7 Dec 1993 13:05:42 -0600 (CST)


> I am including an optional CHARSET attribute in nearly all elements in
> the next revision to the HTML+ DTD. This will allow browsers to switch
> char sets for a paragraph etc, e.g. <P charset="ISO-2022-JP"> ....
> as described in RFC 1468 which is used for Japanese character encoding
> for email and network news.
>
> Should I also include a LANGUAGE attribute for the ISO3316 language codes?

Are these character sets all 8bit sets, or is there support for
multi-byte characters. I am not familiar with the way things work on X,
but on a Mac, Chinese and some other languages use 2-byte characters.

How is the parsing going to be done for character sets where all the
characters are used? Current parser depends on special characters, such
as '<', and '&'. In different character sets, I do not think that we can
depend on this.

Aleks