Re: Unicode and HTML

Christian L. Mogensen (mogens@CS.Stanford.EDU)
Tue, 25 Oct 1994 17:25:22 -0700


As I understood it both sides were right. :-)

LANGUAGE markup was important and an orthogonal issue to
charset marks.

So you could write: (wrong syntax but you get the idea)
<LANGUAGE FRENCH> Sacr&eacute; bleu!</LANGUAGE>
is a french exclamation.

The character set is the same, but the languages are different.

Similarly, charset markup is important. This was not
completely resolved as far as I remember.

<LANGUAGE FRENCH CHARSET="ISO-8859-1">Sacr&eacute;</LANGUAGE>

<LANGUAGE GREEK CHARSET="ISO-xxxx-y">Zevs</LANGUAGE>

where Zevs would be the bytes rendered in the appropriate
greek character set. Knowing the semantic nugget: this is in Greek
is a bonus, but not required for character sets to be marked.

Christian "feel free to correct this vague recollection"