[[ This table is based upon sections 9.7, 13 and 14 in Tim Berners-Lee and Daniel W. Connolly HTML 2.0 Specification, and chapter 24 in Dave Reggett, Arnaud Le Hors and Ian Jacobs HTML 4.0 Specification. It gives the code positions and SGML entity names (as defined in annex D of ISO 8879) for the non-ASCII characters in the ISO Latin 1 (ISO 8859-1) character set referenced by the HTML DTD. The entity names should be supported by all HTML compliant browsers. The numeric character entity references (given in the "Position" column) are also valid designations, but seem to be more prone to non-compliant (i.e. buggy) browser implementation. ]]
This list details the code positions and the characters of the default HTML document character set. The default HTML coded character set is based on ISO 8859-1.
The HTML DTD references the "Added Latin 1" entity set, which only supplies named entities for a subset of the non-ASCII characters in ISO 8859-1, namely the accented characters. The following entities should be supported so that all ISO 8859-1 characters may only be referenced symbolically. The names for these entities are taken from the annexes of ISO 8879 (SGML):
"ISO 8879-1986//ENTITIES Added Latin 1//EN"
"ISO 8879-1986//ENTITIES Numeric and Special Graphic//EN"
"ISO 8879-1986//ENTITIES Diacritical Marks//EN"
Symbol | Posistion | Name | Description |
---|---|---|---|
� -  | Unused | ||
	 | Horizontal tab | ||
| Line feed | ||
 -  | Unused | ||
| Carriage Return | ||
 -  | Unused | ||
  | Space | ||
! | ! | Exclamation mark | |
" | " | " | Quotation mark |
# | # | Number sign | |
$ | $ | Dollar sign | |
% | % | Percent sign | |
& | & | & | Ampersand |
' | ' | ' | Apostrophe |
( - ; | Misc. ASCII characters | ||
< | < | < | Less-than sign |
= | = | Equals sign | |
> | > | > | Greater-than sign |
? - ~ | Misc. ASCII characters | ||
 - Ÿ | Unused | ||
  | | no-break space | |
¡ | ¡ | ¡ | inverted exclamation mark |
¢ | ¢ | ¢ | cent sign |
£ | £ | £ | pound sterling sign |
¤ | ¤ | ¤ | general currency sign |
¥ | ¥ | ¥ | yen sign |
¦ | ¦ | ¦ | broken (vertical) bar |
§ | § | § | section sign |
¨ | ¨ | ¨ | umlaut (dieresis) |
© | © | © | copyright sign |
ª | ª | ª | ordinal indicator, feminine |
« | « | « | angle quotation mark, left |
¬ | ¬ | ¬ | not sign |
- | ­ | ­ | soft hyphen |
® | ® | ® | registered sign |
¯ | ¯ | ¯ | macron |
° | ° | ° | degree sign |
± | ± | ± | plus-or-minus sign |
² | ² | ² | superscript two |
³ | ³ | ³ | superscript three |
´ | ´ | ´ | acute accent |
µ | µ | µ | micro sign |
¶ | ¶ | ¶ | pilcrow (paragraph sign) |
· | · | · | middle dot |
¸ | ¸ | ¸ | cedilla |
¹ | ¹ | ¹ | superscript one |
º | º | º | ordinal indicator, masculine |
» | » | » | angle quotation mark, right |
¼ | ¼ | ¼ | fraction one-quarter |
½ | ½ | ½ | fraction one-half |
¾ | ¾ | ¾ | fraction three-quarters |
¿ | ¿ | ¿ | inverted question mark |
À | À | À | capital A, grave accent |
Á | Á | Á | capital A, acute accent |
 |  |  | capital A, circumflex accent |
à | à | à | capital A, tilde |
Ä | Ä | Ä | capital A, dieresis or umlaut mark |
Å | Å | Å | capital A, ring |
Æ | Æ | Æ | capital AE diphthong |
Ç | Ç | Ç | capital C, cedilla |
È | È | È | capital E, grave accent |
É | É | É | capital E, acute accent |
Ê | Ê | Ê | capital E, circumflex accent |
Ë | Ë | Ë | capital E, dieresis or umlaut mark |
Ì | Ì | Ì | capital I, grave accent |
Í | Í | Í | capital I, acute accent |
Î | Î | Î | capital I, circumflex accent |
Ï | Ï | Ï | capital I, dieresis or umlaut mark |
Ð | Ð | Ð | capital Eth, Icelandic |
Ñ | Ñ | Ñ | capital N, tilde |
Ò | Ò | Ò | capital O, grave accent |
Ó | Ó | Ó | capital O, acute accent |
Ô | Ô | Ô | capital O, circumflex accent |
Õ | Õ | Õ | capital O, tilde |
Ö | Ö | Ö | capital O, dieresis or umlaut mark |
× | × | × | multiply sign |
Ø | Ø | Ø | capital O, slash |
Ù | Ù | Ù | capital U, grave accent |
Ú | Ú | Ú | capital U, acute accent |
Û | Û | Û | capital U, circumflex accent |
Ü | Ü | Ü | capital U, dieresis or umlaut mark |
Ý | Ý | Ý | capital Y, acute accent |
Þ | Þ | Þ | capital THORN, Icelandic |
ß | ß | ß | small sharp s, German sz |
à | à | à | small a, grave accent |
á | á | á | small a, acute accent |
â | â | â | small a, circumflex accent |
ã | ã | ã | small a, tilde |
ä | ä | ä | small a, dieresis or umlaut mark |
å | å | å | small a, ring |
æ | æ | æ | small ae diphthong |
ç | ç | ç | small c, cedilla |
è | è | è | small e, grave accent |
é | é | é | small e, acute accent |
ê | ê | ê | small e, circumflex accent |
ë | ë | ë | small e, dieresis or umlaut mark |
ì | ì | ì | small i, grave accent |
í | í | í | small i, acute accent |
î | î | î | small i, circumflex accent |
ï | ï | ï | small i, dieresis or umlaut mark |
ð | ð | ð | small eth, Icelandic |
ñ | ñ | ñ | small n, tilde |
ò | ò | ò | small o, grave accent |
ó | ó | ó | small o, acute accent |
ô | ô | ô | small o, circumflex accent |
õ | õ | õ | small o, tilde |
ö | ö | ö | small o, dieresis or umlaut mark |
÷ | ÷ | ÷ | divide sign |
ø | ø | ø | small o, slash |
ù | ù | ù | small u, grave accent |
ú | ú | ú | small u, acute accent |
û | û | û | small u, circumflex accent |
ü | ü | ü | small u, dieresis or umlaut mark |
ý | ý | ý | small y, acute accent |
þ | þ | þ | small thorn, Icelandic |
ÿ | ÿ | ÿ | small y, dieresis or umlaut mark |
Gisle Hannemyr, 1998-09-01