HTML 4 Character Entities
Table of the character entities required by HTML 4, to test web browser compliance

This page is ripped out of the HTML 4 documentation at W3 and massaged so you can test your browser’s rendering of special characters.

Contents

  1. Introduction to character entity references
  2. Character entity references for ISO 8859-1 characters
    1. The list of characters
  3. Character entity references for symbols, mathematical symbols, and Greek letters
    1. The list of characters
  4. Character entity references for markup-significant and internationalization characters
    1. The list of characters

24.1 Introduction to character entity references

A character entity reference is an SGML construct that references a character of the document character set.

This version of HTML supports several sets of character entity references:

The following sections present the complete lists of character entity references. Although, by convention, [ISO10646] the comments following each entry are usually written with uppercase letters, we have converted them to lowercase in this specification for reasons of readability.

24.2 Character entity references for ISO 8859-1 characters

The character entity references in this section produce characters whose numeric equivalents should already be supported by conforming HTML 2.0 user agents. Thus, the character entity reference ÷ is a more convenient form than ÷ for obtaining the division sign (÷).

To support these named entities, user agents need only recognize the entity names and convert them to characters that lie within the repertoire of [ISO88591].

Character 65533 (FFFD hexadecimal) is the last valid character in UCS-2. 65534 (FFFE hexadecimal) is unassigned and reserved as the byte-swapped version of ZERO WIDTH NON-BREAKING SPACE for byte-order detection purposes. 65535 (FFFF hexadecimal) is unassigned.

24.2.1 The list of characters

<!-- Portions © International Organization for Standardization 1986
     Permission to copy in any form is granted for use with
     conforming SGML systems and applications as defined in
     ISO 8879, provided this notice is included in all copies.
-->
<!-- Character entity set. Typical invocation:
     <!ENTITY % HTMLlat1 PUBLIC
       "-//W3C//ENTITIES Latin 1//EN//HTML">
     %HTMLlat1;
-->
nbsp  no-break space = non-breaking spaceU+00A0 ISOnum
iexcl ¡inverted exclamation markU+00A1 ISOnum
cent ¢cent signU+00A2 ISOnum
pound £pound signU+00A3 ISOnum
curren ¤currency signU+00A4 ISOnum
yen ¥yen sign = yuan signU+00A5 ISOnum
brvbar ¦broken bar = broken vertical barU+00A6 ISOnum
sect §section signU+00A7 ISOnum
uml ¨diaeresis = spacing diaeresisU+00A8 ISOdia
copy ©copyright signU+00A9 ISOnum
ordf ªfeminine ordinal indicatorU+00AA ISOnum
laquo «left-pointing double angle quotation mark = left pointing guillemetU+00AB ISOnum
not ¬not signU+00AC ISOnum
shy ­soft hyphen = discretionary hyphenU+00AD ISOnum
reg ®registered sign = registered trade mark signU+00AE ISOnum
macr ¯macron = spacing macron = overline = APL overbarU+00AF ISOdia
deg °degree signU+00B0 ISOnum
plusmn ±plus-minus sign = plus-or-minus signU+00B1 ISOnum
sup2 ²superscript two = superscript digit two = squaredU+00B2 ISOnum
sup3 ³superscript three = superscript digit three = cubedU+00B3 ISOnum
acute ´acute accent = spacing acuteU+00B4 ISOdia
micro µmicro signU+00B5 ISOnum
para pilcrow sign = paragraph signU+00B6 ISOnum
middot ·middle dot = Georgian comma = Greek middle dotU+00B7 ISOnum
cedil ¸cedilla = spacing cedillaU+00B8 ISOdia
sup1 ¹superscript one = superscript digit oneU+00B9 ISOnum
ordm ºmasculine ordinal indicatorU+00BA ISOnum
raquo »right-pointing double angle quotation mark = right pointing guillemetU+00BB ISOnum
frac14 ¼vulgar fraction one quarter = fraction one quarterU+00BC ISOnum
frac12 ½vulgar fraction one half = fraction one halfU+00BD ISOnum
frac34 ¾vulgar fraction three quarters = fraction three quartersU+00BE ISOnum
iquest ¿inverted question mark = turned question markU+00BF ISOnum
Agrave Àlatin capital letter A with grave = latin capital letter A graveU+00C0 ISOlat1
Aacute Álatin capital letter A with acuteU+00C1 ISOlat1
Acirc Âlatin capital letter A with circumflexU+00C2 ISOlat1
Atilde Ãlatin capital letter A with tildeU+00C3 ISOlat1
Auml Älatin capital letter A with diaeresisU+00C4 ISOlat1
Aring Ålatin capital letter A with ring above = latin capital letter A ringU+00C5 ISOlat1
AElig Ælatin capital letter AE = latin capital ligature AEU+00C6 ISOlat1
Ccedil Çlatin capital letter C with cedillaU+00C7 ISOlat1
Egrave Èlatin capital letter E with graveU+00C8 ISOlat1
Eacute Élatin capital letter E with acuteU+00C9 ISOlat1
Ecirc Êlatin capital letter E with circumflexU+00CA ISOlat1
Euml Ëlatin capital letter E with diaeresisU+00CB ISOlat1
Igrave Ìlatin capital letter I with graveU+00CC ISOlat1
Iacute Ílatin capital letter I with acuteU+00CD ISOlat1
Icirc Îlatin capital letter I with circumflexU+00CE ISOlat1
Iuml Ïlatin capital letter I with diaeresisU+00CF ISOlat1
ETH Ðlatin capital letter ETHU+00D0 ISOlat1
Ntilde Ñlatin capital letter N with tildeU+00D1 ISOlat1
Ograve Òlatin capital letter O with graveU+00D2 ISOlat1
Oacute Ólatin capital letter O with acuteU+00D3 ISOlat1
Ocirc Ôlatin capital letter O with circumflexU+00D4 ISOlat1
Otilde Õlatin capital letter O with tildeU+00D5 ISOlat1
Ouml Ölatin capital letter O with diaeresisU+00D6 ISOlat1
times ×multiplication signU+00D7 ISOnum
Oslash Ølatin capital letter O with stroke = latin capital letter O slashU+00D8 ISOlat1
Ugrave Ùlatin capital letter U with graveU+00D9 ISOlat1
Uacute Úlatin capital letter U with acuteU+00DA ISOlat1
Ucirc Ûlatin capital letter U with circumflexU+00DB ISOlat1
Uuml Ülatin capital letter U with diaeresisU+00DC ISOlat1
Yacute Ýlatin capital letter Y with acuteU+00DD ISOlat1
THORN Þlatin capital letter THORNU+00DE ISOlat1
szlig ßlatin small letter sharp s = ess-zedU+00DF ISOlat1
agrave àlatin small letter a with grave = latin small letter a graveU+00E0 ISOlat1
aacute álatin small letter a with acuteU+00E1 ISOlat1
acirc âlatin small letter a with circumflexU+00E2 ISOlat1
atilde ãlatin small letter a with tildeU+00E3 ISOlat1
auml älatin small letter a with diaeresisU+00E4 ISOlat1
aring ålatin small letter a with ring above = latin small letter a ringU+00E5 ISOlat1
aelig ælatin small letter ae = latin small ligature aeU+00E6 ISOlat1
ccedil çlatin small letter c with cedillaU+00E7 ISOlat1
egrave èlatin small letter e with graveU+00E8 ISOlat1
eacute élatin small letter e with acuteU+00E9 ISOlat1
ecirc êlatin small letter e with circumflexU+00EA ISOlat1
euml ëlatin small letter e with diaeresisU+00EB ISOlat1
igrave ìlatin small letter i with graveU+00EC ISOlat1
iacute ílatin small letter i with acuteU+00ED ISOlat1
icirc îlatin small letter i with circumflexU+00EE ISOlat1
iuml ïlatin small letter i with diaeresisU+00EF ISOlat1
eth ðlatin small letter ethU+00F0 ISOlat1
ntilde ñlatin small letter n with tildeU+00F1 ISOlat1
ograve òlatin small letter o with graveU+00F2 ISOlat1
oacute ólatin small letter o with acuteU+00F3 ISOlat1
ocirc ôlatin small letter o with circumflexU+00F4 ISOlat1
otilde õlatin small letter o with tildeU+00F5 ISOlat1
ouml ölatin small letter o with diaeresisU+00F6 ISOlat1
divide ÷division signU+00F7 ISOnum
oslash ølatin small letter o with stroke, = latin small letter o slashU+00F8 ISOlat1
ugrave ùlatin small letter u with graveU+00F9 ISOlat1
uacute úlatin small letter u with acuteU+00FA ISOlat1
ucirc ûlatin small letter u with circumflexU+00FB ISOlat1
uuml ülatin small letter u with diaeresisU+00FC ISOlat1
yacute ýlatin small letter y with acuteU+00FD ISOlat1
thorn þlatin small letter thornU+00FE ISOlat1
yuml ÿlatin small letter y with diaeresisU+00FF ISOlat1

24.3 Character entity references for symbols, mathematical symbols, and Greek letters

The character entity references in this section produce characters that may be represented by glyphs in the widely available Adobe Symbol font, including Greek characters, various bracketing symbols, and a selection of mathematical operators such as gradient, product, and summation symbols.

To support these entities, user agents may support full [ISO10646] or use other means. Display of glyphs for these characters may be obtained by being able to display the relevant [ISO10646] characters or by other means, such as internally mapping the listed entities, numeric character references, and characters to the appropriate position in some font that contains the requisite glyphs.

When to use Greek entities. This entity set contains all the letters used in modern Greek. However, it does not include Greek punctuation, precomposed accented characters nor the non-spacing accents (tonos, dialytika) required to compose them. There are no archaic letters, Coptic-unique letters, or precomposed letters for Polytonic Greek. The entities defined here are not intended for the representation of modern Greek text and would not be an efficient representation; rather, they are intended for occasional Greek letters used in technical and mathematical works.

24.3.1 The list of characters

<!-- Mathematical, Greek and Symbolic characters for HTML -- >

<!-- Character entity set. Typical invocation:
     <!ENTITY % HTMLsymbol PUBLIC
       "-//W3C//ENTITIES Symbols//EN//HTML">
     %HTMLsymbol; 
-- >

<!-- Portions © International Organization for Standardization 1986:
     Permission to copy in any form is granted for use with
     conforming SGML systems and applications as defined in
     ISO 8879, provided this notice is included in all copies.
-- >

<!-- Relevant ISO entity set is given unless names are newly introduced.
     New names (i.e., not in ISO 8879 list) do not clash with any
     existing ISO 8879 entity names. ISO 10646 character numbers
     are given for each character, in hex. CDATA values are decimal
     conversions of the ISO 10646 values and refer to the document
     character set. Names are ISO 10646 names. 

-- >
Latin Extended-B
fnof ƒlatin small f with hook = function = florinU+0192 ISOtech
Greek
Alpha Αgreek capital letter alphaU+0391
Beta Βgreek capital letter betaU+0392
Gamma Γgreek capital letter gammaU+0393 ISOgrk3
Delta Δgreek capital letter deltaU+0394 ISOgrk3
Epsilon Εgreek capital letter epsilonU+0395
Zeta Ζgreek capital letter zetaU+0396
Eta Ηgreek capital letter etaU+0397
Theta Θgreek capital letter thetaU+0398 ISOgrk3
Iota Ιgreek capital letter iotaU+0399
Kappa Κgreek capital letter kappaU+039A
Lambda Λgreek capital letter lambdaU+039B ISOgrk3
Mu Μgreek capital letter muU+039C
Nu Νgreek capital letter nuU+039D
Xi Ξgreek capital letter xiU+039E ISOgrk3
Omicron Οgreek capital letter omicronU+039F
Pi Πgreek capital letter piU+03A0 ISOgrk3
Rho Ρgreek capital letter rho
there is no Sigmaf, and no U+03A2 character either
U+03A1
Sigma Σgreek capital letter sigmaU+03A3 ISOgrk3
Tau Τgreek capital letter tauU+03A4
Upsilon Υgreek capital letter upsilonU+03A5 ISOgrk3
Phi Φgreek capital letter phiU+03A6 ISOgrk3
Chi Χgreek capital letter chiU+03A7
Psi Ψgreek capital letter psiU+03A8 ISOgrk3
Omega Ωgreek capital letter omegaU+03A9 ISOgrk3
alpha αgreek small letter alphaU+03B1 ISOgrk3
beta βgreek small letter betaU+03B2 ISOgrk3
gamma γgreek small letter gammaU+03B3 ISOgrk3
delta δgreek small letter deltaU+03B4 ISOgrk3
epsilon εgreek small letter epsilonU+03B5 ISOgrk3
zeta ζgreek small letter zetaU+03B6 ISOgrk3
eta ηgreek small letter etaU+03B7 ISOgrk3
theta θgreek small letter thetaU+03B8 ISOgrk3
iota ιgreek small letter iotaU+03B9 ISOgrk3
kappa κgreek small letter kappaU+03BA ISOgrk3
lambda λgreek small letter lambdaU+03BB ISOgrk3
mu μgreek small letter muU+03BC ISOgrk3
nu νgreek small letter nuU+03BD ISOgrk3
xi ξgreek small letter xiU+03BE ISOgrk3
omicron οgreek small letter omicronU+03BF NEW
pi πgreek small letter piU+03C0 ISOgrk3
rho ρgreek small letter rhoU+03C1 ISOgrk3
sigmaf ςgreek small letter final sigmaU+03C2 ISOgrk3
sigma σgreek small letter sigmaU+03C3 ISOgrk3
tau τgreek small letter tauU+03C4 ISOgrk3
upsilon υgreek small letter upsilonU+03C5 ISOgrk3
phi φgreek small letter phiU+03C6 ISOgrk3
chi χgreek small letter chiU+03C7 ISOgrk3
psi ψgreek small letter psiU+03C8 ISOgrk3
omega ωgreek small letter omegaU+03C9 ISOgrk3
thetasym ϑgreek small letter theta symbolU+03D1 NEW
upsih ϒgreek upsilon with hook symbolU+03D2 NEW
piv ϖgreek pi symbolU+03D6 ISOgrk3
General Punctuation
bull bullet = black small circle
bullet is NOT the same as bullet operator, U+2219
U+2022 ISOpub
hellip horizontal ellipsis = three dot leaderU+2026 ISOpub
prime prime = minutes = feetU+2032 ISOtech
Prime double prime = seconds = inchesU+2033 ISOtech
oline overline = spacing overscoreU+203E NEW
frasl fraction slashU+2044 NEW
Letterlike Symbols
weierp script capital P = power set = Weierstrass pU+2118 ISOamso
image black letter capital I = imaginary partU+2111 ISOamso
real black letter capital R = real part symbolU+211C ISOamso
trade trade mark signU+2122 ISOnum
alefsym alef symbol = first transfinite cardinal
alef symbol is NOT the same as hebrew letter alef, U+05D0 although the same glyph could be used to depict both characters
U+2135 NEW
Arrows
larr leftwards arrowU+2190 ISOnum
uarr upwards arrowU+2191 ISOnum
rarr rightwards arrowU+2192 ISOnum
darr downwards arrowU+2193 ISOnum
harr left right arrowU+2194 ISOamsa
crarr downwards arrow with corner leftwards = carriage returnU+21B5 NEW
lArr leftwards double arrow
ISO 10646 does not say that lArr is the same as the is implied by' arrow but also does not have any other character for that function. So ? lArr can be used for 'is implied by' as ISOtech suggests
U+21D0 ISOtech
uArr upwards double arrowU+21D1 ISOamsa
rArr rightwards double arrow
ISO 10646 does not say this is the 'implies' character but does not have another character with this function so ? rArr can be used for 'implies' as ISOtech suggests
U+21D2 ISOtech
dArr downwards double arrowU+21D3 ISOamsa
hArr left right double arrowU+21D4 ISOamsa
Mathematical Operators
forall for allU+2200 ISOtech
part partial differentialU+2202 ISOtech
exist there existsU+2203 ISOtech
empty empty set = null set = diameterU+2205 ISOamso
nabla nabla = backward differenceU+2207 ISOtech
isin element ofU+2208 ISOtech
notin not an element ofU+2209 ISOtech
ni contains as member
should there be a more memorable name than 'ni'?
U+220B ISOtech
prod n-ary product = product sign
prod is NOT the same character as U+03A0 'greek capital letter pi' though the same glyph might be used for both
U+220F ISOamsb
sum n-ary summation
sum is NOT the same character as U+03A3 'greek capital letter sigma' though the same glyph might be used for both
U+2211 ISOamsb
minus minus signU+2212 ISOtech
lowast asterisk operatorU+2217 ISOtech
radic square root = radical signU+221A ISOtech
prop proportional toU+221D ISOtech
infin infinityU+221E ISOtech
ang angleU+2220 ISOamso
and logical and = wedgeU+2227 ISOtech
or logical or = veeU+2228 ISOtech
cap intersection = capU+2229 ISOtech
cup union = cupU+222A ISOtech
int integralU+222B ISOtech
there4 thereforeU+2234 ISOtech
sim tilde operator = varies with = similar to
tilde operator is NOT the same character as the tilde, U+007E, although the same glyph might be used to represent both
U+223C ISOtech
cong approximately equal toU+2245 ISOtech
asymp almost equal to = asymptotic toU+2248 ISOamsr
ne not equal toU+2260 ISOtech
equiv identical toU+2261 ISOtech
le less-than or equal toU+2264 ISOtech
ge greater-than or equal toU+2265 ISOtech
sub subset ofU+2282 ISOtech
sup superset of
note that nsup, 'not a superset of, U+2283' is not covered by the Symbol font encoding and is not included. Should it be, for symmetry? It is in ISOamsn
U+2283 ISOtech
nsub not a subset ofU+2284 ISOamsn
sube subset of or equal toU+2286 ISOtech
supe superset of or equal toU+2287 ISOtech
oplus circled plus = direct sumU+2295 ISOamsb
otimes circled times = vector productU+2297 ISOamsb
perp up tack = orthogonal to = perpendicularU+22A5 ISOtech
sdot dot operator
dot operator is NOT the same character as U+00B7 middle dot
U+22C5 ISOamsb
Miscellaneous Technical
lceil left ceiling = apl upstileU+2308 ISOamsc
rceil right ceilingU+2309 ISOamsc
lfloor left floor = apl downstileU+230A ISOamsc
rfloor right floorU+230B ISOamsc
lang left-pointing angle bracket = bra
lang is NOT the same character as U+003C 'less than' or U+2039 'single left-pointing angle quotation mark'
U+2329 ISOtech
rang right-pointing angle bracket = ket
rang is NOT the same character as U+003E 'greater than' or U+203A 'single right-pointing angle quotation mark'
U+232A ISOtech
Geometric Shapes
loz lozengeU+25CA ISOpub
Miscellaneous Symbols
spades black spade suit
black here seems to mean filled as opposed to hollow
U+2660 ISOpub
clubs black club suit = shamrockU+2663 ISOpub
hearts black heart suit = valentineU+2665 ISOpub
diams black diamond suitU+2666 ISOpub

24.4 Character entity references for markup-significant and internationalization characters

The character entity references in this section are for escaping markup-significant characters (these are the same as those in HTML 2.0 and 3.2), for denoting spaces and dashes. Other characters in this section apply to internationalization issues such as the disambiguation of bidirectional text (see the section on bidirectional text for details).

Entities have also been added for the remaining characters occurring in CP-1252 which do not occur in the HTMLlat1 or HTMLsymbol entity sets. These all occur in the 128 to 159 range within the CP-1252 charset. These entities permit the characters to be denoted in a platform-independent manner.

To support these entities, user agents may support full [ISO10646] or use other means. Display of glyphs for these characters may be obtained by being able to display the relevant [ISO10646] characters or by other means, such as internally mapping the listed entities, numeric character references, and characters to the appropriate position in some font that contains the requisite glyphs.

24.4.1 The list of characters

<!-- Special characters for HTML -->

<!-- Character entity set. Typical invocation:
     <!ENTITY % HTMLspecial PUBLIC
       "-//W3C//ENTITIES Special//EN//HTML">
     %HTMLspecial; -->

<!-- Portions © International Organization for Standardization 1986:
     Permission to copy in any form is granted for use with
     conforming SGML systems and applications as defined in
     ISO 8879, provided this notice is included in all copies.
-->

<!-- Relevant ISO entity set is given unless names are newly introduced.
     New names (i.e., not in ISO 8879 list) do not clash with any
     existing ISO 8879 entity names. ISO 10646 character numbers
     are given for each character, in hex. CDATA values are decimal
     conversions of the ISO 10646 values and refer to the document
     character set. Names are ISO 10646 names. 
-->
C0 Controls and Basic Latin
quot "quotation mark = APL quote U+0022 ISOnum
amp &ampersandU+0026 ISOnum
lt <less-than signU+003C ISOnum
gt >greater-than signU+003E ISOnum
Latin Extended-A
OElig Œlatin capital ligature OE U+0152 ISOlat2
oelig œlatin small ligature oe
ligature is a misnomer, this is a separate character in some languages
U+0153 ISOlat2
Scaron Šlatin capital letter S with caronU+0160 ISOlat2
scaron šlatin small letter s with caronU+0161 ISOlat2
Yuml Ÿlatin capital letter Y with diaeresisU+0178 ISOlat2
Spacing Modifier Letters
circ ˆmodifier letter circumflex accentU+02C6 ISOpub
tilde ˜small tildeU+02DC ISOdia
General Punctuation
ensp en spaceU+2002 ISOpub
emsp em spaceU+2003 ISOpub
thinsp thin spaceU+2009 ISOpub
zwnj zero width non-joinerU+200C NEW RFC 2070
zwj zero width joinerU+200D NEW RFC 2070
lrm left-to-right markU+200E NEW RFC 2070
rlm right-to-left markU+200F NEW RFC 2070
ndash en dashU+2013 ISOpub
mdash em dashU+2014 ISOpub
lsquo left single quotation markU+2018 ISOnum
rsquo right single quotation markU+2019 ISOnum
sbquo single low-9 quotation markU+201A NEW
ldquo left double quotation markU+201C ISOnum
rdquo right double quotation markU+201D ISOnum
bdquo double low-9 quotation markU+201E NEW
dagger daggerU+2020 ISOpub
Dagger double daggerU+2021 ISOpub
permil per mille signU+2030 ISOtech
lsaquo single left-pointing angle quotation mark
lsaquo is proposed but not yet ISO standardized
U+2039 ISO proposed
rsaquo single right-pointing angle quotation mark
rsaquo is proposed but not yet ISO standardized
U+203A ISO proposed
euro euro signU+20AC NEW