Non-ASCII Symbols in the SCA Armorial Database

by Iulstan Sigewealding
26 June 1999

Since January 1996, the SCA Ordinary database (oanda.db) has begun to encode non-ASCII symbols in names and blazons. The encoding is mostly complete for items registered since July 1980, but only sporadic before that date. In other words, over 90% of the database has been revised.

When a Latin-1 encoding exists, the non-ASCII symbol is encoded in accordance with that standard. The resulting code is an 8-bit byte with the most-significant bit set to 1, as detailed in the table below. (Unfortunately, these 8-bit codes are NOT compatible with the "437" code page normally active on PCs running MS-DOS in the United States.)

The columns in the table are as follows:

 Cnt Dec Oct Hx   Eq Daud Name..................................
   7 192 300 C0 À A  {'A} LATIN CAPITAL LETTER A WITH GRAVE
  54 193 301 C1 Á A  {A'} LATIN CAPITAL LETTER A WITH ACUTE
   3 194 302 C2 Â A  {A^} LATIN CAPITAL LETTER A WITH CIRCUMFLEX
  15 196 304 C4 Ä A  {A:} LATIN CAPITAL LETTER A WITH DIAERESIS
   6 197 305 C5 Å A  {Ao} LATIN CAPITAL LETTER A WITH RING ABOVE
 196 198 306 C6 Æ AE {AE} LATIN CAPITAL LIGATURE AE
   1 199 307 C7 Ç C  {C,} LATIN CAPITAL LETTER C WITH CEDILLA
 134 201 311 C9 É E  {E'} LATIN CAPITAL LETTER E WITH ACUTE
  12 205 315 CD Í I  {I'} LATIN CAPITAL LETTER I WITH ACUTE
   2 206 316 CE Î I  {I^} LATIN CAPITAL LETTER I WITH CIRCUMFLEX
   1 208 320 D0 Ð Dh {Dh} LATIN CAPITAL LETTER ETH
   6 210 322 D2 Ò O  {'O} LATIN CAPITAL LETTER O WITH GRAVE
 208 211 323 D3 Ó O  {O'} LATIN CAPITAL LETTER O WITH ACUTE
  18 214 326 D6 Ö O  {O:} LATIN CAPITAL LETTER O WITH DIAERESIS
  23 216 330 D8 Ø O  {O/} LATIN CAPITAL LETTER O WITH STROKE
  26 218 332 DA Ú U  {U'} LATIN CAPITAL LETTER U WITH ACUTE
   1 221 335 DD Ý Y  {Y'} LATIN CAPITAL LETTER Y WITH ACUTE
  17 222 336 DE Þ Th {Th} LATIN CAPITAL LETTER THORN
  10 223 337 DF ß s  {sz} LATIN SMALL LETTER SHARP S
  70 224 340 E0 à a  {'a} LATIN SMALL LETTER A WITH GRAVE
1280 225 341 E1 á a  {a'} LATIN SMALL LETTER A WITH ACUTE
  46 226 342 E2 â a  {a^} LATIN SMALL LETTER A WITH CIRCUMFLEX
   1 227 343 E3 ã a  {a~} LATIN SMALL LETTER A WITH TILDE
 288 228 344 E4 ä a  {a:} LATIN SMALL LETTER A WITH DIAERESIS
  57 229 345 E5 å a  {ao} LATIN SMALL LETTER A WITH RING ABOVE
 121 230 346 E6 æ ae {ae} LATIN SMALL LIGATURE AE
  48 231 347 E7 ç c  {c,} LATIN SMALL LETTER C WITH CEDILLA
 176 232 350 E8 è e  {'e} LATIN SMALL LETTER E WITH GRAVE
1250 233 351 E9 é e  {e'} LATIN SMALL LETTER E WITH ACUTE
  68 234 352 EA ê e  {e^} LATIN SMALL LETTER E WITH CIRCUMFLEX
  23 235 353 EB ë e  {e:} LATIN SMALL LETTER E WITH DIAERESIS
  15 236 354 EC ì i  {'i} LATIN SMALL LETTER I WITH GRAVE
1011 237 355 ED í i  {i'} LATIN SMALL LETTER I WITH ACUTE
  14 238 356 EE î i  {i^} LATIN SMALL LETTER I WITH CIRCUMFLEX
   3 239 357 EF ï i  {i:} LATIN SMALL LETTER I WITH DIAERESIS
 102 240 360 F0 ð dh {dh} LATIN SMALL LETTER ETH
  40 241 361 F1 ñ n  {n~} LATIN SMALL LETTER N WITH TILDE
  61 242 362 F2 ò o  {'o} LATIN SMALL LETTER O WITH GRAVE
 616 243 363 F3 ó o  {o'} LATIN SMALL LETTER O WITH ACUTE
  28 244 364 F4 ô o  {o^} LATIN SMALL LETTER O WITH CIRCUMFLEX
 391 246 366 F6 ö o  {o:} LATIN SMALL LETTER O WITH DIAERESIS
 104 248 370 F8 ø o  {o/} LATIN SMALL LETTER O WITH STROKE
  28 249 371 F9 ù u  {'u} LATIN SMALL LETTER U WITH GRAVE
 192 250 372 FA ú u  {u'} LATIN SMALL LETTER U WITH ACUTE
  30 251 373 FB û u  {u^} LATIN SMALL LETTER U WITH CIRCUMFLEX
 331 252 374 FC ü u  {u:} LATIN SMALL LETTER U WITH DIAERESIS
  28 253 375 FD ý y  {y'} LATIN SMALL LETTER Y WITH ACUTE
   4 254 376 FE þ th {th} LATIN SMALL LETTER THORN
   2 255 377 FF ÿ y  {y:} LATIN SMALL LETTER Y WITH DIAERESIS

Twenty-nine of the rarer symbols do not have Latin-1 encodings. In the database, these symbols have been reduced to the nearest ASCII equivalent:

 Cnt Eq Daud Name................................
   4 C  {Cv} LATIN CAPITAL LETTER C WITH CARON
   2 OE {OE} LATIN CAPITAL LIGATURE OE
   3 S  {S'} LATIN CAPITAL LETTER S WITH ACUTE
   3 S  {Sv} LATIN CAPITAL LETTER S WITH CARON
   8 Z  {Zv} LATIN CAPITAL LETTER Z WITH CARON
  12 a  {a-} LATIN SMALL LETTER A WITH MACRON
   2 b  {b-} LATIN SMALL LETTER B WITH TOPBAR
   8 c  {cv} LATIN SMALL LETTER C WITH CARON
   2 e  {e,} LATIN SMALL LETTER E WITH OGONEK
  12 e  {e-} LATIN SMALL LETTER E WITH MACRON
   2 e  {ev} LATIN SMALL LETTER E WITH CARON
  16 i  {i-} LATIN SMALL LETTER I WITH MACRON
   1 l  {l/} LATIN SMALL LETTER L WITH STROKE
   7 m  {m.} LATIN SMALL LETTER M WITH DOT BELOW
   1 n  {n'} LATIN SMALL LETTER N WITH ACUTE
   7 n  {n.} LATIN SMALL LETTER N WITH DOT BELOW
   9 o  {o,} LATIN SMALL LETTER O WITH OGONEK
   5 o  {o-} LATIN SMALL LETTER O WITH MACRON
   1 oe {oe} LATIN SMALL LIGATURE OE
   9 r  {rv} LATIN SMALL LETTER R WITH CARON
   7 s  {s.} LATIN SMALL LETTER S WITH DOT BELOW
   4 s  {sv} LATIN SMALL LETTER S WITH CARON
   2 u  {u,} LATIN SMALL LETTER U WITH OGONEK
  10 u  {u-} LATIN SMALL LETTER U WITH MACRON
   7 w  {w^} LATIN SMALL LETTER W WITH CIRCUMFLEX
   2 y  {y^} LATIN SMALL LETTER Y WITH CIRCUMFLEX
   1 y  {y~} LATIN SMALL LETTER Y WITH TILDE
   6 z  {z'} LATIN SMALL LETTER Z WITH ACUTE
   6 z  {zv} LATIN SMALL LETTER Z WITH CARON

Related web pages:


config.web version 20010508