forked from raphinesse/atom-character-table
-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathrfc1345.txt
6078 lines (5361 loc) · 244 KB
/
rfc1345.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
Network Working Group K. Simonsen
Request for Comments: 1345 Rationel Almen Planlaegning
June 1992
Character Mnemonics & Character Sets
Status of the Memo
This memo provides information for the Internet community. It does
not specify an Internet standard. Distribution of this memo is
unlimited.
Summary
This memo lists a selection of characters and their presence in some
coded character sets. To facilitate the coded character set
tabulations an unambiguous mnemonic for each character is used, and a
format for tabulating the coded character sets is defined. The coded
character sets are given names for easy reference. A family of coded
character sets called the mnemonic character sets and conversion
between these coded character set without information loss is
defined.
The character set names are registered with the Internet Assigned
Numbers Authority (IANA). Additional character sets not described in
this memo should be registered with the IANA. This memo may be
updated periodically, or additional specifications may be published,
to reflect other coded character sets.
Please send any comments including comments about the accuracy of the
tables to the author, Keld Simonsen <[email protected]>.
1. INTRODUCTION
With the growing internationalization of the Internet, support for
many coded character sets is required. It is the intention of this
memo to document precisely the mapping between all characters and
their corresponding coded representations in various coded character
sets, and give names to these coded character sets, so they can be
referenced unambiguously in Internet standards.
This memo does not indicate anything about the validity of using
these specifications in any Internet standard, so you should consult
each individual Internet standard to see which coded character sets
and names are allowed there.
Unambiguous character mnemonics are specified, which provide a
practical way of identifying a character, without reference to a
coded character set and its code in this coded character set. The
mnemonics are written in a minimal set of characters, namely the
invariant 83 graphical characters of ISO 646, which is a kind of
greatest common subset to be found between the majority of coded
Simonsen [Page 1]
RFC 1345 Character Mnemonics & Character Sets June 1992
character sets, including ASCII, national variants of the ISO 646 7-
bit character set and various EBCDICs. In addition, the numeric
value of the coded representations of all these characters are the
same in all coded character sets compatible with ISO standards. All
of them except two, EXCLAMATION MARK and QUOTATION MARK, have the
same coded representation in all variants of EBCDIC. This minimal
set of characters is called the reference character set in this memo.
The mnemonics can be used in Internet standards for easy and
unambiguous reference, and they can also serve as a fallback
representation in various Internet specifications.
The coded character sets covered include all parts of ISO 8859, ISO
6937-2 and all ISO 646 conforming coded character sets in the ISO
character set registry managed by ECMA according to ISO 2375. Almost
all graphic coded character sets in the ECMA registry (1) are
covered. The graphic coded character sets not included are registry
numbers 31, 38, 39, 53, 59, 68, 71, 72, 129 and 137. In addition
many vendor defined character sets are covered, including PC
codepages (4), (7), (8), many EBCDIC character sets (4), (5), (6) and
HP, DEC and Apple character sets (8), (9), (10), (13), (14). The
East-Asian 16-bit character sets from the ECMA registry is also
included in this memo.
2. CHARACTER MNEMONICS
2.1 General Syntax
The character mnemonics are taken from the ISO committee draft (CD)
of the POSIX.2 standard (3). They are classified into two groups:
1. A group with two-character mnemonics
- Primarily intended for alphabetic scripts like Latin, Greek,
Cyrillic, Hebrew and Arabic, and special characters.
2. A group with variable-length mnemonics
- primarily intended for non-alphabetic scripts like Japanese and
Chinese, but also used for some accented letters and special
characters.
In the two-character mnemonics, all invariant graphic character in
the ISO 646 character codes except "&" are used, i.e. the following
characters:
! " % ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z _
a b c d e f g h i j k l m n o p q r s t u v w x y z
The character "_" is not used as the first character.
In the variable-length mnemonics, the character "_" is not used as
the first character. If it is used in a name, its presence is
doubled.
Simonsen [Page 2]
RFC 1345 Character Mnemonics & Character Sets June 1992
The mnemonics can be used in several different ways for different
purposes. One of these is description of coded character sets, which
is detailed in section 3. Another is for extending a given coded
character set to a mnemonic character set. This is described in
section 4. The restrictions on the use of the characters "&" and "_"
are due to demands of the compositional methods of these techniques.
2.2 ISO Official Long Descriptive Character Name
For all mnemonics, the character for which it stands is indicated in
the following table by a long descriptive name. This name is
identical to the ISO name of the character as given in reference (2).
For a few characters that are not included there, descriptive names
of the same kind are introduced in this memo. The source of each
character is stated in the table after the name and should be
consulted for a reliable identification of the character.
These long descriptive names consists only of the capital Latin
letters of the invariant part of ISO 646, the digits, "-", and SPACE.
Digits are only used in names of ideographic and Hangul characters
and never as the first character.
2.3 The 2-character Mnemonics
The two-character mnemonics include various accented Latin letters,
Greek, Cyrillic, Hebrew, Arabic, Hiragana and Katakana. Also a fair
number of special characters are included. Almost all ISO or ISO
registered 7- and 8-bit graphical coded character sets are covered
with these two-character mnemonics.
The two characters are chosen so the graphical appearance in the
reference set resembles as much as possible (within the possibilities
available) the graphical appearance of the character. The basic
character set of ISO 646 is used as the reference set, as mentioned
above.
The characters in the reference character set are chosen to represent
themselves.
For control characters from ISO 646 the two-character acronyms of ISO
2047 are used as mnemonics. For the other control characters of ISO
6429, two-character mnemonics have been selected based on the
variable-length acronyms used in that standard.
Letters, including Greek, Cyrillic, Arabic and Hebrew, are
represented with the base letter as the first letter, and the second
letter represents an accent or relation to a non-Latin script. Non-
Latin letters are transliterated to Latin letters, following
transliteration standards as closely as possible. This is also done
with the Latin letters such as ETH and THORN, and the
Danish/Norwegian/Swedish letter A WITH RING ABOVE is transliterated
into "aa".
Simonsen [Page 3]
RFC 1345 Character Mnemonics & Character Sets June 1992
After a letter, the second character signifies the following:
Exclamation mark ! Grave
Apostrophe ' Acute accent
Greater-Than sign > Circumflex accent
Question Mark ? tilde
Hyphen-Minus - Macron
Left parenthesis ( Breve
Full Stop . Dot Above
Colon : Diaeresis
Comma , Cedilla
Underline _ Underline
Solidus / Stroke
Quotation mark " Double acute accent
Semicolon ; Ogonek
Less-Than sign < Caron
Zero 0 Ring above
Two 2 Hook
Nine 9 Horn
Equals = Cyrillic
Asterisk * Greek
Percent sign % Greek/Cyrillic special
Plus + smalls: Arabic, capitals: Hebrew
Three 3 some Latin/Greek/Cyrillic letters
Four 4 Bopomofo
Five 5 Hiragana
Six 6 Katakana
In designing the mnemonics the following special characters were
reserved: The ampersand is reserved as an intro character, indicating
that the following string is in the mnemonic character set. The
underline character is reserved for the variable-length mnemonics.
This use does not eliminate usage as an accent or language
identifier.
Special characters are encoded with some mnemonic value. These are
not systematic thruout, but most mnemonics start with a related
special character of the reference set.
2.4 The Variable-length Character Mnemonics
The Variable-length Character Mnemonics are primarily meant for the
ideographic characters in larger Asian character sets, but are also
used for accented characters with several accents and some special
characters. To have the mnemonics as short as possible, which both
saves storage and is easier to input, a quite short name is
preferred. Considering the Chinese standard GB 2312-1980, the
Japanese standards JIS X0208 and JIS X0212, and the Korean standard
KS C 5601, they are all given by row and column numbers between 1 and
94. So two positions for row and column and a character set
identifier of one character would be almost as short as possible.
The following character set identifiers are defined:
Simonsen [Page 4]
RFC 1345 Character Mnemonics & Character Sets June 1992
c GB 2312-1980
j JIS X0208-1990
J JIS X0212-1990
k KS C 5601-1987
This system for the representation of ideographic characters and
Hangul characters is not truly mnemonic, but it provides short
representations that are easy to connect to the corresponding
character by means of the code table of an official character set
standard. Alternative methods based on the graphic appearance or the
pronunciation of the characters are thought to be unfeasible.
One prominent character in the reference character set is reserved
for identifying variable-length mnemonics, namely the underline
character "_". This character is intended as a delimiter both in the
front and in the end of the mnemonic. An example of its use would be:
(&=intro):
&_j3210_ &_j4436_&_j6530_
3. CHARACTER MNEMONIC TABLE
The following table contains the character mnemonic and the encoding
and long descriptive name of ISO 2DIS 10646 (2). Although the ISO
10646 is only at DIS stage at this moment of writing and there is
quite some debate about it, the long descriptive naming in the DIS is
considered to be stable and the best official ISO reference to
character names. The 2-octet encoded value of the ISO 2DIS 10646 is
also used, but only as an identification of the character, and it
should only be used for identification purposes as the coded
representation may be changed in the final 10646 international
standard. Some characters not in the ISO 2DIS 10646 are allocated
values in the private use zone and given names and references to a
character set where it is used.
The format of the table is:
1st field is the character mnemonic (mostly 2 characters).
2nd field is the ISO 2DIS 10646 code in hexadecimal.
3rd field is the long descriptive name of ISO 2DIS 10646.
SP 0020 SPACE
! 0021 EXCLAMATION MARK
" 0022 QUOTATION MARK
Nb 0023 NUMBER SIGN
DO 0024 DOLLAR SIGN
% 0025 PERCENT SIGN
& 0026 AMPERSAND
' 0027 APOSTROPHE
( 0028 LEFT PARENTHESIS
) 0029 RIGHT PARENTHESIS
* 002a ASTERISK
+ 002b PLUS SIGN
Simonsen [Page 5]
RFC 1345 Character Mnemonics & Character Sets June 1992
, 002c COMMA
- 002d HYPHEN-MINUS
. 002e FULL STOP
/ 002f SOLIDUS
0 0030 DIGIT ZERO
1 0031 DIGIT ONE
2 0032 DIGIT TWO
3 0033 DIGIT THREE
4 0034 DIGIT FOUR
5 0035 DIGIT FIVE
6 0036 DIGIT SIX
7 0037 DIGIT SEVEN
8 0038 DIGIT EIGHT
9 0039 DIGIT NINE
: 003a COLON
; 003b SEMICOLON
< 003c LESS-THAN SIGN
= 003d EQUALS SIGN
> 003e GREATER-THAN SIGN
? 003f QUESTION MARK
At 0040 COMMERCIAL AT
A 0041 LATIN CAPITAL LETTER A
B 0042 LATIN CAPITAL LETTER B
C 0043 LATIN CAPITAL LETTER C
D 0044 LATIN CAPITAL LETTER D
E 0045 LATIN CAPITAL LETTER E
F 0046 LATIN CAPITAL LETTER F
G 0047 LATIN CAPITAL LETTER G
H 0048 LATIN CAPITAL LETTER H
I 0049 LATIN CAPITAL LETTER I
J 004a LATIN CAPITAL LETTER J
K 004b LATIN CAPITAL LETTER K
L 004c LATIN CAPITAL LETTER L
M 004d LATIN CAPITAL LETTER M
N 004e LATIN CAPITAL LETTER N
O 004f LATIN CAPITAL LETTER O
P 0050 LATIN CAPITAL LETTER P
Q 0051 LATIN CAPITAL LETTER Q
R 0052 LATIN CAPITAL LETTER R
S 0053 LATIN CAPITAL LETTER S
T 0054 LATIN CAPITAL LETTER T
U 0055 LATIN CAPITAL LETTER U
V 0056 LATIN CAPITAL LETTER V
W 0057 LATIN CAPITAL LETTER W
X 0058 LATIN CAPITAL LETTER X
Y 0059 LATIN CAPITAL LETTER Y
Z 005a LATIN CAPITAL LETTER Z
<( 005b LEFT SQUARE BRACKET
// 005c REVERSE SOLIDUS
)> 005d RIGHT SQUARE BRACKET
'> 005e CIRCUMFLEX ACCENT
_ 005f LOW LINE
'! 0060 GRAVE ACCENT
Simonsen [Page 6]
RFC 1345 Character Mnemonics & Character Sets June 1992
a 0061 LATIN SMALL LETTER A
b 0062 LATIN SMALL LETTER B
c 0063 LATIN SMALL LETTER C
d 0064 LATIN SMALL LETTER D
e 0065 LATIN SMALL LETTER E
f 0066 LATIN SMALL LETTER F
g 0067 LATIN SMALL LETTER G
h 0068 LATIN SMALL LETTER H
i 0069 LATIN SMALL LETTER I
j 006a LATIN SMALL LETTER J
k 006b LATIN SMALL LETTER K
l 006c LATIN SMALL LETTER L
m 006d LATIN SMALL LETTER M
n 006e LATIN SMALL LETTER N
o 006f LATIN SMALL LETTER O
p 0070 LATIN SMALL LETTER P
q 0071 LATIN SMALL LETTER Q
r 0072 LATIN SMALL LETTER R
s 0073 LATIN SMALL LETTER S
t 0074 LATIN SMALL LETTER T
u 0075 LATIN SMALL LETTER U
v 0076 LATIN SMALL LETTER V
w 0077 LATIN SMALL LETTER W
x 0078 LATIN SMALL LETTER X
y 0079 LATIN SMALL LETTER Y
z 007a LATIN SMALL LETTER Z
(! 007b LEFT CURLY BRACKET
!! 007c VERTICAL LINE
!) 007d RIGHT CURLY BRACKET
'? 007e TILDE
NS 00a0 NO-BREAK SPACE
!I 00a1 INVERTED EXCLAMATION MARK
Ct 00a2 CENT SIGN
Pd 00a3 POUND SIGN
Cu 00a4 CURRENCY SIGN
Ye 00a5 YEN SIGN
BB 00a6 BROKEN BAR
SE 00a7 SECTION SIGN
': 00a8 DIAERESIS
Co 00a9 COPYRIGHT SIGN
-a 00aa FEMININE ORDINAL INDICATOR
<< 00ab LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
NO 00ac NOT SIGN
-- 00ad SOFT HYPHEN
Rg 00ae REGISTERED SIGN
'm 00af MACRON
DG 00b0 DEGREE SIGN
+- 00b1 PLUS-MINUS SIGN
2S 00b2 SUPERSCRIPT TWO
3S 00b3 SUPERSCRIPT THREE
'' 00b4 ACUTE ACCENT
My 00b5 MICRO SIGN
PI 00b6 PILCROW SIGN
Simonsen [Page 7]
RFC 1345 Character Mnemonics & Character Sets June 1992
.M 00b7 MIDDLE DOT
', 00b8 CEDILLA
1S 00b9 SUPERSCRIPT ONE
-o 00ba MASCULINE ORDINAL INDICATOR
>> 00bb RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
14 00bc VULGAR FRACTION ONE QUARTER
12 00bd VULGAR FRACTION ONE HALF
34 00be VULGAR FRACTION THREE QUARTERS
?I 00bf INVERTED QUESTION MARK
A! 00c0 LATIN CAPITAL LETTER A WITH GRAVE
A' 00c1 LATIN CAPITAL LETTER A WITH ACUTE
A> 00c2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX
A? 00c3 LATIN CAPITAL LETTER A WITH TILDE
A: 00c4 LATIN CAPITAL LETTER A WITH DIAERESIS
AA 00c5 LATIN CAPITAL LETTER A WITH RING ABOVE
AE 00c6 LATIN CAPITAL LETTER AE
C, 00c7 LATIN CAPITAL LETTER C WITH CEDILLA
E! 00c8 LATIN CAPITAL LETTER E WITH GRAVE
E' 00c9 LATIN CAPITAL LETTER E WITH ACUTE
E> 00ca LATIN CAPITAL LETTER E WITH CIRCUMFLEX
E: 00cb LATIN CAPITAL LETTER E WITH DIAERESIS
I! 00cc LATIN CAPITAL LETTER I WITH GRAVE
I' 00cd LATIN CAPITAL LETTER I WITH ACUTE
I> 00ce LATIN CAPITAL LETTER I WITH CIRCUMFLEX
I: 00cf LATIN CAPITAL LETTER I WITH DIAERESIS
D- 00d0 LATIN CAPITAL LETTER ETH (Icelandic)
N? 00d1 LATIN CAPITAL LETTER N WITH TILDE
O! 00d2 LATIN CAPITAL LETTER O WITH GRAVE
O' 00d3 LATIN CAPITAL LETTER O WITH ACUTE
O> 00d4 LATIN CAPITAL LETTER O WITH CIRCUMFLEX
O? 00d5 LATIN CAPITAL LETTER O WITH TILDE
O: 00d6 LATIN CAPITAL LETTER O WITH DIAERESIS
*X 00d7 MULTIPLICATION SIGN
O/ 00d8 LATIN CAPITAL LETTER O WITH STROKE
U! 00d9 LATIN CAPITAL LETTER U WITH GRAVE
U' 00da LATIN CAPITAL LETTER U WITH ACUTE
U> 00db LATIN CAPITAL LETTER U WITH CIRCUMFLEX
U: 00dc LATIN CAPITAL LETTER U WITH DIAERESIS
Y' 00dd LATIN CAPITAL LETTER Y WITH ACUTE
TH 00de LATIN CAPITAL LETTER THORN (Icelandic)
ss 00df LATIN SMALL LETTER SHARP S (German)
a! 00e0 LATIN SMALL LETTER A WITH GRAVE
a' 00e1 LATIN SMALL LETTER A WITH ACUTE
a> 00e2 LATIN SMALL LETTER A WITH CIRCUMFLEX
a? 00e3 LATIN SMALL LETTER A WITH TILDE
a: 00e4 LATIN SMALL LETTER A WITH DIAERESIS
aa 00e5 LATIN SMALL LETTER A WITH RING ABOVE
ae 00e6 LATIN SMALL LETTER AE
c, 00e7 LATIN SMALL LETTER C WITH CEDILLA
e! 00e8 LATIN SMALL LETTER E WITH GRAVE
e' 00e9 LATIN SMALL LETTER E WITH ACUTE
e> 00ea LATIN SMALL LETTER E WITH CIRCUMFLEX
e: 00eb LATIN SMALL LETTER E WITH DIAERESIS
Simonsen [Page 8]
RFC 1345 Character Mnemonics & Character Sets June 1992
i! 00ec LATIN SMALL LETTER I WITH GRAVE
i' 00ed LATIN SMALL LETTER I WITH ACUTE
i> 00ee LATIN SMALL LETTER I WITH CIRCUMFLEX
i: 00ef LATIN SMALL LETTER I WITH DIAERESIS
d- 00f0 LATIN SMALL LETTER ETH (Icelandic)
n? 00f1 LATIN SMALL LETTER N WITH TILDE
o! 00f2 LATIN SMALL LETTER O WITH GRAVE
o' 00f3 LATIN SMALL LETTER O WITH ACUTE
o> 00f4 LATIN SMALL LETTER O WITH CIRCUMFLEX
o? 00f5 LATIN SMALL LETTER O WITH TILDE
o: 00f6 LATIN SMALL LETTER O WITH DIAERESIS
-: 00f7 DIVISION SIGN
o/ 00f8 LATIN SMALL LETTER O WITH STROKE
u! 00f9 LATIN SMALL LETTER U WITH GRAVE
u' 00fa LATIN SMALL LETTER U WITH ACUTE
u> 00fb LATIN SMALL LETTER U WITH CIRCUMFLEX
u: 00fc LATIN SMALL LETTER U WITH DIAERESIS
y' 00fd LATIN SMALL LETTER Y WITH ACUTE
th 00fe LATIN SMALL LETTER THORN (Icelandic)
y: 00ff LATIN SMALL LETTER Y WITH DIAERESIS
A- 0100 LATIN CAPITAL LETTER A WITH MACRON
a- 0101 LATIN SMALL LETTER A WITH MACRON
A( 0102 LATIN CAPITAL LETTER A WITH BREVE
a( 0103 LATIN SMALL LETTER A WITH BREVE
A; 0104 LATIN CAPITAL LETTER A WITH OGONEK
a; 0105 LATIN SMALL LETTER A WITH OGONEK
C' 0106 LATIN CAPITAL LETTER C WITH ACUTE
c' 0107 LATIN SMALL LETTER C WITH ACUTE
C> 0108 LATIN CAPITAL LETTER C WITH CIRCUMFLEX
c> 0109 LATIN SMALL LETTER C WITH CIRCUMFLEX
C. 010a LATIN CAPITAL LETTER C WITH DOT ABOVE
c. 010b LATIN SMALL LETTER C WITH DOT ABOVE
C< 010c LATIN CAPITAL LETTER C WITH CARON
c< 010d LATIN SMALL LETTER C WITH CARON
D< 010e LATIN CAPITAL LETTER D WITH CARON
d< 010f LATIN SMALL LETTER D WITH CARON
D/ 0110 LATIN CAPITAL LETTER D WITH STROKE
d/ 0111 LATIN SMALL LETTER D WITH STROKE
E- 0112 LATIN CAPITAL LETTER E WITH MACRON
e- 0113 LATIN SMALL LETTER E WITH MACRON
E( 0114 LATIN CAPITAL LETTER E WITH BREVE
e( 0115 LATIN SMALL LETTER E WITH BREVE
E. 0116 LATIN CAPITAL LETTER E WITH DOT ABOVE
e. 0117 LATIN SMALL LETTER E WITH DOT ABOVE
E; 0118 LATIN CAPITAL LETTER E WITH OGONEK
e; 0119 LATIN SMALL LETTER E WITH OGONEK
E< 011a LATIN CAPITAL LETTER E WITH CARON
e< 011b LATIN SMALL LETTER E WITH CARON
G> 011c LATIN CAPITAL LETTER G WITH CIRCUMFLEX
g> 011d LATIN SMALL LETTER G WITH CIRCUMFLEX
G( 011e LATIN CAPITAL LETTER G WITH BREVE
g( 011f LATIN SMALL LETTER G WITH BREVE
G. 0120 LATIN CAPITAL LETTER G WITH DOT ABOVE
Simonsen [Page 9]
RFC 1345 Character Mnemonics & Character Sets June 1992
g. 0121 LATIN SMALL LETTER G WITH DOT ABOVE
G, 0122 LATIN CAPITAL LETTER G WITH CEDILLA
g, 0123 LATIN SMALL LETTER G WITH CEDILLA
H> 0124 LATIN CAPITAL LETTER H WITH CIRCUMFLEX
h> 0125 LATIN SMALL LETTER H WITH CIRCUMFLEX
H/ 0126 LATIN CAPITAL LETTER H WITH STROKE
h/ 0127 LATIN SMALL LETTER H WITH STROKE
I? 0128 LATIN CAPITAL LETTER I WITH TILDE
i? 0129 LATIN SMALL LETTER I WITH TILDE
I- 012a LATIN CAPITAL LETTER I WITH MACRON
i- 012b LATIN SMALL LETTER I WITH MACRON
I( 012c LATIN CAPITAL LETTER I WITH BREVE
i( 012d LATIN SMALL LETTER I WITH BREVE
I; 012e LATIN CAPITAL LETTER I WITH OGONEK
i; 012f LATIN SMALL LETTER I WITH OGONEK
I. 0130 LATIN CAPITAL LETTER I WITH DOT ABOVE
i. 0131 LATIN SMALL LETTER I DOTLESS
IJ 0132 LATIN CAPITAL LIGATURE IJ
ij 0133 LATIN SMALL LIGATURE IJ
J> 0134 LATIN CAPITAL LETTER J WITH CIRCUMFLEX
j> 0135 LATIN SMALL LETTER J WITH CIRCUMFLEX
K, 0136 LATIN CAPITAL LETTER K WITH CEDILLA
k, 0137 LATIN SMALL LETTER K WITH CEDILLA
kk 0138 LATIN SMALL LETTER KRA (Greenlandic)
L' 0139 LATIN CAPITAL LETTER L WITH ACUTE
l' 013a LATIN SMALL LETTER L WITH ACUTE
L, 013b LATIN CAPITAL LETTER L WITH CEDILLA
l, 013c LATIN SMALL LETTER L WITH CEDILLA
L< 013d LATIN CAPITAL LETTER L WITH CARON
l< 013e LATIN SMALL LETTER L WITH CARON
L. 013f LATIN CAPITAL LETTER L WITH MIDDLE DOT
l. 0140 LATIN SMALL LETTER L WITH MIDDLE DOT
L/ 0141 LATIN CAPITAL LETTER L WITH STROKE
l/ 0142 LATIN SMALL LETTER L WITH STROKE
N' 0143 LATIN CAPITAL LETTER N WITH ACUTE
n' 0144 LATIN SMALL LETTER N WITH ACUTE
N, 0145 LATIN CAPITAL LETTER N WITH CEDILLA
n, 0146 LATIN SMALL LETTER N WITH CEDILLA
N< 0147 LATIN CAPITAL LETTER N WITH CARON
n< 0148 LATIN SMALL LETTER N WITH CARON
'n 0149 LATIN SMALL LETTER N PRECEDED BY APOSTROPHE
NG 014a LATIN CAPITAL LETTER ENG (Lappish)
ng 014b LATIN SMALL LETTER ENG (Lappish)
O- 014c LATIN CAPITAL LETTER O WITH MACRON
o- 014d LATIN SMALL LETTER O WITH MACRON
O( 014e LATIN CAPITAL LETTER O WITH BREVE
o( 014f LATIN SMALL LETTER O WITH BREVE
O" 0150 LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
o" 0151 LATIN SMALL LETTER O WITH DOUBLE ACUTE
OE 0152 LATIN CAPITAL LIGATURE OE
oe 0153 LATIN SMALL LIGATURE OE
R' 0154 LATIN CAPITAL LETTER R WITH ACUTE
r' 0155 LATIN SMALL LETTER R WITH ACUTE
Simonsen [Page 10]
RFC 1345 Character Mnemonics & Character Sets June 1992
R, 0156 LATIN CAPITAL LETTER R WITH CEDILLA
r, 0157 LATIN SMALL LETTER R WITH CEDILLA
R< 0158 LATIN CAPITAL LETTER R WITH CARON
r< 0159 LATIN SMALL LETTER R WITH CARON
S' 015a LATIN CAPITAL LETTER S WITH ACUTE
s' 015b LATIN SMALL LETTER S WITH ACUTE
S> 015c LATIN CAPITAL LETTER S WITH CIRCUMFLEX
s> 015d LATIN SMALL LETTER S WITH CIRCUMFLEX
S, 015e LATIN CAPITAL LETTER S WITH CEDILLA
s, 015f LATIN SMALL LETTER S WITH CEDILLA
S< 0160 LATIN CAPITAL LETTER S WITH CARON
s< 0161 LATIN SMALL LETTER S WITH CARON
T, 0162 LATIN CAPITAL LETTER T WITH CEDILLA
t, 0163 LATIN SMALL LETTER T WITH CEDILLA
T< 0164 LATIN CAPITAL LETTER T WITH CARON
t< 0165 LATIN SMALL LETTER T WITH CARON
T/ 0166 LATIN CAPITAL LETTER T WITH STROKE
t/ 0167 LATIN SMALL LETTER T WITH STROKE
U? 0168 LATIN CAPITAL LETTER U WITH TILDE
u? 0169 LATIN SMALL LETTER U WITH TILDE
U- 016a LATIN CAPITAL LETTER U WITH MACRON
u- 016b LATIN SMALL LETTER U WITH MACRON
U( 016c LATIN CAPITAL LETTER U WITH BREVE
u( 016d LATIN SMALL LETTER U WITH BREVE
U0 016e LATIN CAPITAL LETTER U WITH RING ABOVE
u0 016f LATIN SMALL LETTER U WITH RING ABOVE
U" 0170 LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
u" 0171 LATIN SMALL LETTER U WITH DOUBLE ACUTE
U; 0172 LATIN CAPITAL LETTER U WITH OGONEK
u; 0173 LATIN SMALL LETTER U WITH OGONEK
W> 0174 LATIN CAPITAL LETTER W WITH CIRCUMFLEX
w> 0175 LATIN SMALL LETTER W WITH CIRCUMFLEX
Y> 0176 LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
y> 0177 LATIN SMALL LETTER Y WITH CIRCUMFLEX
Y: 0178 LATIN CAPITAL LETTER Y WITH DIAERESIS
Z' 0179 LATIN CAPITAL LETTER Z WITH ACUTE
z' 017a LATIN SMALL LETTER Z WITH ACUTE
Z. 017b LATIN CAPITAL LETTER Z WITH DOT ABOVE
z. 017c LATIN SMALL LETTER Z WITH DOT ABOVE
Z< 017d LATIN CAPITAL LETTER Z WITH CARON
z< 017e LATIN SMALL LETTER Z WITH CARON
O9 01a0 LATIN CAPITAL LETTER O WITH HORN
o9 01a1 LATIN SMALL LETTER O WITH HORN
OI 01a2 LATIN CAPITAL LETTER OI
oi 01a3 LATIN SMALL LETTER OI
yr 01a6 LATIN LETTER YR
U9 01af LATIN CAPITAL LETTER U WITH HORN
u9 01b0 LATIN SMALL LETTER U WITH HORN
Z/ 01b5 LATIN CAPITAL LETTER Z WITH STROKE
z/ 01b6 LATIN SMALL LETTER Z WITH STROKE
ED 01b7 LATIN CAPITAL LETTER EZH
A< 01cd LATIN CAPITAL LETTER A WITH CARON
a< 01ce LATIN SMALL LETTER A WITH CARON
Simonsen [Page 11]
RFC 1345 Character Mnemonics & Character Sets June 1992
I< 01cf LATIN CAPITAL LETTER I WITH CARON
i< 01d0 LATIN SMALL LETTER I WITH CARON
O< 01d1 LATIN CAPITAL LETTER O WITH CARON
o< 01d2 LATIN SMALL LETTER O WITH CARON
U< 01d3 LATIN CAPITAL LETTER U WITH CARON
u< 01d4 LATIN SMALL LETTER U WITH CARON
U:- 01d5 LATIN CAPITAL LETTER U WITH DIAERESIS AND MACRON
u:- 01d6 LATIN SMALL LETTER U WITH DIAERESIS AND MACRON
U:' 01d7 LATIN CAPITAL LETTER U WITH DIAERESIS AND ACUTE
u:' 01d8 LATIN SMALL LETTER U WITH DIAERESIS AND ACUTE
U:< 01d9 LATIN CAPITAL LETTER U WITH DIAERESIS AND CARON
u:< 01da LATIN SMALL LETTER U WITH DIAERESIS AND CARON
U:! 01db LATIN CAPITAL LETTER U WITH DIAERESIS AND GRAVE
u:! 01dc LATIN SMALL LETTER U WITH DIAERESIS AND GRAVE
A1 01de LATIN CAPITAL LETTER A WITH DIAERESIS AND MACRON
a1 01df LATIN SMALL LETTER A WITH DIAERESIS AND MACRON
A7 01e0 LATIN CAPITAL LETTER A WITH DOT ABOVE AND MACRON
a7 01e1 LATIN SMALL LETTER A WITH DOT ABOVE AND MACRON
A3 01e2 LATIN CAPITAL LETTER AE WITH MACRON
a3 01e3 LATIN SMALL LETTER AE WITH MACRON
G/ 01e4 LATIN CAPITAL LETTER G WITH STROKE
g/ 01e5 LATIN SMALL LETTER G WITH STROKE
G< 01e6 LATIN CAPITAL LETTER G WITH CARON
g< 01e7 LATIN SMALL LETTER G WITH CARON
K< 01e8 LATIN CAPITAL LETTER K WITH CARON
k< 01e9 LATIN SMALL LETTER K WITH CARON
O; 01ea LATIN CAPITAL LETTER O WITH OGONEK
o; 01eb LATIN SMALL LETTER O WITH OGONEK
O1 01ec LATIN CAPITAL LETTER O WITH OGONEK AND MACRON
o1 01ed LATIN SMALL LETTER O WITH OGONEK AND MACRON
EZ 01ee LATIN CAPITAL LETTER EZH WITH CARON
ez 01ef LATIN SMALL LETTER EZH WITH CARON
j< 01f0 LATIN SMALL LETTER J WITH CARON
G' 01f4 LATIN CAPITAL LETTER G WITH ACUTE
g' 01f5 LATIN SMALL LETTER G WITH ACUTE
AA' 01fa LATIN CAPITAL LETTER A WITH RING ABOVE AND ACUTE
aa' 01fb LATIN SMALL LETTER A WITH RING ABOVE AND ACUTE
AE' 01fc LATIN CAPITAL LETTER AE WITH ACUTE
ae' 01fd LATIN SMALL LETTER AE WITH ACUTE
O/' 01fe LATIN CAPITAL LETTER O WITH STROKE AND ACUTE
o/' 01ff LATIN SMALL LETTER O WITH STROKE AND ACUTE
;S 02bf MODIFIER LETTER LEFT HALF RING
'< 02c7 CARON
'( 02d8 BREVE
'. 02d9 DOT ABOVE
'0 02da RING ABOVE
'; 02db OGONEK
'" 02dd DOUBLE ACUTE ACCENT
A% 0386 GREEK CAPITAL LETTER ALPHA WITH ACUTE
E% 0388 GREEK CAPITAL LETTER EPSILON WITH ACUTE
Y% 0389 GREEK CAPITAL LETTER ETA WITH ACUTE
I% 038a GREEK CAPITAL LETTER IOTA WITH ACUTE
O% 038c GREEK CAPITAL LETTER OMICRON WITH ACUTE
Simonsen [Page 12]
RFC 1345 Character Mnemonics & Character Sets June 1992
U% 038e GREEK CAPITAL LETTER UPSILON WITH ACUTE
W% 038f GREEK CAPITAL LETTER OMEGA WITH ACUTE
i3 0390 GREEK SMALL LETTER IOTA WITH ACUTE AND DIAERESIS
A* 0391 GREEK CAPITAL LETTER ALPHA
B* 0392 GREEK CAPITAL LETTER BETA
G* 0393 GREEK CAPITAL LETTER GAMMA
D* 0394 GREEK CAPITAL LETTER DELTA
E* 0395 GREEK CAPITAL LETTER EPSILON
Z* 0396 GREEK CAPITAL LETTER ZETA
Y* 0397 GREEK CAPITAL LETTER ETA
H* 0398 GREEK CAPITAL LETTER THETA
I* 0399 GREEK CAPITAL LETTER IOTA
K* 039a GREEK CAPITAL LETTER KAPPA
L* 039b GREEK CAPITAL LETTER LAMDA
M* 039c GREEK CAPITAL LETTER MU
N* 039d GREEK CAPITAL LETTER NU
C* 039e GREEK CAPITAL LETTER XI
O* 039f GREEK CAPITAL LETTER OMICRON
P* 03a0 GREEK CAPITAL LETTER PI
R* 03a1 GREEK CAPITAL LETTER RHO
S* 03a3 GREEK CAPITAL LETTER SIGMA
T* 03a4 GREEK CAPITAL LETTER TAU
U* 03a5 GREEK CAPITAL LETTER UPSILON
F* 03a6 GREEK CAPITAL LETTER PHI
X* 03a7 GREEK CAPITAL LETTER CHI
Q* 03a8 GREEK CAPITAL LETTER PSI
W* 03a9 GREEK CAPITAL LETTER OMEGA
J* 03aa GREEK CAPITAL LETTER IOTA WITH DIAERESIS
V* 03ab GREEK CAPITAL LETTER UPSILON WITH DIAERESIS
a% 03ac GREEK SMALL LETTER ALPHA WITH ACUTE
e% 03ad GREEK SMALL LETTER EPSILON WITH ACUTE
y% 03ae GREEK SMALL LETTER ETA WITH ACUTE
i% 03af GREEK SMALL LETTER IOTA WITH ACUTE
u3 03b0 GREEK SMALL LETTER UPSILON WITH ACUTE AND DIAERESIS
a* 03b1 GREEK SMALL LETTER ALPHA
b* 03b2 GREEK SMALL LETTER BETA
g* 03b3 GREEK SMALL LETTER GAMMA
d* 03b4 GREEK SMALL LETTER DELTA
e* 03b5 GREEK SMALL LETTER EPSILON
z* 03b6 GREEK SMALL LETTER ZETA
y* 03b7 GREEK SMALL LETTER ETA
h* 03b8 GREEK SMALL LETTER THETA
i* 03b9 GREEK SMALL LETTER IOTA
k* 03ba GREEK SMALL LETTER KAPPA
l* 03bb GREEK SMALL LETTER LAMDA
m* 03bc GREEK SMALL LETTER MU
n* 03bd GREEK SMALL LETTER NU
c* 03be GREEK SMALL LETTER XI
o* 03bf GREEK SMALL LETTER OMICRON
p* 03c0 GREEK SMALL LETTER PI
r* 03c1 GREEK SMALL LETTER RHO
*s 03c2 GREEK SMALL LETTER FINAL SIGMA
s* 03c3 GREEK SMALL LETTER SIGMA
Simonsen [Page 13]
RFC 1345 Character Mnemonics & Character Sets June 1992
t* 03c4 GREEK SMALL LETTER TAU
u* 03c5 GREEK SMALL LETTER UPSILON
f* 03c6 GREEK SMALL LETTER PHI
x* 03c7 GREEK SMALL LETTER CHI
q* 03c8 GREEK SMALL LETTER PSI
w* 03c9 GREEK SMALL LETTER OMEGA
j* 03ca GREEK SMALL LETTER IOTA WITH DIAERESIS
v* 03cb GREEK SMALL LETTER UPSILON WITH DIAERESIS
o% 03cc GREEK SMALL LETTER OMICRON WITH ACUTE
u% 03cd GREEK SMALL LETTER UPSILON WITH ACUTE
w% 03ce GREEK SMALL LETTER OMEGA WITH ACUTE
'G 03d8 GREEK NUMERAL SIGN
,G 03d9 GREEK LOWER NUMERAL SIGN
T3 03da GREEK CAPITAL LETTER STIGMA
t3 03db GREEK SMALL LETTER STIGMA
M3 03dc GREEK CAPITAL LETTER DIGAMMA
m3 03dd GREEK SMALL LETTER DIGAMMA
K3 03de GREEK CAPITAL LETTER KOPPA
k3 03df GREEK SMALL LETTER KOPPA
P3 03e0 GREEK CAPITAL LETTER SAMPI
p3 03e1 GREEK SMALL LETTER SAMPI
'% 03f4 ACUTE ACCENT AND DIAERESIS (Tonos and Dialytika)
j3 03f5 GREEK IOTA BELOW
IO 0401 CYRILLIC CAPITAL LETTER IO
D% 0402 CYRILLIC CAPITAL LETTER DJE (Serbocroatian)
G% 0403 CYRILLIC CAPITAL LETTER GJE (Macedonian)
IE 0404 CYRILLIC CAPITAL LETTER UKRAINIAN IE
DS 0405 CYRILLIC CAPITAL LETTER DZE (Macedonian)
II 0406 CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I
YI 0407 CYRILLIC CAPITAL LETTER YI (Ukrainian)
J% 0408 CYRILLIC CAPITAL LETTER JE
LJ 0409 CYRILLIC CAPITAL LETTER LJE
NJ 040a CYRILLIC CAPITAL LETTER NJE
Ts 040b CYRILLIC CAPITAL LETTER TSHE (Serbocroatian)
KJ 040c CYRILLIC CAPITAL LETTER KJE (Macedonian)
V% 040e CYRILLIC CAPITAL LETTER SHORT U (Byelorussian)
DZ 040f CYRILLIC CAPITAL LETTER DZHE
A= 0410 CYRILLIC CAPITAL LETTER A
B= 0411 CYRILLIC CAPITAL LETTER BE
V= 0412 CYRILLIC CAPITAL LETTER VE
G= 0413 CYRILLIC CAPITAL LETTER GHE
D= 0414 CYRILLIC CAPITAL LETTER DE
E= 0415 CYRILLIC CAPITAL LETTER IE
Z% 0416 CYRILLIC CAPITAL LETTER ZHE
Z= 0417 CYRILLIC CAPITAL LETTER ZE
I= 0418 CYRILLIC CAPITAL LETTER I
J= 0419 CYRILLIC CAPITAL LETTER SHORT I
K= 041a CYRILLIC CAPITAL LETTER KA
L= 041b CYRILLIC CAPITAL LETTER EL
M= 041c CYRILLIC CAPITAL LETTER EM
N= 041d CYRILLIC CAPITAL LETTER EN
O= 041e CYRILLIC CAPITAL LETTER O
P= 041f CYRILLIC CAPITAL LETTER PE
Simonsen [Page 14]
RFC 1345 Character Mnemonics & Character Sets June 1992
R= 0420 CYRILLIC CAPITAL LETTER ER
S= 0421 CYRILLIC CAPITAL LETTER ES
T= 0422 CYRILLIC CAPITAL LETTER TE
U= 0423 CYRILLIC CAPITAL LETTER U
F= 0424 CYRILLIC CAPITAL LETTER EF
H= 0425 CYRILLIC CAPITAL LETTER HA
C= 0426 CYRILLIC CAPITAL LETTER TSE
C% 0427 CYRILLIC CAPITAL LETTER CHE
S% 0428 CYRILLIC CAPITAL LETTER SHA
Sc 0429 CYRILLIC CAPITAL LETTER SHCHA
=" 042a CYRILLIC CAPITAL LETTER HARD SIGN
Y= 042b CYRILLIC CAPITAL LETTER YERU
%" 042c CYRILLIC CAPITAL LETTER SOFT SIGN
JE 042d CYRILLIC CAPITAL LETTER E
JU 042e CYRILLIC CAPITAL LETTER YU
JA 042f CYRILLIC CAPITAL LETTER YA
a= 0430 CYRILLIC SMALL LETTER A
b= 0431 CYRILLIC SMALL LETTER BE
v= 0432 CYRILLIC SMALL LETTER VE
g= 0433 CYRILLIC SMALL LETTER GHE
d= 0434 CYRILLIC SMALL LETTER DE
e= 0435 CYRILLIC SMALL LETTER IE
z% 0436 CYRILLIC SMALL LETTER ZHE
z= 0437 CYRILLIC SMALL LETTER ZE
i= 0438 CYRILLIC SMALL LETTER I
j= 0439 CYRILLIC SMALL LETTER SHORT I
k= 043a CYRILLIC SMALL LETTER KA
l= 043b CYRILLIC SMALL LETTER EL
m= 043c CYRILLIC SMALL LETTER EM
n= 043d CYRILLIC SMALL LETTER EN
o= 043e CYRILLIC SMALL LETTER O
p= 043f CYRILLIC SMALL LETTER PE
r= 0440 CYRILLIC SMALL LETTER ER
s= 0441 CYRILLIC SMALL LETTER ES
t= 0442 CYRILLIC SMALL LETTER TE
u= 0443 CYRILLIC SMALL LETTER U
f= 0444 CYRILLIC SMALL LETTER EF
h= 0445 CYRILLIC SMALL LETTER HA
c= 0446 CYRILLIC SMALL LETTER TSE
c% 0447 CYRILLIC SMALL LETTER CHE
s% 0448 CYRILLIC SMALL LETTER SHA
sc 0449 CYRILLIC SMALL LETTER SHCHA
=' 044a CYRILLIC SMALL LETTER HARD SIGN
y= 044b CYRILLIC SMALL LETTER YERU
%' 044c CYRILLIC SMALL LETTER SOFT SIGN
je 044d CYRILLIC SMALL LETTER E
ju 044e CYRILLIC SMALL LETTER YU
ja 044f CYRILLIC SMALL LETTER YA
io 0451 CYRILLIC SMALL LETTER IO
d% 0452 CYRILLIC SMALL LETTER DJE (Serbocroatian)
g% 0453 CYRILLIC SMALL LETTER GJE (Macedonian)
ie 0454 CYRILLIC SMALL LETTER UKRAINIAN IE
ds 0455 CYRILLIC SMALL LETTER DZE (Macedonian)
Simonsen [Page 15]
RFC 1345 Character Mnemonics & Character Sets June 1992
ii 0456 CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
yi 0457 CYRILLIC SMALL LETTER YI (Ukrainian)
j% 0458 CYRILLIC SMALL LETTER JE
lj 0459 CYRILLIC SMALL LETTER LJE
nj 045a CYRILLIC SMALL LETTER NJE
ts 045b CYRILLIC SMALL LETTER TSHE (Serbocroatian)
kj 045c CYRILLIC SMALL LETTER KJE (Macedonian)
v% 045e CYRILLIC SMALL LETTER SHORT U (Byelorussian)
dz 045f CYRILLIC SMALL LETTER DZHE
Y3 0462 CYRILLIC CAPITAL LETTER YAT
y3 0463 CYRILLIC SMALL LETTER YAT
O3 046a CYRILLIC CAPITAL LETTER BIG YUS
o3 046b CYRILLIC SMALL LETTER BIG YUS
F3 0472 CYRILLIC CAPITAL LETTER FITA
f3 0473 CYRILLIC SMALL LETTER FITA
V3 0474 CYRILLIC CAPITAL LETTER IZHITSA
v3 0475 CYRILLIC SMALL LETTER IZHITSA
C3 0480 CYRILLIC CAPITAL LETTER KOPPA
c3 0481 CYRILLIC SMALL LETTER KOPPA
G3 0490 CYRILLIC CAPITAL LETTER GHE WITH UPTURN
g3 0491 CYRILLIC SMALL LETTER GHE WITH UPTURN
A+ 05d0 HEBREW LETTER ALEF
B+ 05d1 HEBREW LETTER BET
G+ 05d2 HEBREW LETTER GIMEL
D+ 05d3 HEBREW LETTER DALET
H+ 05d4 HEBREW LETTER HE
W+ 05d5 HEBREW LETTER VAV
Z+ 05d6 HEBREW LETTER ZAYIN
X+ 05d7 HEBREW LETTER HET
Tj 05d8 HEBREW LETTER TET
J+ 05d9 HEBREW LETTER YOD
K% 05da HEBREW LETTER FINAL KAF
K+ 05db HEBREW LETTER KAF
L+ 05dc HEBREW LETTER LAMED
M% 05dd HEBREW LETTER FINAL MEM
M+ 05de HEBREW LETTER MEM
N% 05df HEBREW LETTER FINAL NUN
N+ 05e0 HEBREW LETTER NUN
S+ 05e1 HEBREW LETTER SAMEKH
E+ 05e2 HEBREW LETTER AYIN
P% 05e3 HEBREW LETTER FINAL PE
P+ 05e4 HEBREW LETTER PE
Zj 05e5 HEBREW LETTER FINAL TSADI
ZJ 05e6 HEBREW LETTER TSADI
Q+ 05e7 HEBREW LETTER QOF
R+ 05e8 HEBREW LETTER RESH
Sh 05e9 HEBREW LETTER SHIN
T+ 05ea HEBREW LETTER TAV
,+ 060c ARABIC COMMA
;+ 061b ARABIC SEMICOLON
?+ 061f ARABIC QUESTION MARK
H' 0621 ARABIC LETTER HAMZA
aM 0622 ARABIC LETTER ALEF WITH MADDA ABOVE
Simonsen [Page 16]
RFC 1345 Character Mnemonics & Character Sets June 1992
aH 0623 ARABIC LETTER ALEF WITH HAMZA ABOVE
wH 0624 ARABIC LETTER WAW WITH HAMZA ABOVE
ah 0625 ARABIC LETTER ALEF WITH HAMZA BELOW
yH 0626 ARABIC LETTER YEH WITH HAMZA ABOVE
a+ 0627 ARABIC LETTER ALEF
b+ 0628 ARABIC LETTER BEH
tm 0629 ARABIC LETTER TEH MARBUTA
t+ 062a ARABIC LETTER TEH
tk 062b ARABIC LETTER THEH
g+ 062c ARABIC LETTER JEEM
hk 062d ARABIC LETTER HAH
x+ 062e ARABIC LETTER KHAH
d+ 062f ARABIC LETTER DAL
dk 0630 ARABIC LETTER THAL
r+ 0631 ARABIC LETTER REH
z+ 0632 ARABIC LETTER ZAIN
s+ 0633 ARABIC LETTER SEEN
sn 0634 ARABIC LETTER SHEEN
c+ 0635 ARABIC LETTER SAD
dd 0636 ARABIC LETTER DAD
tj 0637 ARABIC LETTER TAH
zH 0638 ARABIC LETTER ZAH
e+ 0639 ARABIC LETTER AIN
i+ 063a ARABIC LETTER GHAIN
++ 0640 ARABIC TATWEEL
f+ 0641 ARABIC LETTER FEH
q+ 0642 ARABIC LETTER QAF
k+ 0643 ARABIC LETTER KAF
l+ 0644 ARABIC LETTER LAM
m+ 0645 ARABIC LETTER MEEM
n+ 0646 ARABIC LETTER NOON
h+ 0647 ARABIC LETTER HEH
w+ 0648 ARABIC LETTER WAW
j+ 0649 ARABIC LETTER ALEF MAKSURA
y+ 064a ARABIC LETTER YEH
:+ 064b ARABIC FATHATAN
"+ 064c ARABIC DAMMATAN
=+ 064d ARABIC KASRATAN
/+ 064e ARABIC FATHA
'+ 064f ARABIC DAMMA
1+ 0650 ARABIC KASRA
3+ 0651 ARABIC SHADDA
0+ 0652 ARABIC SUKUN
aS 0670 SUPERSCRIPT ARABIC LETTER ALEF
p+ 067e ARABIC LETTER PEH
v+ 06a4 ARABIC LETTER VEH
gf 06af ARABIC LETTER GAF
0a 06f0 EASTERN ARABIC-INDIC DIGIT ZERO
1a 06f1 EASTERN ARABIC-INDIC DIGIT ONE
2a 06f2 EASTERN ARABIC-INDIC DIGIT TWO
3a 06f3 EASTERN ARABIC-INDIC DIGIT THREE
4a 06f4 EASTERN ARABIC-INDIC DIGIT FOUR