ISO/IEC 8859-5

ISO 8859-5, also known as Cyrillic is an 8-bit character encoding, part of the ISO 8859 standard. It was designed originally to cover languages using a Cyrillic alphabet such as Bulgarian, Belarusian, Russian and Macedonian, but was never widely used. It would also have been usable for Ukrainian in the Soviet Union from 1933–1990, but it is missing the Ukrainian letter "ge", ґ, which is required in Ukrainian orthography before and since, and during that period outside of Soviet Ukraine.

ISO_8859-5:1988, more commonly known by its preferred MIME name of ISO-8859-5 (note extra hyphen) is the IANA charset consisting of this standard combined with the control codes from ISO/IEC 6429 for the C0 (0x00–0x1F) and C1 (0x80–0x9F) parts. Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not to be interpreted. This charset also has the aliases iso-ir-144, ISO_8859-5, Cyrillic and csISOLatinCyrillic.

The 8-bit encodings KOI8-R and KOI8-U, CP866, and also Windows-1251 are far more commonly used. Another possible way to represent Cyrillic is Unicode.

Codepage layout

] |125
chset-color-punct|chset-cell3|007E|~|126
chset-color-undef
-!chset-left|8
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
-!chset-left|9
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
chset-color-undef
-!chset-left|A
chset-color-punct|chset-ctrl3|00A0|NBSP|160
chset-color-alpha|chset-cell3|0401|Ё|161
chset-color-alpha|chset-cell3|0402|Ђ|162
chset-color-alpha|chset-cell3|0403|Ѓ|163
chset-color-alpha|chset-cell3|0404|Є|164
chset-color-alpha|chset-cell3|0405|Ѕ|165
chset-color-alpha|chset-cell3|0406|І|166
chset-color-alpha|chset-cell3|0407|Ї|167
chset-color-alpha|chset-cell3|0408|Ј|168
chset-color-alpha|chset-cell3|0409|Љ|169
chset-color-alpha|chset-cell3|040A|Њ|170
chset-color-alpha|chset-cell3|040B|Ћ|171
chset-color-alpha|chset-cell3|040C|Ќ|172
chset-color-punct|chset-ctrl3|00AD|SHY|173
chset-color-alpha|chset-cell3|040E|Ў|174
chset-color-alpha|chset-cell3|040F|Џ|175
-!chset-left|B
chset-color-alpha|chset-cell3|0410|А|176
chset-color-alpha|chset-cell3|0411|Б|177
chset-color-alpha|chset-cell3|0412|В|178
chset-color-alpha|chset-cell3|0413|Г|179
chset-color-alpha|chset-cell3|0414|Д|180
chset-color-alpha|chset-cell3|0415|Е|181
chset-color-alpha|chset-cell3|0416|Ж|182
chset-color-alpha|chset-cell3|0417|З|183
chset-color-alpha|chset-cell3|0418|И|184
chset-color-alpha|chset-cell3|0419|Й|185
chset-color-alpha|chset-cell3|041A|К|186
chset-color-alpha|chset-cell3|041B|Л|187
chset-color-alpha|chset-cell3|041C|М|188
chset-color-alpha|chset-cell3|041D|Н|189
chset-color-alpha|chset-cell3|041E|О|190
chset-color-alpha|chset-cell3|041F|П|191
-!chset-left|C
chset-color-alpha|chset-cell3|0420|Р|192
chset-color-alpha|chset-cell3|0421|С|193
chset-color-alpha|chset-cell3|0422|Т|194
chset-color-alpha|chset-cell3|0423|У|195
chset-color-alpha|chset-cell3|0424|Ф|196
chset-color-alpha|chset-cell3|0425|Х|197
chset-color-alpha|chset-cell3|0426|Ц|198
chset-color-alpha|chset-cell3|0427|Ч|199
chset-color-alpha|chset-cell3|0428|Ш|200
chset-color-alpha|chset-cell3|0429|Щ|201
chset-color-alpha|chset-cell3|042A|Ъ|202
chset-color-alpha|chset-cell3|042B|Ы|203
chset-color-alpha|chset-cell3|042C|Ь|204
chset-color-alpha|chset-cell3|042D|Э|205
chset-color-alpha|chset-cell3|042E|Ю|206
chset-color-alpha|chset-cell3|042F|Я|207
-!chset-left|D
chset-color-alpha|chset-cell3|0430|а|208
chset-color-alpha|chset-cell3|0431|б|209
chset-color-alpha|chset-cell3|0432|в|210
chset-color-alpha|chset-cell3|0433|г|211
chset-color-alpha|chset-cell3|0434|д|212
chset-color-alpha|chset-cell3|0435|е|213
chset-color-alpha|chset-cell3|0436|ж|214
chset-color-alpha|chset-cell3|0437|з|215
chset-color-alpha|chset-cell3|0438|и|216
chset-color-alpha|chset-cell3|0439|й|217
chset-color-alpha|chset-cell3|043A|к|218
chset-color-alpha|chset-cell3|043B|л|219
chset-color-alpha|chset-cell3|043C|м|220
chset-color-alpha|chset-cell3|043D|н|221
chset-color-alpha|chset-cell3|043E|о|222
chset-color-alpha|chset-cell3|043F|п|223
-!chset-left|E
chset-color-alpha|chset-cell3|0440|р|224
chset-color-alpha|chset-cell3|0441|с|225
chset-color-alpha|chset-cell3|0442|т|226
chset-color-alpha|chset-cell3|0443|у|227
chset-color-alpha|chset-cell3|0444|ф|228
chset-color-alpha|chset-cell3|0445|х|229
chset-color-alpha|chset-cell3|0446|ц|230
chset-color-alpha|chset-cell3|0447|ч|231
chset-color-alpha|chset-cell3|0448|ш|232
chset-color-alpha|chset-cell3|0449|щ|233
chset-color-alpha|chset-cell3|044A|ъ|234
chset-color-alpha|chset-cell3|044B|ы|235
chset-color-alpha|chset-cell3|044C|ь|236
chset-color-alpha|chset-cell3|044D|э|237
chset-color-alpha|chset-cell3|044E|ю|238
chset-color-alpha|chset-cell3|044F|я|239
-!chset-left|F
chset-color-alpha|chset-cell3|2116||240
chset-color-alpha|chset-cell3|0451|ё|241
chset-color-alpha|chset-cell3|0452|ђ|242
chset-color-alpha|chset-cell3|0453|ѓ|243
chset-color-alpha|chset-cell3|0454|є|244
chset-color-alpha|chset-cell3|0455|ѕ|245
chset-color-alpha|chset-cell3|0456|і|246
chset-color-alpha|chset-cell3|0457|ї|247
chset-color-alpha|chset-cell3|0458|ј|248
chset-color-alpha|chset-cell3|0459|љ|249
chset-color-alpha|chset-cell3|045A|њ|250
chset-color-alpha|chset-cell3|045B|ћ|251
chset-color-alpha|chset-cell3|045C|ќ|252
chset-color-alpha|chset-cell3|00A7|§|253
chset-color-alpha|chset-cell3|045E|ў|254
chset-color-alpha|chset-cell3|045F|џ|255

In the table above, 20 is the regular SPACE character, and A0 is the NO-BREAK SPACE. AD is a SOFT HYPHEN, which should not appear at all in compliant web browsers.

Code values 0x00–0x1F, 0x7F, and 0x80–0x9F are not assigned to characters by ISO/IEC 8859-5.

External links

* [http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=28249&ICS1=35&ICS2=40&ICS3= ISO/IEC 8859-5:1999]
* [http://www.ecma-international.org/publications/standards/Ecma-113.htm Standard ECMA-113] : 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Cyrillic Alphabet "3rd edition (December 1999)"
* [http://www.itscj.ipsj.or.jp/ISO-IR/144.pdf ISO-IR 144] Cyrillic part of the Latin/Cyrillic Alphabet "(May 1, 1988, from ISO 8859-5 2nd version)"


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • ISO/IEC 8859 — is a joint ISO and IEC standard for 8 bit character encodings for use by computers. The standard is divided into numbered, separately published parts, such as ISO/IEC 8859 1, ISO/IEC 8859 2, etc., each of which may be informally referred to as a… …   Wikipedia

  • ISO/IEC 8859-1 — ISO 8859 1 Latin 1, Westeuropäisch 2 Latin 2, Mitteleuropäisch 3 Latin 3, Südeuropäisch 4 Latin 4, Baltisch 5 Kyrillisch 6 Arabisch 7 Griechisch 8 …   Deutsch Wikipedia

  • ISO/IEC 8859-1 — ISO 8859 1, more formally cited as ISO/IEC 8859 1 is part 1 of ISO/IEC 8859, a standard character encoding of the Latin alphabet. It is less formally referred to as Latin 1. It was originally developed by the ISO, but later jointly maintained by… …   Wikipedia

  • ISO/IEC 8859-11 — ISO/IEC 8859 11:2001, Information technology 8 bit single byte coded graphic character sets Part 11: Latin/Thai alphabet, is part of the ISO/IEC 8859 series of ASCII based standard character encodings, first edition published in 2001. It is… …   Wikipedia

  • ISO/IEC 8859-8 — ISO 8859 8, more formally cited as ISO/IEC 8859 8 (but not as Latin 8!), is part 8 of ISO/IEC 8859, a standard character encoding defined by ISO.ISO 8859 8 contains all the Hebrew letters (no Hebrew vowel signs). ISO 8859 8:1988, more commonly… …   Wikipedia

  • ISO/IEC 8859-6 — ISO/IEC 8859 6:1999, Information technology 8 bit single byte coded graphic character sets Part 6: Latin/Arabic alphabet, is part of the ISO/IEC 8859 series of ASCII based standard character encodings, first edition published in 1987. It is… …   Wikipedia

  • ISO/IEC 8859-15 — ISO 8859 15 is part 15 of ISO 8859, a standard character encoding defined by International Organization for Standardization. It is also known as Latin 9, and unofficially as Latin 0 but not as Latin 15. It is similar to ISO 8859 1 but replaces… …   Wikipedia

  • ISO/IEC 8859-2 — ISO 8859 2, more formally cited as ISO/IEC 8859 2 or less formally as Latin 2, is part 2 of ISO/IEC 8859, a standard character encoding defined by ISO. It encodes what it refers to as Latin alphabet no. 2, consisting of 191 characters from the… …   Wikipedia

  • ISO/IEC 8859-7 — ISO 8859 7, also known as Greek, is an 8 bit character encoding, part of the ISO 8859 standard. It was designed originally to cover the modern Greek language as well as mathematical symbols derived from the Greek.The original 1987 version of the… …   Wikipedia

  • ISO/IEC 8859-13 — ISO 8859 13, also known as Latin 7 or Baltic Rim , is an 8 bit character encoding, part of the ISO 8859 standard. It was designed originally to cover the Baltic languages, and added characters missing from the earlier encodings ISO 8859 4 and ISO …   Wikipedia

  • ISO/IEC 8859-16 — ISO 8859 16, also known as Latin 10 or South Eastern European , is an 8 bit character encoding, part of the ISO 8859 standard. It was designed to cover Albanian, Croatian, Hungarian, Polish, Romanian and Slovenian, but also French, German,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”