Extended Binary Coded Decimal Interchange Code

Extended Binary Coded Decimal Interchange Code

Extended Binary Coded Decimal Interchange Code (EBCDIC) is an 8-bit character encoding (code page) used on IBM mainframe operating systems such as z/OS, OS/390, VM and VSE, as well as IBM midrange computer operating systems such as OS/400 and i5/OS (see also Binary Coded Decimal). It is also employed on various non-IBM platforms such as Fujitsu-Siemens' BS2000/OSD, HP MPE/iX, and Unisys MCP. It descended from punched cards and the corresponding six bit binary-coded decimal code that most of IBM's computer peripherals of the late 1950s and early 1960s used.

History

EBCDIC was devised in 1963 and 1964 by IBM and was announced with the release of the IBM System/360 line of mainframe computers. It was created to extend the Binary-Coded Decimal encoding that existed at the time. It is an 8-bit character encoding, in contrast to, and developed separately from, the 7-bit ASCII encoding scheme.

Interestingly, IBM was a chief proponent of the ASCII standardization committee. However, IBM did not have time to prepare ASCII peripherals (such as card punch machines) to ship with its System/360 computers, so the company settled on EBCDIC at the time. The System/360 became wildly successful, and thus so did EBCDIC.

All IBM mainframe peripherals and operating systems (except Linux on zSeries or iSeries) use EBCDIC as their inherent encoding.cite web
url=http://publib.boulder.ibm.com/infocenter/zos/v1r9/index.jsp?topic=/com.ibm.zos.r9.adms700/adms7a05158.htm
title=IBM confirms the use of EBCDIC in their mainframes as a default practice
year=2008
accessdate=2008-06-16
author=IBM
] but software can translate to and from other encodings. Many hardware peripherals provide translation as well and modern mainframes (such as IBM zSeries) include processor instructions, at the hardware level, to accelerate translation between character sets.

At the time it was devised, EBCDIC made it relatively easy to enter data into a computer with punch cards. Since punch cards are no longer used on mainframes, EBCDIC is used in modern mainframes solely for backwards compatibility. It has no real technical advantage over ASCII-based code pages such as the ISO-8859 series or Unicode. There are some technical niceties in each, e.g., ASCII and EBCDIC both have one bit which indicates upper or lower case. But there are some aspects of EBCDIC which make it much less pleasant to work with than ASCII (such as a non-contiguous alphabet). As with single-byte extended ASCII codepages, most EBCDIC codepages only allow up to 2 languages (English and one other language) to be used in a database or text file.

Where true support for multilingual text is desired, a system supporting far more characters is needed. Generally this is done with some form of Unicode support. There is an EBCDIC Unicode Transformation Format called UTF-EBCDIC proposed by the Unicode consortium, but it is not intended to be used in open interchange environments and, even on EBCDIC-based systems, it is almost never used. IBM mainframes support UTF-16, but they do not support UTF-EBCDIC natively.

Arabic EBCDIC versions are typically in presentation order, in left to right order as displayed by an older mainframe or line printer, rather than in the right to left logical order used by modern encodings such as Unicode.

Codepage layout

The table below is derived from CCSID 500, one of the code page variants of EBCDIC, showing only the basic (English) EBCDIC characters. Characters 00–3F and FF are controls, 40 is space, 41 is no-break space ("RSP": "Required Space"), E1 is numeric space ("NSP": "Numeric Space"), and CA is soft hyphen. Characters are shown with their equivalent Unicode codes. Invariant alphanumeric, punctuation, and control characters common to all EBCDIC code pages are shown in color. Unassigned codes are typically filled with international or region-specific characters in the various EBCDIC code page variants.

Criticism and humor

Open source software advocate and hacker Eric S. Raymond writes in his "Jargon File" that EBCDIC was almost universally loathed by early hackers and programmers because of its multitude of different versions, none of which resembled the other versions, and that IBM produced it in direct competition with the already-established ASCII.

The Jargon file 4.4.7 gives the following definition:cquote2|quotetext=EBCDIC: /eb´s@·dik/, /eb´see`dik/, /eb´k@·dik/, n. [abbreviation, Extended Binary Coded Decimal Interchange Code] An alleged character set used on IBM dinosaurs. It exists in at least six mutually incompatible versions, all featuring such delights as non-contiguous letter sequences and the absence of several ASCII punctuation characters fairly important for modern computer languages (exactly which characters are absent varies according to which version of EBCDIC you're looking at). IBM adapted EBCDIC from punched card code in the early 1960s and promulgated it as a customer-control tactic (see "connector conspiracy"), spurning the already established ASCII standard. Today, IBM claims to be an open-systems company, but IBM's own description of the EBCDIC variants and how to convert between them is still internally classified top-secret, burn-before-reading. Hackers blanch at the very name of EBCDIC and consider it a manifestation of purest evil.
quotesource=The Jargon file 4.4.7

Another popular complaint is that the EBCDIC alphabetic characters follow an archaic punch card encoding rather than a linear ordering like ASCII. The upshot of this is that incrementing the character code for "I" does not produce the code for "J", and likewise there is a gap between the codes for "R" and "S". Thus programming a simple control loop to cycle through only the alphabetic characters is problematic.

These incompatibilities were also the source of many jokes. A popular one went: :"Professor: "So the American government went to IBM to come up with an encryption standard, and they came up with—" Student: "EBCDIC!"

See also

* EBCDIC-codepages with Latin-1-charset
* codepage 037 (English, Portuguese)
* codepage 285 (Ireland, United Kingdom)

ources

External links

* [http://www.hansenb.pdx.edu/DMKB/dict/tutorials/ebcdic.php EBCDIC] character list, including decimal and hex values, symbolic name, and character/function
* [http://www.ibm.com/software/globalization/g11n-res.jsp Character Data Representation Architecture (CDRA) from IBM] Contains IBM's official information on codepages and charsets.
* [http://publib.boulder.ibm.com/cgi-bin/bookmgr/BOOKS/QB3AQ501/F.0 F.0 Appendix F. Code Pages] from [http://publib.boulder.ibm.com/cgi-bin/bookmgr/BOOKS/QB3AQ501/CCONTENTS AS/400 International Application Development V4R2]
* [http://demo.icu-project.org/icu-bin/convexp ICU Converter Explorer] Contains more information about EBCDIC, including DBCS EBCDIC (Double Byte Character Set EBCDIC)
* [http://www.icu-project.org/charts/charset/ ICU Charset Mapping Tables] Contains Unicode mapping tables for EBCDIC and many other character sets
* [http://www.legacyj.com/cobol/ebcdic.html LegacyJ- EBCDIC Table]
* [http://www.geocities.com/mikes_javascript/hex_table_ascii.html Computer Character Set Table]
* [http://www.unicode.org/reports/tr16/#Table%201 Unicode Technical Report #16: UTF-EBCDIC]
* [http://www.iconv.com/asciiebcdic.htm iconv.com] Online tool to convert from ASCII to/from EBCDIC
* [http://home.arcor.de/wzwz.de/wiki/ebcdic/cc_en.htm EBCDIC-codepages with Latin-1-charset (JavaScript)]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Extended binary coded decimal interchange code — L Extended Binary Coded Decimal Interchange Code (EBCDIC) est un mode de codage des caractères sur 8 bits créé par IBM à l époque des cartes perforées. Il existe au moins 6 versions différentes bien documentées (et de nombreuses variantes parfois …   Wikipédia en Français

  • Extended Binary Coded Decimal Interchange Code — L Extended Binary Coded Decimal Interchange Code (EBCDIC) est un mode de codage des caractères sur 8 bits créé par IBM à l époque des cartes perforées. Il existe au moins 6 versions différentes bien documentées (et de nombreuses variantes parfois …   Wikipédia en Français

  • Extended Binary Coded Decimal Interchange Code —    Abbreviated EBCDIC; pronounced [eb se dic]. EBCDIC is the character set commonly used on large IBM mainframe computers, most IBM minicomputers, and computers from many other manufacturers.    It is an 8 bit code, allowing 256 different… …   Dictionary of networking

  • ASCII extended character set —    The second group of characters, from 128 through 255, in the ASCII character set. The extended ASCII character set is assigned variable sets of characters by computer hardware manufacturers and software developers, and it is not necessarily… …   Dictionary of networking

  • American Standard Code for Information Interchange —    Abbreviated ASCII, pronounced [as kee]. A standard coding scheme that assigns numeric values to letters, numbers, punctuation characters, and control characters to achieve compatibility among different computers and peripheral devices. In… …   Dictionary of networking

  • character code —    A code that represents one specific alphanumeric or control character in a set of characters.    See also American Standard Code for Information Interchange; Extended Binary Coded Decimal Interchange Code; Unicode …   Dictionary of networking

  • EBCDI-Code —   [ koːt, englisch iːbiːciːdiːaɪ kəʊd], EBCDIC [Abkürzung für englisch extended binary coded decimal interchange code »erweiterter binär codierter Dezimalzahlenumwandlungscode«], Informatik: 8 Bit BCD Code zur …   Universal-Lexikon

  • EBCDIC — extended binary coded decimal interchange code …   Military dictionary

  • EBCDIC — Extended Binary Coded Decimal Interchange Code …   Medical dictionary

  • EBCDIC — • Extended Binary Coded Decimal Interchange Code …   Maritime acronyms and abbreviations

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”