 Check digit

A check digit is a form of redundancy check used for error detection, the decimal equivalent of a binary checksum. It consists of a single digit computed from the other digits in the message.
With a check digit, one can detect simple errors in the input of a series of digits, such as a single mistyped digit or some permutations of two successive digits.
Contents
Design
Check digit algorithms are generally designed to capture human transcription errors. In order of complexity, these include the following:
 single digit errors, such as 1 → 2
 transposition errors, such as 12 → 21
 twin errors, such as 11 → 22
 jump transpositions errors, such as 132 → 231
 jump twin errors, such as 131 → 232
 phonetic errors, such as 60 → 16 ("sixty" to "sixteen")
In choosing a system, a high probability of catching errors is traded off against implementation difficulty; simple check digit systems are easily understood and implemented by humans but do not catch as many errors as complex ones, which require sophisticated programs to implement.
A desirable feature is that leftpadding with zeros should not change the check digit. This allows variable length digits to be used and the length to be changed.
If there is a single check digit added to the original number, the system will not always capture multiple errors, such as two replacement errors (12 → 34) though, typically, double errors will be caught 90% of the time (both changes would need to change the output by offsetting amounts).
A very simple check digit method would be to take the sum of all digits (digital sum) modulo 10. This would catch any singledigit error, as such an error would always change the sum, but does not catch any transposition errors (switching two digits) as reordering does not change the sum.
A slightly more complex method is to take the weighted sum of the digits, modulo 10, with different weights for each number position.
To illustrate this, for example if the weights for a four digit number were 5, 3, 2, 7 and the number to be coded was 4871, then one would take 5×4 + 3×8 + 2×7 + 7×1 = 65, ie 5 modulo 10, and the check digit would be 5, giving 48715.
Systems with weights of 1, 3, 7, or 9, with the weights on neighboring numbers being different, are widely used: for example, 31 31 weights in UPC codes, 13 13 weights in EAN numbers (GS1 algorithm), and the 371 371 371 weights used in United States bank routing transit numbers. This system detects all singledigit errors and around 90% of transposition errors. 1, 3, 7, and 9 are used because they are coprime to 10, so changing any digit changes the check digit; using a coefficient that is divisible by 2 or 5 would lose information (because ) and thus not catch some singledigit errors. Using different weights on neighboring numbers means that most transpositions change the check digit; however, because all weights differ by an even number, this does not catch transpositions of two digits that differ by 5, (0 and 5, 1 and 6, 2 and 7, 3 and 8, 4 and 9), since the 2 and 5 multiply to yield 10.
The ISBN10 code instead uses modulo 11, which is prime, and all the number positions have different weights . This system thus detects all single digit substitution and transposition errors (including jump transpositions), but at the cost of the check digit possibly being 10, represented by "X". (An alternative is simply to avoid using the serial numbers which result in an "X" check digit.) ISBN13 instead uses the GS1 algorithm used in EAN numbers.
More complicated algorithms include the Luhn algorithm (1954), which captures 98% of single digit transposition errors (it does not detect 90 ↔ 09), while more sophisticated is the Verhoeff algorithm (1969), which catches all single digit substitution and transposition errors, and many (but not all) more complex errors. Both these methods use a single check digit and will therefore fail to capture around 10% of more complex errors. To reduce this failure rate, it is necessary to use more than one check digit (for example, the modulo 97 check referred to below, which uses two check digits  for the algorithm, see International Bank Account Number) and/or to use a wider range of characters in the check digit, for example letters plus numbers.
Examples
UPC
The final digit of a Universal Product Code is a check digit computed as follows:^{[1]}
 Add the digits (up to but not including the check digit) in the oddnumbered positions (first, third, fifth, etc.) together and multiply by three.
 Add the digits (up to but not including the check digit) in the evennumbered positions (second, fourth, sixth, etc.) to the result.
 Take the remainder of the result divided by 10 (modulo operation) and subtract this from 10 to derive the check digit.
For instance, the UPCA barcode for a box of tissues is "036000241457". The last digit is the check digit "7", and if the other numbers are correct then the check digit calculation must produce 7.
 Add the odd number digits: 0+6+0+2+1+5 = 14
 Multiply the result by 3: 14 × 3 = 42
 Add the even number digits: 3+0+0+4+4 = 11
 Add the two results together: 42 + 11 = 53
 To calculate the check digit, take the remainder of (53 / 10), which is also known as (53 modulo 10), and subtract from 10. Therefore, the check digit value is 7.
Another example: to calculate the check digit for the following food item "01010101010".
 Add the odd number digits: 0+0+0+0+0+0 = 0
 Multiply the result by 3: 0 x 3 = 0
 Add the even number digits: 1+1+1+1+1 = 5
 Add the two results together: 0 + 5 = 5
 To calculate the check digit, take the remainder of (5 / 10), which is also known as (5 modulo 10), and subtract from 10 i.e. (10  5 modulo 10) = 5. Therefore, the check digit value is 5.
 If the remainder is 0, subtracting from 10 would give 10. In that case, use 0 as the check digit.
ISBN 10
The final character of a ten digit International Standard Book Number is a check digit computed so that multiplying each digit by its position in the number (counting from the right) and taking the sum of these products modulo 11 is 0. The digit the farthest to the right (which is multiplied by 1) is the check digit, chosen to make the sum correct. It may need to have the value 10, which is represented as the letter X. For example, take the ISBN 0201530821. The sum of products is 0×10 + 2×9 + 0×8 + 1×7 + 5×6 + 3×5 + 0×4 + 8×3 + 2×2 + 1×1 = 99 ≡ 0 modulo 11. So the ISBN is valid.
While this may seem more complicated than the first scheme, it can be validated simply by adding all the products together then dividing by 11. The sum can be computed without any multiplications by initializing two variables,
t
andsum
, to 0 and repeatedly performingt = t + digit; sum = sum + t;
(which can be expressed in C assum += t += digit;
). If the finalsum
is a multiple of 11, the ISBN is valid.ISBN 13
ISBN 13 (in use January 2007) is equal to the EAN13 code found underneath a book's barcode. Its check digit is generated the same way as the UPC except that the even digits are multiplied by 3 instead of the odd digits.^{[2]}
EAN (GLN,GTIN, EAN numbers administered by GS1)
EAN (European Article Number) check digits (administered by GS1) are calculated by summing the even position numbers and multiplying by 3 and then by adding the sum of the odd position numbers. The final digit of the result is subtracted from 10 to calculate the check digit (or left as is if already zero). A GS1 check digit calculator and detailed documentation is online at GS1's website.^{[3]}
Other examples of check digits
 The tenth digit of the National Provider Identifier for the US healthcare industry
 The Australian Tax File Number (based on modulo 11)
 The Guatemalan Tax Number (NIT  Número de Identificación Tributaria) based on modulo 11
 The North American CUSIP number
 The final (ninth) digit of the routing transit number, a bank code used in the United States
 The International SEDOL number
 The International Securities Identifying Number (ISIN)
 The International CAS registry number's final digit.
 Modulo 10 check digits in credit card account numbers, calculated with the Luhn algorithm.
 Also used in the Norwegian KID (customer identification number) numbers used in bank giros (credit transfer).
 The final character encoded in a magnetic stripe card is a computed Longitudinal redundancy check
 final digit of a POSTNET code
 final digit of an ISSN code
 final digit of a DUNS number (though this is scheduled to change, such as that the final digit will be chosen freely in new allocations, rather than being a check digit)
 The Spanish fiscal identification number (número de identificación fiscal, NIF), (based on modulo 23).
 The ninth digit of a Vehicle Identification Number (VIN).
 The ninth digit of an Israeli Teudat Zehut (Identity Card) number.
 The 13th digit of Serbian and Former Yugoslav Unique Master Citizen Number (JMBG)
 Last check digit in EAN/UPC serialisation of Global Trade Identification Number (GTIN). It applies to GTIN8, GTIN12, GTIN13 and GTIN14.
 The seventh character of a New Zealand NHI Number.
 The last digit on a New Zealand locomotive's Traffic Monitoring System (TMS) number.
 The last two digits of the 11digit Turkish Identification Number (Turkish: TC Kimlik Numarası).
 The third and fourth digits in an International Bank Account Number (Modulo 97 check).
 The ninth character in the 14character EU cattle passport number (cycles from 1 to 7: see British Cattle Movement Service).
 The ninth digit in an Icelandic Kennitala (national ID number).
 Modulo 97 check digits in a Belgian and Serbian bank account numbers.
 Mayo Clinic patient identification numbers used in Arizona and Florida include a trailing check digit^{[citation needed]}
Algorithms
Notable algorithms include:
 Luhn algorithm (1954)
 Verhoeff algorithm (1969)
See also
 Casting out nines, similar modular sum check
 Check bit, binary equivalent
References
 ^ "GS1 Check Digit Calculator". GS1 US. 2006. http://www.uccouncil.org/ean_ucc_system/education_support/cdc.html. Retrieved 20080521.
 ^ "ISBN Users Manual". International ISBN Agency. 2005. http://www.isbninternational.org/en/download/2005%20ISBN%20Users%27%20Manual%20International%20Edition.pdf. Retrieved 20080521.
 ^ "Check Digit Calculator". GS1. 2005. http://www.gs1.org/barcodes/support/check_digit_calculator. Retrieved 20080521.
External links
 Identification numbers and check digit schemes^{[dead link]} (a mathematical explanation of various check digit schemes)
 GS1 check digit calculator
 Check identity numbers by check digits. (Russian)
Categories:
Wikimedia Foundation. 2010.
Look at other dictionaries:
check digit — ➔ digit * * * check digit UK US noun [C] ► IT a number that is used in a computer program to help to find or prevent errors … Financial and business terms
check digit — noun (computing) A redundant bit in a stored word, used in a self checking procedure such as a parity check • • • Main Entry: ↑check … Useful english dictionary
check digit — A digit often appended at the end of a string of digits; it is used for carrying out a check to verify the accuracy of a sequence or a customer number … Big dictionary of business and management
Check digit — An ISO term. A suffix digit used to test the validity of a number. 1n … International financial encyclopaedia
check digit — A numerical digit added to a number to permit verification of its accuracy by a computerized accounting system … Auditor's dictionary
check digit — /tʃek dɪdʒɪt/ noun the last digit of a string of computerised reference numbers, used to validate the transaction … Dictionary of banking and finance
Check digit — Контрольный разряд … Краткий толковый словарь по полиграфии
check digit — control number … English contemporary dictionary
Check digit weight factor — A method in automated systems to determine valid account and customer numbers. The actual weight factors are customisable by the security officer for the system … International financial encyclopaedia
digit — di‧git [ˈdɪdʒt] noun [countable] one of the written signs that represent the numbers 0 to 9: • Binary code uses the digits 1 and 0. see also double digit ˈcheck ˌdigit COMPUTING a number or series of numbers added to the end of a code (= set of… … Financial and business terms