- Universal Standard Book Code
The increased use of
computersin handling bibliographicdata and the accumulation of large numbers of items, running into millions, will mean less and less involvement of the human element in the various processes such as manual key allocation and quality control. This trend has now become established at least within computerprofessionals and is now accepted as an axiomthat the more we eliminate the human involvement from the internal technical retrieval mechanisms of an information systemthe more successful and free from errors the system will be. Our interest here is the automatic control of large collections of databaserecords with particular emphasis on unique identification and quality control.
Today, the identification and control of bibliographic items is primarily based on an arbitrarily allocated key which accompanies the corresponding record throughout its processing history. Typical keys are the
ISBN(International Standard Book Number) and the ISSN(International Standard Serials Number). The USBC (Universal Standard Book Number) is generated automatically from pertinent bibliographicdata elements, independent of centralised bodies such as the SBN (Standard Book Number) agency.
The USBC is an alphanumeric code which is produced by means of an
algorithmwhich does not require any a priori information about the bibliographicitem. The universalityof the code implies that it is possible to regenerate this at any time and at any part of the world by means of an algorithmwhich conforms to the following criteria:
# Unique items receive unique codes.
algorithmis independent of source input.
# The code is as short as possible.
algorithmis easy to implement.
# The code is regenerable so that the same code is derived for the same item at different times.
# The code can be fixed or variable in length, depending on the operational requirements for record identification.
# It is possible to verify the code manually.
The theoretical basis for the derivation of the code is sound since it is based on the well established
information theory. More specifically, a principle of information sciencestates that the entropyof a set of symbols is maximised when the probability of occurrence of each becomes the same. The USBC algorithm utilises this principle to construct codes (keys) from pertinent fields in order to locate and retrieve unique records as well as clusters of records with lexically homogeneous information. The codes derived offer a very high discriminating strength of over 98% with the use of only 7 bytesper code, where each byteis selected from the least frequent characters found in pertinent bibliographicfields.
The original research was carried out by Professor [http://www.aueb.gr/Users/yannakoudakis/english/index.htm E. J. Yannakoudakis] at the Postgraduate School of Computer Science, University of Bradford, W. Yorkshire, England, between 1975–1978. The project has received funding from the
British Library, the Ministry of Education, the European Union and several other organisations.
* Yannakoudakis E. J., Ayres F. H. & Huggill J. A. W., Character coding for bibliographical record control, [http://comjnl.oxfordjournals.org/ Computer Journal] , Vol. 23, No. 1, pp. 53-60, 1980.
* Yannakoudakis E. J., Derived search keys for bibliographic retrieval, Proc. 6th ACM Conference on Research and Development in Information Retrieval, Washington, SIGIR, pp. 220-237, USA, 5-12 June 1983.
* Ayres F. H., Huggill J. A. W. & Yannakoudakis E. J., The Universal Standard Bibliographic Code (USBC): Its use for cleaning, merging and controlling large databases, Program, Vol. 22, No. 2, pp. 117-132, 1988.
* Yannakoudakis E. J. & Wu A. K. P., Quasi-Equifrequent group generation and evaluation, [http://comjnl.oxfordjournals.org/ Computer Journal] , Vol. 25, No. 2, pp. 183-187, 1982.
* Yannakoudakis E. J., A universal record identification scheme, Computer Bulletin, Vol. 2, No. 33, pp. 20, September 1982.
* Yannakoudakis E. J., Intelligent matching and retrieval for electronic document manipulation, Text Processing & Document Manipulation, J. C. van Vliet (Ed.), [http://www.cambridge.org/ Cambridge University Press] , pp. 65-77, April 1986.
* Yannakoudakis E. J. & Ridley M. J., The DOCMATCH Project: Automating document delivery by linking references to full text databases, Journal of Outlook on Research Libraries, Vol. 11, No. 9, pp. 3-7, 1989.
* Yannakoudakis E. J., A formal coding structure for database record processing, International Journal of Cybernetics & General Systems KYBERNETES, Vol. 18, No. 1, pp. 60-70, 1989.
* Yannakoudakis E. J., Ayres F. H. & Huggill J. A. W., An expert system for quality control in cataloguing and document identification, International Journal of Expert Systems for Information Management, Vol. 2, No. 2, pp. 119-139, 1989.
* Yannakoudakis E. J., Ayres F. H. & Huggill J. A. W., Matching of citations between non-standardized databases, International Journal of the American Society for Information Science, Vol. 41, No. 8, pp. 599-610, 1990.
* Yannakoudakis E. J. & Ridley M. J., DOCMATCH II: Automated linking between bibliographic and full-text databases, Bibliographic Access in Europe, Lorcan Dempsey (Ed.), Gower, pp. 232-240, 1990.
* Ayres F. H., Huggill J. A. W., Ridley M. J. & Yannakoudakis E. J., DOCMATCH: Automated input to ADONIS, Journal of Interlending and Document Supply, Vol. 18, No. 3, pp. 92-97, 1990.
* Ayres F. H., Ellis D., Huggill J. A. W. & Yannakoudakis E. J., The USBC and control of the bibliographic data base, Journal of Information Technology and Libraries, Vol. 1, No. 1, pp. 44-48, March l982.
* Ayres F. H., Ellis D., Huggill J. A. W. & Yannakoudakis E. J., Coding for Union File Creation: A National Database, British Library Bibliographic Services Division, London, April 1984, ISBN 0-7123-1020-7 (Review In: Program, Vol. 19, No. 4, pp. 391-394, 1985).
Wikimedia Foundation. 2010.
См. также в других словарях:
International Standard Text Code — The International Standard Text Code is a unique identifier for text based works. The ISO standard was developed by TC 46/SC 9 and published in March 2009 as ISO 21047:2009. The authority responsible for implementing the standard is The… … Wikipedia
International Standard Book Number — ISBN redirects here. For usage of ISBNs in Wikipedia, see Wikipedia:ISBN. A 13 digit ISBN, 978 3 16 148410 0, as represented by an EAN 13 bar code. The International Standard Book Number (ISBN) is a unique[2 … Wikipedia
International Standard Book Number — ISBN codiert als EAN 13 Barcode Die Internationale Standardbuchnummer (International Standard Book Number), abgekürzt ISBN, ist eine Nummer zur eindeutigen Kennzeichnung von Büchern. Daneben können auch andere selbstständige Veröffentlichungen … Deutsch Wikipedia
Book — A book is a set or collection of written, printed, illustrated, or blank sheets, made of paper, parchment, or other material, usually fastened together to hinge at one side. A single sheet within a book is called a leaf, and each side of a leaf… … Wikipedia
Universal Declaration of Human Rights — Eleanor Roosevelt with the Spanish version of the Universal Declaration of Human Rights. C … Wikipedia
Universal Mobile Telecommunications System — (UMTS) is one of the third generation (3G) cell phone technologies, which is also being developed into a 4G technology. Currently, the most common form of UMTS uses W CDMA as the underlying air interface. It is standardized by the 3GPP, and is… … Wikipedia
Universal Plug & Play — Universal Plug and Play Pile de protocoles 7 • Application 6 • Présentation 5 • Session 4 • Tr … Wikipédia en Français
Universal plug and play — Pile de protocoles 7 • Application 6 • Présentation 5 • Session 4 • Tr … Wikipédia en Français
Universal design — Disability Theory and models … Wikipedia
Universal Turing machine — This article is a supplement to the article Turing machine. Alan Turing s universal computing machine (alternately universal machine , machine U , U ) is the name given by him (1936 1937) to his model of an all purpose a machine (computing… … Wikipedia