Document file format


Document file format

A document file format is a text or binary file format for storing documents on a storage media, especially for use by computers. There currently exist a multitude of incompatible document file formats.

A rough consensus has been established that XML is to be the basis for future document file formats. Examples of XML-based open standards are DocBook, XHTML, and, more recently, the ISO/IEC standards OpenDocument (ISO 26300:2006) and Office Open XML (ISO 29500:2008).

In 1993, the ITU-T tried to establish a standard for document file formats, known as the Open Document Architecture (ODA) which was supposed to replace all competing document file formats. It is described in ITU-T documents T.411 through T.421, which are equivalent to ISO 8613. It did not succeed.

Page description languages such as PostScript and PDF have become the de facto standard for documents that a typical user should only be able to create and read, not edit. In 2001, PDF became an international ISO/IEC standard (ISO 15930-1:2001, ISO 19005-1:2005, ISO 32000-1:2008).

HTML is the most used and open international standard and it is also used as document file format. It has also become ISO/IEC standard (ISO 15445:2000).

The default binary file format used by Microsoft Word (.doc) has become widespread de facto standard for office documents, but it is a proprietary format and is not always fully supported by other word processors.

Contents

Common document file formats

See also

References

External links


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • File format — A file format is a particular way that information is encoded for storage in a computer file. Since a disk drive, or indeed any computer storage, can store only bits, the computer must have some way of converting information to 0s and 1s and vice …   Wikipedia

  • file format —    A file structure that defines the way information is stored in the file and how the file appears on the screen or on the printer.    The simplest file format is a plain ASCII file. Some of the more complex formats are DCA (Document Content… …   Dictionary of networking

  • SNP File Format — Infobox file format name = Snapshot File icon = extension = .snp mime = owner = Microsoft type code = genre = Access report output, multi page, precise containerfor = EMF (contained pages) containedby = CAB (compression wrapper) extended from =… …   Wikipedia

  • ZIP (file format) — unzip redirects here. For the program, see Info ZIP. ZIP Filename extension .zip .zipx (newer compression algorithms) Internet media type application/zip Uniform Type Identifier com.pkware.zip archive Magic …   Wikipedia

  • UEF (file format) — Infobox file format name = Unified Emulator Format icon = caption = extension = .uef mime = application/octet stream type code = uniform type = magic = UEF File! owner = Thomas Harte released = before 10 August 2000… …   Wikipedia

  • Template (file format) — The term document template when used in the context of file format refers to a common feature of many software applications that define a unique non executable file format intended specifically for that particular application. Template file… …   Wikipedia

  • Free file format — A free file format is a file format whose full specification is freely available and for which there are no restrictions (e.g. legal or technical) on its use. [cite web url=http://www.linfo.org/free file format.html title=Free File Format… …   Wikipedia

  • Audio file format — An audio file format is a file format for storing digital audio data on a computer system. This data can be stored uncompressed, or compressed to reduce the file size. It can be a raw bitstream, but it is usually a container format or an audio… …   Wikipedia

  • tar (file format) — tar GNU tar 1.23 showing three common types of Tarballs (shown in red). Filename extension .tar Internet media type application/x tar …   Wikipedia

  • Class (file format) — In the Java programming language, source files (.java files) are compiled into class files which have a .class extension. Since Java is a platform independent language, source code is compiled into an output file known as bytecode, which it… …   Wikipedia