- Document file format
A document file format is a text or binary file format for storing documents on a storage media, especially for use by computers. There currently exist a multitude of incompatible document file formats.
A rough consensus has been established that XML is to be the basis for future document file formats. Examples of XML-based open standards are DocBook, XHTML, and, more recently, the ISO/IEC standards OpenDocument (ISO 26300:2006) and Office Open XML (ISO 29500:2008).
In 1993, the ITU-T tried to establish a standard for document file formats, known as the Open Document Architecture (ODA) which was supposed to replace all competing document file formats. It is described in ITU-T documents T.411 through T.421, which are equivalent to ISO 8613. It did not succeed.
Page description languages such as PostScript and PDF have become the de facto standard for documents that a typical user should only be able to create and read, not edit. In 2001, PDF became an international ISO/IEC standard (ISO 15930-1:2001, ISO 19005-1:2005, ISO 32000-1:2008).
The default binary file format used by Microsoft Word (.doc) has become widespread de facto standard for office documents, but it is a proprietary format and is not always fully supported by other word processors.
Common document file formats
- ASCII, UTF-8 — plain text formats
- .doc for Microsoft Word — Structural binary format developed by Microsoft (specifications available since 2008 under the Open Specification Promise)
- DjVu — file format designed primarily to store scanned documents
- DocBook — an XML format for technical documenation
- HTML (.html, .htm), (open standard, ISO from 2000), in combination with possible image files referred to.
- FictionBook (.fb2) — open XML-based e-book format
- Office Open XML — .docx (XML-based standard for office documents, ISO standard from 2008)
- OpenDocument — .odt (XML-based standard for office documents, ISO standard from 2006)
- OpenOffice.org XML — .sxw (open, XML-based format for office documents)
- OXPS — Open XML Paper Specification
- PalmDoc — Common Handheld document format
- Plucker — Handheld navigable widely used document standard
- .pages for Pages
- PDF — Open standard for documents exchange. ISO standards from 2001, 2005, 2008. It is readable on almost every platform with free or open source readers. Open source PDF creators are also available.
- Rich Text Format (RTF) — meta data format being developed by Microsoft since 1987 for Microsoft products and cross-platform document interchange
- SYmbolic LinK (SYLK)
- TeX — Popular open-source typesetting program and format. First successful mathematical notation language.
- TEI — XML format for digital publication
- Uniform Office Format — Chinese standard
- WordPerfect (.wpd, .wp, .wp7, .doc) (Note: possible confusion with Word format extension)
- List of file formats
- List of document markup languages
- Comparison of document markup languages
- Open format
- ^ "Microsoft Office Binary (doc, xls, ppt) File Formats". 2008-02-15. http://www.microsoft.com/interop/docs/OfficeBinaryFormats.mspx. Retrieved 2010-03-18.
- ^ Microsoft Corporation (2010-07-23). "MS-DOC - Word Binary File Format (.doc) Structure Specification". http://msdn.microsoft.com/en-us/library/cc313153.aspx. Retrieved 2010-08-08.
- ^ "What is DjVu - DjVu.org". DjVu.org. http://djvu.org/resources/whatisdjvu.php. Retrieved 2009-03-05.
- ^ Microsoft Corporation (1999-05). "Rich Text Format (RTF) Specification, version 1.6". http://msdn.microsoft.com/en-us/library/aa140280(office.10).aspx. Retrieved 2010-03-13.
- ^ "4.3 Non-HTML file formats". e-Government Unit. 2002-05. http://archive.cabinetoffice.gov.uk/e-government/resources/handbook/html/4-3.asp. Retrieved 2010-03-13. [dead link]
- ^ http://reference.wolfram.com/mathematica/ref/format/RTF.html
- ^ http://support.microsoft.com/kb/86999
- ^ http://www.techtree.com/India/Reviews/Evolution_of_MS_Word/551-101355-575.html
- ^ Ranjan Parekh, Ranjan (2006). Principles of Multimedia. Tata McGraw-Hill. p. 87. ISBN 0-07-058833-3.
- Lost in Translation: Interoperability Issues for Open Standards - ODF and OOXML as Examples
- Secure document storage
Editable document formats Fixed document formats
Wikimedia Foundation. 2010.
Look at other dictionaries:
File format — A file format is a particular way that information is encoded for storage in a computer file. Since a disk drive, or indeed any computer storage, can store only bits, the computer must have some way of converting information to 0s and 1s and vice … Wikipedia
file format — A file structure that defines the way information is stored in the file and how the file appears on the screen or on the printer. The simplest file format is a plain ASCII file. Some of the more complex formats are DCA (Document Content… … Dictionary of networking
SNP File Format — Infobox file format name = Snapshot File icon = extension = .snp mime = owner = Microsoft type code = genre = Access report output, multi page, precise containerfor = EMF (contained pages) containedby = CAB (compression wrapper) extended from =… … Wikipedia
ZIP (file format) — unzip redirects here. For the program, see Info ZIP. ZIP Filename extension .zip .zipx (newer compression algorithms) Internet media type application/zip Uniform Type Identifier com.pkware.zip archive Magic … Wikipedia
UEF (file format) — Infobox file format name = Unified Emulator Format icon = caption = extension = .uef mime = application/octet stream type code = uniform type = magic = UEF File! owner = Thomas Harte released = before 10 August 2000… … Wikipedia
Template (file format) — The term document template when used in the context of file format refers to a common feature of many software applications that define a unique non executable file format intended specifically for that particular application. Template file… … Wikipedia
Free file format — A free file format is a file format whose full specification is freely available and for which there are no restrictions (e.g. legal or technical) on its use. [cite web url=http://www.linfo.org/free file format.html title=Free File Format… … Wikipedia
Audio file format — An audio file format is a file format for storing digital audio data on a computer system. This data can be stored uncompressed, or compressed to reduce the file size. It can be a raw bitstream, but it is usually a container format or an audio… … Wikipedia
tar (file format) — tar GNU tar 1.23 showing three common types of Tarballs (shown in red). Filename extension .tar Internet media type application/x tar … Wikipedia
Class (file format) — In the Java programming language, source files (.java files) are compiled into class files which have a .class extension. Since Java is a platform independent language, source code is compiled into an output file known as bytecode, which it… … Wikipedia