Standard Generalized Markup Language

Infobox file format
name = Standard Generalized Markup Language
icon =
logo =

extension =
mime = application/sgml, text/sgml
type code =
uniform type = public.xml
magic =
owner = ISO
genre = metalanguage
container for =
contained by =
extended from = GML
extended to = HTML, XML
standard = [ ISO 8879]

The Standard Generalized Markup Language (ISO 8879:1986 SGML) is an ISO Standard metalanguage in which one can define markup languages for documents. SGML is a descendant of IBM's Generalized Markup Language (GML), developed in the 1960s by Charles Goldfarb, Edward Mosher and Raymond Lorie (whose surname initials were used by Goldfarb to make up the term GML [cite web|url=|year=1996|title=The Roots of SGML - A Personal Recollection|author=Charles F. Goldfarb|accessdate=2007-07-07] ).

SGML provides an abstract syntax that can be realized in many different concrete syntaxes. For instance, although it is the norm to use angle brackets as tag delimiters in an SGML document—per the "reference concrete syntax" defined in the standard—it is possible to use other characters instead if a suitable concrete syntax is defined in the document's SGML Declaration. [ [ SGML Declarations] ] GML used a colon to introduce a tag, a period to end it, and 'e' to indicate an end tag: :xmp.thus:exmp., and SGML is flexible enough to accept that grammar, too.

Original uses

SGML was originally designed to enable the sharing of machine-readable documents in large projects in government, law and industry, which have to remain readable for several decades—a very long time in information technology. It has also been used extensively in the printing and publishing industries, but its complexity has prevented its widespread application for small-scale general-purpose use.Primarily intended for text and database publishing, one of its first major applications was the second edition of the Oxford English Dictionary (OED), which was and is wholly marked up using an SGML-like markup.


SGML allows most aspects of a markup language's syntax to be customized.

The default syntax appears similar to this example:

typically something like this

HTML uses this SGML default syntax.

Customization of the syntax for a markup language in SGML is specified by a Document Type Definition, or DTD.

According to the reference syntax, letter case is not distinguished in tag names so the three tags <quote>, <QUOTE>, and <quOtE> are equivalent (a concrete syntax may change this rule through the NAMECASE NAMING declarations).

Whether a tag must be paired (like the above <QUOTE></QUOTE> pair) or occurs singly (like an HTML <HR>) is defined in the DTD for the markup language being defined (as long as the OMITTAG feature is enabled). In this case the XML counterpart would be the specific "empty tag" <hr/>, which is equivalent to the SGML NET-enabling start-tag, introduced in the TC2 (International Standard ISO 8879:1986, Technical Corrigendum 2, Nov. 1999).

SGML markup languages whose concrete syntax enables the SHORTTAG VALUE feature, do not require attribute values containing only alphanumeric characters to be surrounded by quote marks " (LIT) or ' (LITA), so that the above markup could be written:

typically something like this

One feature of SGML markup languages is the "NET" (Null End Tag) construction: <ITALICS/this/ which is structurally equivalent to <ITALICS>this</ITALICS>. Another is the "presumptuous empty tagging", such that the empty tag </> in <ITALICS>this</> "inherits" its value from the nearest previous nonempty tag, which of course is <ITALICS> (in other words, it closes the most recently opened item). The expression is thus another, more concise, equivalent to <ITALICS>this</ITALICS>. A third is the 'text on the same line' feature, which allows an item to be ended by a line-end (especially useful for headings and the like).

Additionally, the SHORTTAG NETENABL IMMEDNET feature allows shortening of tags that surround an empty text value:

can be written as

Where the first "/" stands for the NET-enabling start-tag close (NETSC) and the second one stands for the NET. (Note: XML defines NETSC as "/" and NET as ">" hence, in XML, this construct looks as ).

SGML is an ISO standard: "ISO 8879:1986 Information processing—Text and office systems—Standard Generalized Markup Language (SGML)" which was accepted in October of 1986.



XML is a subset of SGML, designed so as to make the parser much easier to implement than a full SGML parser. A consequence of the ease of implementation is that XML, rather than SGML, is nowadays widely used for deriving document specifications. Contributing to this is also the fact that few SGML-aware programs existed when XML was created. The number of XML applications today is large. XML also has a lightweight internationalization. XML is used for general-purpose applications, such as the Semantic Web, XHTML, SVG, RSS, Atom, XML-RPC and SOAP.


Languages defined using SGML are known as "applications".


The design of HTML was originally inspired by SGML tagging, but since no clear guidelines for expansion were offered, many HTML documents are not proper SGML. HTML was later reformulated (at version 2.0) to be an application of SGML, but only compliant documents can be considered proper SGML, and for a large number of HTML documents, validation was never pursued. The charter for the recently revived World Wide Web Consortium HTML Working Group goes as far as to say, "the Group will not assume that an SGML parser is used for 'classic HTML'". [cite web
title=HTML Working Group Charter
coauthors=Chris Lilley, Tim Berners-Lee


Another markup language originally created as an application of SGML is DocBook, designed for authoring technical documentation. DocBook is now also available as an XML application.


There are also a number of languages that are related in part to SGML and XML, but, because they cannot be parsed or validated or otherwise processed using standard SGML and XML tools, cannot be considered applications of SGML or XML. One example is the Z Format, a language designed for typesetting and documentation.

ee also

*DSSSL - a Scheme-based processing language similar to XSL
*List of general purpose markup languages
*Markup language
*SGML entity


External links

* [ Overview of SGML Resources] at W3C's website.
* [ Introduction and Examples of Software Documentation in SGML]
* [ A gentle introduction to SGML]
* [ SGML Syntax Summary by Charles Goldfarb]

Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Standard Generalized Markup Language — (langage normalisé de balisage généralisé SGML) est un langage de description à balises, de norme ISO (ISO 8879:1986). Sommaire 1 Historique 2 Normalisation 2.1 Document Type Definition …   Wikipédia en Français

  • Standard Generalized Markup Language — Vorlage:Infobox Dateiformat/Wartung/MagischeZahl fehltVorlage:Infobox Dateiformat/Wartung/Entwickler fehltVorlage:Infobox Dateiformat/Wartung/Website fehlt Standard Generalized Markup Language …   Deutsch Wikipedia

  • Standard Generalized Markup Language — SGML (англ. Standard Generalized Markup Language  стандартный обобщённый язык разметки; произносится [эс джи эм эл])  метаязык, на котором можно определять язык разметки для документов. SGML  наследник разработанного в 1969 году в IBM языка GML… …   Википедия

  • Standard Generalized Markup Language — SGML kalba statusas T sritis informatika apibrėžtis Universali dokumentų ↑ženklinimo kalba, kurią apibrėžia tarptautinis standartas ISO 8879. Kalba nustato santykį tarp dokumento turinio ir struktūros. Ja aprašomi dokumentų formatai, indeksavimas …   Enciklopedinis kompiuterijos žodynas

  • Standard Generalized Markup Language —    Abbreviated SGML. A 1986 standard (ISO 8879) for defining the structure and managing the contents of any digital document. The standard specifies a definition for formatting a digital document so that it can be modified, viewed, or output on… …   Dictionary of networking

  • standard generalized markup language — noun (computer science) a standardized language for the descriptive markup of documents; a set of rules for using whatever markup vocabulary is adopted • Syn: ↑SGML • Topics: ↑computer science, ↑computing • Hypernyms: ↑markup …   Useful english dictionary

  • Standard Generalized Markup Language — n. customary language used for writing Internet sites, SGML …   English contemporary dictionary

  • Standard Generalized Markup Language —    A document language that describes embedded data and document structure. Unlike HTML, SGML does not define document formatting; instead it leaves it up to a viewer to define the appearance of the document …   IT glossary of terms, acronyms and abbreviations

  • STANDARD GENERALIZED MARKUP LANGUAGE — (SGML) стандартный язык обобщенной разметки документовоснованный на текстовом представлении информации язык, предназначенный для описания содержимого и структуры цифровых (электронных) документов. Принят Международной организацией по… …   Словарь электронного бизнеса

  • Standard General Markup Language — Standard Generalized Markup Language Standard Generalized Markup Language (langage normalisé de balisage généralisé SGML) est un langage de description à balises, de norme ISO (ISO 8879:1986). Sommaire 1 Historique 2 Normalisation 2.1 DTD …   Wikipédia en Français

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”