EXPRESS (data modeling language)

EXPRESS (data modeling language)

EXPRESS is a standard data modelling language for product data. EXPRESS is formalized in the ISO Standard for the Exchange of Product model STEP (ISO 10303), and standardized as ISO 10303-11.

Overview

Data models formally define data objects and relationships among data objects for a domain of interest. Some typical applications of data models include supporting the development of databases and enabling the exchange of data for a particular area of interest. Data models are specified in a data modelling language. Michael R. McCaleb (1999). [http://nvl.nist.gov/pub/nistpubs/jres/104/4/html/j44mac.htm#apa "A Conceptual Data Model of Datum Systems"] . National Institute of Standards and Technology. August 1999.] EXPRESS is a data modelling language defined in ISO 10303-11, the EXPRESS Language Reference Manual. [ISO International Standard 10303-11:1994, Industrial automation systems and integration — Product data representation andexchange — Part 11: Description methods: The EXPRESS language reference manual, International Organization for Standardization, Geneva, Switzerland (1994).] .

An EXPRESS data model can be defined in two ways, textually and graphically. For formal verification and as input for tools such as SDAI the textual representation within an ASCII file is the most important one. The graphical representation on the other hand is often more suitable for human use such as explanation and tutorials. The graphical representation, called EXPRESS-G, is not able to represent all details that can be formulated in the textual form.

EXPRESS is similar to programming languages such as PASCAL. Within a SCHEMA various datatypes can be defined together with structural constraints and algorithmic rules. A main feature of EXPRESS is the possibility to formally validate a population of datatypes - this is to check for all the structural and algorithmic rules.

EXPRESS-G is a graphical notation that supports a subset of the EXPRESS language. One of the advantages of using EXPRESS-G over EXPRESS is that the structure of a data model can be presented in a more understandable manner. A disadvantage of EXPRESS-G is that complex constraints cannot be formally specified. Figure 1 is an example. The data model presented in figure could be used to specify the requirements of a database for an audio compact disc (CD) collection.

Simple example

A simple EXPRESS data model looks like fig 2, and the code like this: SCHEMA Family; ENTITY Person ABSTRACT SUPERTYPE OF (ONEOF (Male, Female)); name: STRING; mother: OPTIONAL Female; father: OPTIONAL Male; END_ENTITY; ENTITY Female SUBTYPE OF (Person); END_ENTITY; ENTITY Male SUBTYPE of (Person); END_ENTITY; END_SCHEMA;

The data model is enclosed within the EXPRESS schema "Family". It contains a supertype entity "Person" with the two subtypes "Male" and "Female". Since "Person" is declared to be ABSTRACT only occurrences of either (ONEOF) the subtype "Male" or "Female" can exist. Every occurrence of a person has a mandatory "name" attribute and optionally attributes "mother" and "father". There is a fixed style of reading for attributes of some entity type:
* a "Female" can play the role of "mother"for a "Person"
* a "Male" can play the role of "father" for a "Person"

EXPRESS Building blocks

Datatypes

EXPRESS offers a series of datatypes, with specific data type symbols of the EXPRESS-G notation:
* Entity data type: This is the most important datatype in EXPRESS. It is covered below in more details. Entity datatypes can be related in two ways, in a sub-supertype tree and/or by attributes.

* Enumeration data type: Enumeration values are simple strings such as red, green, and blue for an rgb-enumeration. In the case that an enumeration type is declared to be extensible it can be extended in other schemas.

* Defined data type. They can be used to specialize other datatypes further on. E.g. it is possible to define the datatype positive which is of type integer with a value > 0.

* Select data type: Selects define a choice or an alternative between different options. Most commonly used are selects between different entity_types. More rarely are selects which include defined types. In the case that an enumeration type is declared to be extensible it can be extended in other schemas.

* Simple data type
** String: This is the most often used simple type. EXPRESS strings can be of any length and can contain any character (ISO 10646/Unicode). However it is common practise.
** Binary: This data type is only very rarely used. It covers a number of bits (not bytes). For some implementations the size is limited to 32 bit.
** Logical: Similar to the boolean datatype a logical has the possible values TRUE and FALSE and in addition UNKNOWN.
** Boolean: With the boolean values TRUE and FALSE.
** Number: The number data type is a supertype of both, integer and real. Most implementations take uses a double type to represent a real_type, even if the actual value is an integer.
** Integer: EXPRESS integers can have in principle any length, but most implementations restricted them to a signed 32 bit value.
** Real: Ideally an EXPRESS real value is unlimited in accuracy and size. But in practise a real value is represented by a floating point value of type double.

* Aggregation data type: The possible kinds of aggregation_types are SET, BAG, LIST and ARRAY. While SET and BAG are unordered, LIST and ARRAY are ordered. A BAG may contain a particular value more than once, this is not allowed for SET. An ARRAY is the only aggregate which may contain unset members. This is not possible for SET, LIST, BAG. The members of an aggregate may be of any other data type

A few general things are to be mentioned for datatypes.
* Constructed datatypes can be defined within an EXPRESS schema. They are mainly used to define entities, and to specify the type of entity attributes and aggregate members.
* Datatypes can be used in a recursive way to build up more and more complex data types. E.g. it is possible to define a LIST of an ARRAY of a SELECT of either some entities or other datatypes. If it makes sense to define such datatypes is a different question.
* EXPRESS defines a couple of rules how a datatype can be further specialized. This is important for re-declared attributes of entities.
* GENERIC data types can be used for procedures, functions and abstract entities.

Entity-Attribute

Entity attributes allow to add "properties" to entities and to relate one entity with another one in a specific role. The name of the attribute specifies the role. Most datatypes can directly serve as type of an attribute. This includes aggregation as well.

There are three different kinds of attributes, explicit, derived and inverse attributes. And all these can be re-declared in a subtype. In addition an explicit attribute can be re-declared as derived in a subtype. No other change of the kind of attributes is possible.
* Explicit attributes are those which have direct values visible in a STEP-File.
* Derived attributes get their values from an expression. In most cases the expression refers to other attributes of THIS instance. The expression may also use EXPRESS functions.
* Inverse attributes do not add "information" to an entity, but only name and constrain an explicit attribute to an entity from the other end.

Specific attribute symbols of the EXPRESS-G notation:

Supertypes and subtypes

An entity can be defined to be a subtype of one or several other entities (multiple inheritance is allowed!). A supertype can have any number of subtypes. It is very common practice in STEP to build very complex sub-supertype graphs. Some graphs relate 100 and more entities with each other.

An entity instance can be constructed for either a single entity (if not abstract) or for a complex combination of entities in such a sub-supertype graph. For the big graphs the number of possible combinations is likely to grow in astronomic ranges. To restrict the possible combinations special supertype constraints got introduced such as ONEOF and TOTALOVER. Furthermore an entity can be declared to be abstract to enforce that no instance can be constructed of just this entity but only if it contains a non-abstract subtype.

Algorithmic constraints

Entities and defined data types may be further constraint with WHERE rules. WHERE rules are also part of global rules. A WHERE rule is an expression, which must evaluate to TRUE, otherwise a population of an EXPRESS schema, is not valid. Like derived attributes these expression may invoke EXPRESS functions, which may further invoke EXPRESS procedures. The functions and procedures allow formulating complex statements with local variables, parameters and constants - very similar to a programming language.

The EXPRESS language can describe local and global rules. For example:

ENTITY area_unit SUBTYPE OF (named_unit); WHERE WR1: (SELF amed_unit.dimensions.length_exponent = 2) AND (SELF amed_unit.dimensions.mass_exponent = 0) AND (SELF amed_unit.dimensions.time_exponent = 0) AND (SELF amed_unit.dimensions.electric_current_exponent = 0) AND (SELF amed_unit.dimensions. thermodynamic_temperature_exponent = 0) AND (SELF amed_unit.dimensions.amount_of_substance_exponent = 0) AND (SELF amed_unit.dimensions.luminous_intensity_exponent = 0); END_ENTITY; -- area_unit

This example describes that area_unit entity must have square value oflength. For this the attribute dimensions.length_exponent must be equal to 2 and all other exponents of basic SI units must be 0.

Another example: TYPE day_in_week_number = INTEGER; WHERE WR1: (1 <= SELF) AND (SELF <= 7); END_TYPE; -- day_in_week_number

That is, it means that week value cannot exceed 7.

And so, you can describe some rules to your entities. More details on the given examples can be found in ISO 10303-41

See also

;ISO related subjects
* ISO 10303: ISO standard for the computer-interpretable representation and exchange of industrial product data.
* ISO 10303-21: Data exchange form of STEP with an ASCII structure
* ISO 10303-22: Standard data access interface, part of the implementation methods of STEP
* ISO 10303-28: STEP-XML specifies the use of the Extensible Markup Language (XML) to represent EXPRESS schema
* ISO 13399: ISO standard for cutting tool data representation and exchange
* List of STEP (ISO 10303) parts

;Other related subjects
* CAD data exchange
* EDIF: Electronic Design Interchange Format
* Diagram
* General-purpose modeling
* Modeling language
* Wirth syntax notation

References

Further reading

* ISO 10303-11:2004, Industrial automation systems and integration — Product data representation and exchange— Part 11: Description methods: The EXPRESS language reference manual.
* ISO 10303, the main page for "STEP, the Standard for the Exchange of Product model data"
* Douglas A. Schenck and Peter R. Wilson, "Information Modeling the EXPRESS Way", Oxford University Press, 1993, ISBN13: 978-0-19-508714-7


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Modeling language — A modeling language is any artificial language that can be used to express information or knowledge or systems in a structure that is defined by a consistent set of rules. The rules are used for interpretation of the meaning of components in the… …   Wikipedia

  • EXPRESS — can refer to* Express (store) * EXPRESS (data modeling language) is the data modelling modeling language of the STEP (ISO 10303) …   Wikipedia

  • Express — may refer to: Media and communication * The term express may refer to express mail, or parcels carried by train, bus, airplane or by courier. * Express (satellite) is the name of a communication satellite. * The Daily Express is a British… …   Wikipedia

  • Data model — Overview of data modeling context: A data model provides the details of information to be stored, and is of primary use when the final product is the generation of computer software code for an application or the preparation of a functional… …   Wikipedia

  • Data Format Description Language — (DFDL, often pronounced daff o dil) is a modeling language from the Open Grid Forum for describing general text and binary data. A DFDL model or schema allows any text or binary data to be read (or parsed ) from its native format and to be… …   Wikipedia

  • Modeling and Simulation: Conceptual Modeling Overview — Contents 1 Introduction 2 Techniques 2.1 Data Flow Modeling 2.2 Entity Relationship Modeling 2.3 …   Wikipedia

  • General-purpose modeling — (GPM) is the systematic use of a General Purpose modeling language to represent the various facets of an object or a system. Examples of GPM languages are: * the Unified Modeling Language (UML), an industry standard for modeling software… …   Wikipedia

  • Data, context and interaction — (DCI) is a paradigm used in computer software to program systems of communicating objects. Its goals are: To improve the readability of object oriented code by giving system behavior first class status; To cleanly separate code for rapidly… …   Wikipedia

  • Meta-process modeling — Abstraction level for processes.[1] Meta process modeling is a type of metamodeling used in software engineering and systems engineering for the analysis and construction of models applicable and useful to some predefined problems. Meta process… …   Wikipedia

  • Meta-Process Modeling — is a type of metamodeling used in software engineering and systems engineering for the analysis and construction of models applicable and useful some predefined problems. Meta process support the effort of creating flexible process models. The… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”