Molecular descriptor

Molecular descriptor

Molecular descriptors play a fundamental role in chemistry, pharmaceutical sciences, environmental protection policy, and health researches, as well as in quality control, being the way molecules, thought of as real bodies, are transformed into numbers, allowing some mathematical treatment of the chemical information contained in the molecule. This was defined by Todeschini and Consonni as:

"The molecular descriptor is the final result of a logic and mathematical procedure which transforms chemical information encoded within a symbolic representation of a molecule into a useful number or the result of some standardized experiment."[1]

By this definition, the molecular descriptors are divided into two main categories: experimental measurements, such as log P, molar refractivity, dipole moment, polarizability, and, in general, physico-chemical properties, and theoretical molecular descriptors, which are derived from a symbolic representation of the molecule and can be further classified according to the different types of molecular representation.

The main classes of theoretical molecular descriptors are: 1) 0D-descriptors (i.e. constitutional descriptors, count descriptors), 1D-descriptors (i.e. list of structural fragments, fingerprints), 2D-descriptors (i.e. graph invariants), 3D-descriptors (such as, for example, 3D-MoRSE descriptors, WHIM descriptors, GETAWAY descriptors, quantum-chemical descriptors, size, steric, surface and volume descriptors), 4D-descriptors (such as those derived from GRID or CoMFA methods, Volsurf).

Contents

Invariance properties of molecular descriptors

The invariance properties of molecular descriptors can be defined as the ability of the algorithm for their calculation to give a descriptor value that is independent of the particular characteristics of the molecular representation, such as atom numbering or labeling, spatial reference frame, molecular conformations, etc. Invariance to molecular numbering or labeling is assumed as a minimal basic requirement for any descriptor.

Two other important invariance properties, translational invariance and rotational invariance, are the invariance of a descriptor value to any translation or rotation of the molecules in the chosen reference frame. These last invariance properties are required for the 3D-descriptors.

Degeneracy of molecular descriptors

This property refers to the ability of a descriptor to avoid equal values for different molecules. In this sense, descriptors can show no degeneracy at all, low, intermediate, or high degeneracy. For example, the number of molecule atoms and the molecular weights are high degeneracy descriptors, while, usually, 3D-descriptors show low or no degeneracy at all.

Basic requirements for optimal descriptors

1 Should have structural interpretation

2 Should have good correlation with at least one property

3 Should preferably discriminate among isomers

4 Should be possible to apply to local structure

5 Should possible to generalize to "higher" descriptors

6 Should be simple

7 Should not be based on experimental properties

8 Should not be trivially related to other descriptors

9 Should be possible to construct efficiently

10 Should use familiar structural concepts

11 Should change gradually with gradual change in structures

12 Should have the correct size dependence, if related to the molecule size

See also

References

  1. ^ Roberto Todeschini and Viviana Consonni, Handbook of Molecular Descriptors, Wiley-VCH, 2000.http://www.moleculardescriptors.eu/books/handbook.htm

Bibliography

Roberto Todeschini and Viviana Consonni, Molecular Descriptors for Chemoinformatics (2 volumes), Wiley-VCH, 2009.

James Devillers and Alexandru T. Balaban (Eds.), Topological indices and related descriptors in QSAR and QSPR. Taylor & Francis, 2000.

Lemont Kier and Lowell Hall, Molecular structure description. Academic Press, 1999.

Alexandru T. Balaban (Ed.), From chemical topology to three-dimensional geometry. Plenum Press, 1997.

External links



Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Descriptor — may refer to file descriptor, an abstract key for accessing a file index term, also known as a descriptor in information retrieval molecular descriptor, which helps characterize a chemical compound segment descriptor, used for memory addressing… …   Wikipedia

  • Docking (molecular) — Docking glossary • Receptor or host or lock – The receiving molecule, most commonly a protein or other biopolymer. • Ligand or guest or key – The complementary partner molecule which binds to the receptor. Ligands are most often small molecules… …   Wikipedia

  • Topological index — For topological index in mathematics, see Atiyah–Singer index theorem. In the fields of chemical graph theory, molecular topology, and mathematical chemistry, a topological index also known as a connectivity index is a type of a molecular… …   Wikipedia

  • Klincewicz method — The Klincewicz method [Klincewicz K. M., Reid R. C., Estimation of Critical Properties with Group Contribution Methods , AIChE Journal, 30(1), 137 142, 1984] is a predictive method based both on group contributions and on a correlation with some… …   Wikipedia

  • Mathematical chemistry — is the area of research engaged in novel applications of mathematics to chemistry; it concerns itself principally with the mathematical modeling of chemical phenomena.[1] Mathematical chemistry has also sometimes been called computer chemistry,… …   Wikipedia

  • Smiles arbitrary target specification — (SMARTS) is a language for specifying substructural patterns in molecules. The SMARTS line notation is expressive and allows extremely precise and transparent substructural specification and atom typing.SMARTS is related to the SMILES line… …   Wikipedia

  • ГОСТ Р ИСО 14644-6-2010: Чистые помещения и связанные с ними контролируемые среды. Часть 6. Термины — Терминология ГОСТ Р ИСО 14644 6 2010: Чистые помещения и связанные с ними контролируемые среды. Часть 6. Термины оригинал документа: 2.136 U дескриптор (U descriptor): Концентрация частиц (2.102) в 1 м3 воздуха, включая ультрамелкие частицы… …   Словарь-справочник терминов нормативно-технической документации

  • Perfluorooctanoic acid — IUPAC name pentadecafluorooctanoic acid …   Wikipedia

  • heredity — /heuh red i tee/, n., pl. heredities. Biol. 1. the transmission of genetic characters from parents to offspring: it is dependent upon the segregation and recombination of genes during meiosis and fertilization and results in the genesis of a new… …   Universalium

  • Molecule mining — This page describes mining for molecules. Since molecules may be represented by molecular graphs this is strongly related to graph mining and structured data mining. The main problem is how to represent molecules while discriminating the data… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”