# Self-consistent mean field (biology)

﻿
Self-consistent mean field (biology)

The self-consistent mean field (SCMF) method is an adaptation of mean field theory used in protein structure prediction to determine the optimal amino acid side chain packing given a fixed protein backboneref|Koehl. It is faster but less accurate than dead-end elimination and is generally used in situations where the protein of interest is too large for the problem to be tractable by DEEref|Voigt.

General principles

Like dead-end elimination, the SCMF method explores conformational space by discretizing the dihedral angles of each side chain into a set of rotamers for each position in the protein sequence. The method iteratively develops a probabilistic description of the relative population of each possible rotamer at each position, and the probability of a given structure is defined as a function of the probabilities of its individual rotamer components.

The basic requirements for an effective SCMF implementation are:
# A well-defined finite set of discrete independent variables
# A precomputed numerical value (considered the "energy") associated with each element in the set of variables, and associated with each binary element pair
# An initial probability distribution describing the starting population of each individual rotamer
# A way of updating rotamer energies and probabilities as a function of the mean-field energy

The process is generally initialized with a uniform probability distribution over the rotamers - that is, if there are $p$ rotamers at the $kth$ position in the protein, then the probability of any individual rotamer $r_\left\{k\right\}^\left\{A\right\}$ is $1/p$. The conversion between energies and probabilities is generally accomplished via the Boltzmann distribution, which introduces a temperature factor (thus making the method amenable to simulated annealing). Lower temperatures increase the likelihood of converging to a single solution, rather than to a small subpopulation of solutions.

Mean-field energies

The energy of an individual rotamer $r_\left\{k\right\}$ is dependent on the "mean-field" energy of the other positions - that is, at every other position, each rotamer's energy contribution is proportional to its probability. For a protein of length $N$ with $p$ rotamers per residue, the energy at the current iteration is described by the following expression. Note that for clarity, the mean-field energy at iteration $i$ is denoted by $M_\left\{i\right\}$, whereas the precomputed energies are denoted by $E$, and the probability of a given rotamer is denoted by $P_\left\{i\right\}\left(r_\left\{k\right\}^\left\{A\right\}\right)$.:$M_\left\{i\right\}\left(r_\left\{k\right\}^\left\{A\right\}\right) = E_\left\{k\right\}\left(r_\left\{k\right\}^\left\{A\right\}\right) + sum_\left\{x=1\right\}^\left\{N\right\} sum_\left\{y=1\right\}^\left\{p\right\} P_\left\{i-1\right\}\left(r_\left\{x\right\}^\left\{y\right\}\right) E_\left\{xy\right\}\left(r_\left\{k\right\}^\left\{A\right\}, r_\left\{x\right\}^\left\{y\right\}\right)$

These mean-field energies are used to update the probabilities through the Boltzmann law:

:$P_\left\{i\right\}\left(r_\left\{k\right\}^\left\{A\right\}\right) = left\left(expleft\left(-frac\left\{M_\left\{i\right\}\left(r_\left\{k\right\}^\left\{A\right\}\right)\right\}\left\{kT\right\} ight\right) ight\right)left\left(sum_\left\{y=1\right\}^\left\{p\right\}expleft\left(-frac\left\{M_\left\{i\right\}\left(r_\left\{k\right\}^\left\{y\right\}\right)\right\}\left\{kT\right\} ight\right) ight\right)^\left\{-1\right\}$where $k$ is the Boltzmann constant and $T$ is the temperature factor.

Energy of the system

Although computing the system energy is not required in carrying out the SCMF method, it is useful to know the overall energies of the converged results. The system energy $M_\left\{sys\right\}$ consists of two sums:

:$M_\left\{sys\right\} = M_\left\{single\right\} + M_\left\{pair\right\}$where the addends are defined as::$M_\left\{single\right\} = sum_\left\{x=1\right\}^\left\{N\right\} sum_\left\{y=1\right\}^\left\{p\right\} P\left(r_\left\{x\right\}^\left\{y\right\}\right)E_\left\{x\right\}\left(r_\left\{x\right\}^\left\{y\right\}\right)$

:$M_\left\{pair\right\} = sum_\left\{x=1\right\}^\left\{N\right\} sum_\left\{y=1\right\}^\left\{p\right\} sum_\left\{a=x+1\right\}^\left\{N\right\} sum_\left\{b=1\right\}^\left\{p\right\} left\left(P\left(r_\left\{x\right\}^\left\{y\right\}\right)P\left(r_\left\{a\right\}^\left\{b\right\}\right)E_\left\{xy\right\}\left(r_\left\{x\right\}^\left\{y\right\}, r_\left\{a\right\}^\left\{b\right\}\right) ight\right)$

Convergence

Perfect convergence for the SCMF method would result in a probability of 1 for exactly one rotamer at each position $k$ in the protein, and a probability of zero for all other rotamers at each position. Convergence to a unique solution requires probabilities close to 1 for exactly one rotamer at each position. In practice, especially when higher temperatures are used, the algorithm instead identifies a small number of high-probability rotamers at each position, allowing the resulting conformations' relative energies to then be enumerated (based on the precomputed energies, not on those derived from the mean-field approximation). One way to improve convergence is to run again at a lower temperature using the probabilities calculated from a previous higher-temperature run.

Accuracy

Unlike dead-end elimination, SCMF is not guaranteed to converge on the optimal solution. However, it is deterministic (as in, it will converge to the same solution every time given the same initial conditions), unlike alternatives that rely on Monte Carlo analysis. By comparison to DEE, which is guaranteed to find the optimal solution, SCMF is faster but less accurate overall; it is significantly better at identifying correct side chain conformations in the protein's core than it is on identifying correct surface conformationsref|Voigt. Geometric packing constraints are less restrictive on the surface and thus provide fewer boundaries to the conformational search.

References

# Koehl P, Delarue M. (1994). Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. "J Mol Biol" 239(2):249-75.
# Voigt CA, Gordon DB, Mayo SL. (2000). Trading accuracy for speed: A quantitative comparison of search algorithms in protein sequence design. "J Mol Biol" 299(3):789-803.

Wikimedia Foundation. 2010.

### Look at other dictionaries:

• Self-consistent mean field — may be one of the following:* Mean field theory, an approach to the many body problem in physics and statistical mechanics * Self consistent mean field (biology), an application of this theory to the problem of protein structure prediction …   Wikipedia

• Polymorphism (biology) — Light morph Jaguar (typical) Dark morph or melanistic Jaguar (about …   Wikipedia

• List of mathematics articles (S) — NOTOC S S duality S matrix S plane S transform S unit S.O.S. Mathematics SA subgroup Saccheri quadrilateral Sacks spiral Sacred geometry Saddle node bifurcation Saddle point Saddle surface Sadleirian Professor of Pure Mathematics Safe prime Safe… …   Wikipedia

• metaphysics — /met euh fiz iks/, n. (used with a sing. v.) 1. the branch of philosophy that treats of first principles, includes ontology and cosmology, and is intimately connected with epistemology. 2. philosophy, esp. in its more abstruse branches. 3. the… …   Universalium

• Experiment — Experimental redirects here. For the musical classification, see Experimental music. For other uses, see Experiment (disambiguation). Even very young children perform rudimentary experiments in order to learn about the world. An experiment is a… …   Wikipedia

• Science — This article is about the general term, particularly as it refers to experimental sciences. For the specific topics of study by scientists, see Natural science. For other uses, see Science (disambiguation) …   Wikipedia

• Modern history — Modern and Modern Age redirect here. For other uses, see Modern (disambiguation) and Modern Age (disambiguation). Human history This box: view · talk · …   Wikipedia

• Time dilation — This article is about a concept in physics. For the concept in sociology, see time displacement. In the theory of relativity, time dilation is an observed difference of elapsed time between two events as measured by observers either moving… …   Wikipedia

• Monte Carlo method — Not to be confused with Monte Carlo algorithm. Computational physics …   Wikipedia

• Theory — The word theory has many distinct meanings in different fields of knowledge, depending on their methodologies and the context of discussion.In science a theory is a testable model of the manner of interaction of a set of natural phenomena,… …   Wikipedia