 Maximum entropy thermodynamics

In physics, maximum entropy thermodynamics (colloquially, MaxEnt thermodynamics) views equilibrium thermodynamics and statistical mechanics as inference processes. More specifically, MaxEnt applies inference techniques rooted in Shannon information theory, Bayesian probability, and the principle of maximum entropy. These techniques are relevant to any situation requiring prediction from incomplete or insufficient data (e.g., image reconstruction, signal processing, spectral analysis, and inverse problems). MaxEnt thermodynamics began with two papers Edwin T. Jaynes published in the 1957 Physical Review.
Contents
Maximum Shannon entropy
Central to the MaxEnt thesis is the principle of maximum entropy, which states that given certain "testable information" about a probability distribution, for example particular expectation values, but which is not in itself sufficient to uniquely determine the distribution, one should prefer the distribution which maximizes the Shannon information entropy.
This is known as the Gibbs algorithm, having been introduced by J. Willard Gibbs in 1878, to set up statistical ensembles to predict the properties of thermodynamic systems at equilibrium. It is the cornerstone of the statistical mechanical analysis of the thermodynamic properties of equilibrium systems (see partition function).
A direct connection is thus made between the equilibrium thermodynamic entropy S_{Th}, a state function of pressure, volume, temperature, etc., and the information entropy for the predicted distribution with maximum uncertainty conditioned only on the expectation values of those variables:
k_{B}, Boltzmann's constant, has no fundamental physical significance here, but is necessary to retain consistency with the previous historical definition of entropy by Clausius (1865) (see Boltzmann's constant).
However, the MaxEnt school argue that the MaxEnt approach is a general technique of statistical inference, with applications far beyond this. It can therefore also be used to predict a distribution for "trajectories" Γ "over a period of time" by maximising:
This "information entropy" does not necessarily have a simple correspondence with thermodynamic entropy. But it can be used to predict features of nonequilibrium thermodynamic systems as they evolve over time.
In the field of nearequilibrium thermodynamics, the Onsager reciprocal relations and the GreenKubo relations fall out very directly. The approach also creates a solid theoretical framework for the study of farfromequilibrium thermodynamics, making the derivation of the entropy production fluctuation theorem particularly straightforward. Practical calculations for most farfromequilibrium systems remain very challenging, however.
Technical note: For the reasons discussed in the article differential entropy, the simple definition of Shannon entropy ceases to be directly applicable for random variables with continuous probability distribution functions. Instead the appropriate quantity to maximise is the "relative information entropy,"
H_{c} is the negative of the KullbackLeibler divergence, or discrimination information, of m(x) from p(x), where m(x) is a prior invariant measure for the variable(s). The relative entropy H_{c} is always less than zero, and can be thought of as (the negative of) the number of bits of uncertainty lost by fixing on p(x) rather than m(x). Unlike the Shannon entropy, the relative entropy H_{c} has the advantage of remaining finite and welldefined for continuous x, and invariant under 1to1 coordinate transformations. The two expressions coincide for discrete probability distributions, if one can make the assumption that m(x_{i}) is uniform  i.e. the principle of equal apriori probability, which underlies statistical thermodynamics.
Philosophical Implications
Adherents to the MaxEnt viewpoint take a clear position on some of the conceptual/philosophical questions in thermodynamics. This position is sketched below.
The nature of the probabilities in statistical mechanics
Jaynes (1985,^{[1]} 2003,^{[2]} et passim) discussed the concept of probability. According to the MaxEnt viewpoint, the probabilities in statistical mechanics are determined jointly by two factors: by respectively specified particular models for the underlying state space (e.g. Liouvillian phase space); and by respectively specified particular partial descriptions of the system (the macroscopic description of the system used to constrain the MaxEnt probability assignment). The probabilities are objective in the sense that, given these inputs, a uniquely defined probability distribution will result, independent of the subjectivity or arbitrary opinion of particular persons. The probabilities are epistemic in the sense that they are defined in terms of specified data and derived from those data by definite and objective rules of inference. Here the word epistemic, which refers to objective and impersonal scientific knowledge, is used in the sense that contrasts it with opiniative, which refers to the subjective or arbitrary beliefs of particular persons; this contrast was used by Plato and Aristotle and stands reliable today.
The probabilities represent both the degree of knowledge and lack of information in the data and the model used in the analyst's macroscopic description of the system, and also what those data say about the nature of the underlying reality.
The fitness of the probabilities depends on whether the constraints of the specified macroscopic model are a sufficiently accurate and/or complete description of the system to capture all of the experimentally reproducible behaviour. This cannot be guaranteed, a priori. For this reason MaxEnt proponents also call the method predictive statistical mechanics. The predictions can fail. But if they do, this is informative, because it signals the presence of new constraints needed to capture reproducible behaviour in the system, which had not been taken into account.
Is entropy "real" ?
The thermodynamic entropy (at equilibrium) is a function of the state variables of the model description. It is therefore as "real" as the other variables in the model description. If the model constraints in the probability assignment are a "good" description, containing all the information needed to predict reproducible experimental results, then that includes all of the results one could predict using the formulae involving entropy from classical thermodynamics. To that extent, the MaxEnt S_{Th} is as "real" as the entropy in classical thermodynamics.
Of course, in reality there is only one real state of the system. The entropy is not a direct function of that state. It is a function of the real state only through the (subjectively chosen) macroscopic model description.
Is ergodic theory relevant ?
The Gibbsian ensemble idealises the notion of repeating an experiment again and again on different systems, not again and again on the same system. So longterm time averages and the ergodic hypothesis, despite the intense interest in them in the first part of the twentieth century, strictly speaking are not relevant to the probability assignment for the state one might find the system in.
However, this changes if there is additional knowledge that the system is being prepared in a particular way some time before the measurement. One must then consider whether this gives further information which is still relevant at the time of measurement. The question of how 'rapidly mixing' different properties of the system are then becomes very much of interest. Information about some degrees of freedom of the combined system may become unusable very quickly; information about other properties of the system may go on being relevant for a considerable time.
If nothing else, the medium and longrun time correlation properties of the system are interesting subjects for experimentation in themselves. Failure to accurately predict them is a good indicator that relevant macroscopically determinable physics may be missing from the model.
The Second Law
According to Liouville's theorem for Hamiltonian dynamics, the hypervolume of a cloud of points in phase space remains constant as the system evolves. Therefore, the information entropy must also remain constant, if we condition on the original information, and then follow each of those microstates forward in time:
However, as time evolves, that initial information we had becomes less directly accessible. Instead of being easily summarisable in the macroscopic description of the system, it increasingly relates to very subtle correlations between the positions and momenta of individual molecules. (Compare to Boltzmann's Htheorem.) Equivalently, it means that the probability distribution for the whole system, in 6Ndimensional phase space, becomes increasingly irregular, spreading out into long thin fingers rather than the initial tightly defined volume of possibilities.
Classical thermodynamics is built on the assumption that entropy is a state function of the macroscopic variables  i.e., that none of the history of the system matters, so that it can all be ignored.
The extended, wispy, evolved probability distribution, which still has the initial Shannon entropy S_{Th}^{(1)}, should reproduce the expectation values of the observed macroscopic variables at time t_{2}. However it will no longer necessarily be a maximum entropy distribution for that new macroscopic description. On the other hand, the new thermodynamic entropy S_{Th}^{(2)} assuredly will measure the maximum entropy distribution, by construction. Therefore, we expect:
At an abstract level, this result simply means that some of the information we originally had about the system has become "no longer useful" at a macroscopic level. At the level of the 6Ndimensional probability distribution, this result represents coarse graining  i.e., information loss by smoothing out very finescale detail.
Caveats with the argument
Some caveats should be considered with the above.
1. Like all statistical mechanical results according to the MaxEnt school, this increase in thermodynamic entropy is only a prediction. It assumes in particular that the initial macroscopic description contains all of the information relevant to predicting the later macroscopic state. This may not be the case, for example if the initial description fails to reflect some aspect of the preparation of the system which later becomes relevant. In that case the "failure" of a MaxEnt prediction tells us that there is something more which is relevant that we may have overlooked in the physics of the system.
It is also sometimes suggested that quantum measurement, especially in the decoherence interpretation, may give an apparently unexpected reduction in entropy per this argument, as it appears to involve macroscopic information becoming available which was previously inaccessible. (However, the entropy accounting of quantum measurement is tricky, because to get full decoherence one may be assuming an infinite environment, with an infinite entropy).
2. The argument so far has glossed over the question of fluctuations. It has also implicitly assumed that the uncertainty predicted at time t_{1} for the variables at time t_{2} will be much smaller than the measurement error. But if the measurements do meaningfully update our knowledge of the system, our uncertainty as to its state is reduced, giving a new S_{I}^{(2)} which is less than S_{I}^{(1)}. (Note that if we allow ourselves the abilities of Laplace's demon, the consequences of this new information can also be mapped backwards, so our uncertainty about the dynamical state at time t_{1} is now also reduced from S_{I}^{(1)} to S_{I}^{(2)} ).
We know that S_{Th}^{(2)} > S_{I}^{(2)}; but we can now no longer be certain that it is greater than S_{Th}^{(1)} = S_{I}^{(1)}. This then leaves open the possibility for fluctuations in S_{Th}. The thermodynamic entropy may go "down" as well as up. A more sophisticated analysis is given by the entropy Fluctuation Theorem, which can be established as a consequence of the timedependent MaxEnt picture.
3. As just indicated, the MaxEnt inference runs equally well in reverse. So given a particular final state, we can ask, what can we "retrodict" to improve our knowledge about earlier states? However the Second Law argument above also runs in reverse: given macroscopic information at time t_{2}, we should expect it too to become less useful. The two procedures are timesymmetric. But now the information will become less and less useful at earlier and earlier times. (Compare with Loschmidt's paradox.) The MaxEnt inference would predict that the most probable origin of a currently lowentropy state would be as a spontaneous fluctuation from an earlier high entropy state. But this conflicts with what we know to have happened, namely that entropy has been increasing steadily, even back in the past.
The MaxEnt proponents' response to this would be that such a systematic failing in the prediction of a MaxEnt inference is a "good" thing.^{[3]} It means that there is thus clear evidence that some important physical information has been missed in the specification the problem. If it is correct that the dynamics "are" timesymmetric, it appears that we need to put in by hand a prior probability that initial configurations with a low thermodynamic entropy are more likely than initial configurations with a high thermodynamic entropy. This cannot be explained by the immediate dynamics. Quite possibly, it arises as a reflection of the evident timeasymmetric evolution of the universe on a cosmological scale (see arrow of time).
Criticisms
Maximum Entropy thermodynamics has generally failed to be accepted by the majority of scientists, with mainstream thermodynamicists considering Jaynes' work as an unfounded mathematical contrivance. This is in part because of the relative paucity of published results from the MaxEnt school, especially with regard to the new testable predictions farfromequilibrium.^{[4]}
The theory has also been criticized in the grounds of internal consistency. For instance, Radu Balescu provides a concise but strong criticism of the MaxEnt School and of Jaynes' work. Balescu states how Jaynes' and coworkers theory is based on a nontransitive evolution law that produces ambiguous results. Although some difficulties of the theory can be cured, the theory "lacks a solid foundation" and "has not led to any new concrete result". ^{[5]}
References
 ^ Jaynes, E.T. (1985). "Some random observations". Synthese 63: 115–138. doi:10.1007/BF00485957.
 ^ Jaynes, E.T. (2003). Bretthorst, G.L.. ed. Probability Theory: The Logic of Science. Cambridge: Cambridge University Press. ISBN 0521592712.
 ^ Jaynes, E.T. (1979). "Where do we stand on maximum entropy?". In Levine, R., Tribus M.. The Maximum Entropy Formalism. MIT Press. ISBN 0262120801. http://bayes.wustl.edu/etj/articles/stand.on.entropy.pdf.
 ^ Kleidon, Axel; Lorenz, Ralph D. (2005). Nonequilibrium thermodynamics and the production of entropy: life, earth, and beyond. Springer. pp. 42–. ISBN 9783540224952. http://books.google.com/books?id=YRjfuEP_QycC&pg=PA42.
 ^ Balescu, Radu (1997). Statistical Dynamics: Matter out of equilibrium. London: Imperial College Press.
Further Reading
 Bajkova, A.T. (1992). "The generalization of maximum entropy method for reconstruction of complex functions". Astronomical and Astrophysical Transactions 1 (4): 313–320. Bibcode 1991A&AT....1..313B. doi:10.1080/10556799208230532.
 Dewar, R.C. (2003). "Information theory explanation of the fluctuation theorem, maximum entropy production and selforganized criticality in nonequilibrium stationary states". J. Phys. A: Math. Gen. 36 (3): 631–41. arXiv:condmat/0005382. Bibcode 2003JPhA...36..631D. doi:10.1088/03054470/36/3/303. http://iopscience.iop.org/03054470/36/3/303.
 Grinstein, G., Linsker, R. (2007). "Comments on a derivation and application of the 'maximum entropy production' principle". J. Phys. A: Math. Theor. 40 (31): 9717–20. Bibcode 2007JPhA...40.9717G. doi:10.1088/17518113/40/31/N01. http://www.iop.org/EJ/abstract/17518121/40/31/N01/. Shows invalidity of Dewar's derivations (a) of maximum entropy production (MaxEP) from fluctuation theorem for farfromequilibrium systems, and (b) of a claimed link between MaxEP and selforganized criticality.
 Grandy, W. T., 1987. Foundations of Statistical Mechanics. Vol 1: Equilibrium Theory; Vol. 2: Nonequilibrium Phenomena. Dordrecht: D. Reidel. Vol. 1: ISBN 902772489X. Vol. 2: ISBN 9027726493.
 Gull, S.F. (1991). "Some misconceptions about entropy". In Buck, B., Macaulay, V.A.. Maximum Entropy in Action. Oxford University Press. ISBN 0198539630. http://www.ucl.ac.uk/~ucesjph/reality/entropy/text.html.
 Jaynes 1979
 Extensive archive of further papers by E.T. Jaynes on probability and physics. Many are collected in Rosenkrantz, R.D., ed (1983). E.T. Jaynes — Papers on probability, statistics and statistical physics. Dordrecht, Netherlands: D. Reidel.. ISBN 9027714487.
 Lorenz, R. (2003). "Full steam ahead — probably" (PDF). Science 299 (5608): 837–8. doi:10.1126/science.1081280. http://www.lpl.arizona.edu/~rlorenz/fullsteamahead.pdf.
 Rau, Jochen (1998). "Statistical Mechanics in a Nutshell". arXiv:physics/9805024 [physics.edph].
Categories:
Wikimedia Foundation. 2010.
Look at other dictionaries:
Maximum entropy — may refer to: The principle of maximum entropy The maximum entropy probability distribution Maximum entropy spectral estimation Maximum entropy spectral analysis Maximum entropy thermodynamics The law of maximum entropy production Entropy… … Wikipedia
Maximum entropy probability distribution — In statistics and information theory, a maximum entropy probability distribution is a probability distribution whose entropy is at least as great as that of all other members of a specified class of distributions. According to the principle of… … Wikipedia
Principle of maximum entropy — This article is about the probability theoretic principle. For the classifier in machine learning, see maximum entropy classifier. For other uses, see maximum entropy (disambiguation). Bayesian statistics Theory Bayesian probability Probability… … Wikipedia
Thermodynamics — Annotated color version of the original 1824 Carnot heat engine showing the hot body (boiler), working body (system, steam), and cold body (water), the letters labeled according to the stopping points in Carnot cycle … Wikipedia
Entropy (information theory) — In information theory, entropy is a measure of the uncertainty associated with a random variable. The term by itself in this context usually refers to the Shannon entropy, which quantifies, in the sense of an expected value, the information… … Wikipedia
Entropy in thermodynamics and information theory — There are close parallels between the mathematical expressions for the thermodynamic entropy, usually denoted by S , of a physical system in the statistical thermodynamics established by Ludwig Boltzmann and J. Willard Gibbs in the 1870s; and the … Wikipedia
thermodynamics — thermodynamicist, n. /therr moh duy nam iks/, n. (used with a sing. v.) the science concerned with the relations between heat and mechanical energy or work, and the conversion of one into the other: modern thermodynamics deals with the properties … Universalium
Entropy — This article is about entropy in thermodynamics. For entropy in information theory, see Entropy (information theory). For a comparison of entropy in information theory with entropy in thermodynamics, see Entropy in thermodynamics and information… … Wikipedia
Entropy (classical thermodynamics) — In thermodynamics, entropy is a measure of how close a thermodynamic system is to equilibrium. A thermodynamic system is any physical object or region of space that can be described by its thermodynamic quantities such as temperature, pressure,… … Wikipedia
entropy — entropic /en troh pik, trop ik/, adj. entropically, adv. /en treuh pee/, n. 1. Thermodynam. a. (on a macroscopic scale) a function of thermodynamic variables, as temperature, pressure, or composition, that is a measure of the energy that is not… … Universalium