Total variation

As the green ball travels on the graph of the given function, the length of the path travelled by that ball's projection on the y-axis, shown as a red ball, is the total variation of the function.

In mathematics, the total variation identifies several slightly different concepts, related to the (local or global) structure of the codomain of a function or a measure. For a real-valued continuous function $f$ , defined on an interval [a, b] ⊂ ℝ, its total variation on the interval of definition is a measure of the one-dimensional arclength of the curve with parametric equation x → $f$ (x), for x ∈ [a,b].

Historical notice

The concept of total variation for functions of one real variable was first introduced by Camille Jordan in the paper (Jordan 1881).^[1] He used the new concept in order to prove a convergence theorem for Fourier series of discontinuous periodic functions whose variation is bounded. The extension of the concept to functions of more than one variable however is not simple for some reasons.

Definitions

Total variation for functions of one real variable

Definition 1.1. The total variation of a real-valued (or more generally complex-valued) function $f$ , defined on an interval $[a, b]$ ⊂ℝ is the quantity

$V^a_b(f)=\sup_P \sum_{i=0}^{n_P-1} | f(x_{i+1})-f(x_i) |, \,$

where the supremum runs over the set of all partitions $\scriptstyle \mathcal{P} =\left\{P=\{ x_0, \dots , x_{n_P}\}|P\text{ is a partition of } [a,b] \right\}$ of the given interval.

Total variation for functions of n>1 real variables

Definition 1.2. Let $Ω$ be an open subset of ℝⁿ. Given a function $f$ belonging to $L 1 (Ω)$ , the total variation of $f$ in is defined as

$V(f,\Omega):=\sup\left\{\int_\Omega f(x)\mathrm{div}\phi(x)\,\mathrm{d}x\colon \phi\in C_c^1(\Omega,\mathbb{R}^n),\ \Vert \phi\Vert_{L^\infty(\Omega)}\le 1\right\},$

where $\scriptstyle C_c^1(\Omega,\mathbb{R}^n)$ is the set of continuously differentiable vector functions of compact support contained in $Ω$ , and $\scriptstyle \Vert\;\Vert_{L^\infty(\Omega)}$ is the essential supremum norm. Note that this definition does not require that the domain $Ω$ ⊆ℝⁿ of the given function is a bounded set.

Total variation in measure theory

Following Saks (1937, p. 10), consider a signed measure $μ$ on a measurable space $(X,Σ)$ : then it is possible to define two set functions $\scriptstyle\overline{\mathrm{W}}(\mu,\cdot)$ and $\scriptstyle\underline{\mathrm{W}}(\mu,\cdot)$ , respectively called upper variation and lower variation, as follows

$\overline{\mathrm{W}}(\mu,E)=\sup\left\{\mu(A)|A\in\Sigma\text{ and }A\subset E \right\}\qquad\forall E\in\Sigma$

$\underline{\mathrm{W}}(\mu,E)=\inf\left\{\mu(A)|A\in\Sigma\text{ and }A\subset E \right\}\qquad\forall E\in\Sigma$

clearly

$\overline{\mathrm{W}}(\mu,E)\geq 0\geq \underline{\mathrm{W}}(\mu,E)\qquad\forall E\in\Sigma$

Definition 1.3. The variation (also called absolute variation) of the signed measure $μ$ is the set function

$|\mu|(E)=\overline{\mathrm{W}}(\mu,E)+\left|\underline{\mathrm{W}}(\mu,E)\right|\qquad\forall E\in\Sigma$

and its total variation is defined as the value of this measure on the whole space of definition, i.e.

$\|\mu\|=|\mu|(X)$

Saks (1937, p. 11) uses upper and lower variations to prove the Hahn–Jordan decomposition: according to his version of this theorem, the upper and lower variation are respectively a non-negative and a non-positive measure. Using a more modern notation, define

$\mu^+(\cdot)=\overline{\mathrm{W}}(\mu,\cdot)\,,$

$\mu^-(\cdot)=-\underline{\mathrm{W}}(\mu,\cdot)\,,$

Then $μ +$ and $μ -$ are two non-negative measures such that

$\mu=\mu^+-\mu^-\,$

$|\mu|=\mu^++\mu^-\,$

The last measure is sometimes called, by abuse of notation, total variation measure.

If the measure $μ$ is complex-valued i.e. is a complex measure, its upper and lower variation cannot be defined and the Hahn–Jordan decomposition theorem can only be applied to its real and imaginary parts. However, it is possible to follow Rudin (1966, pp. 137–139) and define the total variation of the complex-valued measure $μ$ as follows

Definition 1.4. The variation of the complex-valued measure $μ$ is the set function

$|\mu|(E)=\sup_\pi \sum_{A\isin\pi} |\mu(A)|\qquad\forall E\in\Sigma$

where the supremum is taken over all partitions $π$ of a measurable set $E$ into a finite number of disjoint measurable subsets.

The variation so defined is a positive measure (see Rudin (1966, p. 139)) and coincides with the one defined by 1.3 when $μ$ is a signed measure: its total variation is defined as above. This definition works also if $μ$ is a vector measure: the variation is then defined by the following formula

$|\mu|(E) = \sup_\pi \sum_{A\isin\pi} \|\mu(A)\|\qquad\forall E\in\Sigma$

where the supremum is as above. Note also that this definition is slightly more general than the one given by Rudin (1966, p. 138) since it requires only to consider finite partitions of the space $X$ : this implies that it can be used also to define the total variation on finitely-additive measures.

Total variation of probability measures

Main article: Total variation distance of probability measures

The total variation of any probability measure is exactly one, therefore it is not interesting as a means of investigating the properties of such measures. However, when μ and ν are probability measure, the total variation distance of probability measures can be defined as

$d(\mu,\nu) = \sup\left\{\,\left|\mu(A)-\nu(A)\right| : A\in \Sigma\,\right\}$

and its values are non-trivial. Informally, this is the largest possible difference between the probabilities that the two probability distributions can assign to the same event. For a categorical distribution it is possible to write the total variation distance as follows

$\delta(\mu,\nu) = \frac 1 2 \sum_x \left| \mu(x) - \nu(x) \right|\;.$

The total variational distance for categorical probability distributions is called statistical distance: sometimes, in the definition of this distance, the factor $\scriptstyle\frac 1 2$ is omitted.

Basic properties

Total variation of differentiable functions

The total variation of a differentiable function $f$ can be expressed as an integral involving the given function instead of as the supremum of the functionals of definitions 1.1 and 1.2.

The form of the total variation of a differentiable functions of one variable

Theorem 1. The total variation of a differentiable function $f$ , defined on an interval $[a, b]$ ⊂ℝ, has the following expression

$V^a_b(f) = \int _a^b |f'(x)|\mathrm{d}x$

The form of the total variation of a differentiable functions of several variables

Theorem 2. Given a differentiable function $f$ defined on a bounded open set $Ω$ ⊆ℝⁿ, the total variation of $f$ has the following expression

$V(f,\Omega) = \int\limits_\Omega\left|\nabla f(x)\right|\mathrm{d}x$

Proof

The first step in the proof is to first prove an equality which follows from the Gauss-Ostrogradsky theorem.

Lemma

Under the conditions of the theorem, the following equality holds:

$\int\limits_\Omega f\,\mathrm{div}\varphi = -\int_\Omega\nabla f\cdot\varphi$

Proof of the lemma

From the Gauss-Ostrogradsky theorem:

$\int\limits_\Omega \text{div}\mathbf R = \int\limits_{\partial\Omega}\mathbf R\cdot \mathbf n$

by subtituting $\mathbf R:= f\mathbf\varphi$ , we have:

$\int\limits_\Omega\text{div}\left(f\mathbf\varphi\right) = \int\limits_{\partial\Omega}\left(f\mathbf\varphi\right)\cdot\mathbf n$

where $\mathbf\varphi$ is zero on the border of $Ω$ by definition:

$\int\limits_\Omega\text{div}\left(f\mathbf\varphi\right)=0$

$\int\limits_\Omega \partial_{x_i} \left(f\mathbf\varphi_i\right)=0$

$\int\limits_\Omega \mathbf\varphi_i\partial_{x_i} f + f\partial_{x_i}\mathbf\varphi_i=0$

$\int\limits_\Omega f\partial_{x_i}\mathbf\varphi_i = - \int\limits_\Omega \mathbf\varphi_i\partial_{x_i} f$

$\int\limits_\Omega f\text{div} \mathbf\varphi = - \int\limits_\Omega \mathbf\varphi\cdot\nabla f$

Proof of the equality

Under the conditions of the theorem, from the lemma we have:

$\int\limits_\Omega f\text{div} \mathbf\varphi = - \int\limits_\Omega \mathbf\varphi\cdot\nabla f \leq \left| \int\limits_\Omega \mathbf\varphi\cdot\nabla f \right|\leq \int\limits_\Omega \left|\mathbf\varphi\right|\cdot\left|\nabla f\right|\leq \int\limits_\Omega \left|\nabla f\right|$

in the last part $\mathbf\varphi$ could be omitted, because by definition it's considerate supremum is at most one.

On the other hand we consider $\theta_n:=\mathbb I_{\left[-N,N\right]}\frac{\nabla f}{\left|\nabla f\right|}$ and $\theta^*_n$ which is the up to $ε$ approximation of $θ$ in $C^1_c$ with the same integral. We can do this hence $C^1_c$ is dense in $L 1$ . Now again substituting into the lemma:

$\lim\limits_{N\rightarrow\infty}\int\limits_\Omega f\text{div}\theta^*_n = \lim\limits_{N\rightarrow\infty}\int\limits_\Omega\mathbb I_{\left[-N,N\right]}\nabla f\cdot\frac{\nabla f}{\left|\nabla f\right|}= \lim\limits_{N\rightarrow\infty}\int\limits_{\mathbb I_{\left[-N,N\right]}} \nabla f\cdot\frac{\nabla f}{\left|\nabla f\right|} = \int\limits_\Omega\left|\nabla f\right|$

This means we^[who?] have a convergent sequence of $\int\limits_\Omega f\text{div}\mathbf\varphi$ that tends to $\int\limits_\Omega\left|\nabla f\right|$ as well as we know that $\int\limits_\Omega f\text{div}\mathbf\varphi \leq \int\limits_\Omega\left|\nabla f\right|$ . q.e.d.

It can be seen from the proof that the supremum is attained when

$\varphi\to \frac{-\nabla f}{\left|\nabla f\right|}.$

The function $f$ is said to be of bounded variation precisely if its total variation is finite.

Total variation of a measure

The total variation is a norm defined on the space of measures of bounded variation. The space of measures on a σ-algebra of sets is a Banach space, called the ca space, relative to this norm. It is contained in the larger Banach space, called the ba space, consisting of finitely additive (as opposed to countably additive) measures, also with the same norm. The distance function associated to the norm gives rise to the total variation distance between two measures μ and ν.

For finite measures on ℝ, the link between the total variation of a measure μ and the total variation of a function, as described above, goes as follows. Given μ, define a function $\scriptstyle\varphi\colon \mathbb{R}\to \mathbb{R}$ by

$\varphi(t) = \mu((-\infty,t])~.$

Then, the total variation of the signed measure μ is equal to the total variation, in the above sense, of the function φ. In general, the total variation of a signed measure can be defined using Jordan's decomposition theorem by

$\|\mu\|_{TV} = \mu_+(X) + \mu_-(X)~,$

for any signed measure μ on a measurable space $(X,Σ)$ .

Applications

Total variation can be seen as a non-negative real-valued functional defined on the space of real-valued functions (for the case of functions of one variable) or on the space of integrable functions (for the case of functions of several variables). As a functional, total variation finds applications in several branches of mathematics and engineering, like optimal control, numerical analysis, and calculus of variations, where the solution to a certain problem has to minimize its value. As an example, use of the total variation functional is common in the following two kind of problems

Numerical analysis of differential equations: it is the science of finding approximate solutions to differential equations. Applications of total variation to this problems are detailed in the article "total variation diminishing"

Image denoising: in image processing, denoising is a collection of methods used to reduce the noise in an image reconstructed from data obtained by electronic means, for example data transmission or sensing. Total variation denoising is the name for the application of total variation to image noise reduction; further details can be found in the paper (Caselles, Chambolle & Novaga 2007).

Notes

^ According to Golubov & Vitushkin (2001).

Bibliography

Arzelà, Cesare (7 maggio 1905), "Sulle funzioni di due variabili a variazione limitata (On functions of two variables of bounded variation)" (in Italian), Rendiconto delle sessioni della Reale Accademia delle scienze dell'Istituto di Bologna, Nuova serie IX (4): 100–107, JFM 36.0491.02, archived from the original on 2007-08-07, http://www.archive.org/stream/rendicontodelle04bologoog#page/n121/mode/2up .
Jordan, Camille (1881), "Sur la série de Fourier" (in French), Comptes rendus hebdomadaires des séances de l'Académie des sciences 92: 228–230, JFM 13.0184.01, http://gallica.bnf.fr/ark:/12148/bpt6k7351t/f227 (available at Gallica). This is, according to Boris Golubov, the first paper on functions of bounded variation.
Hahn, Hans (1921) (in German), Theorie der reellen Funktionen, Berlin: Springer Verlag, pp. VII+600, JFM 48.0261.09, archived from the original on 2008-12-31, http://www.archive.org/details/theoriederreelle01hahnuoft .
Vitali, Giuseppe (1908) [17 dicembre 1907], "Sui gruppi di punti e sulle funzioni di variabili reali (On groups of points and functions of real variables)" (in Italian), Atti dell'Accademia delle Scienze di Torino 43: 75–92, JFM 39.0101.05, archived from the original on 2009-03-31, http://www.archive.org/stream/attidellarealeac43real#page/228/mode/2up . The paper containing the first proof of Vitali covering theorem.

References

Adams, C. Raymond; Clarkson, James A. (1933), "On definitions of bounded variation for functions of two variables", Transactions of the American Mathematical Society 35: 824–854, doi:10.1090/S0002-9947-1933-1501718-2, JFM 59.0285.01, MR 1501718, Zbl 0008.00602, http://www.ams.org/journals/tran/1933-035-04/S0002-9947-1933-1501718-2/home.html .
Cesari, Lamberto (1936), "Sulle funzioni a variazione limitata (On the functions of bounded variation)" (in Italian), Annali della Scuola Normale Superiore, II 5 (3-4): 299–313, JFM 62.0247.03, MR 1556778, Zbl 0014.29605, http://www.numdam.org/item?id=ASNSP_1936_2_5_3-4_299_0 . Available at Numdam.
Saks, Stanisław (1937), Theory of the Integral, Monografie Matematyczne, 7 (2nd ed.), Warszawa-Lwów: G.E. Stechert & Co., pp. VI+347, JFM 63.0183.05, MR 1556778, Zbl 0017.30004, http://matwbn.icm.edu.pl/kstresc.php?tom=7&wyd=10&jez=pl . (available at the Polish Virtual Library of Science). English translation from the original French by Laurence Chisholm Young, with two additional notes by Stefan Banach.
Rudin, Walter (1966), Real and Complex Analysis, McGraw-Hill Series in Higher Mathematics (1st ed.), New York: McGraw-Hill, pp. xi+412, MR 210528, Zbl 0142.01701 .

External links

Theory

One variable

Golubov, Boris I.; Vitushkin, Anatolii G. (2001), "Variation of a function", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/V/v096110.htm
"Total variation" on Planetmath.

Several variables

Final comments of Anatolii Georgievich Vitushkin on the paper Golubov, Boris I.; Vitushkin, Anatolii G. (2001), "Variation of a function", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/V/v096110.htm .
Golubov, Boris I. (2001), "Arzelà variation", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/a/a013470.htm .
Golubov, Boris I. (2001), "Fréchet variation", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/f/f041400.htm .
Golubov, Boris I. (2001), "Hardy variation", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/h/h046400.htm .
Golubov, Boris I. (2001), "Pierpont variation", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/p/p072720.htm .
Golubov, Boris I. (2001), "Vitali variation", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/h/h046400.htm .
Golubov, Boris I. (2001), "Tonelli plane variation", in Hazewinkel, Michiel, Encyclopaedia of Mathematics, Springer, ISBN 978-1556080104, http://eom.springer.de/t/t092990.htm .

Measure theory

Rowland, Todd, "Total Variation" from MathWorld..
Jordan decomposition at PlanetMath..

Probability theory

M. Denuit and S. Van Bellegem "On the stop-loss and total variation distances between random sums", discussion paper 0034 of the Statistic Institute of the "Université Catholique de Louvain".

Applications

Caselles, Vicent; Chambolle, Antonin; Novaga, Matteo (2007), The discontinuity set of solutions of the TV denoising problem and some extensions, SIAM, Multiscale Modeling and Simulation, vol. 6 n. 3,, http://cvgmt.sns.it/papers/caschanov07/ (a work dealing with total variation application in denoising problems for image processing).

Tony F. Chan and Jackie (Jianhong) Shen (2005), Image Processing and Analysis - Variational, PDE, Wavelet, and Stochastic Methods, SIAM, ISBN 0-89871-589-X (with in-depth coverage and extensive applications of Total Variations in modern image processing, as started by Rudin, Osher, and Fatemi).

Categories:

Wikimedia Foundation. 2010.

Игры ⚽ Нужно решить контрольную?

Look at other dictionaries:

Total variation diminishing — In systems described by partial differential equations, such as the following hyperbolic advection equation,:frac{part u}{part t} + afrac{part u}{part x} = 0, the total variation (TV) is given by,:TV = int left| frac{part u}{part x} ight| dx ,and … Wikipedia
Total organic carbon — (TOC) is the amount of carbon bound in an organic compound and is often used as a non specific indicator of water quality or cleanliness of pharmaceutical manufacturing equipment. A typical analysis for TOC measures both the total carbon present… … Wikipedia
Total Quality Management — (TQM) is a business management strategy aimed at embedding awareness of quality in all organizational processes. TQM has been widely used in manufacturing, education, call centers, government, and service industries, as well as NASA space and… … Wikipedia
Total dissolved solids — (often abbreviated TDS) is an expression for the combined content of all inorganic and organic substances contained in a liquid which are present in a molecular, ionized or micro granular (colloidal sol) suspended form. Generally the operational… … Wikipedia
Total indicator variation — Total indicator variation. См. Общее изменение индикатора. (Источник: «Металлы и сплавы. Справочник.» Под редакцией Ю.П. Солнцева; НПО Профессионал , НПО Мир и семья ; Санкт Петербург, 2003 г.) … Словарь металлургических терминов
variation — first order lateral force variation first order radial force variation lateral force variation peak to peak lateral force variation peak to peak radial force variation radial force variation total lateral force variation total radial force… … Mechanics glossary
Variation de la pression atmosphérique avec l'altitude — Formule du nivellement barométrique La formule du nivellement barométrique décrit la répartition verticale des molécules de gaz dans l atmosphère de la Terre, et donc la variation de la pression en fonction de l altitude. On parle ainsi d un… … Wikipédia en Français
Total fertility rate — Not to be confused with birth rate. A world map showing countries by fertility rate, 2005 2010 … Wikipedia
Total Harmonic Distorsion — Taux de distorsion harmonique Le taux de distorsion harmonique (abrégé THD, total harmonic distortion en anglais) représente la variation d un signal par rapport à une référence. Sommaire 1 Théorie 1.1 THD F 1.2 THD G 2 … Wikipédia en Français
Total Harmonic Distortion — Taux de distorsion harmonique Le taux de distorsion harmonique (abrégé THD, total harmonic distortion en anglais) représente la variation d un signal par rapport à une référence. Sommaire 1 Théorie 1.1 THD F 1.2 THD G 2 … Wikipédia en Français

Academic Dictionaries and Encyclopedias

Total variation

Contents

Historical notice

Definitions

Total variation for functions of one real variable