- Khmaladze transformation
Consider empirical distribution function based on asequence of i.i.d random variables, .Suppose is a hypothetical distribution function ofeach . To test whether the choice of is correct or not, statisticians use the normalized difference,
:
This , as a random process in , is called empirical process. Various functional from are used as test statistics. The change of the time , , transforms to the so-called uniform empirical process . The latter is an empirical processes based on independent random variables , which are uniformly distributed on if s indeed have distribution function .
This fact, discovered and first utilized by Kolmogorov(1933), Wald and Wolfowitz(1936), Smirnov(1937), especially after Doob(1949) and Anderson and Darling(1952) led to the standard rule to choose test statistics from : these are statistics which possibly depends , in such a way that there exists another statistic from the uniform empirical process, such that . Examples are
:
and
:
For all such functionals, their "null" distribution (under the hypothetical ) does not depend on , and can be calculated once and then used to test any .
However, it is only rarely that one needs to test simple hypothesis, when a fixed as a hypothesis is given. Much more often, one needs to verify parametric hypothesis when hypothetical , depends on some parameter , which the hypothesis does not specify and which has to be estimated from the sample itself.
Although the estimators , most commonly converge to true value of , it was discovered, Kac, Kiefer and Wolfowitz(1955) and Gikhman(1954), that the parametric, or estimated, empirical process
:
differs significantly from and the time transformed process , has the distribution, and the limit distribution as , dependent on parametric form of on and, in general, within one
parametric family , on the value of .From mid-50's to the late-80's, much work was done to clarify the situation and understand the nature of the process .
In 1981, and then 1987 and 1993, E. V. Khmaladze suggested to replace the parametric empirical process by its martingale part only.
:
where is the compensator of . Then the following properties of were established:
* Although the form of , and therefore, of , depends on , as a function of both and , the limit distribution of the time transformed process
:
is that of standard Brownian motion on , i.e., isagain standard and independent of the choice of.
* The relationship between and and between their limits, is one to one, so that the statistical inference based on or on are equivalent, and in , nothing is lost compared to .
* The construction of innovation martingale could be carried over to the case of vector-valued , giving rise to the definition of the so-called scanning martingales in .
For a long time the transformation was, although known, still not used. Later, the work of researchers like R. Koenker, W. Stute, J. Bai, H. Koul, A. Koening, ... and others made it popular in econometrics and other fields of statistics.
Wikimedia Foundation. 2010.