|
Cruise Scientific Visual Statistics Studio Measurement and Scaling |
Using matrix algebra, variance can be measured by computing all possible differences between elements of a population. Consider that a major difference of a vector results in a skew-symmetric matrix with elements describing all possible differences between its values.
Skew symmetric matrices are redundant, as the negative values can be guessed from the symmetric positive values. Removing this redundancy, a skew-asymmetric matrix can be defined as
Variance can be obtained from the sum of the squared elements of skew-asymmetric matrices divided by the square of their order. Thus
and for the example (12 + 22 +12 + 32 + 22 + 12 ) / 42 = 1.25.
The above definition of variance in terms of differences contained by the data does not involve the arithmetic mean. It seems plausible to assume that the information contained in the above matrix could have been also obtained from a matrix of all possible differences between the data elements and a scalar vector M
which might be as well written as
since the variance can be computed either as
(for the example as equal to 1.25) or as
For the example
Thus, using the matrix algebra notation, variance can be computed as an index of differences between elements of a variable
or, by using algebraic notation, as an average of the squared differences between values of a variable and their mean
These observations facilitate the understanding the definition of variance as an index of the magnitude of differences between the values of a variable.
Transformation of the vector x into its adjacent implicatic matrix as
and subtraction of the transpose of this matrix from itself
results in the same skew symmetric matrix as that of the major difference of the vector x.
Transformation of the above matrix from its skew symmetric to the skew asymmetric form,
can provide information about the number of bits contained within the columns of the data vector, as defined within the mathematical theory of information.
The concept of variance in terms of all possible differences between values of a variable was introduced by von Andrae (1872) and Helmert (1876) in a series of articles to Astronomische Nachtrichten and the convention to use the Greek lowercase character σ for the standard deviation was coined by Karl Pearson in a series of articles published in Philosophical Transactions and Biometrika between 1896 and 1906. Ronald Fisher can be credited with popularization of degrees of freedom and using the expression variance in lieu of that of variability, but not with the analytical conceptualization of variante eventum. That was done nearly a century earlier within the framework of astronomical observations.
Using all possible differences between values of a
variable as a foundation of statistical theory was contemplated by ,
as
![]()
For the discontinuous infinite case, the above equation can be written as
![]()
and for the finite case as

where the summed term in the above equation is a vector of all possible
differences between elements of variable x. Pointing out that the value of the u
coefficient is dependent on the spread of the variate-values among
themselves and not on the deviations from some central value, Kendall
(1943, p.47) shows that u
= 2σ
,
concludes that the initial defining formula is nothing but twice the
variance, and abandons the idea. One can only wonder which direction
statistics could have taken if
The unbiased variance can be expressed in the matrix algebra notation as
and in the algebraic notation as