Canonical analysis is an extension of multiple regression analysis from one criterion variable to a set of criterion variables. Using multiple regression, we analyzed relationship of a set of predictors to a single criterion variable
|
|
|
with the correlation supermatrix corresponding to the above data matrix
|
|
|
I canonical analysis we analyze relationship of a set of predictor variables to a set of criterion variables
|
|
|
With the correlation supermatrix corresponding to the above data matrix
|
|
|
The above supermatrix consists of four submatrices:
the submatrix of inter-correlations between predictor variables
,
the submatrices of the cross-correlations between predictor and criterion
variables
and
its transpose
,
and the submatrix of inter-correlations between criterion variables
.
The fundamental equation of the multiple regression analysis is
|
|
|
The fundamental equation of the canonical analysis is
|
|
|
While in multiple regression analysis the expression on the left side of its fundamental equation is a scalar number, the coefficient of multiple determination, R2, the expression on the left side of the fundamental equation of canonical analysis is an asymmetric matrix M. As the extant algorithms for eigenanalysis can extract eigenvalues only from symmetric matrices, the canonical matrix M is typically defined as
where A, a symmetric matrix, equals
and B, also a symmetric matrix, equals
A typical algorithm for eigenanalysis of asymmetric canonical matrices extracts eigenvalues from both A and B matrices separately, inverts the A matrix, and recombines the extracted eigenvalues. Next, it extracts and normalizes the right eigenvectors, D, i.e., the eigenvectors corresponding to the criterion set of variables. Imposing restrictions on the structural solution as to conform to the equation
normalizes the right eigenvectors D. The left eigenvectors, C, are then obtained from the equation
The matrices of standard canonical weights C and D are analogous to a vector of standard partial regression coefficients beta or to the matrix of factor score coefficients, B. Thus the notation designating the right and left sets of standard canonical weights as C and D observes the alphabetic progression B, C, and D.
As in the principal components analysis where the expression
makes the data vectors orthogonal, the definitions of the canonical weights within the canonical analysis as
and
results in two sets of mutually orthogonal canonical variates U and V.
At this point, let us introduce a numerical example based on a hypothetical data matrix presented in the following table.
|
|
|
As a first step, let us standardize the above matrix
and refer all subsequent discussion to this standardized data matrix. To
simplify the notation, instead of using the "z form" for the standard
scores, let us use the "obtained scores" form when referring to
standard scores. The
thus becomes X,
becomes Y, etc., as shown in the following
table.
|
|
|
The matrix of Pearson's product-moment correlations corresponding to the above data is
|
|
|
The canonical analysis commences by computing A and B components of the canonical matrix M. For the example, the matrix A equals
and the matrix B equals
The matrix M can be computed as
which equals
For the example, the matrix M is markedly asymmetric. There is another canonical matrix, N, defined as
Although matrices M and N are different, their eigenvalues are identical. For our example the matrix N happens to be close to symmetric, as
Thus, the eigenvalues of the matrix N can be approximated as
The above equation, for the example, equals
|
|
|
Computation of the determinant results in quadratic equation
|
|
|
which can be solved by the quadratic formula as
|
|
|
where
|
|
|
and
|
|
|
The canonical correlations
and
are just the square roots of their
corresponding eigenvalues
and
.
Thus, for the example, the first canonical correlation
equals .99 and the second canonical
correlation
equals .22. The correct values of the
eigenvalues obtained from the asymmetric eigenanalysis are .99016 and .04734
with the corresponding canonical correlations equal to .995 and .218.
Summarized, the results obtained so far can be presented as in a typical output from a program for canonical analysis, are presented in the following table.
|
|
|
The canonical correlations in the above table are
analogous to multiple R of the regression analysis. While the
indicates the predictability of the criterion
variable from the predictor set of variables, the canonical correlations
indicate the predictability of each of the extracted component of the criterion
set of variables from the corresponding components, extracted from the
predictor set of variables. Eigenvalues of the canonical solution indicate the
proportion of variance in the predictor set shared with the criterion set of
variables on every canonical variate (dimension) extracted. In general, there
will be as many canonical variates (dimensions) extracted, as there are
variables in the smaller set of either predictor or criterion variables.
The significance of the canonical correlations can be tested by Wilk's lambda
Wilk's lambda is computed as a continued product of coefficients of alienation, i.e., for our example
![]()
and
![]()
The comparison of the obtained and tabulated values for the chi-square indicates that only the first canonical variate is significant.
Following the extraction of eigenvalues, the characteristic equation
![]()
will allow us to solve for the characteristic
vectors (eigenvectors) associated with each characteristic root (eigenvalue).
Thus, for the first eigenvalue (
)
the characteristic equation can be written for the example as
![]()
Subtracting the value of the first eigenvalue from the elements located in the principal diagonal of the matrix M
![]()
and postmultiplying with the vector c
![]()
![]()
allows us to obtain the structural equation for the first component of the predictor set of variables. It should not matter whether you use the top or the bottom equation from the above set of two equations to compute the eigenvectors. Thus, using the top equation
![]()
the homogenous equation relating the
and
coefficients was obtained, as listed below
![]()
The first eigenvector thus can be written in a structural form as
![]()
Using the bottom equation, the
should be identical to the value obtained from
the equation on the top
![]()
and to the homogenous equation relating the
and
coefficients as
![]()
The structural form of the eigenvector is thus confirmed to be
![]()
Since this is a homogeneous set of equations, the C matrices will initially contain a universal set of weights, which as in factor analysis, can be normalized.
The predictor eigenvectors of canonical analysis are analogous to vectors of beta weights of the regression analysis. In regression analysis we can write Y = bX. In canonical analysis the analogous equation is U = cX. If normalized, the variance of the first predictor eigenvector should be unity. Thus
![]()
or alternatively, realizing that
,
as
![]()
However, this is obviously not true of our non-standard eigenvector, since
![]()
In general, the ipsative products of non-standard
canonical eigenvectors will be equal to some number
,
as
![]()
If normalized, theta should equal 1, as
![]()
Forming a ratio of the above both equations as
![]()
the normalizing constant k can be obtained as
![]()
For the example,
![]()
Normalizing the structural solution for the first canonical variate as
![]()
gives us a vector of standard canonical weights
![]()
which is identical to the column of canonical weights for the first predictor variate, obtained from a program for canonical analysis, and listed further in our discussion.