Davis | Rip's Applied Mathematics Blog

PCA / FA. Example 4 ! Davis, and almost everyone else

March 29, 2009 — rip

I would like to revisit the work we did in Davis (example 4). For one thing, I did a lot of calculations with that example, and despite the compare-and-contrast posts towards the end, I fear it may be difficult to sort out what I finally came to.

In addition, my notation has settled down a bit since then, and I would like to recast the work using my current notation.

The original (“raw”) data for example 4 was (p. 502, and columns are variables):

$X_r = \left(\begin{array}{lll} 4 & 27 & 18 \\ 12 & 25 & 12 \\ 10 & 23 & 16 \\ 14 & 21 & 14\end{array}\right)$
Read the rest of this entry »

Posted in math PCA. Tags: Bartholomew et al, Davis, Harman, Jolliffe, mathematics, PCA FA principal components factor analysis. Leave a Comment »

PCA / FA example 4: davis. Summary.

April 27, 2008 — rip

Considering how much I learned from Davis’ little example, I almost hate to summarize it. The following just does not do justice to it. We’ll see if the passage of time – coming back to Davis with fresh eyes after doing the next example – improves the summary.

OTOH, an awful lot of what I got out of Davis came from bringing linear algebra to it, and just plain mucking about with the matrices. And I discovered, having picked up the two chemistry books since I drafted this post, that they made a lot more sense than when I first looked at them. I have learned a lot from working with Davis’ example.

Here’s what I have. My own strong preference, but I’d call it a personal choice, is to use the SVD of the data matrix X:

$X = u\ w\ v^T$

and compute the A’s and S’s (R-mode and Q-mode loadings and scores):

$A^R = v\ w^T$ .

$A^Q = u\ w$ .

$S^R = X\ A^R = u\ w\ w^T = A^Q\ w^T$ .

$S^Q = X^T\ A^Q = v\ w^T\ w = A^R\ w$ .

Those equations are worth having as a conceptual basis even if one chooses to do eigendecompositions of $X^T\ X$ or $X\ X^T$ instead of the SVD.
Read the rest of this entry »

Posted in math PCA. Tags: Davis, mathematics, PCA FA principal components factor analysis. Leave a Comment »

PCA / FA example 4: davis. Davis & Jolliffe.

April 26, 2008 — rip

What tools did jolliffe give us (that i could confirm)?

Z = X A, A orthogonal, X old data variables in columns
serious rounding
plot data wrt first two PCs
how many variables to keep?

but jolliffe would have used the correlation matrix. Before we do anything else, let’s get the correlation matrix for davis’ example. Recall the centered data X:

$X = \left(\begin{array}{lll} -6 & 3 & 3 \\ 2 & 1 & -3 \\ 0 & -1 & 1 \\ 4 & -3 & -1\end{array}\right)$

i compute its correlation matrix:

$\left(\begin{array}{lll} 1. & -0.83666 & -0.83666 \\ -0.83666 & 1. & 0.4 \\ -0.83666 & 0.4 & 1.\end{array}\right)$

Now get an eigendecomposition of the correlation matrix; i called the eigenvalues $\lambda$ and the eigenvector matrix A. Here’s A:

$A = \left(\begin{array}{lll} -0.645497 & 0 & -0.763763 \\ 0.540062 & -0.707107 & -0.456435 \\ 0.540062 & 0.707107 & -0.456435\end{array}\right)$

If we compute $A^T\ A$ and get an identity matrix, then A is orthogonal. (it is.)
Read the rest of this entry »

Posted in math PCA. Tags: Davis, Jolliffe, mathematics, PCA FA principal components factor analysis. 2 Comments »

PCA / FA example 4: davis. Reciprocal basis 4.

April 25, 2008 — rip

(this has nothing to do with the covariance matrix of X; we’re back with $A^R$ for the SVD of X.)

in the course of computing the reciprocal basis for my cut down $A^R$

$\left(\begin{array}{ll} 7.48331 & 0. \\ -3.74166 & -2.44949 \\ -3.74166 & 2.44949\end{array}\right)$

i came up with the following matrix:

$\beta = \left(\begin{array}{ll} 0.0445435 & 0 \\ -0.0890871 & -0.204124 \\ -0.0890871 & 0.204124\end{array}\right)$

now, $\beta$ is very well behaved. Its two columns are orthogonal to each other:
Read the rest of this entry »

Posted in math PCA. Tags: Davis, mathematics, PCA FA principal components factor analysis, reciprocal basis dual basis. Leave a Comment »

PCA / FA example 4: davis. Davis & harman 3.

April 24, 2008 — rip

Hey, since we have the reciprocal basis, we can project onto it to get the components wrt the $A^R$ basis. After all that work to show that the $S^R$ are the components wrt the reciprocal basis, we ought to find the components wrt the $A^R$ basis. And now we know that that’s just a projection onto the reciprocal basis: we want to compute X B.

Recall X:

$X = \left(\begin{array}{lll} -6 & 3 & 3 \\ 2 & 1 & -3 \\ 0 & -1 & 1 \\ 4 & -3 & -1\end{array}\right)$

Recall B:

$B = \left(\begin{array}{ll} 0.0890871 & 0. \\ -0.0445435 & -0.204124 \\ -0.0445435 & 0.204124\end{array}\right)$

The product is:

$X\ B = \left(\begin{array}{ll} -0.801784 & 0 \\ 0.267261 & -0.816497 \\ 0 & 0.408248 \\ 0.534522 & 0.408248\end{array}\right)$

What are the column variances?

$\{0.333333,0.333333\}$

Does that surprise you? Read the rest of this entry »

Posted in math PCA. Tags: Davis, Harman, mathematics, PCA FA principal components factor analysis, reciprocal basis dual basis. 1 Comment »

PCA / FA example 4: davis. Reciprocal basis 2 & 3.

April 23, 2008 — rip

The reciprocal basis for A^R using explicit bases.

Here I go again, realizing that I’m being sloppy. i call

$\{2,1,-3\}$

the second data vector, but of course, those are the components of the second data vector. a lot of us blur this distinction a lot of the time, between a vector and its components. So long as we work only with components, this isn’t an issue, but we’re about to write vectors. i wouldn’t go so far as to say that the whole point of matrix algebra is to let us blur that distinction, but it’s certainly a major reason why we do matrix algebra. but for now, let’s look at the linear algebra, distinquishing between vectors and their components.

i need a name for the second data vector; let’s call it s. we will write it two ways, with respect to two bases, and show that the two ways are equivalent.
Read the rest of this entry »

Posted in math PCA. Tags: Davis, mathematics, PCA FA principal components factor analysis, reciprocal basis dual basis. Leave a Comment »

PCA / FA example 4: davis. Scores & reciprocal basis 1.

April 22, 2008 — rip

The reciprocal basis for $A^R$ , using matrix multiplication.

i keep emphasizing that $A^Q$ is the new data wrt the orthogonal eigenvector matrix; and i hope i’ve said that the R-mode scores $S^R = X\ A^R$ are not the new data wrt the weighted eigenvector matrix $A^R$ . In a very real sense, the R-mode scores $S^R$ do not correspond to the R-mode loadings $A^R$ . This is important enough that i want to work out the linear algebra. In fact, i’ll work it out thrice. (And i’m going to do it differently from the original demonstration that $A^Q$ is the new data wrt v.)

Of course, people will expect us compute the scores $S^R$ , and I’ll be perfectly happy to. I just won’t let anyone tell me they correspond to the $A^R$ .

First off, we have been viewing the original data as vectors wrt an orthonormal basis. Recall the design matrix:

$\left(\begin{array}{lll} -6 & 3 & 3 \\ 2 & 1 & -3 \\ 0 & -1 & 1 \\ 4 & -3 & -1\end{array}\right)$

Let’s take just the second observation:

$X_2 = (2,\ 1,\ -3)$

Those are the components of a vector.
Read the rest of this entry »

Posted in math PCA. Tags: Davis, mathematics, PCA FA principal components factor analysis, reciprocal basis dual basis. Leave a Comment »

PCA / FA example 4: davis. Davis & harman 2.

April 21, 2008 — rip

What about redistributing the variance?

We learned from harman that if we use the orthogonal eigenvector matrix v to compute new data, the result has redistributed the original variance according to the eigenvalues; and we also learned that if we use the weighted eigenvector matrix instead, we get new variables of unit variance.

(i think i thought this required us to start with the correlation matrix; it does not.)

let’s see this again for davis’ centered data X:

$\left(\begin{array}{lll} -6&3&3\\ 2&1&-3\\ 0.&-1&1\\ 4&-3&-1\end{array}\right)$

We have variances…

$\{\frac{56}{3},\ \frac{20}{3},\ \frac{20}{3}\}$

or…

${18.6667,\ 6.66667,\ 6.66667}$

and the sum of the variances is…

32.

Read the rest of this entry »

Posted in math PCA. Tags: Davis, Harman, mathematics, PCA FA principal components factor analysis. Leave a Comment »

PCA / FA example 4: davis. Davis & harman 1.

April 20, 2008 — rip

Let us now recall what we saw harman do. Does he have anything to add to what we saw davis do? if we’re building a toolbox for PCA / FA, we need to see if harman gave us any additional tools.

here’s what harman did or got us to do:

a plot of old variables in terms of the first two new ones
redistribute the variance of the original variables
his model, written Z = A F, was (old data, variables in rows) = (some eigenvector matrix) x (new data)

We should remind ourselves, however, that harman used an eigendecomposition of the correlation matrix; and the eigenvectors of the correlation matrix are not linearly related to the eigenvectors of the covariance matrix (or, equivalently, of $X^T\ X$ .) having said that, i’m going to go ahead and apply harman’s tools to the davis results. i will work from the correlation matrix when we get to jolliffe.

What exactly was that graph of harman’s? he took the first two columns of the weighted eigenvector matrix, then plotted each row as a point. He had 5 original variables. His plot

shows that the old 1st and 3rd variables, for example, are defined very similarly in terms of the first two new variables. let me emphasize that this is a description of defining variables, not a display of the resulting data. Simple counting can help with the distinction, e.g. 3 variables but 4 observations.

Read the rest of this entry »

Posted in math PCA. Tags: Davis, Harman, mathematics, PCA FA principal components factor analysis. Leave a Comment »

PCA / FA example 4: Davis review (3)

April 13, 2008 — rip

Let’s talk about centering the data. Recall the questions:

is there any chance that we can always center both the columns and the rows?
is there any chance that we can always standardize both the columns and the rows?
if we can’t have our cake and eat it too, should we give up the duality between R-mode and Q-mode?
what if rows of the data matrix add up to 1 or 100 (i.e. the variables are percentages)?

(My third question is poorly phrased. we always have the duality between A’s and S’s; what i feared losing was the idea that Q-mode is just R-mode applied to the transpose $X^T$ ; but what if X doesn’t have row-centered data?)

The third question is also a red herring. We can always make both the columns and the rows add up to zero. Proving it isn’t too hard; some convenient notation might simplify things. If the data is

$x_{ij}$

then

$x_{\cdot j}$ and $x_{i\cdot}$

are good symbols for the column means and row means, respectively. The grand mean (the mean of all the matrix values = mean of column means = mean of row means) would be denoted

$x_{\cdot\cdot}$

Read the rest of this entry »

Posted in math PCA. Tags: Davis, mathematics, PCA FA principal components factor analysis. Leave a Comment »

« Older posts

	Xobailee Madison on An Overview of Truss Desi…
	Andrew curves… on Andrews Curves
	Hakanai on Color: from XYZ to spectr…
	Hakanai on Color: from spectrum to t…
	VÄRIT! – Desig… on HSB: Hue, Saturation, Bri…
	jf08056 on Color: from spectrum to t…
	Kingsley Kayira on Calculus: Organizing technique…
	marthalindeman on Orbits: the elliptical or…
	prof drdf horia oras… on Control Theory – Example …
	pprof dr mircea oras… on Happenings – 2012 Nov 10
	prof drd horia orasa… on Regression 1: Archer Daniel Mi…
	prof drd horia orasa… on Happenings – 2013 Mar 30
	prof dr mircea orasa… on Happenings – 2012 Dec 29
	prof dr mircea orasa… on Color: Cohen Figure 20,…
	prof dr mircea orasa… on Rotations, especially Euler an…

Rip’s Applied Mathematics Blog

Pages

Categories

Meta

PCA / FA. Example 4 ! Davis, and almost everyone else

PCA / FA example 4: davis. Summary.

PCA / FA example 4: davis. Davis & Jolliffe.

PCA / FA example 4: davis. Reciprocal basis 4.

PCA / FA example 4: davis. Davis & harman 3.

PCA / FA example 4: davis. Reciprocal basis 2 & 3.

PCA / FA example 4: davis. Scores & reciprocal basis 1.

PCA / FA example 4: davis. Davis & harman 2.

PCA / FA example 4: davis. Davis & harman 1.

PCA / FA example 4: Davis review (3)

Recent Posts

Search this blog (see Suggestions)

Recent Comments

Calendar of Posts

Top Posts

Archives