Exercise 10.6

(a)

When we say that a variable 'explains 10% of the variation', we are talking about the variable's capacity to distinguish different observations.

Imagine that you want to move out of your parents house and you're trying to buy a new house. You go to the real estate agency and they show you a set of houses that you could like. Since we live in a capitalist world, the first thing that you'll try to do is to determine the price of each house. Imagine that each house is described by it's location, condition and area. For example, house H1 is in location A, its condition is 'Good', and it has 50 m2 (H1=(A,'Good',50)). The remaining ones can be described like this: H2=(A,'Bad',500) and H3=(A,'Good',5000).

In this case, location would be a variable that doesn't allow us to distinguish the houses. All houses have the same exact location, so that it's not possibile to distinguish, for example, H1 from H2 just based on the location. Thus, the location can't explain why prices are different. This means that its explanatory power is reduced (it doesn't explains the variation that we have between the different observations).

In contrast, since all the houses have different areas, area could be a good explanatory variable. If someone asks you which house has 50 m2, you can confidently say 'H1!'. This means that area gives you enough information to distinguish the observations. Accordingly, we can say that it is able to explain the variation.

To conclude, just remember that the first principal component considered in this exercise explains 10% of the variation. There are still 90% of the variation to be explained or, by other words, 90% of the information is missing if we just consider the first principal component.

References

http://stats.stackexchange.com/questions/2691/making-sense-of-principal-component-analysis-eigenvectors-eigenvalues/140579#140579 (nice storytelling and figures to understand PCA)