看看我的老贴子,呵呵。。
不过,这个我最终还是没有完全解决。
只能说,二者都是对数据协方差矩阵的逼近!(《实用多元统计分析》中说的。)
[此贴子已经被作者于2006-4-17 23:09:18编辑过]
就我个人目前的理解来看,两者似乎是恰恰方向相反,主成分分析得到的因子是每个原变量的线性组合,而因子分析却是把各个原变量分解为公共因子的线性组合,方向相反。另外主成分分析提出85%以上的贡献率的若干因子就可以了,而因子分析是对每个原变量都要解释.概括的说,前者是要从原变量众提炼出公共因子,而后者是已知公共因子的前提下对原变量的解释.不知道理解的对不对,请各位多指教.
[此贴子已经被作者于2006-4-18 9:52:59编辑过]
楼上推荐的的确是一份好东东,应该能澄清我们的疑惑,呵呵。
刚下了,多谢。。[em07]
[em10]
有空慢慢看
Best Practices in Exploratory Factor Analysis: Four Recommendations
For Getting the Most From Your Analysis
Anna B. Costello and Jason W. Osborne North Carolina State University
PCA ( principal components analysis) is the default method of extraction in many popular statistical software packages, including SPSS and SAS, which likely contributes to its popularity. However, PCA is not a true method of factor analysis and there is disagreement among statistical theorists about when it should be used, if at all. Some argue for severely restricted use of components analysis in favor of a true factor analysis method ( Bentler & Kano, 1990; Floyd & Widaman, 1995; Ford, MacCallum & Tait, 1986; Gorsuch, 1990; Loehlin, 1990; MacCallum & Tucker, 1991; Mulaik, 1990; Snook & Gorsuch, 1989; Widaman, 1990, 1993). Others disagree, and point out either that there is almost no difference between principal components and factor analysis, or that PCA is preferable ( Arrindell & van der Ende, 1985; Guadagnoli and Velicer, 1988; Schoenmann, 1990; Steiger, 1990; Velicer & Jackson, 1990). We suggest that factor analysis is preferable to principal components analysis. Components analysis is only a data reduction method. It became common decades ago when computers were slow and expensive to use; it was a quicker, cheaper alternative to factor analysis ( Gorsuch, 1990). It is computed without regard to any underlying structure caused by latent variables; components are calculated using all of the variance of the manifest variables, and all of that variance appears in the solution ( Ford et al., 1986). However, researchers rarely collect and analyze data without an a priori idea about how the variables are related ( Floyd & Widaman, 1995). The aim of factor analysis is to reveal any latent variables that cause the underlying factor structure; only shared variance appears in the solution. Principal components analysis does not discriminate between shared and unique variance. When the factors are uncorrelated and communalities are moderate it can produce inflated values of variance accounted for by the components ( Gorsuch, 1997; McArdle, 1990). Since factor analysis only analyzes shared variance, factor analysis should yield the same solution ( all other things being equal) while also avoiding the inflation of estimates of variance accounted for.
[此贴子已经被作者于2006-4-18 9:54:31编辑过]
Similarities
[此贴子已经被作者于2006-4-18 6:49:24编辑过]
Differences
Principal Component Analysis | Exploratory Factor Analysis |
Principal Components retained account for a maximal amount of variance of observed variables | Factors account for common variance in the data |
Analysis decomposes correlation matrix | Analysis decomposes adjusted correlation matrix |
Diagonals of the correlation matrix | Diagonals of correlation matrix adjusted with unique factors |
Minimizes sum of squared perpendicular distance to the component axis | Estimates factors which influence responses on observed variables |
Component scores are a linear combination of the observed variables weighted by eigenvectors | Observed variables are linear combinations of the underlying and unique factors |
[此贴子已经被作者于2006-4-18 6:48:32编辑过]
Methods of Multivariate Analysis
Second Edition
ALVIN C. RENCHER: Brigham Young University
13.8 THE RELATIONSHIP OF FACTOR ANALYSIS TO PRINCIPAL COMPONENT ANALYSIS
Both factor analysis and principal component analysis have the goal of reducing dimensionality. Because the objectives are similar, many authors discuss principalcomponent analysis as another type of factor analysis. This can be confusing, and we wish to underscore the distinguishing characteristics of the two techniques. Two of the differences between factor analysis and principal component analysis were mentioned in Section 13.1:
If finding and describing some underlying factors is the goal, factor analysis may prove more useful than principal components; we would prefer factor analysis if the factor model fits the data well and we like the interpretation of the rotated factors. On the other hand, if we wish to define a smaller number of variables for input into another analysis, we would ordinarily prefer principal components, although this can sometimes be accomplished with factor scores. Occasionally, principal components are interpretable, as in the size and shape components in Example 12.8.1.
[此贴子已经被作者于2006-4-18 9:11:59编辑过]
Why principal component analysis is not an appropriate feature extraction method for hyperspectral data
Cheriyadat, A. Bruce, L.M.
Dept. of Electr. & Comput. Eng., Mississippi State Univ., MS, USA;
This paper appears in: Geoscience and Remote Sensing Symposium, 2003. IGARSS '03. Proceedings. 2003 IEEE International
Publication Date: 21-25 July 2003
Volume: 6, On page(s): 3420- 3422 vol.6
ISSN:
ISBN: 0-7803-7929-2
Abstract
It is a popular practice in the remote sensing community to apply principal component analysis (PCA) on a high dimensional feature space to achieve dimensionality reduction. Typically, there are two primary goals for dimensionality reduction: (i) data compression and (ii) feature extraction for classification purposes. While PCA has been proven to be an optimal method for data compression, it is not necessarily an optimal method for feature extraction, particularly when the features are used in a supervised classifier. This paper addresses the issue of using PCA on hyperspectral data, specifically why PCA is not optimal for dimensionality reduction in target detection and classification applications. The authors provide theoretical and experimental analysis of PCA to demonstrate why and when PCA is not appropriate. There are variations of the Karhunen-Loeve transform that outperform PCA in a supervised classification scheme, and some of these alternative approaches are discussed in this paper.
印象中主成分分析是寻找椭圆主轴的过程,而因子分析首先得经过因子旋转,并且因子的个数和变量的个数不一定一致,而且主成分分析可以看作是因子分析的一个特例。说的不准确一点,这两种分析的核心思想都是降维。
[此贴子已经被作者于2006-4-18 14:30:57编辑过]
这东西竟又被翻了出来。
二者都是基本但十分重要的降维手段;
主成分分析其实是对数据做一个旋转,并不能严格表为一个数学模型;
主因子法则是一个模型,最简单的正交因子模型更是一种线性模型;
主因子法是主成分法的推广,但主成分法并不能被说成是主因子法的特例,即使在最接近之时二者仍有细微的差距。