Component retention in principal component analysis with application to cDNA microarray data

Richard Cangelosia(Author)
,
Alain Gorielya(Author)

aUniversity of Arizona

Research Output: Contribution to journal Article Peer-review

Open access

Abstract

Shannon entropy is used to provide an estimate of the number of interpretable components in a principal component analysis. In addition, several ad hoc stopping rules for dimension determination are reviewed and a modification of the broken stick model is presented. The modification incorporates a test for the presence of an "effective degeneracy" among the subspaces spanned by the eigenvectors of the correlation matrix of the data set then allocates the total variance among subspaces. A summary of the performance of the methods applied to both published microarray data sets and to simulated data is given.

Publication metrics

PlumX, opens in new tab

Captures

319

Citations

218

Mentions

Access to documents

10.1186/1745-6150-2-2

Component retention in principal component analysis with application to cDNA microarray data

Open access

PlumX, opens in new tab

Links