National Science & Technology Development Agency - Thailand; National Electronics & Computer Technology Center (NECTEC)
Type
Article; Early Access
Source Title
ARTIFICIAL INTELLIGENCE REVIEW
ISSN
0269-2821
Year
2023
Issue
1
Open Access
hybrid
Publisher
SPRINGER
DOI
10.1007/s10462-022-10297-z
Format
PDF
Abstract
Principal Component Analysis (PCA) is one of the most widely used data analysis methods in machine learning and AI. This manuscript focuses on the mathematical foundation of classical PCA and its application to a small-sample-size scenario and a large dataset in a high-dimensional space scenario. In particular, we discuss a simple method that can be used to approximate PCA in the latter case. This method can also help approximate kernel PCA or kernel PCA (KPCA) for a large-scale dataset. We hope this manuscript will give readers a solid foundation on PCA, approximate PCA, and approximate KPCA.