What is PCA ?

原創

不會停的蝸牛

2019-07-29 01:36

figure cited here, recommend reading: A step by step explanation of Principal Component Analysis

PCA，Principal Component Analysis, is a dimensionality-reduction method.
It can reduce the number of variables of a data set, using one or more components to represent the original data.

Principal components are constructed as linear combinations of the initial variables.

Geometrically speaking, principal components are new axes with the most spread out projection of all the data points.

The more spread out, the more variance they carry, the more information they can keep, so PCA can reduce the dimensionality and preserve as much information as possible.

Step 1: Standardization

This step transforms all the variables to the same scale, because PCA is quite sensitive regarding the variances of the initial variables.

Step 2: Compute the Covariance Matrix

This matrix can reflect relationships among all the variables, and high correlation means redundant information.

Step 3: Compute the eigenvectors and eigenvalues of the covariance matrix

The eigenvectors of the Covariance matrix are Principal Components，since these directions have the most variance, and eigenvalues are the amount of variance carried in each Principal Component.

Step 4: Keep p components

Rank the eigenvalues from highest to lowest, for example, PC1 may carry 95% of the variance and PC2 carries 5%. We can keep all components or discard some of lesser significance ones.

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

What is k-means, How to set K?

不會停的蝸牛

2019-07-29 01:36:42

容器運行時Containerd

sudo ctr image import image.tar #導入docker鏡像 sudo ctr image list #顯示鏡像列表 sudo ctr containers list #列出容器列表

2024-05-12 14:32:51

Shopify Theme 開發 —— 性能優化

一、概述關於 Shopify Theme 的性能優化，通常有以下幾點： 1、卸載未使用的應用程序有些 app 會在 theme 裏面插入一些代碼，即使 app 未被使用，也可能會加載一些腳本文件，影響頁面渲染速度，所以建議不使用的 ap

2024-05-12 14:28:51

爬蟲selenium解決網頁空白問題

from selenium.webdriver import Chrome import time # https://blog.csdn.net/zhoukeguai/article/details/113247342 # driver

張博的博客

2024-05-12 14:25:11

接口請求軟件, 後端必備

apifox, 完全免費軟件, 比postman好用, 性能高, 推薦給每一個童鞋.

張博的博客

2024-05-12 14:25:11

Python 潮流週刊#50：我最喜歡的 Python 3.13 新特性！

本週刊由 Python貓出品，精心篩選國內外的 250+ 信息源，爲你挑選最值得分享的文章、教程、開源項目、軟件工具、播客和視頻、熱門話題等內容。願景：幫助所有讀者精進 Python 技術，並增長職業和副業的收入。本期分享了 12 篇文

豌豆花下貓

2024-05-12 14:24:30

vue綁定對象，綁定的值不改變的問題

在使用vue結合elmentui的table組件，對數組綁定，需要編輯數組裏一些屬性的值。我的情況是，需要在打開這個表時，根據條件插入一些對象到table裏，經測試，到這裏是沒問題的，可以顯示新插入的對象。問題在於，當我改變這些新插入對象的

2024-05-12 14:22:30

PLY文件格式及cpp解析

PLY (Polygon File Format, 多邊形文件格式)文件用於存儲Geometry Object Data(包括vertices, face and other element頂點/面片/其它屬性) 文件格式： Header

2024-05-12 14:18:50

UBUNTU無法上網的解決

一.網絡圖標不見的應對方法1、刪除NetworkManager緩存文件service NetworkManager stop rm /var/lib/NetworkManager/NetworkManager.state service N

2024-05-12 14:18:40

前端使用 Konva 實現可視化設計器（10）- 對齊線

請大家動動小手，給我一個免費的 Star 吧~ 大家如果發現了 Bug，歡迎來提 Issue 喲~ github源碼 gitee源碼示例地址不知不覺來到第 10 章了，感覺接近尾聲了。。。對齊線先看效果：這裏交互有兩個部分：

2024-05-12 14:13:19

AFL漏洞挖掘技術漫談（一）：用AFL開始你的第一次Fuzzing

https://www.freebuf.com/articles/system/191543.html 一、前言模糊測試（Fuzzing）技術作爲漏洞挖掘最有效的手段之一，近年來一直是衆多安全研究人員發現漏洞的首選技術。AFL、LibFu

2024-05-12 14:11:19

使用c#強大的表達式樹實現對象的深克隆

一、表達式樹的基本概念表達式樹是一個以樹狀結構表示的表達式，其中每個節點都代表表達式的一部分。例如，一個算術表達式 a + b 可以被表示爲一個樹，其中根節點是加法運算符，它的兩個子節點分別是 a 和 b。在 LINQ（語言集成查詢）中，

2024-05-12 14:10:39

SQL SERVER 數據庫清空語句忽略外鍵觸發器等（轉載）

有時候我們想清空SQL Server中所有表的數據，但是由於有外鍵約束和觸發器，有時候清表語句無法執行，下面的語句可以關掉和開啓一個數據庫中的所有外鍵約束和觸發器，以便執行清表語句： USE ClothesShop EXECUTE sp

2024-05-12 14:08:28

從零手寫實現 tomcat-11-filter 過濾器

創作緣由平時使用 tomcat 等 web 服務器不可謂不多，但是一直一知半解。於是想着自己實現一個簡單版本，學習一下 tomcat 的精髓。系列教程從零手寫實現 apache Tomcat-01-入門介紹從零手寫實現 apach

2024-05-12 14:04:58

Java開發利器Commons Lang之元組Tuple

標準Java庫沒有提供足夠的方法來操作其核心類，Apache Commons Lang提供了這些額外的方法。 Apache Commons Lang爲java提供了大量的幫助工具。lang API，特別是String操作方法、基本數值

2024-05-12 14:03:48

24小時熱門文章

最新文章

最新評論文章