目錄
1. Motivation:
A. Explosive growth of data:
Source of abundant data: Business、Science、Society and Everyone.
B. Turn Data into Values and Knowledge:
User Opinions:Blog、Social Network、Query logs
Health Status:Body Temperature、Body Weight、Age、Gender
System Diagnosis:Network Traffic、Software logs、CPU Usage、Power Consumption
diagnosis [ˌdaɪəɡˈnəʊsɪs] 診斷
consumption [kənˈsʌmpʃn] 消耗,消費
2. Definition and Procedure:
A. Definition:
Non-trivial Extraction of Implicit,previously unknown and potentially userful imformation from data.
Definition [ˌdefɪˈnɪʃn] 定義
Trival [ˈtrɪviəl] 瑣碎的,不重要的
Non - Trival 無法輕易就能實現,有一定複雜度的
Extraction [ɪkˈstrækʃn] 提取, 抽取
Implicit [ɪmˈplɪsɪt] 內含的
B. Procedure:
數據源 -> 數據預處理 -> 數據勘探 -> 數據挖掘 -> 數據可視化 -> 決策
intergration 整合
Data Warehouse 數據倉庫
3. What we are going to learn:
A. Simple Introdution to Data Exploration:
B. Association to Rule Mining:
C. Clustering:
D. Classification:
E. Anomaly Detection:
F. Link Analysis:
G. Recommendation Systems:
H. Decision Support
I. Evaluation of Knowledge
Anomaly [əˈnɒməli] 異常事物
Link Analysis 鏈接分析
Evaluation [ɪˌvæljuˈeɪʃn] 估值,評價