機器學習與數據挖掘算法
1.編程實現ID3算法,針對下表數據,生成決策樹。
ID |
color |
size |
act |
age |
inflated |
1 |
YELLOW |
SMALL |
STRETCH |
ADULT |
T |
2 |
YELLOW |
SMALL |
STRETCH |
CHILD |
T |
3 |
YELLOW |
SMALL |
DIP |
CHILD |
F |
4 |
YELLOW |
LARGE |
STRETCH |
ADULT |
T |
5 |
YELLOW |
LARGE |
DIP |
ADULT |
T |
6 |
YELLOW |
LARGE |
DIP |
CHILD |
F |
7 |
PURPLE |
SMALL |
STRETCH |
CHILD |
T |
8 |
PURPLE |
SMALL |
DIP |
ADULT |
T |
9 |
PURPLE |
SMALL |
DIP |
CHILD |
F |
10 |
PURPLE |
LARGE |
STRETCH |
CHILD |
T |
問題提示:可設計數據文件格式,如color屬性取值YELLOW:0,PURPLE:1等,程序從指定數據文件中讀取訓練集數據。
問題拓展:要求將計算各屬性信息增益過程及決策樹生成過程演示出來。