7
a) Error rate data without partitioning
50 50 50
𝐸𝑜𝑟𝑖𝑔 = 1 − max ( , )=
100 100 100
b) The A=T child node is pure so no need for splitting, in A=F child node, the distribution
of training instances:
Class label
B C
+ -
T T 0 20
F T 0 5
T F 25 0
F F 0 25
Therefore, A is chosen as splitting attribute. For C=F child, the rate of error splitting is
25
𝐸𝑜𝑟𝑖𝑔 =
50
1. Splitting on attribute A, error rate is
A=T A=F
+ 0 25
- 0 25
𝐸𝐴=𝑇 = 0
25
𝐸𝐴=𝐹 =
50
∆𝐴 = 0
𝐸𝐵=𝑇 = 0
𝐸𝐵=𝐹 = 0
25
∆𝐵 =
50
B is used as the splitting attribute. The overall error rate of the induced tree is 0.
e) The greedy heuristic does not necessarily lead to the best tree.