CSCI 374: Homework Assignment #4

Decision Trees
Due: 11:59 PM on Friday, November 11

You can download the assignment instructions by clicking on this link

Instructions for using GitHub for our assignments can be found on the Resources page of the class website, as well as using this link.

Example Accuracies

Here are the test set accuracies (and runtimes) that my implementation achieved on the four data sets with a training set size of 75% and the random seed 12345. Your answers might be slightly different because of different ways of implementing the solution and different random seeds.

Data Set Accuracy (Numeric = False) Learning Time (Numeric = False) Accuracy (Numeric = True) Learning Time (Numeric = True)
monks1.csv 0.9444 0.002 sec N/A N/A
penguins.csv 0.6744 0.005 sec 0.9651 0.03 sec
occupancy.csv 0.9531 0.40 sec 0.9911 39.05 sec
opticalDigit.csv 0.5680 0.44 sec 0.9132 8.97 sec