Due
November 18
Part
I: Consider
the (crisp) K-Nearest Neighbor Algorithm discussed in class.
Using the 2 class, 4 dimensional
dataset from project #1, experiment with different values of K and report the
classification rates in a confusion matrix. Test the algorithm using features 1
and 2 only and using features 3 and 4 only. (That is, you do NOT have to use
all 4 features together.) To simplify the testing, reserve the first 10 % of
each class data for testing and use the remaining data as the training set. (If
you are feeling ambitious, you can test using 10 fold cross validation; as
before, the first fold uses the first 10% of each class data for testing and
the remaining data as the training set.) Report the confusion matrix for each
test.
Part II: Now consider the Fuzzy
K-Nearest Neighbor Algorithm discussed in class. Again using the 2 class, 4 dimensional dataset, repeat part I above using the
Fuzzy K-NN algorithm. Experiment with different values of K. Try using crisp
memberships of the training data and also setting the fuzzy memberships of the
training data as we discussed in class. Report the confusion matrix for each
test.