I have an assignment for school about k -Nearest Neighbors (k -NN) that I need help to finish . It should not take long if you are an expert. the project is expected to be done in 2 hours from the time you get all the information you need.
the assignment's questions are :
What is a choice of k that balances between overfitting and ignoring the predictor information?
Show the classification matrix for the validation data that results from using the best k.
Classify the customer using the best k.
Repartition the data, this time into training, validation, and test sets (50% : 30% : 20%). Apply the k-NN method with the k chosen above. Compare the classification matrix of the test set with that of the training and validation sets. Comment on the differences and their reason.
Before you bid please make sure that you will be able to finish on 2 hours and if I have any question about the topic that you will be able to answer it. i am familiar with the topic but I need help to understand it.