Encerrado

data mining using R

Data Mining

This project is in two parts, A and B.

Part A Association Rule Mining

In Part A, the purpose is to mine the Zoo dataset to obtain association rules that contain a mix of positive and negative terms. For this purpose, we will build on Tutorial 3 that mined positive rules. Tutorial 3 also laid the foundation for negative rules by using the factor concept to identify items which do not occur in the dataset such as venomous=false.

association rule mining and Tutorial 3, you are required to accomplish the following tasks:

Task 1

Describe, without the use of code how a rule base can be generated which will not exceed a given size specified by a maxrules parameter. A suggested value of maxrules is 20.

Your description can be stated in pseudo code form that is iterative in nature. The process starts off with a high minsup value of 0.9 for frequent itemset generation. At each iteration the minsup threshold is reduced (say in decrements of 0.1) and the rules generated are examined. The rules that survive the quality check process (what measures would you use for quality control? – see class notes) are then compared with the maxrules threshold and the process is repeated as long as the number of quality rules do not exceed the threshold.

Task 2

Implement the process described in Task 1 and produce the R code needed. Note the rules generated should contain a mix of positive and negative terms. In your R code, clearly indicate the parameter values for each threshold that you used.

Task 3

Visualize the rules produced in Task 2 by using appropriate R code.

Part B Clustering

In this part you will use two clustering algorithms, Kmeans and DBscan on the Wholesale customers dataset UCI Machine Learning Repository: Wholesale customers Data Set. Apply the two algorithms, visualize the results, and evaluate the results using the silhouette cluster quality measure.

Task 4

Hand in the R code for each of the two algorithms. Please ensure that all parameter values are clearly indicated in your R code and documentation. For the DBSCAN algorithm use the kth nearest neighbor method discussed in class by implementing the R kNNdistplot() function.

Task 5

Using the visualization and cluster quality measures you used in Task 4 which of the two algorithms would you consider to be better? Explain your answer.

End of project specification

Note:

All code must meet good programming practices such as naming variables, modularity (using functions for repetitive tasks), and adequate comments at key points in the code.

Your code in a Google Colab sheet

A copy of your code in pdf form. This document must contain the name(s) of the persons who have undertaken the assignment. In addition, it must have a brief description of how the workload was distributed amongst the project partners if work was done in group mode.

A separate pdf document that provides written answers (not code) to the questions asked in this project specification.

Habilidades: Extração de Dados, Freelance, Processamento de Dados

Sobre o Cliente:
( 0 comentários ) Denton, United States

ID do Projeto: #34263251

27 freelancers estão ofertando em média $174 nesse trabalho

(348 Comentários)
8.1
(78 Comentários)
7.1
(11 Comentários)
6.4
merinsinha

Senior R expert. I can do it. As 9+ years experiences in these field. I can give good quality work. I have read the guidelines of your work.I believe that i can provide you the best quality works you are anticipating Mais

$200 USD in 4 dias
(119 Comentários)
5.4
liveexperts123

Hi there,I'm biddin on your project "data mining using R" I have read your project description and i'm an expert in Machine learning/Python/C++/Java and Data science therefore i can do this project for you perfectly.I Mais

$250 USD in 6 dias
(2 Comentários)
5.0
suyashdhoot

Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp Mais

$250 USD in 7 dias
(24 Comentários)
5.6
(6 Comentários)
4.5
sramireddy065

☀️☀️☀️MACHINE LEARNING && NLP, COMPUTER VISION EXPERT☀️☀️☀️ Hello , I hope you are safe and Doing well I have seen your project requirements , I am looking to discuss further with you Hope we Mais

$140 USD in 7 dias
(6 Comentários)
4.1
braincenter

Hello, Hope this message finds you well, I checked your details and I believe that my experience is what you are looking 4. I have been working on similar projects for the past 8 years, and I have the essential skills Mais

$100 USD in 3 dias
(2 Comentários)
3.2
freeproject800

Hello sir ! I have an engineer degree in Data science with more than 5 years of professional experience in R . I can start now Regards

$95 USD em 1 dia
(3 Comentários)
3.2
Jack201307

Dear client, I am Jack201307. Thank you very much for posting the project “data mining using R”. I am using both R and SAS to analyze data every day. Recently I successfully finished more than 200 projects relevant to Mais

$80 USD in 3 dias
(2 Comentários)
3.0
gracew20

R PROGRAMMING EXPERT , I am best in statistics, R programming analysis of data, SPSS, STATA, MINITAB, R language, Forecasting, Statistical Quality Control, Spatial Data Analysis, Structural Equation Modeling,Data minin Mais

$500 USD in 7 dias
(2 Comentários)
2.7
mykhailodew

Master in R Programming and Mathematics Hello, I hope you are safe and Doing well I have seen your project requirements , I am looking to discuss further with you Hope we will meet soon to discuss further Coming to me, Mais

$150 USD in 3 dias
(1 Comentário)
1.4
alamineee

Hi, Dear Employer, I have read the instructions carefully and I clearly understand what is required of the project. I am expert in this field. That's why I placed my bid on this project. I'm a professional in ✔Excel fi Mais

$50 USD em 1 dia
(2 Comentários)
1.4
anenkovakateryna

Hi I've read the project description carefully. I'm an expert in R programming. It would be a great pleasure for me to have the opportunity working with you. ✓ Looking forward to hearing more about your project via ch Mais

$50 USD em 1 dia
(1 Comentário)
1.0
dilaraaydin4

Firstly I hope you are being well.I am an university student who deals with computer stuff.I am school holiday right now so I do not have anything to deal with or any job that makes your project delay.I can completely Mais

$140 USD in 7 dias
(1 Comentário)
0.4
jake66405

MASTERS IN COMPUTER SCIENCE AND SOFTWARE ARCHITECTURE DATA MINING EXPERT Hi there, I have carefully gone through your project description and I would like to help you with this. Let me know if you have any more info t Mais

$250 USD in 7 dias
(0 Comentários)
0.0
dizaardnsyh

I can do the entry data as soon as possible according the time given, You can catch me everytime, and I will reply as soon as possible in the afternoon or night hour

$100 USD in 7 dias
(0 Comentários)
0.0
Harshitshakya820

Because I'm abel to do this work i am do this work very hard and fast try to this work perfact and so on

$140 USD in 7 dias
(0 Comentários)
0.0
josephwriter1996

Hi, Greetings and hoping you are doing well, i welcome you to my profile where quality and client satisfaction is the Priority. I am Expert Joseph and i hope to cooperate with you on your project . CERTIFIED EXPERT I Mais

$250 USD in 2 dias
(0 Comentários)
0.0