Need to perform Data mining using spread sheet

Encerrado Postado 1 ano atrás Pago na entrega
Encerrado Pago na entrega

The sinking of the Titanic is one of the most infamous shipwrecks in history. On April 15,

1912, during her maiden voyage, the widely considered “unsinkable” RMS Titanic sank after

colliding with an iceberg. Unfortunately, there weren’t enough lifeboats for everyone

onboard, resulting in the death of 1502 out of 2224 passengers and crew. While there was

some element of luck involved in surviving, it seems some groups of people were more likely

to survive than others. We would want to build a predictive model to predict what people

are more likely to survive Titanic sinking? The data is grouped according to whether or not a

person survived (1=significant, 0=insignificant). Download the data from D2L, and the

following steps that will guide you how to build a data mining model:

A. For any dataset, we need to clean our data first before doing any data analysis. There

are 8 steps that we discussed in Topic 1 – Data Preparation needed to perform.

However, to simplify this step we will perform only data transformation and clean

our missing data:

i. There are 4 missing variables in our data set. Variable ‘Cabin’ missed 80%

values. Therefore, we won’t use it in our models. Replaced missing values of

variable ‘Embarked’ with the most common value, missing values of variable

‘Age’ and variable ‘Price’ with average values (0.5 point).

ii. Since variable ‘Sex’ and ‘Embarked’ are categorical variable, we will need to

transform them. Transform variable ‘Sex’ into dummy variable (value 0 and

1), and variable ‘Embarked’ into numeric variable (value 1, 2, and 3) (0.5

point).

B. Next step, we will need to perform cross-validation by perform partitioning our data.

Use Analytic Solver’s standard data partition command to partition the data into a

training set (with 50% of the observations), validation set (with 30% of the

observations), and test set (with 20% of the observations) using the default seed of

12345. (1 point)

C. Perform discriminant analysis, logistic regression, k-nearest neighbor (with

normalized inputs), single classification tree (with normalized inputs and at least 4

observations per terminal node), and manual neural network (use normalized inputs

and a single hidden layer with 3 nodes) to create a classifier for this data. How

accurate is this procedure on the training, validation, and test data sets? (1 point).

Extração de Dados Ciência de Dados Big Data Sales Excel Python

ID do Projeto: #34322682

Sobre o projeto

9 propostas Projeto remoto Ativo em 1 ano atrás

9 freelancers estão ofertando em média $19 nesse trabalho

jake66405

MASTERS IN COMPUTER SCIENCE AND SOFTWARE ARCHITECT EXCEL EXPERT Hi there, I have carefully gone through your project description and I would like to help you with this. Let me know if you have any more info that may h Mais

$20 USD in 7 dias
(1 Comentário)
1.1
jamesohoff

Hi. Thank you for your title and I feel I am ready for your project right now. I saw your title for a position as the developer for your project. I have experienced with 12+ years of website development using HTML, CSS Mais

$20 USD in 7 dias
(1 Comentário)
0.8
Hujaifa007

As I am working in shifts in my current job, I can give proper time to this project, Currently I am working as SAP Operator ,so I have work experience of 5 years.

$20 USD in 7 dias
(0 Comentários)
0.0
sonuvermaharidw5

am a hard-working and driven individual who isn't afraid to face a challenge. I'm passionate about my work and I know how to get the job done. I would describe myself as an open and honest person who doesn't believe in Mais

$20 USD in 7 dias
(0 Comentários)
0.0
tituofficial7

Hello, I will do your task in the next few hours. I will provide a sample. kindly award the project. Thank you

$20 USD in 7 dias
(0 Comentários)
0.0
salvomilani18

Dear Client My name is Salvo and I am Python&Excel Expert. I read your description carefully and I think it is just fixed job for me. I have 6 years of experience and finished many kinds of Python&Excel project. I have Mais

$20 USD in 7 dias
(0 Comentários)
0.0
nithinbijue3

I am looking for work as a freelancer. Doing graphic design, data entry, content writing, etc... You can trust me 100%….I will do the work you assign with great responsibility.

$20 USD in 7 dias
(0 Comentários)
0.0
AhmedElabidy

Hello! I am Ahmed, I saw your project and I can help you as a Data Entry to finish your project. I will be happy if you choose me to work with you. Thanks

$10 USD em 1 dia
(0 Comentários)
0.0