Consider the time taken to verify a single bug is a minimum two hours where the bugs can be from the field or are found in-house. If a test team verifies hundreds of bugs per year, for 250 defects, that is 100 days of work on nothing else but this. Create a training model that will predict which defects were incorrectly fixed based on previous data (i.e., previously verified defects).
The bug prediction dataset contains data about the following software systems:
- Eclipse JDT Core
- Eclipse PDE UI
- Equinox Framework
Justify whether to use supervised/unsupervised/reinforcement learning for the task.
1. Import Libraries/Dataset
a. Download the dataset
b. Import the required libraries
2. Data Visualization and Exploration
a. Print at least 5 rows for sanity check to identify all the features present in the dataset and if the target matches with them.
b. Print the description and shape of the dataset.
c. Provide appropriate visualization to get an insight about the dataset.
d. Try exploring the data and see what insights can be drawn from the dataset.
3. Data Pre-processing and cleaning
a. Do the appropriate preprocessing of the data like identifying NULL or Missing Values if any, handling of outliers if present in the dataset, skewed data etc. Apply appropriate feature engineering techniques for them.
b. Apply the feature transformation techniques like Standardization, Normalization, etc. You are free to apply the appropriate transformations depending upon the structure and the complexity of your dataset.
c. Do the correlational analysis on the dataset. Provide a visualization for the same.
4. Data Preparation
a. Do the final feature selection and extract them into Column X and the class label into Column into Y.
b. Split the dataset into training and test sets.
5. Model Building
a. Perform Model Development using at least three models, separately. You are free to apply any Machine Learning Models on the dataset. Deep Learning Models are strictly not allowed.
b. Train the model and print the training accuracy and loss values.
6. Performance Evaluation
a. Print the confusion matrix. Provide appropriate analysis for the same.
b. Do the prediction for the test data and display the results for the inference.
27 freelancers estão ofertando em média ₹8481 nesse trabalho
Hi, I am Ibrahim, and I am a data scientist, I can help you train and evaluate ML models, please share the dataset as well. Regards, Ibrahim Anjum
Hello, I am an independent, experienced Machine learning expert. I can help with this task with a quick turn-around. Looking to hearing from you. Kind regards.
Hi , Am a Data Scientist working in a Big MNC Training models is my everyday job, Also a freelancer during free time , I can do this for win win amount which I have quoted, Ping back for more details
Hello, I have gone through your problem statement, and i would like to tell you that i can do this. I have done various type of projects related to that. So, if you want i can do this. Thank You!