question archive Dear Learner, Please submit your final report here
Subject:BusinessPrice:32.99 Bought5
Dear Learner, Please submit your final report here. Review Parameters Review Points 1. Introduction 3 - Brief introduction about the problem statement and the need of solving it. 2. EDA and Business Implication 5 - Uni-variate / Bi-variate / Multi-variate analysis to understand relationship b/w variables. How your analysis is impacting the business? - Both visual and non-visual understanding of the data. 3. Data Cleaning and Pre-processing 8 - Approach used for identifying and treating missing values and outlier treatment (and why) - Need for variable transformation (if any) - Variables removed or added and why (if any) 4. Model building 8 - Clear on why was a particular model(s) chosen. - Effort to improve model performance. 5. Model validation 8 - How was the model validated? Just accuracy, or anything else too? 6. Final interpretation / recommendation 8 - Detailed recommendations for the management/client based on the analysis done.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Business Problem:
We all know that Health care is very important domain in the market. It is directly linked with the life
of the individual; hence we have to be always be proactive in this particular domain. Money plays a
major role in this domain, because sometime treatment becomes super costly and if any individual is
not covered under the insurance then it will become a pretty tough financial situation for that
individual. The companies in the medical insurance also want to reduce their risk by optimizing the
insurance cost, because we all know a healthy body is in the hand of the individual only. If individual
eat healthy and do proper exercise the chance of getting ill is drastically reduced.
Goal & Objective: The objective of this exercise is to build a model, using data that provide the
optimum insurance cost for an individual. You have to use the health and habit related parameters for
the estimated cost of insurance
File: Data.csv
Target variable: insurance_cost
Data dictionary:
| Variable | Business Definition |
| applicant_id | Applicant unique ID |
| years_of_insurance_with_u s |
Since how many years customer is taking policy from the same company only |
| regular_checkup_lasy_year | Number of times customers has done the regular health check up in last one year |
| adventure_sports | Customer is involved with adventure sports like climbing, diving etc. |
| Occupation | Occupation of the customer |
| visited_doctor_last_1_year | Number of times customer has visited doctor in last one year |
| cholesterol_level | Cholesterol level of the customers while applying for insurance |
| daily_avg_steps | Average daily steps walked by customers |
| age | Age of the customer |
| heart_decs_history | Any past heart diseases |
| other_major_decs_history | Any past major diseases apart from heart like any operation |
| Gender | Gender of the customer |
| avg_glucose_level | Average glucose level of the customer while applying the insurance |
| bmi | BMI of the customer while applying the insurance |
| smoking_status | Smoking status of the customer |
| Year_last_admitted | When customer have been admitted in the hospital last time |
| Location | Location of the hospital |
| weight | Weight of the customer |
| covered_by_any_other_co mpany |
Customer is covered from any other insurance company |
| Alcohol | Alcohol consumption status of the customer |
| exercise | Regular exercise status of the customer |
| weight_change_in_last_one _year |
How much variation has been seen in the weight of the customer in last year |
| fat_percentage | Fat percentage of the customer while applying the insurance |
| insurance_cost | Total Insurance cost |
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.

Please downoad the answer file using this link
https://drive.google.com/file/d/1dtbJaMTg4c7suTupbH3S8ioW5mK4gN-3/view?usp=sharing

