question archive Dear Learner, Please submit your final report here
Subject:BusinessPrice:32.99 Bought5
Dear Learner, Please submit your final report here. Review Parameters Review Points 1. Introduction 3 - Brief introduction about the problem statement and the need of solving it. 2. EDA and Business Implication 5 - Uni-variate / Bi-variate / Multi-variate analysis to understand relationship b/w variables. How your analysis is impacting the business? - Both visual and non-visual understanding of the data. 3. Data Cleaning and Pre-processing 8 - Approach used for identifying and treating missing values and outlier treatment (and why) - Need for variable transformation (if any) - Variables removed or added and why (if any) 4. Model building 8 - Clear on why was a particular model(s) chosen. - Effort to improve model performance. 5. Model validation 8 - How was the model validated? Just accuracy, or anything else too? 6. Final interpretation / recommendation 8 - Detailed recommendations for the management/client based on the analysis done.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Business Problem:
We all know that Health care is very important domain in the market. It is directly linked with the life
of the individual; hence we have to be always be proactive in this particular domain. Money plays a
major role in this domain, because sometime treatment becomes super costly and if any individual is
not covered under the insurance then it will become a pretty tough financial situation for that
individual. The companies in the medical insurance also want to reduce their risk by optimizing the
insurance cost, because we all know a healthy body is in the hand of the individual only. If individual
eat healthy and do proper exercise the chance of getting ill is drastically reduced.
Goal & Objective: The objective of this exercise is to build a model, using data that provide the
optimum insurance cost for an individual. You have to use the health and habit related parameters for
the estimated cost of insurance
File: Data.csv
Target variable: insurance_cost
Data dictionary:
Variable | Business Definition |
applicant_id | Applicant unique ID |
years_of_insurance_with_u s |
Since how many years customer is taking policy from the same company only |
regular_checkup_lasy_year | Number of times customers has done the regular health check up in last one year |
adventure_sports | Customer is involved with adventure sports like climbing, diving etc. |
Occupation | Occupation of the customer |
visited_doctor_last_1_year | Number of times customer has visited doctor in last one year |
cholesterol_level | Cholesterol level of the customers while applying for insurance |
daily_avg_steps | Average daily steps walked by customers |
age | Age of the customer |
heart_decs_history | Any past heart diseases |
other_major_decs_history | Any past major diseases apart from heart like any operation |
Gender | Gender of the customer |
avg_glucose_level | Average glucose level of the customer while applying the insurance |
bmi | BMI of the customer while applying the insurance |
smoking_status | Smoking status of the customer |
Year_last_admitted | When customer have been admitted in the hospital last time |
Location | Location of the hospital |
weight | Weight of the customer |
covered_by_any_other_co mpany |
Customer is covered from any other insurance company |
Alcohol | Alcohol consumption status of the customer |
exercise | Regular exercise status of the customer |
weight_change_in_last_one _year |
How much variation has been seen in the weight of the customer in last year |
fat_percentage | Fat percentage of the customer while applying the insurance |
insurance_cost | Total Insurance cost |
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Purchased 5 times