Medical Insurance Cost Prediction — Machine Learning Case Study

Charan H U
2 min readAug 11, 2022

--

Medical insurance cost prediction using Machine Learning with Python. For this project, I have used Linear Regression model.

Table of Contents:

  1. Insurance Cost Data Collection link
  2. Data Analysis
  3. Data Preprocessing
  4. Train Test Split
  5. Linear Regression Model
  6. Trained Linear Regression Model
  7. Test using New Data
  8. Prediction

1. Insurance Cost Data Collection link

2. Data Analysis

  1. Age distribution

2. Gender Distribution

3. BMI Distribution

4. Children Distribution

5. Smoker Distribution

6. Region Distribution

7. Charges Distribution

3. Data Preprocessing

Encoding categorical features

After encoding

4. Splitting the Data into Train and Test

5. Model training

6. Model prediction

R squared value of Train data:  0.751505643411174
R squared value of Test:  0.7447273869684077

7. Building a Predictive System

The insurance cost is USD  3760.080576496055

GitHub Repo link:

--

--

Charan H U
Charan H U

Written by Charan H U

Applied AI Engineer | Internet Content Creator

No responses yet