marta.github.io

Insurance Costs Analysis

This project explores the β€œU.S. Medical Insurance dataset” to understand which factors influence individual insurance pricing. The analysis includes data exploration, visualization, and a multiple linear regression model using statsmodels.


πŸ“Š Dataset

The dataset contains 1,338 observations with the following variables:


πŸ”Ž Exploratory Data Analysis

Example visualization:

Scatterplot BMI vs Charges


πŸ“ˆ Regression Model

I built a multiple linear regression model including all variables (categorical variables converted to dummies).

Key results:


πŸ›  Tools & Libraries


πŸ“Œ Key Learnings


πŸš€ Next Steps