This project analyzes the chemical properties of Portuguese red Vinho Verde wines and their impact on perceived wine quality. Using a combination of statistical and machine learning techniques, the goal was to identify key predictors of wine quality and build models that allow restaurant managers to select high-quality wines without relying solely on sensory tests.
The analysis focuses on 1,599 wine samples and evaluates models that can predict Low, Medium, and High quality classes based on chemical composition, providing actionable insights for business decisions.
The dataset contains 1,599 observations with the following variables:
Physicochemical data
Fixed acidity
Volatile acidity
Citric acid
Residual sugar
Chlorides
Free sulfur dioxide
Total sulfur dioxide
Density
pH
Sulphates
Alcohol
Quality
Significant Chemical Properties:
Model Performance:
Business Impact:
1. Data Cleaning & Preprocessing: Outlier removal, normalization, and feature selection
2. Exploratory Analysis: Correlation matrices, chemical property distributions, and class-specific insights
3. Modeling Approaches:
4. Evaluation & Insights: Accuracy metrics, confusion matrices, Chi-Square tests
5. Actionable Recommendations: Translating results into clear guidelines for restaurant managers
Report: Project report.pdf – Analysis report written in accessible language for both technical and non-technical stakeholders
Presentation: Presentation.pdf – Summarizes insights in accessible language and visualilzations for both technical and non-technical stakeholders