Welcome to our Restaurant Tips Optimization Project! This project aims to enhance restaurant services and optimize earnings through data analysis and machine learning. Below is a comprehensive guide to the project, highlighting key findings, conclusions, and the modeling process.
- Source: tip.csv file
- Contains 244 rows/samples of tipping data
total_bill
: the total bill amounttip
: tip amount paidsex
: gender of bill payer (Male/Female)smoker
: whether the bill payer is a smoker (Yes/No)day
: day of weektime
: meal timesize
: size of dining party
- Provides detailed tipping data across different parties, days, and meal times
- Can analyze differences in tipping behavior by party size, gender, smoking status, and time/day
- Useful for food service business insights and modeling tip percentage prediction
-
Data Analysis:
- Explore restaurant tips data.
- Uncover insights to optimize restaurant processes.
-
Machine Learning Model:
- Develop a model predicting tips based on various factors.
- Support restaurant owners and staff in improving services and earnings.
-
Customer Demographics:
- The restaurant attracts more male customers than females.
- Non-smokers significantly outnumber smokers.
-
Day-wise Analysis:
- Saturdays, Sundays, and Thursdays observe higher customer turnout.
- Fridays have lower attendance, while Mondays, Tuesdays, and Wednesdays are missing data.
-
Meal Preference:
- Dinner is more popular than lunch.
- Right-skewed total_bill distribution suggests a diverse customer base.
-
Tip Distribution:
- Right skewness in tip amounts implies potential for tip pooling policies.
- Majority of tables host around 2 customers.
-
Tipping Behavior:
- Male customers tend to be higher tippers.
- High tippers prefer dinner service.
-
Smoking Customers:
- Smoking customers do not significantly impact tip amounts.
-
Tip Policy Recommendation:
- Suggested tip range: $1 to $10, with an average of $3.
-
Dining Timing:
- Male customers prefer early-week dining.
-
Customized Menu Consideration:
- Customer groups mainly consist of dinner companions, suggesting a need for a tailored menu.
-
Total Bill and Table Size Influence:
- Both total bill and table size positively impact tip amounts.
- Assumption of data representativeness.
- Ignored limitations, e.g., missing days and assumed absence of breakfast menu.
- Log-transforming tip data to mitigate extreme tipping behavior's impact.
- Feature scaling to ensure equal influence of variables on the model.
-
Baseline Model (The mean):
- MAE: 0.8184
-
RandomForestRegressor Model:
- MAE: 0.5788
- Outperforms the baseline, showcasing its ability to capture complex relationships.
-
Baseline Model:
- Predicts tips based on the mean tip value.
- MAE: 0.8184
-
RandomForestRegressor Model:
- Outperforms the baseline.
- MAE: 0.5788
The RandomForestRegressor model demonstrates superior predictive accuracy, making it a valuable tool for optimizing restaurant tips.
This project provides actionable insights and a robust machine learning model to optimize restaurant tips, benefiting both owners and staff. Future improvements may involve refining data assumptions and exploring additional features for model enhancement.