@brian_pk_chan
Home Projects Work About

Foodhub Dataset: Data Analysis to Improve Business Outcomes

Published on January 26, 2024

FoodHub is a mobile app that provides food delivery services (Similar to UberEats). When an order is placed by a customer for a specific restaurant, the app will assign an delivery person to receive the food and delivery it to the customer. They will follow the map to reach the restaurant, pick up the food, and follow the map again to delivery to the customer. Once food is delivered, the customer will confirm reception and be allowed to review the order. The app is able to earn money by collecting a fixed margin per order.

Aim

My aim in analyzing this data is to understand customer order trends, and provide informed recommendations to improve business.

Data Cleaning

To clean the data, multiple steps were taken. Firstly, I had looked at the datatypes to see if they were accurate (ie. is the cuisine_type a categorical value?). Secondly, I removed all empty entries. Finally, I also edited the rating column for orders with a “Not Given” value, this was made NaN. Final table can be see in Table 1. In total, there are 1898 entries and 8 columns of data.

Table 1 - Cleaned Data Example

order_idcustomer_idrestaurant_namecuisine_typecost_of_the_orderday_of_the_weekratingfood_preparation_timedelivery_time
1477147337525HangawiKorean30.75WeekendNaN2520
1477685358141Blue Ribbon Sushi IzakayaJapanese12.08WeekendNaN2523
147707066393Cafe HabanaMexican12.23Weekday52328
1477334106968Blue Ribbon Fried ChickenAmerican29.20Weekend32515
147824976942Dirty Bird to GoAmerican11.59Weekday42524

Table 2 - Summarized Data analytics

countmeanstdmin25%50%75%max
cost_of_the_order1898.016.47.54.512.114.122.335.4
rating1162.04.30.73.04.05.05.05.0
food_preparation_time1898.027.34.620.023.027.03135.0
delivery_time1898.024.15.015.020.025.02.833.0

Data Exploration

Univariable Analysis

To understand the data better, I had a look at each variable to understand customer behaviours. To do this, I made histograms and boxplots of each column using Code 1 and Code 2 as an example.

Code 1:

Example Code for histogram and boxplots

Code 2:

Example Code for Delivery Time Plot

Using the above code, I was able to obtain a couple of important takeaways about customer behaviour.

For delivery time (Figure 1A), average delivery time was 24.1 minutes, where a majority of orders were under 25 minutes. In comparison, food preparation time (figure 1B) was a bit longer with an average of 27.3 minutes, where majority of orders took under 27 minutes. Finally, average price (figure 1C) was 16.4 dollars, such that majority were under 14.1. The average rating of all orders was 4.3 (Figure 1D).

With this data,

Figure 1

A) Delivery Time

Delivery Time

B) Food Preparation Time

Food Preparation Time

C) Order Cost

Orders Cost

D) Customer Rating

Customer Rating

The data reveals that American and Japanese cuisines are the most popular (Figure 2A), accounting for over half of the total orders, while Vietnamese, Southern, Spanish, and French cuisines are the least preferred. Additionally, 71.18% of orders occur on weekends (Figure 2B), indicating a strong preference for weekend dining, which businesses could leverage with targeted promotions. Lastly, most customers place only one order (Figure 2C), though a small subset of frequent customers makes multiple purchases. To maximize growth, businesses should focus on popular cuisines, weekend marketing, and implementing loyalty programs to retain and engage customers.

Figure 2

A) Orders per Cuisine

Orders Per Cuisine

B) Orders on weekdays vs. weekends

Orders per weekday

C) Orders per customer ID

Customer Orders

Multivariable Analysis

Based on the multivariate analysis in figure 3A and 3B, there doesn’t seem to be a significant trend between delivery time, food preparation time, price, and rating.

Figure 3

A) Heatmap

Multivariate Heatmap

B) Pairplot

Multivariate Pairplot

In my analysis of the delivery service data, I found consistent patterns across various metrics. Cost and preparation time remain remarkably stable, showing no significant variations across different cuisine types, days of the week, or order characteristics (Figure 4A, 4C). This suggests a level of operational consistency in pricing and food preparation. However, delivery time emerges as the standout metric, with a notable distinction between weekday and weekend performance (Figure 4B). Weekend deliveries are notably more efficient, averaging 22.4 minutes compared to weekday deliveries which stretch to 28.3 minutes. This difference could indicate potential operational challenges during weekdays, such as traffic, staffing constraints, or other logistical issues that impact the speed of food delivery. Despite the variations in delivery times, the underlying data reveals the service’s performance across different metrics.

Figure 4

A) Delivery Time

Delivery Subplots 1

_

Delivery Subplots 2

B) Food Preparation Time

Food Prep Subplots 1

_

Delivery Subplots 2

C) Order Cost

Order Costs Subplots 1

_

Order Costs Subplots 2

Conclusion and recommendations

As I dig into the apps performance, some patterns emerge that might surprise both the company and its customers. Weekend deliveries dominate the landscape, but there’s a difference in how long it takes to get your food depending on the day of the week. It seems like something’s going on with the delivery team – maybe they’re stretched thin during weekdays or struggling to keep up with demand. Even more concerning is that many customers try the service once and never come back, which is a red flag for customer satisfaction. On a brighter note, the business is finding its sweet spot with orders over $20, proving that customers are willing to spend a bit more. Perhaps most interestingly, this analysis showed that factors like delivery time and preparation don’t seem to directly impact customer ratings – a finding that could lead to improvements in how the service operates.

Based on these insights, I would propose a strategic approach to addressing the delivery service’s challenges. To combat the low customer retention rate, implementing targeted promotional campaigns that encourage customers to return after their initial order, such as providing attractive discount coupons for repeat users would help. Recognizing the significant drop in weekday orders, I suggest introducing compelling incentives like 10% discounts to motivate customers to place orders during traditionally slower weekdays. To boost the average order value, the company could offer 20% discounts for orders exceeding $30, which could not only increase revenue but also provide additional value to customers. Lastly, to address potential staffing and performance issues during weekdays, I recommend creating a compensation structure that offers higher salaries for delivery drivers working on weekdays, which could improve worker morale and help balance the current weekend-heavy delivery model. These targeted recommendations aim to transform the current operational challenges into opportunities for growth and improved customer satisfaction.

In the end, this was just an excuse for me to practice my data skills so I don’t forget them forever. Thanks for reading! 🫰