Elist is a sample e-commerce company that sells popular electronics to customers around the world.
In this project, I analyzed a dataset to investigate trends and growth rates in revenue, average order value (AOV), product popularity, marketing channels, refund rates, and shipping efficiency. Additionally, I closely examined their recently launched loyalty program to assess its effectiveness in overall sales, AOV, and total orders.
This project consists of four parts:
Part 1: Trends
~ Using Excel, I take a high-level look at some of the broad trends within the data to see what notable insights can be pulled out for the finance and product team.
Part 2: Targeted Insights
~ With the aid of SQL, I pull out more targeted insights for the finance and marketing team, highlighting things like MacBook sales, refund rates, and best performing marketing channels.
Part 3: Visualizations
~ Leveraging Tableau, I create a dashboard for the finance and product teams (as well as the sales and operations teams to a lesser extent) to monitor metrics of interest on an ongoing basis.
Part 4: Recommendations & Next Step
~ Suggestions on things to take a look at going forward.
The data I'll be using is spread out across four tables and consists of information on orders, order statuses, customers, and geographic information.
Interestingly, the first year of the pandemic (2020) saw the highest average order value of $298.
Total orders were down to 19K in 2022 (from 31k in 2021).
Revenue was down 44% year-over-year in 2022.
The concerning sales numbers of 2022 appear to reflect broader trends in the e-commerce industry. Factors such as the economic recession, an oversaturated market, decreased effectiveness of marketing campaigns, and supply chain issues were identified by Statista as contributors to an unprecedented forecasted shrink in e-commerce revenue for 2022. Particularly notable were changes in digital marketing practices: the privacy update in Apple's iOS 14.5, alongside a 47% increase in Facebook advertising costs, seems to have had a profound impact on e-commerce merchants.
Seasonality
Winter and spring tend to perform better, most likely due to holiday sales and special promotions.
There's a noticeable sales uptick in the first half of 2020, most likely due to the pandemic.
Products
Airpods have consistently been our best-selling product in terms of total orders.
In 2022, Airpods and a gaming monitor made up a whopping 68% of all orders.
Bose Soundsport Headphones have consistently performed poorly. We've only sold 25 pairs over 3 years.
Loyalty Program
We recommend continuing to expand this. When the program started in 2019, members were spending an average of $29 less than non-members. In 2022, loyalty program members now spend an average of $34 more than non-members per purchase.
By the end of 2022, loyalty program members generated 27% more revenue compared to non-members. This is largely attributed to their higher receptiveness to our marketing efforts. The fact that they are loyalty members makes them more open to receiving emails from us, as demonstrated by spending 217% more than non-members through that channel in 2022. It's worth noting that email marketing boasts a 47x ROI and has remained relatively unaffected by changes in the marketing landscape.
Technical Analysis: For this section, I used Pivot Tables, conditional formatting, aggregation functions, and statistical analysis to clean, analyze, and summarize my insights for the finance and product teams.
Here is an example of the pivot table used for seasonality insights:
In 2021, the lowest amount of returns for Apple products was 6 (in November). The highest was 33 (in March).
Across all years, Macbook Airs had the highest refund rate at 4.2% followed by ThinkPads (3.8%) and iPhones (3.5%).
Account creation methods (Jan-Feb 2022)
Accounts created on tablets had the highest average order value at $287 (but only 25 purchases were made on tablets).
Desktop was the account creation method that led to the most new customers, with 2,487, which is more than three times the number of new customers generated by the next closest method, mobile, with 701.
Time to purchase (all years)
Average of 51 days between account creation and first purchase.
Marketing channels (all years)
Direct - highest number of orders.
Emails - second highest number of orders.
Technical Analysis: For this analysis, I used SQL and BigQuery. In regards to SQL, I used aggregation functions, window functions, joins, filtering, common table expressions (CTEs), and in a couple instances the QUALIFY clause to use row_number() to filter results.
Here is an example of one query result that used the aforementioned qualify clause. It's the result of a query that first creates a brand category and totals the amount of refunds per month for each brand, filtering for the year 2020. After that it returns the month with the most refunds and its corresponding number of refunds:
Airpods, gaming monitors, and charging packs have accounted for over 80% of all orders from 2019-2022.
Every product except webcams saw its total orders peak around the start of the pandemic. Paradoxically, total orders for webcams rose in the following years. This is something we should investigate further.
Shipping times
iPhones and Bose headphones have, relative to all other products, incredibly high variability when it comes to shipping times. One might wonder if this is a chicken and egg situation, considering their sales. Are the shipping times for iPhones and Bose headphones all over the place because we rarely sell them (and consequently don't have much stock on hand)? Or do we rarely sell iPhones and Bose headphones because customers find our shipping times to be too unpredictable?
Sales
From 2019-2022, gaming monitors have consistently generated the highest total sales revenue, with the exception of a brief period at the end of 2020.
Even though charging packs consistently make up 15-24% of all orders, they’ve never accounted for more than 3% of sales in terms of total dollar value.
Apart from Airpods, computer hardware accounts for the majority of our revenue.
Technical Analysis: In this section, I primarily used Tableau. SQL and BigQuery were also used to create a dataset for Tableau. My Tableau dashboard incorporates filters, tables, line graphs, and area charts.
You can find the SQL code for the dataset I created in BigQuery here.
Here is a peek of what the Tableau dashboard for this part of my analysis looks like:
An interactive version of the above Tableau dashboard can be found here.
Prioritize Expansion of the Loyalty Program: Due to the current difficulty of marketing through channels outside of email, it's essential that this becomes a high priority. One retailer whose program we should look at is Altitude Sports. Altitude Sports typically employs a strategy of using very competitive pricing during high sales seasons like Black Friday to attract new potential customers. They then promote their membership to them by offering it for half price and doubling its discount for a limited time period. Additionally, throughout the year, they give members other benefits like member-only discounts and store credits to encourage them to spend money during non-peak seasons.
Focus on Broadening Our Reach into Revenue-Generating Product Categories: Since computer hardware represents such an outsized portion of total sales in terms of dollars, consider exploring the possibility of adding additional computer hardware to our product assortment.
Integrate Key Financial Metrics into Analysis: Incorporate customer acquisition costs and wholesale or costs of goods sold into analysis to get an understanding of customer lifetime value and what products perform best in terms of net profits. Insights from the latter point can guide us in identifying products that could help reverse our declining sales.
Take a Closer Look at Shipping Fluctuations: Investigate why shipping times are so volatile for iPhones and Bose headphones. Additionally, consider if the volatility, combined with poor sales numbers, means it's better to replace those products with something that might appeal more to our most valuable customer segments.
Please note that as a part of the data cleaning stage, 15,200 entries were identified as duplicates in the initial Excel data. To maintain the integrity and reliability of my insights, these duplicates were removed from the final dataset used for my analysis. Therefore, the original list of entries, which initially consisted of 108,127 entries, was reduced to 92,927 after removing the identified duplicates. This decision was essential to ensure the validity of the observations made for this project.