Menu

My Amazon Spending

Analysis of personal Amazon purchases made in 2020, utilizing a Tableau dashboard to explore and visualize various aspects of the data.

Skills Used

Challenge

What began as a seemingly straightforward data acquisition task quickly revealed itself as a multi-faceted challenge. I initially expected Amazon's data export feature to provide instant access to my purchasing history. Instead, I discovered that requesting comprehensive historical data required a formal process that took approximately two weeks to complete.

This delay highlighted the complexity of working with real-world data from major platforms that handle vast amounts of user information. Once I finally received the data, I encountered another significant obstacle: Amazon's complex nested subcategory classification system rendered the 'Product Category' column largely ineffective for meaningful analysis. Each product was categorized using Amazon's internal hierarchy, which didn't align with the intuitive groupings needed for insightful visualization.

To overcome this limitation, I developed a systematic approach to manually recategorize each product purchase, creating a custom taxonomy that better served my analytical objectives. This meticulous data cleaning process, while time-consuming, was essential to ensure the dataset would yield valuable insights when visualized in Tableau.

After cleaning and restructuring the data, I employed Tableau to create interactive visualizations that revealed spending patterns across different product categories, time periods, and price ranges. I focused on designing dashboards that would tell a coherent story about collective purchasing behaviors while maintaining an aesthetically pleasing and intuitive user interface.

Results

The analysis yielded several interesting insights about shared Amazon usage patterns. It's important to note that the account I analyzed is a shared Amazon Prime subscription used by 11 people in total (myself included). The cumulative spending of approximately $4,500 represents our collective purchasing behavior during an unusual period - the COVID-19 lockdown - when online shopping became a primary means of acquiring goods for many households.

The visualization revealed distinct usage patterns among the account members, with some friends utilizing the shared account significantly more than others, including myself. This dynamic created an interesting social dimension to the data analysis.

The project timeline coincided with my first custom PC build, which is clearly reflected in the 'Most Popular Categories' chart where computer components and peripherals feature prominently. This personal context adds another layer to understanding the spending distribution.

Process

Process

Process

01

Understanding Requirements

Given that I had such a large dataset, it was important to hone in on what exactly I wanted to achieve. Understanding the requirements beforehand made it easier to narrow my scope into the relevant parts of the data

02

Data Cleaning

Removal of unnecessary columns and missing values. Standardizing the report for tabular analysis

03

Exploratory Data Analysis

Created some basic charts to understand the distribution of the data and identify outliers. I also found errors in my data entry during the categorization process in some rows (i.e. Books was spelled Bookss)

04

Data Visualization

Designed the overall template in Figma to ensure the dashboard follows design principles. Calculated custom measures within Tableau to better understand spending

We conducted user interviews, surveys, and analyzed in-app analytics to understand the pain points and user needs. We also studied competitor apps and industry trends to gather insights

01

Understanding Requirements

Given that I had such a large dataset, it was important to hone in on what exactly I wanted to achieve. Understanding the requirements beforehand made it easier to narrow my scope into the relevant parts of the data

02

Data Cleaning

Removal of unnecessary columns and missing values. Standardizing the report for tabular analysis

03

Exploratory Data Analysis

Created some basic charts to understand the distribution of the data and identify outliers. I also found errors in my data entry during the categorization process in some rows (i.e. Books was spelled Bookss)

04

Data Visualization

Designed the overall template in Figma to ensure the dashboard follows design principles. Calculated custom measures within Tableau to better understand spending

We conducted user interviews, surveys, and analyzed in-app analytics to understand the pain points and user needs. We also studied competitor apps and industry trends to gather insights

01

Understanding Requirements

Given that I had such a large dataset, it was important to hone in on what exactly I wanted to achieve. Understanding the requirements beforehand made it easier to narrow my scope into the relevant parts of the data

02

Data Cleaning

Removal of unnecessary columns and missing values. Standardizing the report for tabular analysis

03

Exploratory Data Analysis

Created some basic charts to understand the distribution of the data and identify outliers. I also found errors in my data entry during the categorization process in some rows (i.e. Books was spelled Bookss)

04

Data Visualization

Designed the overall template in Figma to ensure the dashboard follows design principles. Calculated custom measures within Tableau to better understand spending

We conducted user interviews, surveys, and analyzed in-app analytics to understand the pain points and user needs. We also studied competitor apps and industry trends to gather insights

01

Understanding Requirements

Given that I had such a large dataset, it was important to hone in on what exactly I wanted to achieve. Understanding the requirements beforehand made it easier to narrow my scope into the relevant parts of the data

02

Data Cleaning

Removal of unnecessary columns and missing values. Standardizing the report for tabular analysis

03

Exploratory Data Analysis

Created some basic charts to understand the distribution of the data and identify outliers. I also found errors in my data entry during the categorization process in some rows (i.e. Books was spelled Bookss)

04

Data Visualization

Designed the overall template in Figma to ensure the dashboard follows design principles. Calculated custom measures within Tableau to better understand spending

We conducted user interviews, surveys, and analyzed in-app analytics to understand the pain points and user needs. We also studied competitor apps and industry trends to gather insights