Contoso Retail Data Analysis Project

Introduction

This project analyzes the Contoso 100K retail dataset to uncover insights into sales performance, customer demographics, and product trends. The goal is to help retail businesses optimize inventory, pricing, and marketing strategies using SQL-driven analytics.

Background

Retail businesses face challenges in:

This project leverages the Contoso 100K database (simulating a global retail chain) to provide actionable insights through SQL queries.

Tools I Used

PostgreSQL SQL Git/GitHub

The Analysis

The project includes five key analyses:

1. Sales by Product Category

  • Identifies top-selling categories by revenue and quantity
  • Highlights underperforming segments for potential discounts or promotions

2. Customer Demographics (US Focus)

  • Breaks down customers by gender, occupation, and age
  • Uses ROLLUP for hierarchical summaries (e.g., gender × occupation)

3. Monthly Sales in USD

  • Converts sales to USD using exchange rate data
  • Tracks revenue trends over time to identify seasonal patterns

4. Price Optimization

  • Implements a 20% cost-based markup for products
  • Returns before/after price comparisons for validation

5. Store Performance Benchmarking

  • Compares store revenue against the national average
  • Flags underperforming locations for further investigation

What I Learned

Through this analysis, I discovered:

1. Revenue Drivers:

2. Customer Insights:

3. Pricing & Currency Effects:

4. Store Performance:

Conclusions

The analysis provides actionable recommendations for retail businesses:

1. Inventory Management:

2. Marketing Targeting:

3. Pricing Strategies:

4. Store Optimization:

Future Enhancements

  1. Temporal Analysis: Track sales before/after pricing changes.
  2. Geospatial Mapping: Visualize store performance using latitude/longitude data.
  3. Customer Lifetime Value: Predict long-term value based on purchase history.

How to Use This Project

  1. Clone the repository.
  2. Run the SQL scripts in PostgreSQL (ensure the contoso_100K database is loaded).
  3. Modify queries to test alternative scenarios (e.g., regional filters).
git clone https://github.com/Brtelfer/SQL-Contoso-100k-Analysis.git