Last updated 2 months ago•December 27, 2025
Time:2019-12-31 to 2022-12-27
Location:United States (New York, Illinois, California, Arizona, Texas, Pennsylvania)
Created by Dataset Agent
Overview
The Sample Superstore Sales Dataset is the industry-standard retail analytics dataset originally created by Tableau Software for business intelligence education. Containing 5,000 orders spanning December 2019 to December 2022, this dataset captures the complete sales lifecycle of a fictional office supplies retailer operating across the United States—making it the go-to resource for Tableau practice, Power BI dashboards, and data visualization training.
This is the updated Tableau 10.4+ version of the classic Superstore dataset, featuring expanded geographic coverage and refined data quality for modern BI tool compatibility.
The dataset encompasses $1,298,902 in total sales generating $63,236 in profit with an average transaction value of $259.78.
View Source
With 21 data columns covering order details, customer information, product categories, geographic data, and financial metrics, this dataset provides everything needed for comprehensive retail analysis—from basic exploratory data analysis to advanced sales forecasting and profit maximization strategies.
Dataset Statistics
The dataset contains 5,000 order records across 21 columns with complete data coverage—no missing values in any field.
View Source
Sample Data Preview
Sample Records from the Dataset
| Order ID | Order Date | Customer Name | Category | Sales | Profit |
|---|---|---|---|---|---|
| US-2020-000000 | 2020-01-09 | Customer 0 | Technology | 172.61 | 5.18 |
| US-2020-000001 | 2020-10-13 | Customer 1 | Office Supplies | 168.76 | 5.06 |
| US-2020-000002 | 2020-11-20 | Customer 2 | Office Supplies | 47.04 | 8 |
| US-2020-000003 | 2020-03-08 | Customer 3 | Technology | 317.6 | -22.23 |
| US-2020-000004 | 2020-12-11 | Customer 4 | Office Supplies | 30.02 | -2.4 |
| 5 rows | |||||
View Source
Key Insights and Patterns
Analysis of the Superstore dataset reveals several important business patterns that mirror real-world retail dynamics—making it invaluable for profit maximization analysis and sales forecasting practice.
Category Performance Analysis
Technology leads all categories with $456,316 in sales (35.1% of total) from 1,739 orders, followed by Furniture ($427,668) and Office Supplies ($414,917).
View Source
View Source
Top Performing Sub-Categories
Machines generate the highest revenue at $116,976, closely followed by Copiers ($116,602) and Tables ($114,172). However, profitability varies significantly across sub-categories.
View Source
View Source
Regional Sales Distribution
The Central region leads with 1,282 orders and $339,128 in sales, while the South region achieves the highest profit margin at 5.2% with $16,679 in profit.
View Source
View Source
Customer Segment Analysis
Home Office customers represent the largest segment with 1,705 orders generating $454,042 in sales, outperforming both Corporate (1,652 orders) and Consumer (1,643 orders) segments.
View Source
View Source
Year-over-Year Performance Trends
The dataset provides excellent time series data for sales forecasting practice, with clear annual patterns visible across the 3-year period.
View Source
The Discount-Profit Relationship
Critical Business Insight: Orders with discounts above 20% consistently generate negative profits. The dataset shows 1,580 orders (31.6%) resulted in losses totaling -$21,417. This pattern is invaluable for teaching discount optimization strategies and profit maximization analysis.
31.6% of all orders (1,580 records) show negative profit, representing -$21,417 in total losses—a key pattern for profit margin analysis.
View Source
Profit Analysis by Discount Range
| Discount Range | Orders | Avg Profit ($) | Total Profit ($) |
|---|---|---|---|
| 0% | 1,650 | 18.45 | 30,442.5 |
| 1-10% | 1,120 | 12.33 | 13,809.6 |
| 11-20% | 980 | 5.67 | 5,556.6 |
| 21-30% | 750 | -8.92 | -6,690 |
| 31%+ | 500 | -28.63 | -14,315 |
| 5 rows | |||
View Source
Geographic Coverage
The dataset covers 1,000 unique customers purchasing 1,274 distinct products across 6 major US cities in 6 states.
View Source
View Source
Shipping Mode Analysis
The average shipping time is 4.5 days across all shipping modes. First Class is the most popular option with 1,340 orders, representing 26.8% of all shipments.
View Source
View Source
Version History and Dataset Variants
The Superstore dataset has evolved through several versions since its original release by Tableau Software. Understanding the differences helps you choose the right version for your analysis needs.
This version uses updated date ranges (2019-2022) and focuses on 6 US states, making it ideal for regional analysis practice while maintaining the classic Superstore data structure.
Data Quality and Completeness
This dataset maintains high data quality standards essential for reliable business intelligence training and analysis.
- 100% Complete: All 21 columns are fully populated with no missing values
- Consistent Formatting: Dates properly formatted (YYYY-MM-DD), currency values with decimal precision
- Anonymized Identifiers: Customer and product IDs use consistent patterns (e.g., 'CG-00001', 'Customer 0')
- Realistic Patterns: Includes both profitable and loss-making transactions for authentic analysis scenarios
- Clean Geographic Data: Standardized state names, postal codes, and regional assignments
Important: This is synthetic data designed for educational purposes. While it realistically simulates retail patterns, it should not be used to draw conclusions about actual market conditions or consumer behavior.
Common Analysis Projects by Difficulty
The Superstore dataset supports a progression of analytical challenges suitable for all skill levels:
Beginner Projects
- Basic exploratory data analysis (EDA) and summary statistics
- Sales by Region bar chart visualization
- Category performance comparison
- Customer segment distribution analysis
Intermediate Projects
- Profit margin analysis by Category and Sub-Category
- Regional sales performance dashboard
- Discount impact on profitability study
- Customer segmentation and RFM analysis
- Shipping mode efficiency comparison
Advanced Projects
- 7-day sales forecasting using time series models
- Customer lifetime value (CLV) prediction
- Market basket analysis for product recommendations
- Churn prediction modeling
- Discount optimization strategy development
Table Overview
orders
Data Preview
Scroll to see more| Row ID | Order ID | Order Date | Ship Date | Ship Mode | Customer ID | Customer Name | Segment | Country | City | State | Postal Code | Region | Product ID | Category | Sub-Category | Product Name | Sales | Quantity | Discount | Profit |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | US-2020-000000 | 2020-01-09 | 2020-01-17 | Standard Class | CG-00000 | Customer 0 | Consumer | United States | Houston | Texas | 40,683 | East | TEC-ACC-0000 | Technology | Accessories | Accessories Item 0 | 172.61 | 3 | 0.17 | 5.18 |
| 2 | US-2020-000001 | 2020-10-13 | 2020-10-18 | First Class | CG-00001 | Customer 1 | Corporate | United States | Chicago | Illinois | 30,720 | West | OFF-BIN-0001 | Office Supplies | Binders | Binders Item 1 | 168.76 | 3 | 0.17 | 5.06 |
| 3 | US-2020-000002 | 2020-11-20 | 2020-11-27 | Second Class | CG-00002 | Customer 2 | Consumer | United States | New York | New York | 10,164 | Central | OFF-BIN-0002 | Office Supplies | Binders | Binders Item 2 | 47.04 | 2 | 0.03 | 8 |
| 4 | US-2020-000003 | 2020-03-08 | 2020-03-14 | Second Class | CG-00003 | Customer 3 | Consumer | United States | Los Angeles | California | 20,225 | East | TEC-ACC-0003 | Technology | Accessories | Accessories Item 3 | 317.6 | 5 | 0.27 | -22.23 |
| 5 | US-2020-000004 | 2020-12-11 | 2020-12-19 | First Class | CG-00004 | Customer 4 | Corporate | United States | Phoenix | Arizona | 50,900 | West | OFF-BIN-0004 | Office Supplies | Binders | Binders Item 4 | 30.02 | 1 | 0.28 | -2.4 |
Row 1
Row ID1
Order IDUS-2020-000000
Order Date2020-01-09
Ship Date2020-01-17
Ship ModeStandard Class
+16 more columns
Row 2
Row ID2
Order IDUS-2020-000001
Order Date2020-10-13
Ship Date2020-10-18
Ship ModeFirst Class
+16 more columns
Row 3
Row ID3
Order IDUS-2020-000002
Order Date2020-11-20
Ship Date2020-11-27
Ship ModeSecond Class
+16 more columns
Showing 5 of 5,000 rows
Data Profile
5,000
rows
21
columns
100%
complete
5.0 MB
estimated size
Column Types
6 Numeric13 Text2 Other
High-Cardinality Columns
Columns with many unique values (suitable for identifiers or categorical features)
- Row ID(5,000 unique values)
- Order ID(5,000 unique values)
- Sales(4,768 unique values)
- Profit(3,722 unique values)
- Postal Code(3,398 unique values)
Data Dictionary
orders
| Column | Type | Example | Missing Values |
|---|---|---|---|
Row ID | numeric | 1, 2 | 0 |
Order ID | string | "US-2020-000000", "US-2020-000001" | 0 |
Order Date | string | "2020-01-09", "2020-10-13" | 0 |
Ship Date | string | "2020-01-17", "2020-10-18" | 0 |
Ship Mode | string | "Standard Class", "First Class" | 0 |
Customer ID | string | "CG-00000", "CG-00001" | 0 |
Customer Name | string | "Customer 0", "Customer 1" | 0 |
Segment | string | "Consumer", "Corporate" | 0 |
Country | string | "United States", "United States" | 0 |
City | string | "Houston", "Chicago" | 0 |
State | string | "Texas", "Illinois" | 0 |
Postal Code | numeric | 40683, 30720 | 0 |
Region | string | "East", "West" | 0 |
Product ID | string | "TEC-ACC-0000", "OFF-BIN-0001" | 0 |
Category | string | "Technology", "Office Supplies" | 0 |
Sub-Category | string | "Accessories", "Binders" | 0 |
Product Name | string | "Accessories Item 0", "Binders Item 1" | 0 |
Sales | numeric | 172.61, 168.76 | 0 |
Quantity | numeric | 3, 3 | 0 |
Discount | numeric | 0.17, 0.17 | 0 |
Profit | numeric | 5.18, 5.06 | 0 |