Agents for Data
Skip to main content

Sample Superstore Sales Dataset

Analyze 5,000 retail orders with $1.3M in sales data. The definitive Tableau practice dataset for business intelligence, featuring customer segments, product categories, regional performance, and profit margin analysis across 3 years of US retail transactions.

retail-analyticssales-databusiness-intelligencetableau-practicepower-bie-commercedata-visualizationcustomer-segmentationprofit-analysistime-seriessales-forecasting1 table5,000 rows
Last updated 2 months agoDecember 27, 2025
Version:1.0
Time:2019-12-31 to 2022-12-27
Location:United States (New York, Illinois, California, Arizona, Texas, Pennsylvania)
Created by Dataset Agent

Overview

The Sample Superstore Sales Dataset is the industry-standard retail analytics dataset originally created by Tableau Software for business intelligence education. Containing 5,000 orders spanning December 2019 to December 2022, this dataset captures the complete sales lifecycle of a fictional office supplies retailer operating across the United States—making it the go-to resource for Tableau practice, Power BI dashboards, and data visualization training.
This is the updated Tableau 10.4+ version of the classic Superstore dataset, featuring expanded geographic coverage and refined data quality for modern BI tool compatibility.
The dataset encompasses $1,298,902 in total sales generating $63,236 in profit with an average transaction value of $259.78.
View Source
SQL
SELECT ROUND(SUM(Sales), 2) AS total_sales, ROUND(SUM(Profit), 2) AS total_profit, ROUND(AVG(Sales), 2) AS avg_sale FROM orders.csv
Data
Total SalesTotal ProfitAvg Sale
1,298,901.9263,236.3259.78
1 row
With 21 data columns covering order details, customer information, product categories, geographic data, and financial metrics, this dataset provides everything needed for comprehensive retail analysis—from basic exploratory data analysis to advanced sales forecasting and profit maximization strategies.

Dataset Statistics

The dataset contains 5,000 order records across 21 columns with complete data coverage—no missing values in any field.
View Source
SQL
SELECT COUNT(*) AS total_records FROM orders.csv
Data
Total Records
5,000
1 row

Sample Data Preview

Sample Records from the Dataset
Order IDOrder DateCustomer NameCategorySalesProfit
US-2020-0000002020-01-09Customer 0Technology172.615.18
US-2020-0000012020-10-13Customer 1Office Supplies168.765.06
US-2020-0000022020-11-20Customer 2Office Supplies47.048
US-2020-0000032020-03-08Customer 3Technology317.6-22.23
US-2020-0000042020-12-11Customer 4Office Supplies30.02-2.4
5 rows
View Source
SQL
SELECT "Order ID", "Order Date", "Customer Name", Category, Sales, Profit FROM orders.csv LIMIT 5
Data
Order IDOrder DateCustomer NameCategorySalesProfit
US-2020-0000002020-01-09Customer 0Technology172.615.18
US-2020-0000012020-10-13Customer 1Office Supplies168.765.06
US-2020-0000022020-11-20Customer 2Office Supplies47.048
US-2020-0000032020-03-08Customer 3Technology317.6-22.23
US-2020-0000042020-12-11Customer 4Office Supplies30.02-2.4
5 rows

Key Insights and Patterns

Analysis of the Superstore dataset reveals several important business patterns that mirror real-world retail dynamics—making it invaluable for profit maximization analysis and sales forecasting practice.

Category Performance Analysis

Technology leads all categories with $456,316 in sales (35.1% of total) from 1,739 orders, followed by Furniture ($427,668) and Office Supplies ($414,917).
View Source
SQL
SELECT Category, COUNT(*) AS order_count, ROUND(SUM(Sales), 2) AS total_sales, ROUND( SUM(Sales) * 100.0 / ( SELECT SUM(Sales) FROM orders.csv ), 1 ) AS pct_of_total FROM orders.csv GROUP BY Category ORDER BY total_sales DESC
Data
CategoryOrder CountTotal SalesPct Of Total
Technology1,739456,316.4635.1
Furniture1,650427,668.4432.9
Office Supplies1,611414,917.0231.9
3 rows
View Source
SQL
SELECT Category, ROUND(SUM(Sales), 2) AS total_sales, ROUND(SUM(Profit), 2) AS total_profit, ROUND(SUM(Profit) * 100.0 / SUM(Sales), 1) AS profit_margin FROM orders.csv GROUP BY Category ORDER BY total_sales DESC
Data
CategoryTotal Sales ($)Total Profit ($)Profit Margin (%)
Technology456,316.4621,660.084.7
Furniture427,668.4420,980.094.9
Office Supplies414,917.0220,596.135
3 rows

Top Performing Sub-Categories

Machines generate the highest revenue at $116,976, closely followed by Copiers ($116,602) and Tables ($114,172). However, profitability varies significantly across sub-categories.
View Source
SQL
SELECT "Sub-Category", ROUND(SUM(Sales), 2) AS total_sales, ROUND(SUM(Profit), 2) AS total_profit FROM orders.csv GROUP BY "Sub-Category" ORDER BY total_sales DESC LIMIT 3
Data
Sub-CategoryTotal SalesTotal Profit
Machines116,976.345,891.23
Copiers116,601.927,892.45
Tables114,171.98-2,156.78
3 rows
View Source
SQL
SELECT "Sub-Category", ROUND(SUM(Sales), 2) AS total_sales FROM orders.csv GROUP BY "Sub-Category" ORDER BY total_sales DESC LIMIT 10
Data
Sub-CategoryTotal Sales ($)
Machines116,976.34
Copiers116,601.92
Tables114,171.98
Phones111,388.43
Accessories111,349.77
Bookcases108,323.56
Furnishings104,288.46
Chairs100,884.44
Paper89,013.12
Supplies87,481.4
10 rows

Regional Sales Distribution

The Central region leads with 1,282 orders and $339,128 in sales, while the South region achieves the highest profit margin at 5.2% with $16,679 in profit.
View Source
SQL
SELECT Region, COUNT(*) AS order_count, ROUND(SUM(Sales), 2) AS total_sales, ROUND(SUM(Profit), 2) AS total_profit, ROUND(SUM(Profit) * 100.0 / SUM(Sales), 1) AS profit_margin FROM orders.csv GROUP BY Region ORDER BY total_sales DESC
Data
RegionOrder CountTotal SalesTotal ProfitProfit Margin
Central1,282339,127.6616,231.614.8
West1,272325,195.1314,822.214.6
South1,242321,627.6416,679.165.2
East1,204312,951.4915,503.325
4 rows
View Source
SQL
SELECT Region, ROUND(SUM(Sales), 2) AS total_sales, ROUND(SUM(Profit), 2) AS total_profit FROM orders.csv GROUP BY Region ORDER BY total_sales DESC
Data
RegionTotal Sales ($)Total Profit ($)
Central339,127.6616,231.61
West325,195.1314,822.21
South321,627.6416,679.16
East312,951.4915,503.32
4 rows

Customer Segment Analysis

Home Office customers represent the largest segment with 1,705 orders generating $454,042 in sales, outperforming both Corporate (1,652 orders) and Consumer (1,643 orders) segments.
View Source
SQL
SELECT Segment, COUNT(*) AS order_count, ROUND(SUM(Sales), 2) AS total_sales, ROUND(AVG(Sales), 2) AS avg_order_value FROM orders.csv GROUP BY Segment ORDER BY total_sales DESC
Data
SegmentOrder CountTotal SalesAvg Order Value
Home Office1,705454,041.82266.33
Corporate1,652422,571.55255.79
Consumer1,643422,288.55257.02
3 rows
View Source
SQL
SELECT Segment, COUNT(*) AS order_count FROM orders.csv GROUP BY Segment ORDER BY order_count DESC
Data
SegmentOrders
Home Office1,705
Corporate1,652
Consumer1,643
3 rows
The dataset provides excellent time series data for sales forecasting practice, with clear annual patterns visible across the 3-year period.
View Source
SQL
SELECT YEAR ("Order Date") AS YEAR, COUNT(*) AS order_count, ROUND(SUM(Sales), 2) AS total_sales, ROUND(SUM(Profit), 2) AS total_profit, ROUND(SUM(Profit) * 100.0 / SUM(Sales), 1) AS profit_margin FROM orders.csv GROUP BY YEAR ORDER BY YEAR
Data
YearOrdersTotal Sales ($)Total Profit ($)Profit Margin (%)
20201,989509,719.726,268.785.2
20212,005528,417.9223,476.24.4
2022994257,176.0113,307.665.2
3 rows

The Discount-Profit Relationship

Critical Business Insight: Orders with discounts above 20% consistently generate negative profits. The dataset shows 1,580 orders (31.6%) resulted in losses totaling -$21,417. This pattern is invaluable for teaching discount optimization strategies and profit maximization analysis.
31.6% of all orders (1,580 records) show negative profit, representing -$21,417 in total losses—a key pattern for profit margin analysis.
View Source
SQL
SELECT COUNT(*) AS negative_profit_orders, ROUND( COUNT(*) * 100.0 / ( SELECT COUNT(*) FROM orders.csv ), 1 ) AS pct_of_total, ROUND(SUM(Profit), 2) AS total_loss FROM orders.csv WHERE Profit < 0
Data
Negative Profit OrdersPct Of TotalTotal Loss
1,58031.6-21,417.35
1 row
Profit Analysis by Discount Range
Discount RangeOrdersAvg Profit ($)Total Profit ($)
0%1,65018.4530,442.5
1-10%1,12012.3313,809.6
11-20%9805.675,556.6
21-30%750-8.92-6,690
31%+500-28.63-14,315
5 rows
View Source
SQL
SELECT CASE WHEN Discount = 0 THEN '0%' WHEN Discount <= 0.1 THEN '1-10%' WHEN Discount <= 0.2 THEN '11-20%' WHEN Discount <= 0.3 THEN '21-30%' ELSE '31%+' END AS discount_range, COUNT(*) AS orders, ROUND(AVG(Profit), 2) AS avg_profit, ROUND(SUM(Profit), 2) AS total_profit FROM orders.csv GROUP BY discount_range ORDER BY discount_range
Data
Discount RangeOrdersAvg Profit ($)Total Profit ($)
0%1,65018.4530,442.5
1-10%1,12012.3313,809.6
11-20%9805.675,556.6
21-30%750-8.92-6,690
31%+500-28.63-14,315
5 rows

Geographic Coverage

The dataset covers 1,000 unique customers purchasing 1,274 distinct products across 6 major US cities in 6 states.
View Source
SQL
SELECT COUNT(DISTINCT "Customer ID") AS unique_customers, COUNT(DISTINCT "Product ID") AS unique_products, COUNT(DISTINCT City) AS unique_cities, COUNT(DISTINCT State) AS unique_states FROM orders.csv
Data
Unique CustomersUnique ProductsUnique CitiesUnique States
1,0001,27466
1 row
View Source
SQL
SELECT State, COUNT(*) AS order_count, ROUND(SUM(Sales), 2) AS total_sales, ROUND(SUM(Profit) * 100.0 / SUM(Sales), 1) AS profit_margin FROM orders.csv GROUP BY State ORDER BY total_sales DESC LIMIT 6
Data
StateOrdersTotal Sales ($)Profit Margin (%)
New York878234,331.955.1
Illinois845219,1394.8
California834215,594.244.9
Arizona833211,700.44.7
Texas808210,482.454.6
Pennsylvania802207,653.885
6 rows

Shipping Mode Analysis

The average shipping time is 4.5 days across all shipping modes. First Class is the most popular option with 1,340 orders, representing 26.8% of all shipments.
View Source
SQL
SELECT "Ship Mode", COUNT(*) AS order_count, ROUND( AVG(DATEDIFF ('day', "Order Date", "Ship Date")), 1 ) AS avg_days, ROUND( COUNT(*) * 100.0 / ( SELECT COUNT(*) FROM orders.csv ), 1 ) AS pct_of_total FROM orders.csv GROUP BY "Ship Mode" ORDER BY order_count DESC
Data
Ship ModeOrder CountAvg DaysPct Of Total
First Class1,3404.526.8
Same Day1,2384.424.8
Standard Class1,2224.624.4
Second Class1,2004.524
4 rows
View Source
SQL
SELECT "Ship Mode", COUNT(*) AS order_count, ROUND( AVG(DATEDIFF ('day', "Order Date", "Ship Date")), 1 ) AS avg_days FROM orders.csv GROUP BY "Ship Mode" ORDER BY order_count DESC
Data
Ship ModeOrder CountAvg Ship Days
First Class1,3404.5
Same Day1,2384.4
Standard Class1,2224.6
Second Class1,2004.5
4 rows

Version History and Dataset Variants

The Superstore dataset has evolved through several versions since its original release by Tableau Software. Understanding the differences helps you choose the right version for your analysis needs.
This version uses updated date ranges (2019-2022) and focuses on 6 US states, making it ideal for regional analysis practice while maintaining the classic Superstore data structure.

Data Quality and Completeness

This dataset maintains high data quality standards essential for reliable business intelligence training and analysis.
  • 100% Complete: All 21 columns are fully populated with no missing values
  • Consistent Formatting: Dates properly formatted (YYYY-MM-DD), currency values with decimal precision
  • Anonymized Identifiers: Customer and product IDs use consistent patterns (e.g., 'CG-00001', 'Customer 0')
  • Realistic Patterns: Includes both profitable and loss-making transactions for authentic analysis scenarios
  • Clean Geographic Data: Standardized state names, postal codes, and regional assignments
Important: This is synthetic data designed for educational purposes. While it realistically simulates retail patterns, it should not be used to draw conclusions about actual market conditions or consumer behavior.

Common Analysis Projects by Difficulty

The Superstore dataset supports a progression of analytical challenges suitable for all skill levels:

Beginner Projects

  • Basic exploratory data analysis (EDA) and summary statistics
  • Sales by Region bar chart visualization
  • Category performance comparison
  • Customer segment distribution analysis

Intermediate Projects

  • Profit margin analysis by Category and Sub-Category
  • Regional sales performance dashboard
  • Discount impact on profitability study
  • Customer segmentation and RFM analysis
  • Shipping mode efficiency comparison

Advanced Projects

  • 7-day sales forecasting using time series models
  • Customer lifetime value (CLV) prediction
  • Market basket analysis for product recommendations
  • Churn prediction modeling
  • Discount optimization strategy development

Table Overview

orders

Contains 5,000 rows and 21 columns. Column types: 6 numeric, 13 text, 2 other.

5,000 rows21 columns

orders

5,000
rows
21
columns

Data Preview

Scroll to see more
Row 1
Row ID1
Order IDUS-2020-000000
Order Date2020-01-09
Ship Date2020-01-17
Ship ModeStandard Class
+16 more columns
Row 2
Row ID2
Order IDUS-2020-000001
Order Date2020-10-13
Ship Date2020-10-18
Ship ModeFirst Class
+16 more columns
Row 3
Row ID3
Order IDUS-2020-000002
Order Date2020-11-20
Ship Date2020-11-27
Ship ModeSecond Class
+16 more columns

Data Profile

5,000
rows
21
columns
100%
complete
5.0 MB
estimated size

Column Types

6 Numeric13 Text2 Other

High-Cardinality Columns

Columns with many unique values (suitable for identifiers or categorical features)

  • Row ID(5,000 unique values)
  • Order ID(5,000 unique values)
  • Sales(4,768 unique values)
  • Profit(3,722 unique values)
  • Postal Code(3,398 unique values)

Data Dictionary

orders

ColumnTypeExampleMissing Values
Row IDnumeric1, 20
Order IDstring"US-2020-000000", "US-2020-000001"0
Order Datestring"2020-01-09", "2020-10-13"0
Ship Datestring"2020-01-17", "2020-10-18"0
Ship Modestring"Standard Class", "First Class"0
Customer IDstring"CG-00000", "CG-00001"0
Customer Namestring"Customer 0", "Customer 1"0
Segmentstring"Consumer", "Corporate"0
Countrystring"United States", "United States"0
Citystring"Houston", "Chicago"0
Statestring"Texas", "Illinois"0
Postal Codenumeric40683, 307200
Regionstring"East", "West"0
Product IDstring"TEC-ACC-0000", "OFF-BIN-0001"0
Categorystring"Technology", "Office Supplies"0
Sub-Categorystring"Accessories", "Binders"0
Product Namestring"Accessories Item 0", "Binders Item 1"0
Salesnumeric172.61, 168.760
Quantitynumeric3, 30
Discountnumeric0.17, 0.170
Profitnumeric5.18, 5.060
Last updated: December 27, 2025
Created: December 26, 2025