Last updated 3 weeks ago•December 27, 2025
Time:2018-12-31 to 2024-12-30
Location:United States
Created by Dataset Agent
Overview
The S&P 500 Stock Market Dataset provides comprehensive daily trading data for 10 of the most influential companies in the U.S. stock market. This curated sample spans technology giants, financial institutions, and healthcare leaders, offering researchers and analysts a focused view into market dynamics over a six-year period that includes the historic COVID-19 crash, the post-pandemic rally, and the AI-driven market surge of 2023-2024.
This dataset contains 15,660 daily price records spanning from December 31, 2018 to December 30, 2024.
View Source
The dataset covers 10 major S&P 500 companies with 1,566 trading days per stock, representing complete market coverage without gaps.
View Source
Understanding S&P 500 Data Fields
Each record contains standard OHLCV (Open, High, Low, Close, Volume) data that forms the foundation of technical analysis and quantitative trading strategies. Understanding these fields is critical for accurate analysis:
Data Quality Note: Prices in this dataset are historical trading prices, not adjusted for stock splits or dividends. For accurate long-term return calculations, apply split adjustments for AAPL (4:1 split Aug 2020), TSLA (5:1 split Aug 2020, 3:1 split Aug 2022), GOOGL (20:1 split Jul 2022), and AMZN (20:1 split Jun 2022). Pre/post market data is not included.
S&P 500 Index Composition
The S&P 500 is a market-capitalization-weighted index maintained by S&P Dow Jones Indices. Despite its name, the index actually contains 503 stocks due to companies with multiple share classes (e.g., both GOOGL and GOOG are included for Alphabet). This dataset samples 10 representative companies across key sectors:
Companies Included by Sector
| Symbol | Company Name | GICS Sector | Trading Days |
|---|---|---|---|
| AAPL | Apple Inc. | Information Technology | 1,566 |
| MSFT | Microsoft Corporation | Information Technology | 1,566 |
| GOOGL | Alphabet Inc. (Class A) | Communication Services | 1,566 |
| AMZN | Amazon.com Inc. | Consumer Discretionary | 1,566 |
| META | Meta Platforms Inc. | Communication Services | 1,566 |
| NVDA | NVIDIA Corporation | Information Technology | 1,566 |
| TSLA | Tesla Inc. | Consumer Discretionary | 1,566 |
| JPM | JPMorgan Chase & Co. | Financials | 1,566 |
| JNJ | Johnson & Johnson | Healthcare | 1,566 |
| V | Visa Inc. | Financials | 1,566 |
| 10 rows | |||
View Source
Sector Distribution
Key Market Insights
Analysis of this dataset reveals significant performance variations among the included stocks over the six-year period, with technology companies showing the strongest growth trajectories despite higher volatility.
View Source
Apple (AAPL) delivered the highest total return at 178.6%, followed by Microsoft (MSFT) at 168.59%, significantly outperforming the broader market.
View Source
Price Analysis
View Source
Apple (AAPL) reached the highest closing price of $688.77, while Johnson & Johnson (JNJ) maintained the most stable price profile with an average of $75.46 and the lowest volatility.
View Source
Market Trends Over Time
View Source
The data shows a consistent upward trend in average closing prices across the portfolio, with the average price increasing from $161.95 in 2019 to $242.90 in 2024, representing approximately 50% growth over the period despite significant drawdowns during the COVID-19 crash and 2022 tech correction.
Trading Volume Analysis
The dataset captures an average daily trading volume of 26 million shares across all stocks, with maximum single-day volume reaching 51 million shares.
View Source
View Source
Volatility Analysis
Stock volatility, measured by standard deviation of closing prices, reveals significant differences in price stability across the portfolio. Higher volatility stocks offer greater profit potential but also increased risk:
View Source
Apple (AAPL) exhibits the highest price volatility with a standard deviation of $128.38, while Johnson & Johnson (JNJ) shows the most stability at $10.76 — making JNJ suitable for conservative strategies and AAPL for momentum-based approaches.
View Source
Historical Context: Major Market Events
This dataset spans a particularly eventful period in market history, capturing several significant economic events that provide valuable data for studying market behavior during both normal conditions and extreme volatility:
The dataset includes the historic COVID-19 market crash of March 2020, where the S&P 500 fell 34% in 23 trading days, and the subsequent recovery. This provides exceptional data for studying market behavior during extreme volatility events, testing drawdown protection strategies, and analyzing V-shaped recovery patterns.
Why This Dataset
This dataset offers several advantages over alternative data sources for stock market analysis:
Sample Data Preview
Sample Data Records
| Date | Open | High | Low | Close | Volume | Symbol |
|---|---|---|---|---|---|---|
| 2018-12-31 | 242.13 | 242.16 | 241.18 | 242.13 | 25,879,561 | AAPL |
| 2019-01-01 | 242.43 | 245.36 | 239.65 | 243.23 | 44,464,946 | AAPL |
| 2019-01-02 | 244.62 | 248.29 | 243.19 | 246.22 | 31,592,282 | AAPL |
| 2019-01-03 | 246.09 | 247.75 | 245.76 | 247.26 | 41,551,454 | AAPL |
| 2019-01-06 | 249.57 | 253.36 | 244.64 | 249.12 | 31,330,353 | AAPL |
| 5 rows | ||||||
View Source
Data Limitations
This is a sample dataset containing only 10 of the 500+ companies in the S&P 500 index. For comprehensive market analysis or index-level studies, consider using the full S&P 500 dataset or combining with additional data sources.
- Limited Stock Selection: Only 10 companies included; may not represent overall market behavior or enable proper diversification studies
- No Adjusted Prices: Prices are not adjusted for stock splits or dividends — critical for accurate backtesting
- No Pre/Post Market Data: Only regular trading hours (9:30 AM - 4:00 PM ET) data included
- No Fundamental Data: Does not include earnings, P/E ratios, revenue, or other fundamental metrics
- No Index Constituent Changes: Historical additions/removals from S&P 500 not documented
- Sample Period: Six-year window may not capture longer-term market cycles or secular trends
Table Overview
daily_prices
Data Preview
Scroll to see more| date | open | high | low | close | volume | Name |
|---|---|---|---|---|---|---|
| 2018-12-31 | 242.13 | 242.16 | 241.18 | 242.13 | 25,879,561 | AAPL |
| 2019-01-01 | 242.43 | 245.36 | 239.65 | 243.23 | 44,464,946 | AAPL |
| 2019-01-02 | 244.62 | 248.29 | 243.19 | 246.22 | 31,592,282 | AAPL |
| 2019-01-03 | 246.09 | 247.75 | 245.76 | 247.26 | 41,551,454 | AAPL |
| 2019-01-06 | 249.57 | 253.36 | 244.64 | 249.12 | 31,330,353 | AAPL |
Row 1
date2018-12-31
open242.13
high242.16
low241.18
close242.13
+2 more columns
Row 2
date2019-01-01
open242.43
high245.36
low239.65
close243.23
+2 more columns
Row 3
date2019-01-02
open244.62
high248.29
low243.19
close246.22
+2 more columns
Showing 5 of 15,660 rows
Data Profile
15,660
rows
7
columns
100%
complete
5.2 MB
estimated size
Column Types
5 Numeric1 Text1 Other
High-Cardinality Columns
Columns with many unique values (suitable for identifiers or categorical features)
- volume(15,658 unique values)
- high(12,045 unique values)
- low(12,007 unique values)
- close(11,991 unique values)
- open(11,955 unique values)
Data Dictionary
daily_prices
| Column | Type | Example | Missing Values |
|---|---|---|---|
date | string | "2018-12-31", "2019-01-01" | 0 |
open | numeric | 242.13, 242.43 | 0 |
high | numeric | 242.16, 245.36 | 0 |
low | numeric | 241.18, 239.65 | 0 |
close | numeric | 242.13, 243.23 | 0 |
volume | numeric | 25879561, 44464946 | 0 |
Name | string | "AAPL", "AAPL" | 0 |