https://fred.stlouisfed.org/legal/#copyright-pre-approvalhttps://fred.stlouisfed.org/legal/#copyright-pre-approval
Graph and download economic data for Dow Jones Industrial Average (DJIA) from 2015-08-31 to 2025-08-29 about stock market, average, industry, and USA.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The collected tweets and Earnings Announcements (EA) cover the period of three years, from June 1, 2013 to June 3, 2016. Companies are ordered by the total number of tweets collected. For each company, there is the sentiment distribution, market capitalization, and the prevailing timing of EAs with respect to the NYSE trading hours. Each company issues four EAs per year, therefore there is a total of 360 EAs (30 companies, three years, four EAs per year)1.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset captures historical financial market data and macroeconomic indicators spanning over three decades, from 1990 onwards. It is designed for financial analysis, time series forecasting, and exploring relationships between market volatility, stock indices, and macroeconomic factors. This dataset is particularly relevant for researchers, data scientists, and enthusiasts interested in studying: - Volatility forecasting (VIX) - Stock market trends (S&P 500, DJIA, HSI) - Macroeconomic influences on markets (joblessness, interest rates, etc.) - The effect of geopolitical and economic uncertainty (EPU, GPRD)
The data has been aggregated from a mix of historical financial records and publicly available macroeconomic datasets: - VIX (Volatility Index): Chicago Board Options Exchange (CBOE). - Stock Indices (S&P 500, DJIA, HSI): Yahoo Finance and historical financial databases. - Volume Data: Extracted from official exchange reports. - Macroeconomic Indicators: Bureau of Economic Analysis (BEA), Federal Reserve, and other public records. - Uncertainty Metrics (EPU, GPRD): Economic Policy Uncertainty Index and Global Policy Uncertainty Database.
dt
: Date of observation in YYYY-MM-DD format.vix
: VIX (Volatility Index), a measure of expected market volatility.sp500
: S&P 500 index value, a benchmark of the U.S. stock market.sp500_volume
: Daily trading volume for the S&P 500.djia
: Dow Jones Industrial Average (DJIA), another key U.S. market index.djia_volume
: Daily trading volume for the DJIA.hsi
: Hang Seng Index, representing the Hong Kong stock market.ads
: Aruoba-Diebold-Scotti (ADS) Business Conditions Index, reflecting U.S. economic activity.us3m
: U.S. Treasury 3-month bond yield, a short-term interest rate proxy.joblessness
: U.S. unemployment rate, reported as quartiles (1 represents lowest quartile and so on).epu
: Economic Policy Uncertainty Index, quantifying policy-related economic uncertainty.GPRD
: Geopolitical Risk Index (Daily), measuring geopolitical risk levels.prev_day
: Previous day’s S&P 500 closing value, added for lag-based time series analysis.Feel free to use this dataset for academic, research, or personal projects.
The Dow Jones Industrial Average (DJIA) index dropped around ***** points in the four weeks from February 12 to March 11, 2020, but has since recovered and peaked at ********* points as of November 24, 2024. In February 2020 - just prior to the global coronavirus (COVID-19) pandemic, the DJIA index stood at a little over ****** points. U.S. markets suffer as virus spreads The COVID-19 pandemic triggered a turbulent period for stock markets – the S&P 500 and Nasdaq Composite also recorded dramatic drops. At the start of February, some analysts remained optimistic that the outbreak would ease. However, the increased spread of the virus started to hit investor confidence, prompting a record plunge in the stock markets. The Dow dropped by more than ***** points in the week from February 21 to February 28, which was a fall of **** percent – its worst percentage loss in a week since October 2008. Stock markets offer valuable economic insights The Dow Jones Industrial Average is a stock market index that monitors the share prices of the 30 largest companies in the United States. By studying the performance of the listed companies, analysts can gauge the strength of the domestic economy. If investors are confident in a company’s future, they will buy its stocks. The uncertainty of the coronavirus sparked fears of an economic crisis, and many traders decided that investment during the pandemic was too risky.
Throughout the 1920s, prices on the U.S. stock exchange rose exponentially, however, by the end of the decade, uncontrolled growth and a stock market propped up by speculation and borrowed money proved unsustainable, resulting in the Wall Street Crash of October 1929. This set a chain of events in motion that led to economic collapse - banks demanded repayment of debts, the property market crashed, and people stopped spending as unemployment rose. Within a year the country was in the midst of an economic depression, and the economy continued on a downward trend until late-1932.
It was during this time where Franklin D. Roosevelt (FDR) was elected president, and he assumed office in March 1933 - through a series of economic reforms and New Deal policies, the economy began to recover. Stock prices fluctuated at more sustainable levels over the next decades, and developments were in line with overall economic development, rather than the uncontrolled growth seen in the 1920s. Overall, it took over 25 years for the Dow Jones value to reach its pre-Crash peak.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Detailed information about the detected events from the Twitter data and their polarity. We show the 118 detected EA events and 182 detected non-EA events. (PDF)
https://fred.stlouisfed.org/legal/#copyright-pre-approvalhttps://fred.stlouisfed.org/legal/#copyright-pre-approval
View data of the S&P 500, an index of the stocks of 500 leading companies in the US economy, which provides a gauge of the U.S. equity market.
The value of the DJIA index amounted to ****** at the end of June 2025, up from ********* at the end of March 2020. Global panic about the coronavirus epidemic caused the drop in March 2020, which was the worst drop since the collapse of Lehman Brothers in 2008. Dow Jones Industrial Average index – additional information The Dow Jones Industrial Average index is a price-weighted average of 30 of the largest American publicly traded companies on New York Stock Exchange and NASDAQ, and includes companies like Goldman Sachs, IBM and Walt Disney. This index is considered to be a barometer of the state of the American economy. DJIA index was created in 1986 by Charles Dow. Along with the NASDAQ 100 and S&P 500 indices, it is amongst the most well-known and used stock indexes in the world. The year that the 2018 financial crisis unfolded was one of the worst years of the Dow. It was also in 2008 that some of the largest ever recorded losses of the Dow Jones Index based on single-day points were registered. On September 29, 2008, for instance, the Dow had a loss of ****** points, one of the largest single-day losses of all times. The best years in the history of the index still are 1915, when the index value increased by ***** percent in one year, and 1933, year when the index registered a growth of ***** percent.
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Prices for United States Stock Market Index (US30) including live quotes, historical charts and news. United States Stock Market Index (US30) was last updated by Trading Economics this September 1 of 2025.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The inter-annotator agreement is computed from the examples labeled twice. The classifier performance is estimated from the 10-fold cross-validation.
https://fred.stlouisfed.org/legal/#copyright-citation-requiredhttps://fred.stlouisfed.org/legal/#copyright-citation-required
Graph and download economic data for Dow-Jones Industrial Stock Price Index for United States (M1109BUSM293NNBR) from Dec 1914 to Dec 1968 about stock market, industry, price index, indexes, price, and USA.
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Companies are ordered as in Table 1. The arrows indicate a statistically significant Granger causality relation for a company, at the 5% significance level. A right arrow indicates that the Twitter variable (sentiment polarity Pd or volume TWd) Granger-causes the market variable (return Rd), while a left arrow indicates that the market variable Granger-causes the Twitter variable. The counts at the bottom show the total number of companies passing the Granger test.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The main stock market index of United States, the US500, rose to 6464 points on September 1, 2025, gaining 0.06% from the previous session. Over the past month, the index has climbed 2.13% and is up 16.92% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks this benchmark index from United States. United States Stock Market Index - values, historical data, forecasts and news - updated on September of 2025.
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Context Predicting stock market movements is a classic challenge in machine learning. While raw Open, High, Low, Close, and Volume (OHLCV) data is the standard starting point, its predictive power is often limited. To build robust models, data scientists require a much richer feature set that captures different aspects of market dynamics, from technical patterns to sentiment hidden in financial news.
This dataset was created to bridge that gap. It provides a highly-enriched, pre-processed collection of features for the Dow Jones Industrial Average (DJIA), designed to accelerate research and modeling for stock price prediction.
Content The dataset is organized into several files, each representing a distinct category of engineered features. This modular structure allows you to easily select, combine, or test the importance of different feature types.
Description: Each day's top 25 news headlines have been transformed into a sophisticated knowledge graph. These graphs, enriched with data from Wikidata, are then encoded into 128-dimensional vectors using a Graph Convolutional Network (GCN). This file captures the semantic meaning and relationships within the news, providing a powerful non-price-based feature.
Description: Contains fundamental features derived directly from OHLCV data. These are crucial for capturing intraday volatility and price action.
Example Features: intraday_range, body_size, price_change, simple_return, log_return, price_volume_interaction.
Description: A wide array of popular technical indicators calculated using the pandas-ta library. These features are staples of financial analysis and help identify trends, momentum, and volatility.
Example Features: Simple Moving Averages (SMA_20, SMA_50, SMA_200), Exponential Moving Averages (EMA_12, EMA_26), MACD, RSI, Bollinger Bands (BBL, BBM, BBU), On-Balance Volume (OBV), and more.
Description: This file includes features based on the statistical properties of returns over an optimized rolling window, as well as cyclical time-based features. The optimal window was determined by finding the period with the highest correlation to future returns.
Example Features: rolling_mean, rolling_std (volatility), rolling_skew, rolling_kurt, day_of_week_sin, day_of_week_cos, is_month_end.
Description: More complex and transformational features designed to capture deeper market dynamics.
Example Features: Lagged returns and RSI, quantitative candlestick pattern features, wavelet transform coefficients (to decompose price signals into different frequencies), and the Hurst Exponent (to measure long-term memory in the time series).
Methodology The features were systematically generated using a series of Python scripts.
News Embeddings: Headlines were processed to extract named entities. These entities were used to build knowledge subgraphs from Wikidata. Finally, a Graph Convolutional Network (GCN) model encoded these graphs into dense vectors.
Tabular Features: All other features were generated from the raw DJIA price and volume data. The process involved several stages, from basic price calculations to advanced transformations. For features requiring a lookback period (e.g., rolling statistics, Hurst exponent), an optimal window length was programmatically determined to maximize its correlation with the target variable.
Acknowledgements The raw OHLCV and news data was originally sourced from: https://www.kaggle.com/datasets/aaron7sun/stocknews. We thank them for making the data available.
Inspiration This dataset is perfect for a variety of financial machine learning tasks:
Can you build a model to predict the next day's market direction (Up/Down)?
Which feature set is the most powerful? The technical indicators, the news embeddings, or a combination of all?
How do advanced features like the Hurst exponent or wavelet coefficients contribute to model performance?
Can you use these features to build a profitable trading strategy (backtesting required)?
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This table contains 14 series, with data starting from 1953 (not all combinations necessarily have data for all years). This table contains data described by the following dimensions (Not all combinations are available): Geography (1 items: Canada ...), Stock market statistics (14 items: Toronto Stock Exchange; value of shares traded; United States common stocks; Dow-Jones industrials; high; United States common stocks; Dow-Jones industrials; low; Toronto Stock Exchange; volume of shares traded ...).
https://fred.stlouisfed.org/legal/#copyright-pre-approvalhttps://fred.stlouisfed.org/legal/#copyright-pre-approval
Graph and download economic data for Dow Jones Industrial Average (DJIA) from 2015-08-31 to 2025-08-29 about stock market, average, industry, and USA.