Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Non-professional investors often try to find an interesting stock among those in an index (such as the Standard and Poor's 500, Nasdaq, etc.). They need only one company, the best, and they don't want to fail (perform poorly). So, the metric to optimize is accuracy, described as:
Accuracy = True Positives / (True Positives + False Positives)
And the predictive model can be a binary classifier.
The data covers the price and volume of shares of 31 NASDAQ companies in the year 2022.
Every data set I found to predict a stock price (investing) aims to find the price for the next day, and only for that stock. But in practical terms, people like to find the best stocks to buy from an index and wait a few days hoping to get an increase in the price of this investment.
Rows are grouped by companies and their age (newest to oldest) on a common date. The first column is the company. The following are the age, market, date (separated by year, month, day, hour, minute), share volume, various traditional prices of that share (close, open, high...), some price and volume statistics and target. The target is mainly defined as 1 when the closing price increases by at least 5% in 5 days (open market days). The target is 0 in any other case.
Complex features and target were made by executing: https://www.kaggle.com/code/luisandresgarcia/202307
Many thanks to everyone who participates in scientific papers and Kaggle notebooks related to financial investment.
https://brightdata.com/licensehttps://brightdata.com/license
Use our Stock Market dataset to access comprehensive financial and corporate data, including company profiles, stock prices, market capitalization, revenue, and key performance metrics. This dataset is tailored for financial analysts, investors, and researchers to analyze market trends and evaluate company performance.
Popular use cases include investment research, competitor benchmarking, and trend forecasting. Leverage this dataset to make informed financial decisions, identify growth opportunities, and gain a deeper understanding of the business landscape. The dataset includes all major data points: company name, company ID, summary, stock ticker, earnings date, closing price, previous close, opening price, and much more.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The main stock market index of United States, the US500, rose to 6464 points on September 1, 2025, gaining 0.06% from the previous session. Over the past month, the index has climbed 2.13% and is up 16.92% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks this benchmark index from United States. United States Stock Market Index - values, historical data, forecasts and news - updated on September of 2025.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains historical daily prices for all tickers currently trading on NASDAQ. The up to date list is available from nasdaqtrader.com. The historic data is retrieved from Yahoo finance via yfinance python package.
It contains prices for up to 01 of April 2020. If you need more up to date data, just fork and re-run data collection script also available from Kaggle.
The date for every symbol is saved in CSV format with common fields:
All that ticker data is then stored in either ETFs or stocks folder, depending on a type. Moreover, each filename is the corresponding ticker symbol. At last, symbols_valid_meta.csv
contains some additional metadata for each ticker such as full name.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Hascol
End-of-day prices refer to the closing prices of various financial instruments, such as equities (stocks), bonds, and indices, at the end of a trading session on a particular trading day. These prices are crucial pieces of market data used by investors, traders, and financial institutions to track the performance and value of these assets over time. The Techsalerator closing prices dataset is considered the most up-to-date, standardized valuation of a security trading commences again on the next trading day. This data is used for portfolio valuation, index calculation, technical analysis and benchmarking throughout the financial industry. The End-of-Day Pricing service covers equities, equity derivative bonds, and indices listed on 170 markets worldwide.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains comprehensive stock market data for June 2025, capturing daily trading information across multiple companies and sectors. The dataset represents a substantial collection of market data with detailed financial metrics and trading statistics.
Column Name | Data Type | Description | Example Values |
---|---|---|---|
Date | Date | Trading date in DD-MM-YYYY format | 01-06-2025, 02-06-2025 |
Ticker | String | Stock ticker symbol (3-4 characters) | AAPL, GOOGL, TSLA |
Open Price | Float | Opening price of the stock | 34.92, 206.5, 125.1 |
Attribute | Details |
---|---|
Dataset Name | Stock Market Data - June 2025 |
File Format | CSV |
File Size | ~2.5 MB |
Number of Records | 11,600+ |
Number of Features | 13 |
Time Period | June 1-21, 2025 |
Column Name | Data Type | Description | Example Values |
---|---|---|---|
Date | Date | Trading date in DD-MM-YYYY format | 01-06-2025, 02-06-2025 |
Ticker | String | Stock ticker symbol (3-4 characters) | AAPL, GOOGL, TSLA, SLH |
Open Price | Float | Opening price of the stock | 34.92, 206.5, 125.1 |
Close Price | Float | Closing price of the stock | 34.53, 208.45, 124.03 |
High Price | Float | Highest price during the trading day | 35.22, 210.51, 127.4 |
Low Price | Float | Lowest price during the trading day | 34.38, 205.12, 121.77 |
Volume Traded | Integer | Number of shares traded | 2,966,611, 1,658,738 |
Market Cap | Float | Market capitalization in dollars | 57,381,363,838.88 |
PE Ratio | Float | Price-to-Earnings ratio | 29.63, 13.03, 29.19 |
Dividend Yield | Float | Dividend yield percentage | 2.85, 2.73, 2.64 |
EPS | Float | Earnings per Share | 1.17, 16.0, 4.25 |
52 Week High | Float | Highest price in the last 52 weeks | 39.39, 227.38, 138.35 |
52 Week Low | Float | Lowest price in the last 52 weeks | 28.44, 136.79, 100.69 |
Sector | String | Industry sector classification | Industrials, Energy, Healthcare |
✅ Authentic Price Ranges: Based on realistic 2025 market projections ✅ Sector-Appropriate Volatility: Different volatility patterns by industry ✅ Correlated Metrics: P/E ratios, dividend yields, and EPS align with market caps ✅ Realistic Trading Volumes: Volume scaled appropriately to market cap ✅ Temporal Consistency: Logical price progression over 53-day period ✅ Market Cap Accuracy: Daily fluctuations reflect actual price movements
This dataset provides a comprehensive foundation for quantitative finance research, offering both breadth across market sectors and depth in daily trading dynamics while maintaining statistical realism throughout the observation period...
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset captures historical financial market data and macroeconomic indicators spanning over three decades, from 1990 onwards. It is designed for financial analysis, time series forecasting, and exploring relationships between market volatility, stock indices, and macroeconomic factors. This dataset is particularly relevant for researchers, data scientists, and enthusiasts interested in studying: - Volatility forecasting (VIX) - Stock market trends (S&P 500, DJIA, HSI) - Macroeconomic influences on markets (joblessness, interest rates, etc.) - The effect of geopolitical and economic uncertainty (EPU, GPRD)
The data has been aggregated from a mix of historical financial records and publicly available macroeconomic datasets: - VIX (Volatility Index): Chicago Board Options Exchange (CBOE). - Stock Indices (S&P 500, DJIA, HSI): Yahoo Finance and historical financial databases. - Volume Data: Extracted from official exchange reports. - Macroeconomic Indicators: Bureau of Economic Analysis (BEA), Federal Reserve, and other public records. - Uncertainty Metrics (EPU, GPRD): Economic Policy Uncertainty Index and Global Policy Uncertainty Database.
dt
: Date of observation in YYYY-MM-DD format.vix
: VIX (Volatility Index), a measure of expected market volatility.sp500
: S&P 500 index value, a benchmark of the U.S. stock market.sp500_volume
: Daily trading volume for the S&P 500.djia
: Dow Jones Industrial Average (DJIA), another key U.S. market index.djia_volume
: Daily trading volume for the DJIA.hsi
: Hang Seng Index, representing the Hong Kong stock market.ads
: Aruoba-Diebold-Scotti (ADS) Business Conditions Index, reflecting U.S. economic activity.us3m
: U.S. Treasury 3-month bond yield, a short-term interest rate proxy.joblessness
: U.S. unemployment rate, reported as quartiles (1 represents lowest quartile and so on).epu
: Economic Policy Uncertainty Index, quantifying policy-related economic uncertainty.GPRD
: Geopolitical Risk Index (Daily), measuring geopolitical risk levels.prev_day
: Previous day’s S&P 500 closing value, added for lag-based time series analysis.Feel free to use this dataset for academic, research, or personal projects.
End-of-day prices refer to the closing prices of various financial instruments, such as equities (stocks), bonds, and indices, at the end of a trading session on a particular trading day. These prices are crucial pieces of market data used by investors, traders, and financial institutions to track the performance and value of these assets over time. The Techsalerator closing prices dataset is considered the most up-to-date, standardized valuation of a security trading commences again on the next trading day. This data is used for portfolio valuation, index calculation, technical analysis and benchmarking throughout the financial industry. The End-of-Day Pricing service covers equities, equity derivative bonds, and indices listed on 170 markets worldwide.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
DAX
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
By [source]
This dataset contains 862,231 labeled tweets and associated stock returns, providing a comprehensive look into the impact of social media on company-level stock market performance. For each tweet, researchers have extracted data such as the date of the tweet and its associated stock symbol, along with metrics such as last price and various returns (1-day return, 2-day return, 3-day return, 7-day return). Also recorded are volatility scores for both 10 day intervals and 30 day intervals. Finally, sentiment scores from both Long Short - Term Memory (LSTM) and TextBlob models have been included to quantify the overall tone in which these messages were delivered. With this dataset you will be able to explore how tweets can affect a company's share prices both short term and long term by leveraging all of these data points for analysis!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
In order to use this dataset, users can utilize descriptive statistics such as histograms or regression techniques to establish relationships between tweet content & sentiment with corresponding stock return data points such as 1-day & 7-day returns measurements.
The primary fields used for analysis include Tweet Text (TWEET), Stock symbol (STOCK), Date (DATE), Closing Price at the time of Tweet (LAST_PRICE) a range of Volatility measures 10 day Volatility(VOLATILITY_10D)and 30 day Volatility(VOLATILITY_30D ) for each Stock which capture changes in market fluctuation during different periods around when Twitter reactions occur. Additionally Sentiment Polarity analysis undertaken via two Machine learning algorithms LSTM Polarity(LSTM_POLARITY)and Textblob polarity provide insight into whether people are expressing positive or negative sentiments about each company at given times which again could influence thereby potentially influence Stock Prices over shorter term periods like 1-Day Returns(1_DAY_RETURN),2-Day Returns(2_DAY_RETURN)or longer term horizon like 7 Day Returns*7DAY RETURNS*.Finally MENTION field indicates if names/acronyms associated with Companies were specifically mentioned in each Tweet or not which gives extra insight into whether company specific contexts were present within individual Tweets aka “Company Relevancy”
- Analyzing the degree to which tweets can influence stock prices. By analyzing relationships between variables such as tweet sentiment and stock returns, correlations can be identified that could be used to inform investment decisions.
- Exploring natural language processing (NLP) models for predicting future market trends based on textual data such as tweets. Through testing and evaluating different text-based models using this dataset, better predictive models may emerge that can give investors advance warning of upcoming market shifts due to news or other events.
- Investigating the impact of different types of tweets (positive/negative, factual/opinionated) on stock prices over specific time frames. By studying correlations between the sentiment or nature of a tweet and its effect on stocks, insights may be gained into what sort of news or events have a greater impact on markets in general
If you use this dataset in your research, please credit the original authors. Data Source
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: reduced_dataset-release.csv | Column name | Description | |:----------------------|:-------------------------------------------------------------------------------------------------------| | TWEET | Text of the tweet. (String) | | STOCK | Company's stock mentioned in the tweet. (String) | | DATE | Date the tweet was posted. (Date) | | LAST_PRICE | Company's last price at the time of tweeting. (Float) ...
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Stock Market Tweets Data
Overview
This dataset is the same as the Stock Market Tweets Data on IEEE by Bruno Taborda.
Data Description
This dataset contains 943,672 tweets collected between April 9 and July 16, 2020, using the S&P 500 tag (#SPX500), the references to the top 25 companies in the S&P 500 index, and the Bloomberg tag (#stocks).
Dataset Structure
created_at: The exact time this tweet was posted. text: The text of the tweet, providing… See the full description on the dataset page: https://huggingface.co/datasets/StephanAkkerman/stock-market-tweets-data.
End-of-day prices refer to the closing prices of various financial instruments, such as equities (stocks), bonds, and indices, at the end of a trading session on a particular trading day. These prices are crucial pieces of market data used by investors, traders, and financial institutions to track the performance and value of these assets over time. The Techsalerator closing prices dataset is considered the most up-to-date, standardized valuation of a security trading commences again on the next trading day. This data is used for portfolio valuation, index calculation, technical analysis and benchmarking throughout the financial industry. The End-of-Day Pricing service covers equities, equity derivative bonds, and indices listed on 170 markets worldwide.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains the historical stock prices and related financial information for five major technology companies: Apple (AAPL), Microsoft (MSFT), Amazon (AMZN), Google (GOOGL), and Tesla (TSLA). The dataset spans a five-year period from January 1, 2019, to January 1, 2024. It includes key stock metrics such as Open, High, Low, Close, Adjusted Close, and Volume for each trading day.
The data was sourced using the yfinance library in Python, which provides convenient access to historical market data from Yahoo Finance.
The dataset contains the following columns:
Date: The trading date. Open: The opening price of the stock on that date. High: The highest price of the stock on that date. Low: The lowest price of the stock on that date. Close: The closing price of the stock on that date. Adj Close: The adjusted closing price, accounting for dividends and splits. Volume: The number of shares traded on that date. Ticker: The stock ticker symbol representing each company.
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
1) Data Introduction • The Stock Market Dataset contains metadata on stocks and ETFs listed on NASDAQ, including attributes such as ticker symbol, company name, market classification, ETF status, start date, and last trading date.
2) Data Utilization (1) Characteristics of the Stock Market Dataset: • Since the dataset includes only static metadata without price data, it is well-suited for preprocessing and classification tasks such as stock filtering, sector labeling, and distinguishing between ETFs and regular stocks.
(2) Applications of the Stock Market Dataset: • Automated sector classification of stocks: This dataset can be used to automatically tag or analyze stocks by sector using text-based industry keywords.
Discover the Huge Stock Market Dataset with historical price and volume data from NYSE, NASDAQ, and NYSE MKT.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Japan's main stock market index, the JP225, fell to 42718 points on August 29, 2025, losing 0.26% from the previous session. Over the past month, the index has climbed 5.08% and is up 10.53% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks this benchmark index from Japan. Japan Stock Market Index (JP225) - values, historical data, forecasts and news - updated on August of 2025.
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
The Yahoo Stocks Dataset is an invaluable resource for analysts, traders, and developers looking to enhance their financial data models or trading strategies. Sourced from Yahoo Finance, this dataset includes historical stock prices, market trends, and financial indicators. With its accurate and comprehensive data, it empowers users to analyze patterns, forecast trends, and build robust machine learning models.
Whether you're a seasoned stock market analyst or a beginner in financial data science, this dataset is tailored to meet diverse needs. It features details like stock prices, trading volume, and market capitalization, enabling a deep dive into investment opportunities and market dynamics.
For machine learning and AI enthusiasts, the Yahoo Stocks Dataset is a goldmine. It’s perfect for developing predictive models, such as stock price forecasting and sentiment analysis. The dataset's structured format ensures seamless integration into Python, R, and other analytics platforms, making data visualization and reporting effortless.
Additionally, this dataset supports long-term trend analysis, helping investors make informed decisions. It’s also an essential resource for those conducting research in algorithmic trading and portfolio management.
Key benefits include:
Download the Yahoo Stocks Dataset today and harness the power of financial data for your projects. Whether for AI, financial reporting, or trend analysis, this dataset equips you with the tools to succeed in the dynamic world of stock markets.
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Dataset Information
This dataset includes daily price data for various stock indices.
Instruments Included
ADSMI: United Arab Emirates Stock Market (ADX General) - United Arab Emirates AEX: Netherlands Stock Market (AEX) - Netherlands (NL) AS30: Australian All - Australia (AU) AS51: Australia S&P/ASX 200 Stock Market Index - Australia (AU) AS52: ASX 50 - Australia (AU) ASE: Greece Stock Market (ASE) - Greece (GR) ATX: Austria Stock Market (ATX) - Austria (AT) BEL20:… See the full description on the dataset page: https://huggingface.co/datasets/paperswithbacktest/Indices-Daily-Price.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Non-professional investors often try to find an interesting stock among those in an index (such as the Standard and Poor's 500, Nasdaq, etc.). They need only one company, the best, and they don't want to fail (perform poorly). So, the metric to optimize is accuracy, described as:
Accuracy = True Positives / (True Positives + False Positives)
And the predictive model can be a binary classifier.
The data covers the price and volume of shares of 31 NASDAQ companies in the year 2022.
Every data set I found to predict a stock price (investing) aims to find the price for the next day, and only for that stock. But in practical terms, people like to find the best stocks to buy from an index and wait a few days hoping to get an increase in the price of this investment.
Rows are grouped by companies and their age (newest to oldest) on a common date. The first column is the company. The following are the age, market, date (separated by year, month, day, hour, minute), share volume, various traditional prices of that share (close, open, high...), some price and volume statistics and target. The target is mainly defined as 1 when the closing price increases by at least 5% in 5 days (open market days). The target is 0 in any other case.
Complex features and target were made by executing: https://www.kaggle.com/code/luisandresgarcia/202307
Many thanks to everyone who participates in scientific papers and Kaggle notebooks related to financial investment.