13 datasets found

NASDAQ and NYSE stocks histories
kaggle.com
Updated Nov 5, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiun Yen (2018). NASDAQ and NYSE stocks histories [Dataset]. https://www.kaggle.com/qks1lver/nasdaq-and-nyse-stocks-histories/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 5, 2018
Dataset provided by
Kaggle
Authors
Jiun Yen
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
NASDAQ and NYSE stocks histories

Update every Saturday night because I'm too tired to do anything on Friday

Full history of stock symbols on NASDAQ and NYSE:

Unzip fh_< version_date >.zip

Each stock symbol has a .csv file under full_history/

i.e. AMD.csv

Columns in .csv

date - year-month-day, 2018-08-08

volume - int, volume of the day

open - float, opening price of the day

close - float, closing price of the day

high - float, highest price of the day

low - float, lowest price of the day

adjclose - float, adjusted closing price of the day

Other files:

all_symbols.txt - All the stock symbols with history

excluded_symbols.txt - All the ones that I couldn't retrieve data for

NASDAQ.txt - NASDAQ listing

NYSE.txt - NYSE listing

All data compiled from Yahoo Finance

If you have questions, e-mail me: jiunyyen@gmail.com

Happy mining!
AMEX, NYSE, NASDAQ stock histories
kaggle.com
Updated Jul 4, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiun Yen (2020). AMEX, NYSE, NASDAQ stock histories [Dataset]. https://www.kaggle.com/qks1lver/amex-nyse-nasdaq-stock-histories/home
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 4, 2020
Dataset provided by
Kaggle
Authors
Jiun Yen
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
AMEX, NYSE, and NASDAQ stocks histories

Update every Satur... Sun... I mean Friday... >_< sometime during the weekend. I lied, I've been too busy the past few months and haven't updated in forever until today (2020.6.14) - Last scrape 2020.06.12 Friday evening (p.s. Download shows 3GB unzipped, zipped file is ~600MB)

Full history of stock symbols:

Unzip fh_< version_date >.zip

Each stock symbol has a .csv file under full_history/

i.e. AMD.csv

Columns in .csv

date - year-month-day, 2018-08-08

volume - int, volume of the day

open - float, opening price of the day

close - float, closing price of the day

high - float, highest price of the day

low - float, lowest price of the day

adjclose - float, adjusted closing price of the day

Other files:

all_symbols.txt - All the stock symbols with history

excluded_symbols.txt - All the ones that I couldn't retrieve data for

NASDAQ.txt - NASDAQ listing

NYSE.txt - NYSE listing

AMEX.txt - AMEX listing

Disclaimer

This dataset contains almost all the stocks listed on these exchanges as of the date shown in the file name. Some of the symbols cannot be found on Yahoo Finance, which I plan on using CNN Money to scrape. There are other symbols that have different classes that require some modification before I can make them queryable... I have yet to decide on the best course of action. If you want to know what these excluded symbols are, see excluded_symbols.txt.

Note: there used to be some tickers missing because of poor connection, that's been solved now.

I've also been asked why I don't put everything into one table, and here's my rationale (copy/pasted from my email):

It is possible and I've debated this before, but I've decided to go with individual files for quite a number of reasons, and I highly recommend you consider these before combining them: 1) I don't need to load everything into memory or search for the right rows if I only want to work with particular sets, 2) easier and faster to manipulate (append, remove, or whatever) when all the data of a ticker is in the same place, 3) I don't need to repeat ticker names for each row just to know which row belongs to which ticker, 4) reduce risk, latency, and waits during parallel processing of different ticker data, 5) in case of any unforeseen bad writes or termination, this way reduces the chances of affecting the entire dataset and allows for restart anytime without the need to keep backup things up every 5 minutes. I get all these benefits only at the cost of slightly larger compressed file and a few more lines of code. To me it's worth it, but I can understand if you are frustrated, but it is possible to concatenate everything.

Github - for you to DIY:

https://github.com/qks1lver/redtide

Data source

Listing files (i.e. NYSE.txt) are from http://eoddata.com/symbols.aspx

Daily historical data compiled from Yahoo Finance

Need someone to talk to?

If you have questions, e-mail me: jiunyyen@gmail.com

Happy mining!
Stock Market Dataset
kaggle.com
zip
Updated Apr 2, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oleh Onyshchak (2020). Stock Market Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/1054465
Explore at:
zip(547714524 bytes)Available download formats
Unique identifier
https://doi.org/10.34740/kaggle/dsv/1054465
Dataset updated
Apr 2, 2020
Authors
Oleh Onyshchak
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Overview

This dataset contains historical daily prices for all tickers currently trading on NASDAQ. The up to date list is available from nasdaqtrader.com. The historic data is retrieved from Yahoo finance via yfinance python package.

It contains prices for up to 01 of April 2020. If you need more up to date data, just fork and re-run data collection script also available from Kaggle.

Data Structure

The date for every symbol is saved in CSV format with common fields:

Date - specifies trading date

Open - opening price

High - maximum price during the day

Low - minimum price during the day

Close - close price adjusted for splits

Adj Close - adjusted close price adjusted for both dividends and splits.

Volume - the number of shares that changed hands during a given day

All that ticker data is then stored in either ETFs or stocks folder, depending on a type. Moreover, each filename is the corresponding ticker symbol. At last, symbols_valid_meta.csv contains some additional metadata for each ticker such as full name.
Get OHLCV, MBO, equities market events, and more from NYSE Integrated
databento.com
csv, dbn, json
Updated Jan 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Databento (2025). Get OHLCV, MBO, equities market events, and more from NYSE Integrated [Dataset]. https://databento.com/datasets/XNYS.PILLAR
Explore at:
json, dbn, csvAvailable download formats
Dataset updated
Jan 15, 2025
Dataset provided by
Databento Inc.
Authors
Databento
Time period covered
Mar 28, 2023 - Present
Area covered
United States
Description
NYSE Integrated is a proprietary data feed that disseminates full order book updates from the New York Stock Exchange (XNYS). It delivers every quote and order at each price level, along with any event that updates the order book after an order is placed, such as trade executions, modifications, or cancellations.

NYSE is the leading venue for listing blue-chip companies and large-cap stocks. Powered by NYSE's Pillar platform, its hybrid market model of floor-based auction and electronic trading allows it to capture a significant portion of trading activity during the US equity market open and close. As of January 2025, the NYSE represented approximately 6.31% of the average daily volume (ADV) across all exchange-listed US securities, including those listed on Nasdaq, other NYSE venues, and Cboe exchanges.

NYSE is also the only exchange to offer Designated Market Maker (DMM) privileges, allowing the floor to send D-Quote Orders, short for Discretionary Orders, throughout the day. Most D-Quote Orders execute in the closing auction, where they're known as Closing D Orders and allow traders to access the NYSE closing auction after 3:50 PM. This creates significant price discovery during the NYSE Closing Auction, where interest represented via the floor contributes more than 40% of total volume.

NYSE is also unique for being the only exchange with a Parity/Priority Allocation model for matching. This resembles a mixed FIFO and pro-rata matching algorithm, where the participant who sets the best price is matched first, and then the remaining shares are allocated to other orders entered by floor brokers at that price (parity allocation). Floor brokers may utilize e-Quotes to to receive such parity allocation of incoming executions.

With L3 granularity, NYSE Integrated captures information beyond the L1, top-of-book data available through SIP feeds, enabling accurate modeling of the book imbalances, queue dynamics, and the auction process. This data includes explicit trade aggressor side, odd lots, and imbalances. Auction imbalances offer valuable insights into NYSE’s opening and closing auctions by providing details like imbalance quantity, paired quantity, imbalance reference price, and book clearing price.

Historical data is available for usage-based rates or with any Databento US Equities subscription. Visit our pricing page for more details or to upgrade your plan.

Asset class: Equities

Origin: Directly captured at Equinix NY4 (Secaucus, NJ) with an FPGA-based network card and hardware timestamping. Synchronized to UTC with PTP.

Supported data encodings: DBN, CSV, JSON (Learn more)

Supported market data schemas: MBO, MBP-1, MBP-10, TBBO, Trades, BBO-1s, BBO-1m, OHLCV-1s, OHLCV-1m, OHLCV-1h, OHLCV-1d, Definition, Imbalance, Statistics, Status (Learn more)

Resolution: Immediate publication, nanosecond-resolution timestamps
F
S&P 500
fred.stlouisfed.org
json
Updated Jun 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). S&P 500 [Dataset]. https://fred.stlouisfed.org/series/SP500
Explore at:
jsonAvailable download formats
Dataset updated
Jun 20, 2025
License
https://fred.stlouisfed.org/legal/#copyright-pre-approvalhttps://fred.stlouisfed.org/legal/#copyright-pre-approval
Description
View data of the S&P 500, an index of the stocks of 500 leading companies in the US economy, which provides a gauge of the U.S. equity market.
d
Historical volatility time series and Live prices on Equity Options
datarade.ai
Updated Mar 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Canari (2023). Historical volatility time series and Live prices on Equity Options [Dataset]. https://datarade.ai/data-products/historical-volatility-time-series-and-live-prices-on-equity-o-canari
Explore at:
Dataset updated
Mar 9, 2023
Dataset authored and provided by
Canari
Area covered
Switzerland, United Kingdom, France, Norway, Netherlands, Sweden, Belgium, Germany, Italy, Spain
Description
This dataset offers both live (delayed) prices and End Of Day time series on equity options

1/ Live (delayed) prices for options on European stocks and indices including: Reference spot price, bid/ask screen price, fair value price (based on surface calibration), implicit volatility, forward Greeks : delta, vega Canari.dev computes AI-generated forecast signals indicating which option is over/underpriced, based on the holders strategy (buy and hold until maturity, 1 hour to 2 days holding horizon...). From these signals is derived a "Canari price" which is also available in this live tables.
Visit our website (canari.dev ) for more details about our forecast signals.

The delay ranges from 15 to 40 minutes depending on underlyings.

2/ Historical time series: Implied vol Realized vol Smile Forward
See a full API presentation here : https://youtu.be/qitPO-SFmY4 .

These data are also readily accessible in Excel thanks the provided Add-in available on Github: https://github.com/canari-dev/Excel-macro-to-consume-Canari-API

If you need help, contact us at: contact@canari.dev

User Guide: You can get a preview of the API by typing "data.canari.dev" in your web browser. This will show you a free version of this API with limited data.

Here are examples of possible syntaxes:

For live options prices: data.canari.dev/OPT/DAI data.canari.dev/OPT/OESX/0923 The "csv" suffix to get a csv rather than html formating, for example: data.canari.dev/OPT/DB1/1223/csv For historical parameters: Implied vol : data.canari.dev/IV/BMW

data.canari.dev/IV/ALV/1224

data.canari.dev/IV/DTE/1224/csv

Realized vol (intraday, maturity expressed as EWM, span in business days): data.canari.dev/RV/IFX ... Implied dividend flow: data.canari.dev/DIV/IBE ... Smile (vol spread between ATM strike and 90% strike, normalized to 1Y with factor 1/√T): data.canari.dev/SMI/DTE ... Forward: data.canari.dev/FWD/BNP ...

List of available underlyings: Code Name OESX Eurostoxx50 ODAX DAX OSMI SMI (Swiss index) OESB Eurostoxx Banks OVS2 VSTOXX ITK AB Inbev ABBN ABB ASM ASML ADS Adidas AIR Air Liquide EAD Airbus ALV Allianz AXA Axa BAS BASF BBVD BBVA BMW BMW BNP BNP BAY Bayer DBK Deutsche Bank DB1 Deutsche Boerse DPW Deutsche Post DTE Deutsche Telekom EOA E.ON ENL5 Enel INN ING IBE Iberdrola IFX Infineon IES5 Intesa Sanpaolo PPX Kering LOR L Oreal MOH LVMH LIN Linde DAI Mercedes-Benz MUV2 Munich Re NESN Nestle NOVN Novartis PHI1 Philips REP Repsol ROG Roche SAP SAP SNW Sanofi BSD2 Santander SND Schneider SIE Siemens SGE Société Générale SREN Swiss Re TNE5 Telefonica TOTB TotalEnergies UBSN UBS CRI5 Unicredito SQU Vinci VO3 Volkswagen ANN Vonovia ZURN Zurich Insurance Group
f
Data from: Trading Imbalance in Chinese Stock Market - A High-Frequency View...
figshare.com
txt
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jichang Zhao; Shan Lu (2023). Trading Imbalance in Chinese Stock Market - A High-Frequency View [Dataset]. http://doi.org/10.6084/m9.figshare.5835936.v3
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.5835936.v3
Dataset updated
May 31, 2023
Dataset provided by
figshare
Authors
Jichang Zhao; Shan Lu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The series of files named as ‘*_polarity.csv’ in folder ‘polarity’ includes the trading polarities of stocks listed on Shenzhen Stock Exchange from May 4 to July 31 2015. The eight numbers in the filenames specify the dates. The columns of these dataframes indicate the stock names, while the indices of dataframes indicate the time. The granularity of trading polarity is 1 minute for every stock. These trading polarities are calculated from the serial numbers for buyers and sellers in transactions data. The original transactions data is not publicly available due to the company’s license requirement.2. The files in the 'log_ret' folder cover the log returns of 1646 stocks listed on Shenzhen Stock Exchange from May 4 to July 31 2015. These data are calculated from the intraday price trends data provided by Thomson Reuters’ Tick History. The original price trends data is not publicly available due to the company’s license requirement.3. The file named as "stock_market_value.csv" gives the capitalization of stocks in June 31 2015, which is downloaded from Wind Information and we have converted the unit of measure from RMB into a dollar. Due to license requirements of the data companies, all of the above files have converted the names of stocks into integers in a consistent way. 4. Please cite the following paper:Shan Lu, Jichang Zhao and Huiwen Wang. Trading Imbalance in Chinese Stock Market—A High-Frequency View. Entropy, 2020, 22(8), 897.
o
Nairobi Securities Exchange Prices 2008-2012 for 6 selected stocks
explore.openaire.eu
data.mendeley.com
Updated Mar 10, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Barack Wanjawa (2020). Nairobi Securities Exchange Prices 2008-2012 for 6 selected stocks [Dataset]. http://doi.org/10.17632/95fb84nzcd
Explore at:
Unique identifier
https://doi.org/10.17632/95fb84nzcd
Dataset updated
Mar 10, 2020
Authors
Barack Wanjawa
Description
Stock market prediction remains active research in a quest to inform investors on how to trade (buy/sell) at the most opportune time. The prevalent methods used by stock market players in trying to predict the likely future trade prices are either technical, fundamental or time series analysis. This research wanted to try out machine learning methods, in contrast to the existing prevalent methods. Artificial neural networks (ANNs) tend to be the preferred machine learning method for this type of application. However, ANNs require some historical data to learn from, in order to do predictions. The research used an ANN model to test the hypothesis that the next day price (prediction) can be determined from the stock prices of the immediate last five days. The final ANN model used for the tests was a feedforward multi-layer perceptron (MLP) with error backpropagation, using sigmoid activation function, with network configuration 5:21:21:1. The data period used was a 5-year dataset (2008 to 2012), with 80% of the data (4-year data) used for training and the balance 20% used for testing (last 1-year data). The original raw data for Nairobi Securities Exchange (NSE) was scrapped from a publicly available and accessible website of a stock market analysis company in Kenya (Synergy, 2020). This daily prices data was first exported to a spreadsheet, then cleaned off headers and other redundant information, leaving only the data with stock name, date of trade and the related data such as volumes, low prices, high prices and adjusted prices. The data was further sorted by the stock names and then the trading dates. The data dimension was finally reduced to only what was needed for the research, which was the stock name, the date of trade and the adjusted price (average trade price). This final dataset was in CSV format, as hereby presented. The research tested three NSE stocks with the mean absolute percentage error (MAPE) ranging between 0.77% to 1.91%, over the 3-month testing period, while the root mean squared error (RMSE) ranged between 1.83 and 3.07. This raw data can be used to train and test any machine learning model that requires training and testing data. The data can also be used to validate and reproduce the results already presented in this research. There could be slight variance between what is obtained when reproducing the results, due to the differences in the final exact weights that the trained ANN model eventually achieves. However, these differences should not be significant. List of data files on this dataset: stock01_NSE_01jan2008_to_31dec2012_Kakuzi.csv stock02_NSE_01jan2008_to_31dec2012_StandardBank.csv stock03_NSE_01jan2008_to_31dec2012_KenyaAirways.csv stock04_NSE_01jan2008_to_31dec2012_BamburiCement.csv stock05_NSE_01jan2008_to_31dec2012_Kengen.csv stock06_NSE_01jan2008_to_31dec2012_BAT.csv References: Synergy Systems Ltd. (2020). MyStocks. Retrieved March 9, 2020, from http://live.mystocks.co.ke/
m
Nairobi Stock Exchange Prices 2008-2012 for 6 selected stocks
data.mendeley.com
Updated Mar 9, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Barack Wanjawa (2020). Nairobi Stock Exchange Prices 2008-2012 for 6 selected stocks [Dataset]. http://doi.org/10.17632/95fb84nzcd.1
Explore at:
Unique identifier
https://doi.org/10.17632/95fb84nzcd.1
Dataset updated
Mar 9, 2020
Authors
Barack Wanjawa
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Stock market prediction remains active research in a quest to inform investors on how to trade (buy/sell) at the most opportune time. The prevalent methods used by stock market players in trying to predict the likely future trade prices are either technical, fundamental or time series analysis. This research wanted to try out machine learning methods, in contrast to the existing prevalent methods. Artificial neural networks (ANNs) tend to be the preferred machine learning method for this type of application. However, ANNs require some historical data to learn from, in order to do predictions. The research used an ANN model to test the hypothesis that the next day price (prediction) can be determined from the stock prices of the immediate last five days.

The final ANN model used for the tests was a feedforward multi-layer perceptron (MLP) with error backpropagation, using sigmoid activation function, with network configuration 5:21:21:1. The data period used was a 5-year dataset (2008 to 2012), with 80% of the data (4-year data) used for training and the balance 20% used for testing (last 1-year data).

The original raw data for Nairobi Securities Exchange (NSE) was scrapped from a publicly available and accessible website of a stock market analysis company in Kenya (Synergy, 2020). This data was first exported to a spreadsheet, then cleaned off headers and other redundant information, leaving only the data with stock name, date of trade and the related data such as volumes, low prices, high prices and adjusted prices. The data was further sorted by the stock names and then the trading dates. The data dimension was finally reduced to only what was needed for the research, which was the stock name, the date of trade and the adjusted price (average trade price). This final dataset was in CSV format, as hereby presented.

The research tested three NSE stocks with the mean absolute percentage error (MAPE) ranging between 0.77% to 1.91%, over the 3-month testing period, while the root mean squared error (RMSE) ranged between 1.83 and 3.07.

This raw data can be used to train and test any machine learning model that requires training and testing data. The data can also be used to validate and reproduce the results already presented in this research. There could be slight variance between what is obtained when reproducing the results, due to the differences in the final exact weights that the trained ANN model eventually achieves. However, these differences should not be significant.

List of data files on this dataset: stock01_NSE_01jan2008_to_31dec2012_Kakuzi.csv stock02_NSE_01jan2008_to_31dec2012_StandardBank.csv stock03_NSE_01jan2008_to_31dec2012_KenyaAirways.csv stock04_NSE_01jan2008_to_31dec2012_BamburiCement.csv stock05_NSE_01jan2008_to_31dec2012_Kengen.csv stock06_NSE_01jan2008_to_31dec2012_BAT.csv

References: Synergy Systems Ltd. (2020). MyStocks. Retrieved March 9, 2020, from http://live.mystocks.co.ke/
Stock Market Dataset (NIFTY-500)
kaggle.com
Updated Jun 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sourav Banerjee (2023). Stock Market Dataset (NIFTY-500) [Dataset]. https://www.kaggle.com/datasets/iamsouravbanerjee/nifty500-stocks-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 10, 2023
Dataset provided by
Kaggle
Authors
Sourav Banerjee
Description
Context

NIFTY 500 is India’s first broad-based stock market index of the Indian stock market. It contains the top 500 listed companies on the NSE. The NIFTY 500 index represents about 96.1% of free-float market capitalization and 96.5% of the total turnover on the National Stock Exchange (NSE).

NIFTY 500 companies are disaggregated into 72 industry indices. Industry weights in the index reflect industry weights in the market. For example, if the banking sector has a 5% weight in the universe of stocks traded on the NSE, banking stocks in the index would also have an approximate representation of 5% in the index. NIFTY 500 can be used for a variety of purposes such as benchmarking fund portfolios, launching index funds, ETFs, and other structured products.

Other Notable Indices -

NIFTY 50: Top 50 listed companies on the NSE. A diversified 50-stock index accounting for 13 sectors of the Indian economy.

NIFTY Next 50: Also called NIFTY Juniors. Represents 50 companies from NIFTY 100 after excluding the NIFTY 50 companies.

NIFTY 100: Diversified 100 stock index representing major sectors of the economy. NIFTY 100 represents the top 100 companies based on full market capitalization from NIFTY 500.

NIFTY 200: Designed to reflect the behavior and performance of large and mid-market capitalization companies.

Content

The dataset comprises various parameters and features for each of the NIFTY 500 Stocks, including Company Name, Symbol, Industry, Series, Open, High, Low, Previous Close, Last Traded Price, Change, Percentage Change, Share Volume, Value in Indian Rupee, 52 Week High, 52 Week Low, 365 Day Percentage Change, and 30 Day Percentage Change.

Dataset Glossary (Column-Wise)

Company Name: Name of the Company.

Symbol: A stock symbol is a unique series of letters assigned to a security for trading purposes.

Industry: Name of the industry to which the stock belongs.

Series: EQ stands for Equity. In this series intraday trading is possible in addition to delivery and BE stands for Book Entry. Shares falling in the Trade-to-Trade or T-segment are traded in this series and no intraday is allowed. This means trades can only be settled by accepting or giving the delivery of shares.

Open: It is the price at which the financial security opens in the market when trading begins. It may or may not be different from the previous day's closing price. The security may open at a higher price than the closing price due to excess demand for the security.

High: It is the highest price at which a stock is traded during the course of the trading day and is typically higher than the closing or equal to the opening price.

Low: Today's low is a security's intraday low trading price. Today's low is the lowest price at which a stock trades over the course of a trading day.

Previous Close: The previous close almost always refers to the prior day's final price of a security when the market officially closes for the day. It can apply to a stock, bond, commodity, futures or option co-contract, market index, or any other security.

Last Traded Price: The last traded price (LTP) usually differs from the closing price of the day. This is because the closing price of the day on NSE is the weighted average price of the last 30 mins of trading. The last traded price of the day is the actual last traded price.

Change: For a stock or bond quote, change is the difference between the current price and the last trade of the previous day. For interest rates, change is benchmarked against a major market rate (e.g., LIBOR) and may only be updated as infrequently as once a quarter.

Percentage Change: Take the selling price and subtract the initial purchase price. The result is the gain or loss. Take the gain or loss from the investment and divide it by the original amount or purchase price of the investment. Finally, multiply the result by 100 to arrive at the percentage change in the investment.

Share Volume: Volume is an indicator that means the total number of shares that have been bought or sold in a specific period of time or during the trading day. It will also involve the buying and selling of every share during a specific time period.

Value (Indian Rupee): Market value—also known as market cap—is calculated by multiplying a company's outstanding shares by its current market price.

52-Week High: A 52-week high is the highest share price that a stock has traded at during a passing year. Many market aficionados view the 52-week high as an important factor in determining a stock's current value and predicting future price movement. 52-week High prices are adjusted for Bonus, Split & Rights Corporate actions.

52-Week Low: A 52-week low is the lowest ...
US Stock Market Data
kaggle.com
zip
Updated Jan 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohammed Obeidat (2023). US Stock Market Data [Dataset]. https://www.kaggle.com/mohammedobeidat/us-stock-market-data
Explore at:
zip(42432995 bytes)Available download formats
Dataset updated
Jan 14, 2023
Authors
Mohammed Obeidat
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The dataset contains the file required for training and testing and split accordingly.

There are two groups of features that you can use for prediction:

Fundamentals and ratios: Values collected form statements and balance sheets for each ticker

Technical indicators and strategy flags: Technical indicators calculated on close value of each day and buy and sell signals generated using some commonly used trading strategies.

Files found in Fundamentals folder is a processed format of the files found in raw folder. Ratios and other values are stretched to match the length of the closing price column such that the value in the pe_ratio column for example is the PE ratio from the most recent quarter and this applies for every column.

Technical indicators are calculated with the default parameters used in Pandas_TA package.

Data is collected form finance.yahoo.com and macrotrends.net Timeframe for the given data is different from one ticker to another because of unavailability of some stocks for a given time frame on either of the websites.

All code required to collect the data and perform preprocessing and feature engineering to get the data in the given format can be found in the following notebooks:

https://www.kaggle.com/code/mohammedobeidat/us-stocks-data-collection

https://www.kaggle.com/code/mohammedobeidat/us-stocks-technicals-feature-engineering-and-eda

https://www.kaggle.com/code/mohammedobeidat/us-stocks-fundamentals-preprocessing-and-eda

Files

{<>_ticker_train}.csv - the training set

{<>_ticker_train}.csv - the test set

Columns

Columns names are supposed to be self-explanatory assuming you are familiar with the stock market. Some acronyms you may encounter:

tmm is short for Trailing Twelve Months

pe is short for Price to Earnings

pb is short for Price to Book Value

ps is short for Price to Sales

fcf is short for Free Cash Flow

eps is short for Earnings per Share
Twitter Stocks Dataset
kaggle.com
Updated Nov 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MaharshiPandya (2022). Twitter Stocks Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/4463401
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/4463401
Dataset updated
Nov 7, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
MaharshiPandya
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Content

This is a dataset of Twitter stock prices over a range of 9 years. The stock prices' date ranges from November 2013 to October 2022. The data is in CSV format which is tabular and can be loaded quickly.

Usage

The dataset can be used for:

Time Series Analysis of the stock prices

Forecasting whether the stock will go into an uptrend or downtrend

Finding underlying patterns or trends

Any other application that you can think of. Feel free to discuss!

Column Description

There are 7 columns in this dataset.

Note: The currency is in USD ($)

Date: The date for which the stock data is considered.

Open: The stock's opening price on that day.

High: The stock's highest price on that day.

Low: The stock's lowest price on that day.

Close: The stock's closing price on that day. The close price is adjusted for splits.

Adj Close: Adjusted close price adjusted for splits and dividend and/or capital gain distributions.

Volume: Volume measures the number of shares traded in a stock or contracts traded in futures or options.

Acknowledgement

Image credits: IndiaTimes
NIFTY-50 Stock Market Data (2000 - 2021)
kaggle.com
zip
Updated May 1, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vopani (2021). NIFTY-50 Stock Market Data (2000 - 2021) [Dataset]. https://www.kaggle.com/rohanrao/nifty50-stock-market-data
Explore at:
zip(19302363 bytes)Available download formats
Dataset updated
May 1, 2021
Authors
Vopani
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

Stock market data is widely analyzed for educational, business and personal interests.

Content

The data is the price history and trading volumes of the fifty stocks in the index NIFTY 50 from NSE (National Stock Exchange) India. All datasets are at a day-level with pricing and trading values split across .cvs files for each stock along with a metadata file with some macro-information about the stocks itself. The data spans from 1st January, 2000 to 30th April, 2021.

Update Frequency

Since new stock market data is generated and made available every day, in order to have the latest and most useful information, the dataset will be updated once a month.

Acknowledgements

NSE India: https://www.nseindia.com/
Thanks to NSE for providing all the data publicly.

Inspiration

Various machine learning techniques can be applied and explored to stock market data, especially for trading algorithms and learning time series models.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Jiun Yen (2018). NASDAQ and NYSE stocks histories [Dataset]. https://www.kaggle.com/qks1lver/nasdaq-and-nyse-stocks-histories/code

NASDAQ and NYSE stocks histories

Full daily historic prices of 5800+ stocks on NASDAQ and NYSE listings

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Nov 5, 2018

Dataset provided by

Kaggle

Authors

Jiun Yen

License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

NASDAQ and NYSE stocks histories

Update every Saturday night because I'm too tired to do anything on Friday

Full history of stock symbols on NASDAQ and NYSE:

Unzip fh_< version_date >.zip
Each stock symbol has a .csv file under full_history/
- i.e. AMD.csv
Columns in .csv
- date - year-month-day, 2018-08-08
- volume - int, volume of the day
- open - float, opening price of the day
- close - float, closing price of the day
- high - float, highest price of the day
- low - float, lowest price of the day
- adjclose - float, adjusted closing price of the day

Other files:

all_symbols.txt - All the stock symbols with history
excluded_symbols.txt - All the ones that I couldn't retrieve data for
NASDAQ.txt - NASDAQ listing
NYSE.txt - NYSE listing

All data compiled from Yahoo Finance

If you have questions, e-mail me: jiunyyen@gmail.com

Happy mining!

Clear search

Close search

Google apps

Main menu

NASDAQ and NYSE stocks histories

NASDAQ and NYSE stocks histories

Update every Saturday night because I'm too tired to do anything on Friday

Full history of stock symbols on NASDAQ and NYSE:

Other files:

AMEX, NYSE, NASDAQ stock histories

AMEX, NYSE, and NASDAQ stocks histories

Update every Satur... Sun... I mean Friday... >_< sometime during the weekend. I lied, I've been too busy the past few months and haven't updated in forever until today (2020.6.14) - Last scrape 2020.06.12 Friday evening (p.s. Download shows 3GB unzipped, zipped file is ~600MB)

Full history of stock symbols:

Other files:

Disclaimer

Github - for you to DIY:

Data source

Need someone to talk to?

Stock Market Dataset

Overview

Data Structure

Get OHLCV, MBO, equities market events, and more from NYSE Integrated

S&P 500

Historical volatility time series and Live prices on Equity Options

Data from: Trading Imbalance in Chinese Stock Market - A High-Frequency View...

Nairobi Securities Exchange Prices 2008-2012 for 6 selected stocks

Nairobi Stock Exchange Prices 2008-2012 for 6 selected stocks

Stock Market Dataset (NIFTY-500)

Context

Content

Dataset Glossary (Column-Wise)

US Stock Market Data

Files

Columns

Twitter Stocks Dataset

Content

Usage

Column Description

Acknowledgement

NIFTY-50 Stock Market Data (2000 - 2021)

Context

Content

Update Frequency

Acknowledgements

Inspiration

NASDAQ and NYSE stocks histories

Full daily historic prices of 5800+ stocks on NASDAQ and NYSE listings

NASDAQ and NYSE stocks histories

Update every Saturday night because I'm too tired to do anything on Friday

Full history of stock symbols on NASDAQ and NYSE:

Other files: