https://brightdata.com/licensehttps://brightdata.com/license
Yahoo Finance dataset provides information on top traded companies. It contains financial information on each company including stock ticker and risk scores and general company information such as company location and industry. Each record in the dataset is a unique stock, where multiple stocks can be related to the same company. Yahoo Finance dataset attributes include: company name, company ID, entity type, summary, stock ticker, currency, earnings, exchange, closing price, previous close, open, bid, ask, day range, week range, volume, and much more.
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
The Yahoo Stocks Dataset is an invaluable resource for analysts, traders, and developers looking to enhance their financial data models or trading strategies. Sourced from Yahoo Finance, this dataset includes historical stock prices, market trends, and financial indicators. With its accurate and comprehensive data, it empowers users to analyze patterns, forecast trends, and build robust machine learning models.
Whether you're a seasoned stock market analyst or a beginner in financial data science, this dataset is tailored to meet diverse needs. It features details like stock prices, trading volume, and market capitalization, enabling a deep dive into investment opportunities and market dynamics.
For machine learning and AI enthusiasts, the Yahoo Stocks Dataset is a goldmine. It’s perfect for developing predictive models, such as stock price forecasting and sentiment analysis. The dataset's structured format ensures seamless integration into Python, R, and other analytics platforms, making data visualization and reporting effortless.
Additionally, this dataset supports long-term trend analysis, helping investors make informed decisions. It’s also an essential resource for those conducting research in algorithmic trading and portfolio management.
Key benefits include:
Download the Yahoo Stocks Dataset today and harness the power of financial data for your projects. Whether for AI, financial reporting, or trend analysis, this dataset equips you with the tools to succeed in the dynamic world of stock markets.
https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
The Financial data from Yahoo!
*** Key Points to Note ***
All financial data is sourced from Yahoo!Ⓡ Finance, Nasdaq!Ⓡ, and the U.S. Department of the Treasury via publicly available APIs, and is intended for research and educational purposes. I will update the data regularly, and you are welcome to follow this project and use the data. Each time the data is updated, I will record the update time in spec.json.
Data Usage Instructions
Use DuckDB or… See the full description on the dataset page: https://huggingface.co/datasets/bwzheng2010/yahoo-finance-data.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains historical daily prices for all tickers currently trading on NASDAQ. The up to date list is available from nasdaqtrader.com. The historic data is retrieved from Yahoo finance via yfinance python package.
It contains prices for up to 01 of April 2020. If you need more up to date data, just fork and re-run data collection script also available from Kaggle.
The date for every symbol is saved in CSV format with common fields:
All that ticker data is then stored in either ETFs or stocks folder, depending on a type. Moreover, each filename is the corresponding ticker symbol. At last, symbols_valid_meta.csv
contains some additional metadata for each ticker such as full name.
This dataset provides comprehensive access to financial market data from Google Finance in real-time. Get detailed information on stocks, market quotes, trends, ETFs, international exchanges, forex, crypto, and related news. Perfect for financial applications, trading platforms, and market analysis tools. The dataset is delivered in a JSON format via REST API.
This dataset includes daily historical price data for Bitcoin (BTC-USD) from 2014 to 2025, obtained through web scraping from the Yahoo Finance page using Selenium. The primary data source can be accessed at Yahoo Finance - Bitcoin Historical Data . The dataset contains daily information such as opening price (Open), highest price (High), lowest price (Low), closing price (Close), adjusted closing price (Adj Close), and trading volume (Volume).
About Bitcoin: Bitcoin (BTC) is the world's first decentralized digital currency, introduced in 2009 by an anonymous creator known as Satoshi Nakamoto. It operates on a peer-to-peer network powered by blockchain technology, enabling secure, transparent, and trustless transactions without the need for intermediaries like banks. Bitcoin's limited supply of 21 million coins and its growing adoption have made it a popular asset for investment, trading, and as a hedge against inflation.
We are excited to share this dataset and look forward to seeing the insights it can provide. We hope it will inspire collaboration and innovation within the community. By leveraging this daily data, we can explore trends, develop predictive models, and design innovative trading strategies that deepen our understanding of Bitcoin's market behavior. Together, we can unlock new opportunities and contribute to the collective advancement of cryptocurrency research and analysis.
Auto-generated structured data of Yahoo! Finance Price from table Fields
Yahoo Finance Business Information dataset to access comprehensive details on companies, including financial data and business profiles. Popular use cases include market analysis, investment research, and competitive benchmarking.
Use our Yahoo Finance Business Information dataset to access comprehensive financial and corporate data, including company profiles, stock prices, market capitalization, revenue, and key performance metrics. This dataset is tailored for financial analysts, investors, and researchers to analyze market trends and evaluate company performance.
Popular use cases include investment research, competitor benchmarking, and trend forecasting. Leverage this dataset to make informed financial decisions, identify growth opportunities, and gain a deeper understanding of the business landscape.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Time Series Forecasting with Yahoo Stock Price ’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/arashnic/time-series-forecasting-with-yahoo-stock-price on 28 January 2022.
--- Dataset description provided by original source is as follows ---
Stocks and financial instrument trading is a lucrative proposition. Stock markets across the world facilitate such trades and thus wealth exchanges hands. Stock prices move up and down all the time and having ability to predict its movement has immense potential to make one rich. Stock price prediction has kept people interested from a long time. There are hypothesis like the Efficient Market Hypothesis, which says that it is almost impossible to beat the market consistently and there are others which disagree with it.
There are a number of known approaches and new research going on to find the magic formula to make you rich. One of the traditional methods is the time series forecasting. Fundamental analysis is another method where numerous performance ratios are analyzed to assess a given stock. On the emerging front, there are neural networks, genetic algorithms, and ensembling techniques.
Another challenging problem in stock price prediction is Black Swan Event, unpredictable events that cause stock market turbulence. These are events that occur from time to time, are unpredictable and often come with little or no warning.
A black swan event is an event that is completely unexpected and cannot be predicted. Unexpected events are generally referred to as black swans when they have significant consequences, though an event with few consequences might also be a black swan event. It may or may not be possible to provide explanations for the occurrence after the fact – but not before. In complex systems, like economies, markets and weather systems, there are often several causes. After such an event, many of the explanations for its occurrence will be overly simplistic.
#
#
https://www.visualcapitalist.com/wp-content/uploads/2020/03/mm3_black_swan_events_shareable.jpg">
#
#
New bleeding age state-of-the-art deep learning models stock predictions is overcoming such obstacles e.g. "Transformer and Time Embeddings". An objectives are to apply these novel models to forecast stock price.
Stock price prediction is the task of forecasting the future value of a given stock. Given the historical daily close price for S&P 500 Index, prepare and compare forecasting solutions. S&P 500 or Standard and Poor's 500 index is an index comprising of 500 stocks from different sectors of US economy and is an indicator of US equities. Other such indices are the Dow 30, NIFTY 50, Nikkei 225, etc. For the purpose of understanding, we are utilizing S&P500 index, concepts, and knowledge can be applied to other stocks as well.
The historical stock price information is also publicly available. For our current use case, we will utilize the pandas_datareader library to get the required S&P 500 index history using Yahoo Finance databases. We utilize the closing price information from the dataset available though other information such as opening price, adjusted closing price, etc., are also available. We prepare a utility function get_raw_data() to extract required information in a pandas dataframe. The function takes index ticker name as input. For S&P 500 index, the ticker name is ^GSPC. The following snippet uses the utility function to get the required data.(See Simple LSTM Regression)
Features and Terminology: In stock trading, the high and low refer to the maximum and minimum prices in a given time period. Open and close are the prices at which a stock began and ended trading in the same period. Volume is the total amount of trading activity. Adjusted values factor in corporate actions such as dividends, stock splits, and new share issuance.
Mining and updating of this dateset will depend upon Yahoo Finance .
Sort of variation of sequence modeling and bleeding age e.g. attention can be applied for research and forecasting
--- Original source retains full ownership of the source dataset ---
Yahoo Shares
This data set contains historical share information for the analysis and modelling of share price predictions. It can be used to train machine learning models that predict future share prices. All data was retrieved from the Yahoo Finance API.
Content of the data record
Column Description
Adj Close Adjusted closing price
Close Closing price
High Highest price of the day
Low Lowest price of the day
Open Opening price
Volume Trading Volume… See the full description on the dataset page: https://huggingface.co/datasets/jonas-is-coding/yahoo-shares.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the name of the 38 global main stock indexes in the world. We collected from Yahoo! Finance. For the convenience of expression and computation later, we numbered it. For each item, the front is its serial number, followed by the corresponding stock index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Overall, this project was meant test the relationship between social media posts and their short-term effect on stock prices. We decided to use Reddit posts from financial specific subreddit communities like r/wallstreetbets, r/investing, and r/stocks to see the changes in the market associated with a variety of posts made by users. This idea came to light because of the GameStop short squeeze that showed the power of social media in the market. Typically, stock prices should purely represent the total present value of all the future value of the company, but the question we are asking is whether social media can impact that intrinsic value. Our research question was known from the start and it was do Reddit posts for or against a certain stock provide insight into how the market will move in a short window. To solve this problem, we selected five large tech companies including Apple, Tesla, Amazon, Microsoft, and Google. These companies would likely give us more data in the subreddits and would have less volatility day to day allowing us to simulate an experiment easier. They trade at very high values so a change from a Reddit post would have to be significant giving us proof that there is an effect.
Next, we had to choose our data sources for to have data to test with. First, we tried to locate the Reddit data using a Reddit API, but due to circumstances regarding Reddit requiring approval to use their data we switched to a Kaggle dataset that contained metadata from Reddit. For our second data set we had planned to use Yahoo Finance through yfinance, but due to the large amount of data we were pulling from this public API our IP address was temporarily blocked. This caused us to switch our second data to pull from Alpha Vantage. While this was a large switch in the public it was a minor roadblock and fixing the Finance pulling section allowed for everything else to continue to work in succession. Once we had both of our datasets programmatically pulled into our local vs code, we implemented a pipeline to clean, merge, and analyze all the data. At the end, we implement a Snakemake workflow to ensure the project was easily reproducible. To continue, we utilized Textblob to label our Reddit posts with a sentiment value of positive, negative, or neutral and provide us with a correlation value to analyze with. We then matched the time frame of each post with the stock data and computed any possible changes, found a correlation coefficient, and graphed our findings.
To conclude the data analysis, we found that there is relatively small or no correlation between the total companies, but Microsoft and Google do show stronger correlations when analyzed on their own. However, this may be due to other circumstances like why the post was made or if the market had other trends on those dates already. A larger analysis with more data from other social media platforms would be needed to conclude for our hypothesis that there is a strong correlation.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Material published at "https://opencodecom.net/post/2021-07-22-como-baixar-e-zipar-csv-utilizando-python/"
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The dataset contains the stock price information of the s&p 500 from 1927 till June 2023 with features such as Date, Open, High, Low, Close, Volume, Dividends and splits. The dataset can be used for EDA as well as Time Series Analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The data files contain seven low-dimensional financial research data (in .txt format) and four high-dimensional daily stock prices data (in .csv format). The low-dimensional data sets are provided by Lorenzo Garlappi on his website, while the high-dimensional data sets are downloaded from Yahoo!Finance by the contributor's own efforts. The description of the low-dimensional data sets can be found in DeMiguel et al. (2009, RFS).
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset was created by Vatsal Lakhmani
Released under MIT
Attribution 1.0 (CC BY 1.0)https://creativecommons.org/licenses/by/1.0/
License information was derived automatically
This data set includes stock information for the companies Tesla, Porsche, Nio and Ferrari for each day from the date 11/08/2019 to 11/08/2020. Specifically, it shows information about the opening, closing, maximum and minimum price of the session, as well as the volume, the dividends granted to investors and the presence of stock splits generated per day. This dataste has been created with the aim to analyze how the quotes have been evolving during the COVID-19 pandemic in the automotive sector.
The AccionesSectorAutomovil.xlsx dataset contains 4 sheets (TESLA, PAH3.DE, NIO, RACE ) and 9 variables per sheet:
- Fecha: date in dd/MM/yyyy format
- Abrir: value of the share at the market opening expressed in US dollars (USD)
- Max: maximum value of the share throughout the day expressed in USD
- Cierre*: value of the share at the close of the market expressed in USD
- Cierre ajus.*: estimated share value at market close, expressed in USD.
- Volumen: the amount of a specific asset invested in during a day.
- Dividends: money received by shareholders in the form of dividends that day.
- Stock Splits: Whether or not a stock split operation was carried out that day.
For more information about the project visit the link on [Github](https://github.com/paulamlago/Financial-Web-Scrapping)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Cryptocurrency historical datasets from January 2012 (if available) to October 2021 were obtained and integrated from various sources and Application Programming Interfaces (APIs) including Yahoo Finance, Cryptodownload, CoinMarketCap, various Kaggle datasets, and multiple APIs. While these datasets used various formats of time (e.g., minutes, hours, days), in order to integrate the datasets days format was used for in this research study. The integrated cryptocurrency historical datasets for 80 cryptocurrencies including but not limited to Bitcoin (BTC), Ethereum (ETH), Binance Coin (BNB), Cardano (ADA), Tether (USDT), Ripple (XRP), Solana (SOL), Polkadot (DOT), USD Coin (USDC), Dogecoin (DOGE), Tron (TRX), Bitcoin Cash (BCH), Litecoin (LTC), EOS (EOS), Cosmos (ATOM), Stellar (XLM), Wrapped Bitcoin (WBTC), Uniswap (UNI), Terra (LUNA), SHIBA INU (SHIB), and 60 more cryptocurrencies were uploaded in this online Mendeley data repository. Although the primary attribute of including the mentioned cryptocurrencies was the Market Capitalization, a subject matter expert i.e., a professional trader has also guided the initial selection of the cryptocurrencies by analyzing various indicators such as Relative Strength Index (RSI), Moving Average Convergence/Divergence (MACD), MYC Signals, Bollinger Bands, Fibonacci Retracement, Stochastic Oscillator and Ichimoku Cloud. The primary features of this dataset that were used as the decision-making criteria of the CLUS-MCDA II approach are Timestamps, Open, High, Low, Closed, Volume (Currency), % Change (7 days and 24 hours), Market Cap and Weighted Price values. The available excel and CSV files in this data set are just part of the integrated data and other databases, datasets and API References that was used in this study are as follows: [1] https://finance.yahoo.com/ [2] https://coinmarketcap.com/historical/ [3] https://cryptodatadownload.com/ [4] https://kaggle.com/philmohun/cryptocurrency-financial-data [5] https://kaggle.com/deepshah16/meme-cryptocurrency-historical-data [6] https://kaggle.com/sudalairajkumar/cryptocurrencypricehistory [7] https://min-api.cryptocompare.com/data/price?fsym=BTC&tsyms=USD [8] https://min-api.cryptocompare.com/ [9] https://p.nomics.com/cryptocurrency-bitcoin-api [10] https://www.coinapi.io/ [11] https://www.coingecko.com/en/api [12] https://cryptowat.ch/ [13] https://www.alphavantage.co/ This dataset is part of the CLUS-MCDA (Cluster analysis for improving Multiple Criteria Decision Analysis) and CLUS-MCDAII Project: https://aimaghsoodi.github.io/CLUSMCDA-R-Package/ https://github.com/Aimaghsoodi/CLUS-MCDA-II https://github.com/azadkavian/CLUS-MCDA
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The dataset reports a collection of earnings call transcripts, the related stock prices, and the sector index In terms of volume, there is a total of 188 transcripts, 11970 stock prices, and 1196 sector index values. Furthermore, all of these data originated in the period 2016-2020 and are related to the NASDAQ stock market. Furthermore, the data collection was made possible by Yahoo Finance and Thomson Reuters Eikon. Specifically, Yahoo Finance enabled the search for stock values and Thomson Reuters Eikon provided the earnings call transcripts. Lastly, the dataset can be used as a benchmark for the evaluation of several NLP techniques to understand their potential for financial applications. Moreover, it is also possible to expand the dataset by extending the period in which the data originated following a similar procedure. Contact at Tilburg University: Francesco Lelli
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
2023
https://brightdata.com/licensehttps://brightdata.com/license
Yahoo Finance dataset provides information on top traded companies. It contains financial information on each company including stock ticker and risk scores and general company information such as company location and industry. Each record in the dataset is a unique stock, where multiple stocks can be related to the same company. Yahoo Finance dataset attributes include: company name, company ID, entity type, summary, stock ticker, currency, earnings, exchange, closing price, previous close, open, bid, ask, day range, week range, volume, and much more.