Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset encompasses the historical data of major stock indices from around the world, sourced directly from Yahoo Finance. With data reaching back to the early 1920s (where available), it serves as an invaluable repository for academic researchers, financial analysts, and market enthusiasts. Users can delve into trends across decades, evaluate historical market behaviors, or even design and validate predictive financial models.
Photo by Tötös Ádám on Unsplash
all_indices_data.csv:
date: The date of the data point (formatted as YYYY-MM-DD).open: The opening value of the index on that date.high: The highest value of the index during the trading session.low: The lowest value of the index during the trading session.close: The closing value of the index.volume: The trading volume of the index on that date.ticker: The ticker symbol of the stock index.individual_indices_data/[SYMBOL]_data.csv:
[SYMBOL] denotes the ticker symbol of the respective stock index. Each dataset is curated from Yahoo Finance's historical data archives.date: The date of the data point (formatted as YYYY-MM-DD).open: The opening value of the index on that date.high: The highest value of the index during the trading session.low: The lowest value of the index during the trading session.close: The closing value of the index.volume: The trading volume of the index on that date.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The provided dataset is extracted from yahoo finance using pandas and yahoo finance library in python. This deals with stock market index of the world best economies. The code generated data from Jan 01, 2003 to Jun 30, 2023 that’s more than 20 years. There are 18 CSV files, dataset is generated for 16 different stock market indices comprising of 7 different countries. Below is the list of countries along with number of indices extracted through yahoo finance library, while two CSV files deals with annualized return and compound annual growth rate (CAGR) has been computed from the extracted data.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15657145%2F90ce8a986761636e3edbb49464b304d8%2FNumber%20of%20Index.JPG?generation=1688490342207096&alt=media" alt="">
This dataset is useful for research purposes, particularly for conducting comparative analyses involving capital market performance and could be used along with other economic indicators.
There are 18 distinct CSV files associated with this dataset. First 16 CSV files deals with number of indices and last two CSV file deals with annualized return of each year and CAGR of each index. If data in any column is blank, it portrays that index was launch in later years, for instance: Bse500 (India), this index launch in 2007, so earlier values are blank, similarly China_Top300 index launch in year 2021 so early fields are blank too.
The extraction process involves applying different criteria, like in 16 CSV files all columns are included, Adj Close is used to calculate annualized return. The algorithm extracts data based on index name (code given by the yahoo finance) according start and end date.
Annualized return and CAGR has been calculated and illustrated in below image along with machine readable file (CSV) attached to that.
To extract the data provided in the attachment, various criteria were applied:
Content Filtering: The data was filtered based on several attributes, including the index name, start and end date. This filtering process ensured that only relevant data meeting the specified criteria.
Collaborative Filtering: Another filtering technique used was collaborative filtering using yahoo finance, which relies on index similarity. This approach involves finding indices that are similar to other index or extended dataset scope to other countries or economies. By leveraging this method, the algorithm identifies and extracts data based on similarities between indices.
In the last two CSV files, one belongs to annualized return, that was calculated based on the Adj close column and new DataFrame created to store its outcome. Below is the image of annualized returns of all index (if unreadable, machine-readable or CSV format is attached with the dataset).
As far as annualised rate of return is concerned, most of the time India stock market indices leading, followed by USA, Canada and Japan stock market indices.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15657145%2F37645bd90623ea79f3708a958013c098%2FAnnualized%20Return.JPG?generation=1688525901452892&alt=media" alt="">
The best performing index based on compound growth is Sensex (India) that comprises of top 30 companies is 15.60%, followed by Nifty500 (India) that is 11.34% and Nasdaq (USA) all is 10.60%.
The worst performing index is China top300, however this is launch in 2021 (post pandemic), so would not possible to examine at that stage (due to less data availability). Furthermore, UK and Russia indices are also top 5 in the worst order.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15657145%2F58ae33f60a8800749f802b46ec1e07e7%2FCAGR.JPG?generation=1688490409606631&alt=media" alt="">
Geography: Stock Market Index of the World Top Economies
Time period: Jan 01, 2003 – June 30, 2023
Variables: Stock Market Index Title, Open, High, Low, Close, Adj Close, Volume, Year, Month, Day, Yearly_Return and CAGR
File Type: CSV file
This is not a financial advice; due diligence is required in each investment decision.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the name of the 38 global main stock indexes in the world. We collected from Yahoo! Finance. For the convenience of expression and computation later, we numbered it. For each item, the front is its serial number, followed by the corresponding stock index.
Facebook
TwitterThis dataset was created by EL Younes
Facebook
TwitterCollected from Yahoo Finance, Investing.com and WJS, this dataset consists of the following indices ranging from July 17, 2017 to July 22, 2022:
Facebook
TwitterDaily price data for World indices stock exchanges from all over the world (United States, China, Canada, Germany, Japan, and more). The data was all collected from Yahoo Finance, which had several decades of data available for most exchanges. Prices are quoted in terms of the USD currency of where each exchange is located.
Data collected from Yahoo Finance.
Facebook
TwitterAs of October 2025, Google represented ***** percent of the global online search engine referrals on desktop devices. Despite being much ahead of its competitors, this represents a modest increase from the previous months. Meanwhile, its longtime competitor Bing accounted for ***** percent, as tools like Yahoo and Yandex held shares of over **** percent and **** percent respectively. Google and the global search market Ever since the introduction of Google Search in 1997, the company has dominated the search engine market, while the shares of all other tools has been rather lopsided. The majority of Google revenues are generated through advertising. Its parent corporation, Alphabet, was one of the biggest internet companies worldwide as of 2024, with a market capitalization of **** trillion U.S. dollars. The company has also expanded its services to mail, productivity tools, enterprise products, mobile devices, and other ventures. As a result, Google earned one of the highest tech company revenues in 2024 with roughly ****** billion U.S. dollars. Search engine usage in different countries Google is the most frequently used search engine worldwide. But in some countries, its alternatives are leading or competing with it to some extent. As of the last quarter of 2023, more than ** percent of internet users in Russia used Yandex, whereas Google users represented little over ** percent. Meanwhile, Baidu was the most used search engine in China, despite a strong decrease in the percentage of internet users in the country accessing it. In other countries, like Japan and Mexico, people tend to use Yahoo along with Google. By the end of 2024, nearly half of the respondents in Japan said that they had used Yahoo in the past four weeks. In the same year, over ** percent of users in Mexico said they used Yahoo.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Stocks and financial instrument trading is a lucrative proposition. Stock markets across the world facilitate such trades and thus wealth exchanges hands. Stock prices move up and down all the time and having ability to predict its movement has immense potential to make one rich. Stock price prediction has kept people interested from a long time. There are hypothesis like the Efficient Market Hypothesis, which says that it is almost impossible to beat the market consistently and there are others which disagree with it.
There are a number of known approaches and new research going on to find the magic formula to make you rich. One of the traditional methods is the time series forecasting. Fundamental analysis is another method where numerous performance ratios are analyzed to assess a given stock. On the emerging front, there are neural networks, genetic algorithms, and ensembling techniques.
Another challenging problem in stock price prediction is Black Swan Event, unpredictable events that cause stock market turbulence. These are events that occur from time to time, are unpredictable and often come with little or no warning.
A black swan event is an event that is completely unexpected and cannot be predicted. Unexpected events are generally referred to as black swans when they have significant consequences, though an event with few consequences might also be a black swan event. It may or may not be possible to provide explanations for the occurrence after the fact – but not before. In complex systems, like economies, markets and weather systems, there are often several causes. After such an event, many of the explanations for its occurrence will be overly simplistic.
#
#
https://www.visualcapitalist.com/wp-content/uploads/2020/03/mm3_black_swan_events_shareable.jpg">
#
#
New bleeding age state-of-the-art deep learning models stock predictions is overcoming such obstacles e.g. "Transformer and Time Embeddings". An objectives are to apply these novel models to forecast stock price.
Stock price prediction is the task of forecasting the future value of a given stock. Given the historical daily close price for S&P 500 Index, prepare and compare forecasting solutions. S&P 500 or Standard and Poor's 500 index is an index comprising of 500 stocks from different sectors of US economy and is an indicator of US equities. Other such indices are the Dow 30, NIFTY 50, Nikkei 225, etc. For the purpose of understanding, we are utilizing S&P500 index, concepts, and knowledge can be applied to other stocks as well.
The historical stock price information is also publicly available. For our current use case, we will utilize the pandas_datareader library to get the required S&P 500 index history using Yahoo Finance databases. We utilize the closing price information from the dataset available though other information such as opening price, adjusted closing price, etc., are also available. We prepare a utility function get_raw_data() to extract required information in a pandas dataframe. The function takes index ticker name as input. For S&P 500 index, the ticker name is ^GSPC. The following snippet uses the utility function to get the required data.(See Simple LSTM Regression)
Features and Terminology: In stock trading, the high and low refer to the maximum and minimum prices in a given time period. Open and close are the prices at which a stock began and ended trading in the same period. Volume is the total amount of trading activity. Adjusted values factor in corporate actions such as dividends, stock splits, and new share issuance.
Mining and updating of this dateset will depend upon Yahoo Finance .
Sort of variation of sequence modeling and bleeding age e.g. attention can be applied for research and forecasting
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Baltic Dry rose to 2,600 Index Points on December 2, 2025, up 0.66% from the previous day. Over the past month, Baltic Dry's price has risen 33.68%, and is up 110.19% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks the benchmark market for this commodity. Baltic Exchange Dry Index - values, historical data, forecasts and news - updated on December of 2025.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Overview This dataset provides daily snapshots of cryptocurrency, stock market, and forex data.
Sources Yahoo Finance (via yfinance)
Features Automated daily updates Covers major global indices and top cryptocurrencies Includes sentiment analysis for financial news
Use Cases Financial market analysis Machine learning for price prediction Trading strategy research
License Data compiled from public APIs for educational and analytical use.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
In this dataset you can find the Top 100 companies in the technology sector. You can also find 5 of the most important and used indices in the financial market as well as a list of all the companies in the S&P 500 index and in the technology sector.
The Global Industry Classification Standard also known as GICS is the primary financial industry standard for defining sector classifications. The Global Industry Classification Standard was developed by index providers MSCI and Standard and Poor’s. Its hierarchy begins with 11 sectors which can be further delineated to 24 industry groups, 69 industries, and 158 sub-industries.
You can read the definition of each sector here.
The 11 broad GICS sectors commonly used for sector breakdown reporting include the following: Energy, Materials, Industrials, Consumer Discretionary, Consumer Staples, Health Care, Financials, Information Technology, Telecommunication Services, Utilities and Real Estate.
In this case we will focuse in the Technology Sector. You can see all the sectors and industry groups here.
To determine which companies, correspond to the technology sector, we use Yahoo Finance, where we rank the companies according to their “Market Cap”. After having the list of the Top 100 best valued companies in the sector, we proceeded to download the historical data of each of the companies using the NASDAQ website.
Regarding to the indices, we searched various sources to find out which were the most used and determined that the 5 most frequently used indices are: Dow Jones Industrial Average (DJI), S&P 500 (SPX), NASDAQ Composite (IXIC), Wilshire 5000 Total Market Inde (W5000) and to specifically view the technology sector SPDR Select Sector Fund - Technology (XLK). Historical data for these indices was also obtained from the NASDQ website.
In total there are 107 files in csv format. They are composed as follows:
Every company and index file has the same structure with the same columns:
Date: It is the date on which the prices were recorded. High: Is the highest price at which a stock traded during the course of the trading day. Low: Is the lowest price at which a stock traded during the course of the trading day. Open: Is the price at which a stock started trading when the opening bell rang. Close: Is the last price at which a stock trades during a regular trading session. Volume: Is the number of shares that changed hands during a given day. Adj Close: The adjusted closing price factors in corporate actions, such as stock splits, dividends, and rights offerings.
The two other files have different columns names:
List of S&P 500 companies
Symbol: Ticker symbol of the company. Name: Name of the company. Sector: The sector to which the company belongs.
Technology Sector Companies List
Symbol: Ticker symbol of the company. Name: Name of the company. Price: Current price at which a stock can be purchased or sold. (11/24/20) Change: Net change is the difference between closing prices from one day to the next. % Change: Is the difference between closing prices from one day to the next in percentage. Volume: Is the number of shares that changed hands during a given day. Avg Vol: Is the daily average of the cumulative trading volume during the last three months. Market Cap (Billions): Is the total value of a company’s shares outstanding at a given moment in time. It is calculated by multiplying the number of shares outstanding by the price of a single share. PE Ratio: Is the ratio of a company's share (stock) price to the company's earnings per share. The ratio is used for valuing companies and to find out whether they are overvalued or undervalued.
SEC EDGAR | Company Filings NASDAQ | Historical Quotes Yahoo Finance | Technology Sector Wikipedia | List of S&P 500 companies S&P Dow Jones Indices | S&P 500 [S&P Dow Jones Indices | DJI](https://www.spglobal.com/spdji/en/i...
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Coffee fell to 408.66 USd/Lbs on December 2, 2025, down 0.95% from the previous day. Over the past month, Coffee's price has risen 0.50%, and is up 38.54% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks the benchmark market for this commodity. Coffee - values, historical data, forecasts and news - updated on December of 2025.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Walmart Inc. (NYSE: WMT) is a multinational retail corporation headquartered in Bentonville, Arkansas. As one of the world’s largest companies by revenue and employment, Walmart operates a global chain of hypermarkets, discount department stores, and grocery stores. Founded in 1962 by Sam Walton, the company has become a leading player in both brick-and-mortar and e-commerce retail sectors.
Walmart's business model focuses on offering low prices and a wide range of products to attract value-conscious customers. In recent years, the company has expanded its digital presence, supply chain efficiency, and sustainability efforts to adapt to changing consumer trends.
Walmart stock (WMT) is included in major indices such as the Dow Jones Industrial Average and S&P 500, and is widely followed for its performance as a stable, dividend-paying blue-chip stock. Investors often view Walmart as a defensive play due to its resilience during economic downturns and consistent cash flow generation.
📊 Financial Analysis: Analyze trends in Walmart stock performance over time.
🤖 Machine Learning Models: Train models for price prediction, volatility estimation, or portfolio simulation.
📈 Time Series Forecasting: Apply statistical models (ARIMA, LSTM, Prophet) to forecast future stock movements.
🧮 Correlation Studies: Compare Walmart stocks with other kind of stores to predict and analyze his behaviour.
📰 Event Impact Analysis: Study how product launches, earnings calls, or regulatory news affect Walmart's stock.
-💼 Investment Strategy Backtesting: Test strategies like momentum investing, mean reversion, or moving average crossovers.
Facebook
TwitterTime series modelling for the prediction of stocks prices is a challenging task. Political events, market expectations and economic factors are just a few known factors that can impact financial market behaviour. The financial market is a complex, noisy, evolutionary and chaotic field of study that attracts many enthusiasts and researches — the first, usually driven by the economic benefit of it, the latter, inspired by the challenge of handling such complex data.
This project aims to predict Facebook (FB) next day stock price direction with machine learning algorithms. Technical indicators and global market indexes are used, and their influence on the forecast accuracy is analysed.
Daily values were retrieved (volume, open, close, low and high prices) from Yahoo! Finance website. For Facebook (FB), July 2012 was the earliest data available. The date range is July 2012 to November 2018.
The closing price of current day C(t) and closing price from the previous day C(t-1) are compared to build the initial dataset. The objective is to define if the price trend is going up or down by analysing these two values. For each instance, a comparison was made and recorded. If the price is going up, C(t) > C(t-1), class “1” is assigned. Class “0” is assigned for the opposite case.
Research was initiated to understand which features could help the model to forecast the stock direction. Three main routes were found: Lag features, Technical Indicators and Global Market Indexes. Below is an explanation of each group of features.
Lag features are features that contain the closing price and direction of previous days and it is a common strategy for Time Series models. The following features were added:
Technical indicators are used by researches and financial market analysts to support stock market trend forecasting. Common indicators retrieved from the literature were selected and calculated for Facebook stock. Techical Indicators added:
Technical indicators provide a suggestion of the stock price movement. Additional features were created for each technical indicator by analysing its daily value and assigning a class according to their meaning. Class “1” is given if the indicator numerical value suggests upper trend, class “0” for a downtrend. In other words, financial market analysis is performed at a simplistic level, in the attempt to translate what the continuous value means.
For a given country or region, the stock market index characterises the performance of its financial market and the overall local economy. For this reason, the same day performance of these markets could contribute to the machine learning model predictions. Six global indexes were added as features, with their closing direction as up or down, class “1” or “0”, respectively. Data for these indexes (Nikkei, Hang Seng, All Ordinaries, Euronext 100, SSE and DAX) were also retrieved from Yahoo! Finance.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset encompasses the historical data of major stock indices from around the world, sourced directly from Yahoo Finance. With data reaching back to the early 1920s (where available), it serves as an invaluable repository for academic researchers, financial analysts, and market enthusiasts. Users can delve into trends across decades, evaluate historical market behaviors, or even design and validate predictive financial models.
Photo by Tötös Ádám on Unsplash
all_indices_data.csv:
date: The date of the data point (formatted as YYYY-MM-DD).open: The opening value of the index on that date.high: The highest value of the index during the trading session.low: The lowest value of the index during the trading session.close: The closing value of the index.volume: The trading volume of the index on that date.ticker: The ticker symbol of the stock index.individual_indices_data/[SYMBOL]_data.csv:
[SYMBOL] denotes the ticker symbol of the respective stock index. Each dataset is curated from Yahoo Finance's historical data archives.date: The date of the data point (formatted as YYYY-MM-DD).open: The opening value of the index on that date.high: The highest value of the index during the trading session.low: The lowest value of the index during the trading session.close: The closing value of the index.volume: The trading volume of the index on that date.