Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Context
The stock market has consistently proven to be a good place to invest in and save for the future. There are a lot of compelling reasons to invest in stocks. It can help in fighting inflation, create wealth, and also provides some tax benefits. Good steady returns on investments over a long period of time can also grow a lot more than seems possible. Also, thanks to the power of compound interest, the earlier one starts investing, the larger the corpus one can have for retirement. Overall, investing in stocks can help meet life's financial aspirations.
It is important to maintain a diversified portfolio when investing in stocks in order to maximise earnings under any market condition. Having a diversified portfolio tends to yield higher returns and face lower risk by tempering potential losses when the market is down. It is often easy to get lost in a sea of financial metrics to analyze while determining the worth of a stock, and doing the same for a multitude of stocks to identify the right picks for an individual can be a tedious task. By doing a cluster analysis, one can identify stocks that exhibit similar characteristics and ones which exhibit minimum correlation. This will help investors better analyze stocks across different market segments and help protect against risks that could make the portfolio vulnerable to losses.
Objective
Trade&Ahead is a financial consultancy firm who provide their customers with personalized investment strategies. They have hired you as a Data Scientist and provided you with data comprising stock price and some financial indicators for a few companies listed under the New York Stock Exchange. They have assigned you the tasks of analyzing the data, grouping the stocks based on the attributes provided, and sharing insights about the characteristics of each group
Data Dictionary
Ticker Symbol: An abbreviation used to uniquely identify publicly traded shares of a particular stock on a particular stock market Company: Name of the company GICS Sector: The specific economic sector assigned to a company by the Global Industry Classification Standard (GICS) that best defines its business operations GICS Sub Industry: The specific sub-industry group assigned to a company by the Global Industry Classification Standard (GICS) that best defines its business operations Current Price: Current stock price in dollars Price Change: Percentage change in the stock price in 13 weeks Volatility: Standard deviation of the stock price over the past 13 weeks ROE: A measure of financial performance calculated by dividing net income by shareholders' equity (shareholders' equity is equal to a company's assets minus its debt) Cash Ratio: The ratio of a company's total reserves of cash and cash equivalents to its total current liabilities Net Cash Flow: The difference between a company's cash inflows and outflows (in dollars) Net Income: Revenues minus expenses, interest, and taxes (in dollars) Earnings Per Share: Company's net profit divided by the number of common shares it has outstanding (in dollars) Estimated Shares Outstanding: Company's stock currently held by all its shareholders P/E Ratio: Ratio of the company's current stock price to the earnings per share P/B Ratio: Ratio of the company's stock price per share by its book value per share (book value of a company is the net difference between that company's total assets and total liabilities)
Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
EGPB - An Event-based Gold Price Benchmark Dataset
This benchmark dataset consists of 8030 rows and 36 variables sourced from multiple credible economic websites, covering a period from January 2001 to December 2022. This dataset can be utilized to predict gold prices specifically or to aid any economic field that is influenced by the variables in this dataset.
Key variables & Features include:
• Previous gold prices
• Future gold prices with predictions for one day, one week, and one month
• Oil prices
• Standard & Poor's 500 Index (S&P 500)
• Dow Jones Industrial (DJI)
• US dollar index
• US treasury
• Inflation rate
• Consumer price index (CPI)
• Federal funds rate
• Silver prices
• Copper prices
• Iron prices
• Platinum prices
• Palladium prices
Additionally, the dataset considers global events that may impact gold prices, which were categorized into groups and collected from three distinct sources: the Al-Jazeera website spanning from 2022 to 2019, the Investing website spanning from 2018 to 2016, and the Yahoo Finance website spanning from 2007 to 2001.
These events data were then divided into multiple groups:
• Economic data
• Politics
• logistics
• Oil
• OPEC
• Dollar currency
• Sterling pound currency
• Russian ruble currency
• Yen currency
• Euro currency
• US stocks
• Global stocks
• Inflation
• Job reports
• Unemployment rates
• CPI rate
• Interest rates
• Bonds
These events were encoded using a numeric value, where 0 represented no events, 1 represented low events, 2 represented high events, 3 represented stable events, 4 represented unstable events, and 5 represented events that were observed during the day but had no effect on the dataset.
Cite this dataset: Farah Mansour and Wael Etaiwi, "EGPBD: An Event-based Gold Price Benchmark Dataset," 2023 3rd International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), Tenerife, Canary Islands, Spain, 2023, pp. 1-7, doi: 10.1109/ICECCME57830.2023.10252987.
@INPROCEEDINGS{10252987, author={Mansour, Farah and Etaiwi, Wael}, booktitle={2023 3rd International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)}, title={EGPBD: An Event-based Gold Price Benchmark Dataset}, year={2023}, volume={}, number={}, pages={1-7}, doi={10.1109/ICECCME57830.2023.10252987}}
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Japan's main stock market index, the JP225, rose to 49553 points on December 2, 2025, gaining 0.51% from the previous session. Over the past month, the index has declined 3.78%, though it remains 26.25% higher than a year ago, according to trading on a contract for difference (CFD) that tracks this benchmark index from Japan. Japan Stock Market Index (JP225) - values, historical data, forecasts and news - updated on December of 2025.
Facebook
Twitterhttps://www.ycharts.com/termshttps://www.ycharts.com/terms
View monthly updates and historical trends for S&P 500 Shiller CAPE Ratio. from United States. Source: Robert Shiller. Track economic data with YCharts an…
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Real-time Sahm Rule Recession Indicator (SAHMREALTIME) from Dec 1959 to Sep 2025 about recession indicators, academic data, and USA.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
China's main stock market index, the SHANGHAI, fell to 3898 points on December 2, 2025, losing 0.42% from the previous session. Over the past month, the index has declined 1.98%, though it remains 15.36% higher than a year ago, according to trading on a contract for difference (CFD) that tracks this benchmark index from China. China Shanghai Composite Stock Market Index - values, historical data, forecasts and news - updated on December of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Turkey's main stock market index, the BIST 100, rose to 11132 points on December 2, 2025, gaining 0.14% from the previous session. Over the past month, the index has climbed 0.64% and is up 13.27% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks this benchmark index from Turkey. Turkey Stock Market - values, historical data, forecasts and news - updated on December of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Poland's main stock market index, the WIG, fell to 110618 points on December 2, 2025, losing 1.16% from the previous session. Over the past month, the index has declined 1.29%, though it remains 36.78% higher than a year ago, according to trading on a contract for difference (CFD) that tracks this benchmark index from Poland. Warsaw Stock Exchange WIG Index - values, historical data, forecasts and news - updated on December of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Gold fell to 4,199.97 USD/t.oz on December 2, 2025, down 0.75% from the previous day. Over the past month, Gold's price has risen 4.93%, and is up 58.92% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks the benchmark market for this commodity. Gold - values, historical data, forecasts and news - updated on December of 2025.
Facebook
Twitterhttps://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
Facebook
Twitterhttps://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.
Historical daily stock prices (open, high, low, close, volume)
Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)
Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)
Feature engineering based on financial data and technical indicators
Sentiment analysis data from social media and news articles
Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)
Stock price prediction
Portfolio optimization
Algorithmic trading
Market sentiment analysis
Risk management
Researchers investigating the effectiveness of machine learning in stock market prediction
Analysts developing quantitative trading Buy/Sell strategies
Individuals interested in building their own stock market prediction models
Students learning about machine learning and financial applications
The dataset may include different levels of granularity (e.g., daily, hourly)
Data cleaning and preprocessing are essential before model training
Regular updates are recommended to maintain the accuracy and relevance of the data
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Consumer Price Index CPI in the United States increased to 324.80 points in September from 323.98 points in August of 2025. This dataset provides the latest reported value for - United States Consumer Price Index (CPI) - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Coffee fell to 408.66 USd/Lbs on December 2, 2025, down 0.95% from the previous day. Over the past month, Coffee's price has risen 0.50%, and is up 38.54% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks the benchmark market for this commodity. Coffee - values, historical data, forecasts and news - updated on December of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Canada's main stock market index, the TSX, fell to 30943 points on December 2, 2025, losing 0.51% from the previous session. Over the past month, the index has climbed 2.21% and is up 20.70% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks this benchmark index from Canada. Canada Stock Market Index (TSX) - values, historical data, forecasts and news - updated on December of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Orange Juice fell to 147.99 USd/Lbs on December 2, 2025, down 0.38% from the previous day. Over the past month, Orange Juice's price has fallen 15.22%, and is down 71.10% compared to the same time last year, according to trading on a contract for difference (CFD) that tracks the benchmark market for this commodity. Orange Juice - values, historical data, forecasts and news - updated on December of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The USD/CNY exchange rate fell to 7.0696 on December 2, 2025, down 0.05% from the previous session. Over the past month, the Chinese Yuan has strengthened 0.81%, and is up by 3.15% over the last 12 months. Chinese Yuan - values, historical data, forecasts and news - updated on December of 2025.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Context
The stock market has consistently proven to be a good place to invest in and save for the future. There are a lot of compelling reasons to invest in stocks. It can help in fighting inflation, create wealth, and also provides some tax benefits. Good steady returns on investments over a long period of time can also grow a lot more than seems possible. Also, thanks to the power of compound interest, the earlier one starts investing, the larger the corpus one can have for retirement. Overall, investing in stocks can help meet life's financial aspirations.
It is important to maintain a diversified portfolio when investing in stocks in order to maximise earnings under any market condition. Having a diversified portfolio tends to yield higher returns and face lower risk by tempering potential losses when the market is down. It is often easy to get lost in a sea of financial metrics to analyze while determining the worth of a stock, and doing the same for a multitude of stocks to identify the right picks for an individual can be a tedious task. By doing a cluster analysis, one can identify stocks that exhibit similar characteristics and ones which exhibit minimum correlation. This will help investors better analyze stocks across different market segments and help protect against risks that could make the portfolio vulnerable to losses.
Objective
Trade&Ahead is a financial consultancy firm who provide their customers with personalized investment strategies. They have hired you as a Data Scientist and provided you with data comprising stock price and some financial indicators for a few companies listed under the New York Stock Exchange. They have assigned you the tasks of analyzing the data, grouping the stocks based on the attributes provided, and sharing insights about the characteristics of each group
Data Dictionary
Ticker Symbol: An abbreviation used to uniquely identify publicly traded shares of a particular stock on a particular stock market Company: Name of the company GICS Sector: The specific economic sector assigned to a company by the Global Industry Classification Standard (GICS) that best defines its business operations GICS Sub Industry: The specific sub-industry group assigned to a company by the Global Industry Classification Standard (GICS) that best defines its business operations Current Price: Current stock price in dollars Price Change: Percentage change in the stock price in 13 weeks Volatility: Standard deviation of the stock price over the past 13 weeks ROE: A measure of financial performance calculated by dividing net income by shareholders' equity (shareholders' equity is equal to a company's assets minus its debt) Cash Ratio: The ratio of a company's total reserves of cash and cash equivalents to its total current liabilities Net Cash Flow: The difference between a company's cash inflows and outflows (in dollars) Net Income: Revenues minus expenses, interest, and taxes (in dollars) Earnings Per Share: Company's net profit divided by the number of common shares it has outstanding (in dollars) Estimated Shares Outstanding: Company's stock currently held by all its shareholders P/E Ratio: Ratio of the company's current stock price to the earnings per share P/B Ratio: Ratio of the company's stock price per share by its book value per share (book value of a company is the net difference between that company's total assets and total liabilities)