100+ datasets found

f
S1 Data -
plos.figshare.com
application/csv
Updated Mar 13, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yuancheng Si; Saralees Nadarajah; Zongxin Zhang; Chunmin Xu (2024). S1 Data - [Dataset]. http://doi.org/10.1371/journal.pone.0299164.s001
Explore at:
application/csvAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0299164.s001
Dataset updated
Mar 13, 2024
Dataset provided by
PLOS ONE
Authors
Yuancheng Si; Saralees Nadarajah; Zongxin Zhang; Chunmin Xu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In the dynamic landscape of financial markets, accurate forecasting of stock indices remains a pivotal yet challenging task, essential for investors and policymakers alike. This study is motivated by the need to enhance the precision of predicting the Shanghai Composite Index’s opening price spread, a critical measure reflecting market volatility and investor sentiment. Traditional time series models like ARIMA have shown limitations in capturing the complex, nonlinear patterns inherent in stock price movements, prompting the exploration of advanced methodologies. The aim of this research is to bridge the gap in forecasting accuracy by developing a hybrid model that integrates the strengths of ARIMA with deep learning techniques, specifically Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks. This novel approach leverages the ARIMA model’s proficiency in linear trend analysis and the deep learning models’ capability in modeling nonlinear dependencies, aiming to provide a comprehensive tool for market prediction. Utilizing a comprehensive dataset covering the period from December 20, 1990, to June 2, 2023, the study develops and assesses the efficacy of ARIMA, LSTM, GRU, ARIMA-LSTM, and ARIMA-GRU models in forecasting the Shanghai Composite Index’s opening price spread. The evaluation of these models is based on key statistical metrics, including Mean Squared Error (MSE) and Mean Absolute Error (MAE), to gauge their predictive accuracy. The findings indicate that the hybrid models, ARIMA-LSTM and ARIMA-GRU, perform better in forecasting the opening price spread of the Shanghai Composite Index than their standalone counterparts. This outcome suggests that combining traditional statistical methods with advanced deep learning algorithms can enhance stock market prediction. The research contributes to the field by providing evidence of the potential benefits of integrating different modeling approaches for financial forecasting, offering insights that could inform investment strategies and financial decision-making.
k
Short/Long Term Stocks: Dow Jones New Zealand Index Stock Forecast...
kappasignal.com
Updated Oct 21, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2022). Short/Long Term Stocks: Dow Jones New Zealand Index Stock Forecast (Forecast) [Dataset]. https://www.kappasignal.com/2022/10/shortlong-term-stocks-dow-jones-new.html
Explore at:
Dataset updated
Oct 21, 2022
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

Short/Long Term Stocks: Dow Jones New Zealand Index Stock Forecast

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
k
Data from: CPZ Calamos Long/Short Equity & Dynamic Income Trust Common Stock...
kappasignal.com
Updated Mar 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2023). CPZ Calamos Long/Short Equity & Dynamic Income Trust Common Stock (Forecast) [Dataset]. https://www.kappasignal.com/2023/03/cpz-calamos-longshort-equity-dynamic.html
Explore at:
Dataset updated
Mar 23, 2023
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

CPZ Calamos Long/Short Equity & Dynamic Income Trust Common Stock

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
k
Short/Long Term Stocks: Tadawul All Share Index Stock Forecast (Forecast)
kappasignal.com
Updated Nov 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2022). Short/Long Term Stocks: Tadawul All Share Index Stock Forecast (Forecast) [Dataset]. https://www.kappasignal.com/2022/11/shortlong-term-stocks-tadawul-all-share.html
Explore at:
Dataset updated
Nov 2, 2022
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

Short/Long Term Stocks: Tadawul All Share Index Stock Forecast

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
d
Replication Data for: The Surrogate Index: Combining Short-Term Proxies to...
search.dataone.org
dataverse.harvard.edu
Updated Nov 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Athey, Susan; Chetty, Raj; Imbens, Guido; Kang, Hyunseung (2023). Replication Data for: The Surrogate Index: Combining Short-Term Proxies to Estimate Long-Term Treatment Effects More Rapidly and Precisely [Dataset]. http://doi.org/10.7910/DVN/QCKJYL
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/QCKJYL
Dataset updated
Nov 12, 2023
Dataset provided by
Harvard Dataverse
Authors
Athey, Susan; Chetty, Raj; Imbens, Guido; Kang, Hyunseung
Description
This dataset contains replication files for "The Surrogate Index: Combining Short-Term Proxies to Estimate Long-Term Treatment Effects More Rapidly and Precisely" by Susan Athey, Raj Chetty, Guido Imbens, and Hyunseung Kang. For more information, see https://opportunityinsights.org/paper/the-surrogate-index/. A summary of the related publication follows. The impacts of many policies, such as efforts to increase upward income mobility or improve health outcomes, are only observed with long delays. For example, it can take decades to see the effects of early childhood interventions on lifetime earnings. This problem has greatly limited researchers’ and policymakers’ ability to test and improve policies and arises frequently in our own work at Opportunity Insights on the determinants of economic opportunity. In this study, we develop a new method of estimating the long-term impacts of policies more rapidly and precisely using short-term proxies. We predict long-term outcomes (e.g., lifetime earnings) using short-term outcomes (e.g., earnings in early adulthood or test scores). We then show that the causal effects of policies on this predictive index (which we term a “surrogate index”, following terminology in the statistics literature) can help us learn about their long-term impacts more quickly under certain assumptions that are described in the full paper. We apply our method to analyze the long-term impacts of a job training experiment in California. Using short-term employment rates as surrogates, we show that one could have estimated the program’s impact on mean employment rates over a 9 year horizon within 1.5 years, with a 35% reduction in standard errors. The success of the surrogate index in this job training application suggests that our method could be applied to predict the long-term impacts of other programs as well. Going forward, we hope to build a public library of early indicators (surrogate indices) for social science by harnessing historical experiments along with the large-scale datasets we have built. If you would like to contribute to this effort by reporting a surrogate index that predicts long-term impacts estimated in an experiment, as in the GAIN program, please contact us.
J
Value-at-risk for long and short trading positions (replication data)
journaldata.zbw.eu
jda-test.zbw.eu
.data, txt
Updated Dec 8, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pierre Giot; Sébastien Laurent; Pierre Giot; Sébastien Laurent (2022). Value-at-risk for long and short trading positions (replication data) [Dataset]. http://doi.org/10.15456/jae.2022314.1316858395
Explore at:
txt(2441), .data(102325), .data(106150), .data(45920), .data(164969)Available download formats
Unique identifier
https://doi.org/10.15456/jae.2022314.1316858395
Dataset updated
Dec 8, 2022
Dataset provided by
ZBW - Leibniz Informationszentrum Wirtschaft
Authors
Pierre Giot; Sébastien Laurent; Pierre Giot; Sébastien Laurent
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In this paper we model Value-at-Risk (VaR) for daily asset returns using a collection of parametric univariate and multivariate models of the ARCH class based on the skewed Student distribution. We show that models that rely on a symmetric density distribution for the error term underperform with respect to skewed density models when the left and right tails of the distribution of returns must be modelled. Thus, VaR for traders having both long and short positions is not adequately modelled using usual normal or Student distributions. We suggest using an APARCH model based on the skewed Student distribution (combined with a time-varying correlation in the multivariate case) to fully take into account the fat left and right tails of the returns distribution. This allows for an adequate modelling of large returns defined on long and short trading positions. The performances of the univariate models are assessed on daily data for three international stock indexes and three US stocks of the Dow Jones index. In a second application, we consider a portfolio of three US stocks and model its long and short VaR using a multivariate skewed Student density.
f
A deep learning framework for financial time series using stacked...
plos.figshare.com
tiff
Updated Jun 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wei Bao; Jun Yue; Yulei Rao (2023). A deep learning framework for financial time series using stacked autoencoders and long-short term memory [Dataset]. http://doi.org/10.1371/journal.pone.0180944
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0180944
Dataset updated
Jun 4, 2023
Dataset provided by
PLOS ONE
Authors
Wei Bao; Jun Yue; Yulei Rao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The application of deep learning approaches to finance has received a great deal of attention from both investors and researchers. This study presents a novel deep learning framework where wavelet transforms (WT), stacked autoencoders (SAEs) and long-short term memory (LSTM) are combined for stock price forecasting. The SAEs for hierarchically extracted deep features is introduced into stock price forecasting for the first time. The deep learning framework comprises three stages. First, the stock price time series is decomposed by WT to eliminate noise. Second, SAEs is applied to generate deep high-level features for predicting the stock price. Third, high-level denoising features are fed into LSTM to forecast the next day’s closing price. Six market indices and their corresponding index futures are chosen to examine the performance of the proposed model. Results show that the proposed model outperforms other similar models in both predictive accuracy and profitability performance.
k
Short/Long Term Stocks: LON:PMG Stock Forecast (Forecast)
kappasignal.com
Updated Nov 10, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2022). Short/Long Term Stocks: LON:PMG Stock Forecast (Forecast) [Dataset]. https://www.kappasignal.com/2022/11/shortlong-term-stocks-lonpmg-stock.html
Explore at:
Dataset updated
Nov 10, 2022
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

Short/Long Term Stocks: LON:PMG Stock Forecast

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
k
Dow Jones Industrial Average Index assigned short-term B1 & long-term Ba1...
kappasignal.com
Updated Oct 24, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2022). Dow Jones Industrial Average Index assigned short-term B1 & long-term Ba1 forecasted stock rating. (Forecast) [Dataset]. https://www.kappasignal.com/2022/10/dow-jones-industrial-average-index.html
Explore at:
Dataset updated
Oct 24, 2022
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

Dow Jones Industrial Average Index assigned short-term B1 & long-term Ba1 forecasted stock rating.

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
A New Index to Measure U.S. Financial Conditions
catalog.data.gov
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Board of Governors of the Federal Reserve System (2024). A New Index to Measure U.S. Financial Conditions [Dataset]. https://catalog.data.gov/dataset/a-new-index-to-measure-u-s-financial-conditions
Explore at:
Dataset updated
Dec 18, 2024
Dataset provided by
Federal Reserve Board of Governors
Federal Reserve Systemhttp://www.federalreserve.gov/
Description
An index that can be used to gauge broad financial conditions and assess how these conditions are related to future economic growth. The index is broadly consistent with how the FRB/US model generally relates key financial variables to economic activity. The index aggregates changes in seven financial variables: the federal funds rate, the 10-year Treasury yield, the 30-year fixed mortgage rate, the triple-B corporate bond yield, the Dow Jones total stock market index, the Zillow house price index, and the nominal broad dollar index using weights implied by the FRB/US model and other models in use at the Federal Reserve Board. These models relate households' spending and businesses' investment decisions to changes in short- and long-term interest rates, house and equity prices, and the exchange value of the dollar, among other factors. These financial variables are weighted using impulse response coefficients (dynamic multipliers) that quantify the cumulative effects of unanticipated permanent changes in each financial variable on real gross domestic product (GDP) growth over the subsequent year. The resulting index is named Financial Conditions Impulse on Growth (FCI-G). One appealing feature of the FCI-G is that its movements can be used to measure whether financial conditions have tightened or loosened, to summarize how changes in financial conditions are associated with real GDP growth over the following year, or both.
k
Data from: Short/Long Term Stocks: Karachi 100 Index Stock Forecast...
kappasignal.com
Updated Nov 3, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2022). Short/Long Term Stocks: Karachi 100 Index Stock Forecast (Forecast) [Dataset]. https://www.kappasignal.com/2022/11/shortlong-term-stocks-karachi-100-index.html
Explore at:
Dataset updated
Nov 3, 2022
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

Short/Long Term Stocks: Karachi 100 Index Stock Forecast

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
R
Replication data for: predicting the brazilian stock market using sentiment...
redu.unicamp.br
bin
Updated Sep 22, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Repositório de Dados de Pesquisa da Unicamp (2022). Replication data for: predicting the brazilian stock market using sentiment analysis, technical indicators, and stock prices [Dataset]. http://doi.org/10.25824/redu/GFJHFK
Explore at:
bin(5393278), bin(10558), bin(248443), bin(13971), bin(835573)Available download formats
Unique identifier
https://doi.org/10.25824/redu/GFJHFK
Dataset updated
Sep 22, 2022
Dataset provided by
Repositório de Dados de Pesquisa da Unicamp
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Brazil
Dataset funded by
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
Description
This package contains the datasets and source codes used in the PhD thesis entitled Predicting the Brazilian stock market using sentiment analysis, technical indicators and stock prices. The following files are included: File Labeled.zip - financial news labeled in two classes (Positive and Negative), organized to train Sentiment Analysis models. Part of these news were initially presented in [1]. Besides the news in this file, in the related PhD thesis the training dataset was complemented with the labeled news presented in [2]. File Unlabeled.zip - general unlabeled financial news collected during the period 2010-2020 from the following online sources: G1, Folha de São Paulo and Estadão. This file contains news from the Bovespa index and from the following companies: Banco do Brasil, Itau, Gerdau and Ambev. File Stocks.zip - stock prices from the companies Banco do Brasil, Itau, Gerdau, Ambev, and the Bovespa index. The considered period ranges from 2010 to 2020. File Models.zip - contains the source codes of the models used in the PhD thesis (i.e., Multilayer Perceptron, Long Short-Term Memory, Bidirectional Long Short-Term Memory, Convolutional Neural Network, and Support Vector Machines). File Utils.zip - contains the source codes of the preprocessing step designed for the methodology of this work (i.e., load data and generate the word embeddings), alongside with stocks manipulation, and investment evaluation. [1] Carosia, A. E. D. O., Januário, B. A., da Silva, A. E. A., & Coelho, G. P. (2021). Sentiment Analysis Applied to News from the Brazilian Stock Market. IEEE Latin America Transactions, 100. DOI: 10.1109/TLA.2022.9667151 [2] MARTINS, R. F.; PEREIRA, A.; BENEVENUTO, F. An approach to sentiment analysis of web applications in portuguese. Proceedings of the 21st Brazilian Symposium on Multimedia and the Web, ACM, p. 105–112, 2015. DOI: 10.1145/2820426.2820446
Description of the input variables.
figshare.com
plos.figshare.com
xls
Updated Jun 1, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wei Bao; Jun Yue; Yulei Rao (2023). Description of the input variables. [Dataset]. http://doi.org/10.1371/journal.pone.0180944.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0180944.t001
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Wei Bao; Jun Yue; Yulei Rao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Description of the input variables.
d
Supplement: Commodity Index Report.
datadiscoverystudio.org
data.wu.ac.at
txt
Updated Jan 12, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2014). Supplement: Commodity Index Report. [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/a8e2d560e64e46d8a10bd13f10d9d3d8/html
Explore at:
txtAvailable download formats
Dataset updated
Jan 12, 2014
Description
description: Shows index traders in selected agricultural markets. These traders are drawn from the noncommercial and commercial categories. The noncommercial category includes positions of managed funds, pension funds, and other investors that are generally seeking exposure to a broad index of commodity prices as an asset class in an unleveraged and passively-managed manner. The commercial category includes positions for entities whose trading predominantly reflects hedging of over-the-counter transactions involving commodity indices, for example, a swap dealer holding long futures positions to hedge a short commodity index exposure opposite institutional traders, such as pension funds.; abstract: Shows index traders in selected agricultural markets. These traders are drawn from the noncommercial and commercial categories. The noncommercial category includes positions of managed funds, pension funds, and other investors that are generally seeking exposure to a broad index of commodity prices as an asset class in an unleveraged and passively-managed manner. The commercial category includes positions for entities whose trading predominantly reflects hedging of over-the-counter transactions involving commodity indices, for example, a swap dealer holding long futures positions to hedge a short commodity index exposure opposite institutional traders, such as pension funds.
G
Blended Index – Long Term
open.canada.ca
catalogue.arctic-sdi.org
esri rest, geotif +3
Updated Aug 13, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agriculture and Agri-Food Canada (2024). Blended Index – Long Term [Dataset]. https://open.canada.ca/data/en/dataset/b1c73404-1aee-4a59-ad5d-eda7db046676
Explore at:
html, pdf, geotif, wms, esri restAvailable download formats
Dataset updated
Aug 13, 2024
Dataset provided by
Agriculture and Agri-Food Canada
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Description
The Blended Index (BI) is a model which employs multiple potential indicators of drought and excess moisture, such as the Palmer drought index, rolling precipitation amounts and soil moisture, and combines them into a weighted, normalized value between 0 and 100. The inputs and weights used in this model are subject to change periodically as it is optimized to best represent extent, duration and severity of impactful weather conditions. The blended index is deployed as two variations; short term (st) focusing on 1 to 3 months, and long term (lt) focusing on 6 months to 5 years.
f
Predictive accuracy in developing markets.
plos.figshare.com
xls
Updated May 31, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wei Bao; Jun Yue; Yulei Rao (2023). Predictive accuracy in developing markets. [Dataset]. http://doi.org/10.1371/journal.pone.0180944.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0180944.t003
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Wei Bao; Jun Yue; Yulei Rao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Predictive accuracy in developing markets.
T
United States S&P Case-Shiller 20-City Composite Home Price Index
tradingeconomics.com
pt.tradingeconomics.com
+11more
csv, excel, json, xml
Updated May 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). United States S&P Case-Shiller 20-City Composite Home Price Index [Dataset]. https://tradingeconomics.com/united-states/case-shiller-home-price-index
Explore at:
json, excel, xml, csvAvailable download formats
Dataset updated
May 27, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 31, 2000 - Mar 31, 2025
Area covered
United States
Description
Case Shiller Home Price Index in the United States increased to 338.39 points in March from 335.08 points in February of 2025. This dataset provides the latest reported value for - United States S&P Case-Shiller Home Price Index - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
k
Short/Long Term Stocks: SPB Stock Forecast (Forecast)
kappasignal.com
Updated Sep 21, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2022). Short/Long Term Stocks: SPB Stock Forecast (Forecast) [Dataset]. https://www.kappasignal.com/2022/09/shortlong-term-stocks-spb-stock-forecast.html
Explore at:
Dataset updated
Sep 21, 2022
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

Short/Long Term Stocks: SPB Stock Forecast

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
T
United States Economic Optimism Index
tradingeconomics.com
ru.tradingeconomics.com
+12more
csv, excel, json, xml
Updated Apr 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). United States Economic Optimism Index [Dataset]. https://tradingeconomics.com/united-states/economic-optimism-index
Explore at:
csv, json, xml, excelAvailable download formats
Dataset updated
Apr 1, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Feb 28, 2001 - Jun 30, 2025
Area covered
United States
Description
Economic Optimism Index in the United States increased to 49.20 points in June from 47.90 points in May of 2025. This dataset provides the latest reported value for - United States IBD/TIPP Economic Optimism Index - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
f
Returns obtained from investment in SZSE.
plos.figshare.com
xls
Updated Jun 2, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hongfei Xiao (2025). Returns obtained from investment in SZSE. [Dataset]. http://doi.org/10.1371/journal.pone.0322737.t007
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0322737.t007
Dataset updated
Jun 2, 2025
Dataset provided by
PLOS ONE
Authors
Hongfei Xiao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
LSTM (Long Short-Term Memory Network) is currently extensively utilized for forecasting financial time series, primarily due to its distinct advantages in separating the long-term from the short-term memory information within a sequence. However, the experimental results presented in this paper indicate that LSTM may struggle to clearly differentiate between these two types of information. To overcome this limitation, we propose the ARMA-RNN-LSTM Hybrid Model, aimed at enhancing the separation between the long-term and short-term memory information on top of LSTM framework. The experiment in this paper is inspired by an observation: when LSTMs and RNNs are respectively used to forecast the same time series that contains only short-term memory information, LSTMs exhibit significantly lower forecasting accuracy than RNNs, and we attributed this to LSTMs potentially misclassifying some short-term memory information as long-term during forecasting process. Further, we speculate that this confusion might also arise when LSTMs are used to forecast the time series containing both the long-term and short-term memory information. To verify the aforementioned hypothesis and improve the forecasting accuracy for financial time series, this paper combines RNNs with LSTMs, proposing a method of ARMA-RNN-LSTM Hybrid Modelling, and conducts an experiment with stock index prices. Eventually, the experiment results show that the ARMA-RNN-LSTM Hybrid Model outperforms standalone RNNs and LSTMs in forecasting stock index series containing both long-term and short-term memory information, confirming that the ARMA-RNN-LSTM Hybrid Model has effectively enhanced the separation between the long-term and short-term memory information within sequence. This hybrid modelling approach has innovatively addressed the issue of the confusion between the long-term and the short-term memory information in a sequence during LSTM’s forecasting process, improving the accuracy of forecasting financial time series, and demonstrates that neural network’s forecasting errors is a area worth to explore in the future.

Facebook

Twitter

Click to copy link

Link copied

Cite

Yuancheng Si; Saralees Nadarajah; Zongxin Zhang; Chunmin Xu (2024). S1 Data - [Dataset]. http://doi.org/10.1371/journal.pone.0299164.s001

S1 Data -

Explore at:

application/csvAvailable download formats

Unique identifier

https://doi.org/10.1371/journal.pone.0299164.s001

Dataset updated

Mar 13, 2024

Dataset provided by

PLOS ONE

Authors

Yuancheng Si; Saralees Nadarajah; Zongxin Zhang; Chunmin Xu

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

In the dynamic landscape of financial markets, accurate forecasting of stock indices remains a pivotal yet challenging task, essential for investors and policymakers alike. This study is motivated by the need to enhance the precision of predicting the Shanghai Composite Index’s opening price spread, a critical measure reflecting market volatility and investor sentiment. Traditional time series models like ARIMA have shown limitations in capturing the complex, nonlinear patterns inherent in stock price movements, prompting the exploration of advanced methodologies. The aim of this research is to bridge the gap in forecasting accuracy by developing a hybrid model that integrates the strengths of ARIMA with deep learning techniques, specifically Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks. This novel approach leverages the ARIMA model’s proficiency in linear trend analysis and the deep learning models’ capability in modeling nonlinear dependencies, aiming to provide a comprehensive tool for market prediction. Utilizing a comprehensive dataset covering the period from December 20, 1990, to June 2, 2023, the study develops and assesses the efficacy of ARIMA, LSTM, GRU, ARIMA-LSTM, and ARIMA-GRU models in forecasting the Shanghai Composite Index’s opening price spread. The evaluation of these models is based on key statistical metrics, including Mean Squared Error (MSE) and Mean Absolute Error (MAE), to gauge their predictive accuracy. The findings indicate that the hybrid models, ARIMA-LSTM and ARIMA-GRU, perform better in forecasting the opening price spread of the Shanghai Composite Index than their standalone counterparts. This outcome suggests that combining traditional statistical methods with advanced deep learning algorithms can enhance stock market prediction. The research contributes to the field by providing evidence of the potential benefits of integrating different modeling approaches for financial forecasting, offering insights that could inform investment strategies and financial decision-making.

Clear search

Close search

Google apps

Main menu

S1 Data -

Short/Long Term Stocks: Dow Jones New Zealand Index Stock Forecast...

Short/Long Term Stocks: Dow Jones New Zealand Index Stock Forecast

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

Data from: CPZ Calamos Long/Short Equity & Dynamic Income Trust Common Stock...

CPZ Calamos Long/Short Equity & Dynamic Income Trust Common Stock

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

Short/Long Term Stocks: Tadawul All Share Index Stock Forecast (Forecast)

Short/Long Term Stocks: Tadawul All Share Index Stock Forecast

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

Replication Data for: The Surrogate Index: Combining Short-Term Proxies to...

Value-at-risk for long and short trading positions (replication data)

A deep learning framework for financial time series using stacked...

Short/Long Term Stocks: LON:PMG Stock Forecast (Forecast)

Short/Long Term Stocks: LON:PMG Stock Forecast

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

Dow Jones Industrial Average Index assigned short-term B1 & long-term Ba1...

Dow Jones Industrial Average Index assigned short-term B1 & long-term Ba1 forecasted stock rating.

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

A New Index to Measure U.S. Financial Conditions

Data from: Short/Long Term Stocks: Karachi 100 Index Stock Forecast...

Short/Long Term Stocks: Karachi 100 Index Stock Forecast

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

Replication data for: predicting the brazilian stock market using sentiment...

Description of the input variables.

Supplement: Commodity Index Report.

Blended Index – Long Term

Predictive accuracy in developing markets.

United States S&P Case-Shiller 20-City Composite Home Price Index

Short/Long Term Stocks: SPB Stock Forecast (Forecast)

Short/Long Term Stocks: SPB Stock Forecast

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

United States Economic Optimism Index

Returns obtained from investment in SZSE.

S1 Data -See More Versions

S1 Data -