We compile all return and macroeconomic data from Kenneth French's website and the OECD statistical data warehouse, respectively, for the period from January 1990 to December 2018. All return and macroeconomic data include the following countries: Austria, Belgium, Denmark, Finland, France, Germany, Greece, Ireland, Italy, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland and United Kingdom.The dataset comprises the following series:
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ABSTRACT The purpose of this work is to present the Weighted Forward Search (FSW) method for the detection of outliers in asset pricing data. This new estimator, which is based on an algorithm that downweights the most anomalous observations of the dataset, is tested using both simulated and empirical asset pricing data. The impact of outliers on the estimation of asset pricing models is assessed under different scenarios, and the results are evaluated with associated statistical tests based on this new approach. Our proposal generates an alternative procedure for robust estimation of portfolio betas, allowing for the comparison between concurrent asset pricing models. The algorithm, which is both efficient and robust to outliers, is used to provide robust estimates of the models’ parameters in a comparison with traditional econometric estimation methods usually used in the literature. In particular, the precision of the alphas is highly increased when the Forward Search (FS) method is used. We use Monte Carlo simulations, and also the well-known dataset of equity factor returns provided by Prof. Kenneth French, consisting of the 25 Fama-French portfolios on the United States of America equity market using single and three-factor models, on monthly and annual basis. Our results indicate that the marginal rejection of the Fama-French three-factor model is influenced by the presence of outliers in the portfolios, when using monthly returns. In annual data, the use of robust methods increases the rejection level of null alphas in the Capital Asset Pricing Model (CAPM) and the Fama-French three-factor model, with more efficient estimates in the absence of outliers and consistent alphas when outliers are present.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This paper illustrates the pitfalls of the conventional heteroskedasticity and autocorrelation robust (HAR) Wald test and the advantages of new HAR tests developed by Kiefer and Vogelsang in 2005 and by Phillips, Sun and Jin in 2003 and 2006. The illustrations use the 1993 Fama-French three-factor model. The null that the intercepts are zero is tested for 5-year, 10-year and longer sub-periods. The conventional HAR test with asymptotic P-values rejects the null for most 5-year and 10-year sub-periods. By contrast, the null is not rejected by the new HAR tests. This conflict is explained by showing that inferences based on the conventional HAR test are misleading for the sample sizes used in this application.
Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
Using all stocks listed in the Tokyo Stock Exchange and macroeconomic data for Japan, the dataset comprises the following series:
We have produced all return series using the following data from Datastream: (i) total return index (RI series), (ii) market value (MV series), (iii) market-to-book equity (PTBV series), (iv) total assets (WC02999 series), (v) return on equity (WC08301 series), (vi) price-to-cash flow ratio (PC series), and (vii) dividend yield (DY series). We have used the generic rules suggested by Griffin, Kelly, & Nardari (2010) for excluding non-common equity securities from Datastream data. We also exclude stocks with less than twelve observations in the period from July 1992 to June 2018. Accordingly, our sample comprises a total number of 5,312 stocks.
REFERENCES:
Fama, E. F. and French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33, 3–56. Fama, E. F. and French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116, 1–22. Griffin, J. M., Kelly, P., and Nardari, F. (2010). Do market efficiency measures yield correct inferences? A comparison of developed and emerging markets. Review of Financial Studies, 23, 3225–3277. Hou K, Xue C, Zhang L. (2014). Digesting anomalies: An investment approach. Review of Financial Studies, 28, 650-705.
We compile raw data from the Datastream database for all stocks traded on the Spanish equity market. Particularly, we compile the following data series: (i) total return index (RI series), (ii) market value (MV series), (iii) market-to-book equity (PTBV series), (iv) total assets (WC02999 series), (v) return on equity (WC08301 series), (vi) dividend yield (DY series), (vii) price-to-earnings ratio (PE series), and (viii) effective tax rate (WC08346 series). We use the filters suggested by Griffin, Kelly, and Nardari (2010) for the Datastream database to exclude assets other than ordinary shares from our sample. Hence, our sample comprises 443 companies, including all firms that started trading within the time interval under study, as well as those that were delisted. As a proxy for the risk-free rate, we use the three-month Treasury Bill rate for Spain, as provided by the OECD. Accordingly, the dataset comprises the following series:
REFERENCES:
Fama, E. F. and French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33, 3–56. Fama, E. F. and French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116, 1–22. Griffin, J. M., Kelly, P., and Nardari, F. (2010). Do market efficiency measures yield correct inferences? A comparison of developed and emerging markets. Review of Financial Studies, 23, 3225–3277.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Using an intangible intensity factor that is orthogonal to the Fama–French factors, we compare the role of intangible investment in predicting stock returns over the periods 1963–1992 and 1993–2022. For 1963–1992, intangible investment is weak in predicting stock returns, but for 1993–2022, the predictive power of intangible investment becomes very strong. Intangible investment has a significant impact not only on the MTB ratio (Fama-French HML factor) but also on operating profitability (Fama-French RMW factor) when forecasting stock returns from 1993 to 2022. For intangible asset-intensive firms, intangible investment is the main predictor of stock returns, rather than MTB ratio and profitability. Our evidence suggests that intangible investment has become an important factor in explaining stock returns over time, independent of other factors such as profitability and MTB ratio.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The replication data contain MATLAB and GAUSS codes as well as the data required for replication of the results from the paper
Contains codes and data for simulation study from Section 3.
MV.mat, MV.txt- monthly data on market capitalization of the 205 stocks of the S&P500 index obtained from DataStream for the period 01.01.1974-01.05.2015
sp500_edata.mat - monthly data on close prices of components of S&P500 index for the period 01.01.1974-01.05.2015 processed to obtain excess returns using as a risk free return data on the risk free return from French & Fama database. Description of the price data from DataStream: "The ‘current’ prices taken at the close of market are stored each day. These stored prices are adjusted for subsequent capital actions, and this adjusted figure then becomes the default price offered on all Research programs. " Description of the excess return of the market from French & Fama database : "the excess return on the market, value-weight return of all CRSP firms incorporated in the US and listed on the NYSE, AMEX, or NASDAQ that have a CRSP share code of 10 or 11 at the beginning of month t, good shares and price data at the beginning of t, and good return data for t minus theone-month Treasury bill rate (from Ibbotson Associates)." From the latest file two separate data files were created (see CAPMsim.m):
sp500_stocks.txt, sp500_stocks.mat - monthly data on close prices of 205 components of S&P500 index for the period 01.01.1974-01.05.2015
FactorData.txt, FactorData.txt - The Fama & French factors from French & Fama database for a period July 1926 - May 2015.
CAPMsim.m - the main code that replicates the Monte Carlo simulation of the artificial market and proxy indexes subject to different types of the measurement error.
sure.m- obtains the estimated parameters for the SUR system and performs hypothesis testing of the significance of the coefficients.
Contains codes and data for empirical application from Section 4.
data1204.txt - 120 monthly observations on the excess returns on 30 stocks from DJIA, S&P500 index return, DJIA return from DataStream and excess return of the CRSP index from French & Fama database for a period 01/06/2005-01/05/2015.
DJSTOCKS_60_FF_Z.dat - 60 monthly observations on the excess returns on 30 stocks from DJIA from DataStream and excess return of the CRSP index from French & Fama database for a period 01/06/2010-01/05/2015.
DJSTOCKS_60_SP_Z.dat - 60 monthly observations on the excess returns on 30 stocks from DJIA and S&P500 index return from DataStream for a period 01/06/2010-01/05/2015.
STOCKS_60_DJ_Z.dat - 60 monthly observations on the excess returns on 20 random stocks from S&P500 and DJIA return from DataStream for a period 01/06/2010-01/05/2015.
Description of the variables in the data sets:
Z_1, Z_2,...,Z_20,..., Z_30 - returns of individual stocks depending on the data set.
For calculation of the returns adjusted prices from DataStream were used (see data from Monte Carlo simulation part). Risk free return is taken from French & Fama database.
Time period was shortened from 120 to 60 observations: 01/06/2010-01/05/2015
Excess returns from the market and indeces:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Our research is about robust analysis for high dimensional factor model in present of heavy-tailed data. We propose novel methods by integrating the modified Huber loss function and the common Principal Component Analysis. The methods are superior or comparable to others in numerical studies and the estimated factor number is more aligned with financial practice.
The real data in finance is from Kenneth R. French's website: http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html. We use three portfolio pools: Pool A, Pool B, and Pool C to do factor analysis. Each pool contains 100 portfolios with complete monthly average value-weighted returns data from July 2016 to June 2024. The Portfolios in each pool are influenced by two primary factors. The authors have no permission to share the data or make the data public available.
We give the R codes for data generating, parameter setting and computational details in simulations.
Using all stocks listed in the London Stock Exchange for the period from January 1989 to December 2018, the dataset comprises the following series: 1. Annual returns for 20 asset growth portfolios, following Fama and French (1993) methodology. 2. Annual returns for 25 portfolios size-book to market equity, following Fama and French (1993) methodology. 3. Annual returns for 62 industry portfolios, using two-digit SIC codes. 4. Fama and French (1993) factors for their three-factor model (RM, SMB and HML). 5. Fama and French (2015) factors for their five-factor model (RM, SMB, HML, RMW, and CMA). 6. Variation of the Amihid illiquidy measure for the London Stock Exchange, following Amihud (2002) methodology. 7. Three-month interest rate of the Treasury Bill for the United Kingdom, as provided by the OECD database. We have produced these series using the following data from Thomson Reuters Datastream: (i) total return index (RI series), (ii) market value (MV series), (iii) market-to-book equity (PTBV series), (iv) total assets (WC02999 series), (v) return on equity (WC08301 series), (vi) tax rate (WC08346 series), (vii) primary SIC codes, (viii) turnover by volume (VO series), and (ix) the market price (P series). Following Griffin et al. (2010), we use the generic rules provided by the authors for excluding non-common equity securities from Datastream data. REFERENCES: Amihud, Y. (2002). Illiquidity and stock returns: Cross-section and time-series effects. Journal of Financial Markets, 5, 31–56. Fama, E. F. and French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33, 3–56. Fama, E. F. and French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116, 1–22. Griffin, J. M., Kelly, P., and Nardari, F. (2010). Do market efficiency measures yield correct inferences? A comparison of developed and emerging markets. Review of Financial Studies, 23, 3225–3277.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We compile raw data from the Datastream database for all stocks traded on the Tokyo Stock Exchance, Osaka Exchange, Fukuoka Stock Exchange, Nagoya Stock Exchange and Sapporo Securities Exchange. Particularly, we collect the following data series, on a monthly basis: (i) total return index (RI series), (ii) market value (MV series), (iii) market-to-book equity (PTBV series), and (iv) primary SIC codes. Following Griffing et al. (2010), we exclude non-common equity securities from Datastream data. Additionally, we remove all companies with less than 12 observations in RI series for the period under analysis. Hence, our sample comprises 5,627 stocks, considering all companies that started trading or were delisted in the period under analysis. We use the three-month Treasury Bill rate for Japan, as provided by the OECD database, as a proxy for the risk-free rate. Accordingly, the dataset comprises the following series:
REFERENCES:
Cochrane, J.H. (1991), Production-based asset pricing and the link between stock returns and economic fluctuations. The Journal of Finance, 46, 209-237. Fama, E. F. and French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33, 3–56. Griffin, J. M., Kelly, P., and Nardari, F. (2010). Do market efficiency measures yield correct inferences? A comparison of developed and emerging markets. Review of Financial Studies, 23, 3225–3277.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Sample size after data cleansing and multiple imputation procedures.
Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
Using all stocks listed on the Japanese equity market and macroeconomic data for Japan, the dataset comprises the following series:
We have produced all return series using the following data from Datastream: (i) total return index (RI series), (ii) market value (MV series), (iii) market-to-book equity (PTBV series), (iv) total assets (WC02999 series), (v) return on equity (WC08301 series), (vi) price-to-earnings ratio (PE series), and (vii) industry (SECTOR series). We have used the generic rules suggested by Griffin, Kelly, & Nardari (2010) for excluding non-common equity securities from Datastream data. We also exclude stocks with less than twelve observations. Accordingly, our sample comprises a total number of 5,212 stocks.
REFERENCES:
Fama, E. F. and French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33, 3–56. Fama, E. F. and French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116, 1–22. Griffin, J. M., Kelly, P., and Nardari, F. (2010). Do market efficiency measures yield correct inferences? A comparison of developed and emerging markets. Review of Financial Studies, 23, 3225–3277.
Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
Using all stocks listed in the Australian Securities Exchange and macroeconomic data for Australia, the dataset comprises the following series:
We have produced all return series using the following data from Datastream: (i) total return index (RI series), (ii) market value (MV series), (iii) market-to-book equity (PTBV series), (iv) price-to-cash flow ratio (PC series), (v) primary SIC codes, and (vi) tax rate (WC08346 series). We use the rules suggested by Griffin, Kelly, & Nardari (2010) for excluding non-common equity securities from Datastream data.
REFERENCES:
Fama, E. F. and French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33, 3–56. Fama, E. F. and French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116, 1–22. Griffin, J. M., Kelly, P., and Nardari, F. (2010). Do market efficiency measures yield correct inferences? A comparison of developed and emerging markets. Review of Financial Studies, 23, 3225–3277.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data contains information from ESG Corporate Ranking by RAEX from December 2020 to February 2022, its dynamics, RSPP indices "Vector of sustainable development" and "Responsibility and openness", data for the Fama-French model, market capitalization and assets of companies and daily stock closing prices.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Fama French factor returns