33 datasets found
  1. m

    Low- and High-Dimensional Asset Prices Data

    • data.mendeley.com
    Updated Oct 18, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chi Seng Pun (2017). Low- and High-Dimensional Asset Prices Data [Dataset]. http://doi.org/10.17632/ndxfrshm74.2
    Explore at:
    Dataset updated
    Oct 18, 2017
    Authors
    Chi Seng Pun
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data files contain seven low-dimensional financial research data (in .txt format) and four high-dimensional daily stock prices data (in .csv format). The low-dimensional data sets are provided by Lorenzo Garlappi on his website, while the high-dimensional data sets are downloaded from Yahoo!Finance by the contributor's own efforts. The description of the low-dimensional data sets can be found in DeMiguel et al. (2009, RFS).

  2. D

    Finance Data Fusion Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Finance Data Fusion Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/finance-data-fusion-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Oct 4, 2024
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Finance Data Fusion Market Outlook



    The global finance data fusion market size is projected to grow at a CAGR of 12.5% from 2024 to 2032, with the market value increasing from USD 2.5 billion in 2023 to an estimated USD 7.4 billion by 2032. This impressive growth is driven by an intensifying demand for real-time analytics, the increasing complexity of financial transactions, and the need for improved risk management and fraud detection mechanisms in the financial sector.



    One of the primary growth factors propelling the finance data fusion market is the rising necessity for robust risk management solutions. Financial institutions are increasingly recognizing the importance of integrating diverse data sources to gain comprehensive insights into potential risks. With the advent of big data and advanced analytics, data fusion technologies enable organizations to synthesize information from multiple datasets, including market data, transactional data, and social media feeds, thereby enhancing their ability to predict and manage risks in a dynamic market environment. This capability is particularly critical in an era where financial stability and regulatory compliance are paramount.



    Another significant driver of market growth is the surging demand for enhanced fraud detection systems. Financial fraud has become increasingly sophisticated, necessitating the adoption of advanced technologies that can detect and mitigate fraudulent activities in real-time. Data fusion solutions allow for the integration of diverse data points, providing a holistic view of customer behavior and transaction patterns. This multi-dimensional analysis significantly improves the accuracy of fraud detection systems, enabling financial institutions to safeguard their assets and maintain customer trust. The growing reliance on digital payment systems further underscores the need for advanced fraud detection technologies.



    Furthermore, the growing importance of customer analytics in the financial sector is contributing to the market's expansion. Financial institutions are leveraging data fusion technologies to gain deeper insights into customer preferences, behavior, and needs. By integrating data from various sources, such as transaction histories, social media interactions, and demographic information, organizations can create detailed customer profiles that drive personalized marketing strategies and improve customer satisfaction. The ability to deliver tailored financial products and services based on comprehensive data analysis is a key competitive advantage in the financial industry.



    Regionally, North America is expected to dominate the finance data fusion market, owing to its advanced financial infrastructure and the early adoption of innovative technologies. The presence of major financial institutions and a highly developed regulatory framework further supports market growth in this region. Europe and Asia Pacific are also anticipated to witness substantial growth, driven by increasing investments in financial technology and the rising demand for advanced data analytics solutions. In contrast, Latin America and the Middle East & Africa are projected to experience moderate growth, influenced by varying levels of technological adoption and economic development.



    Component Analysis



    The finance data fusion market can be segmented by component into software, hardware, and services. The software segment is expected to hold the largest market share, driven by the increasing adoption of advanced analytic tools and platforms that enable the integration and analysis of diverse data sources. Financial institutions are investing heavily in software solutions that provide real-time insights and predictive analytics, facilitating more informed decision-making and enhancing operational efficiency. The proliferation of cloud-based software solutions is also contributing to the segment's growth, as they offer scalable and cost-effective alternatives to traditional on-premises systems.



    The hardware segment, although smaller in comparison to software, plays a crucial role in supporting data fusion activities. High-performance computing systems, storage solutions, and network infrastructure are essential for managing and processing the vast amounts of data generated in the financial sector. As financial institutions continue to expand their data capabilities, the demand for robust and scalable hardware solutions is expected to rise. Innovations in hardware technology, such as advanced processors and high-speed storage devices, are further driving the segment's growth.


    <b

  3. f

    Data from: Quasi Maximum Likelihood Estimation for Large-Dimensional Matrix...

    • tandf.figshare.com
    • datasetcatalog.nlm.nih.gov
    txt
    Updated Sep 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sainan Xu; Chaofeng Yuan; Jianhua Guo (2024). Quasi Maximum Likelihood Estimation for Large-Dimensional Matrix Factor Models [Dataset]. http://doi.org/10.6084/m9.figshare.26791894.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Sep 23, 2024
    Dataset provided by
    Taylor & Francis
    Authors
    Sainan Xu; Chaofeng Yuan; Jianhua Guo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In this study, we introduce a novel approach, called the quasi maximum likelihood estimation (Q-MLE), for estimating large-dimensional matrix factor models. In contrast to the principal component analysis based approach, Q-MLE considers the heteroscedasticity of the idiosyncratic error term, the heteroscedasticity of which is simultaneously estimated with other parameters. Interestingly, under the homoscedasticity assumption of the idiosyncratic error, the Q-MLE estimator encompassed the projected estimator (PE) as a special case. We provide the convergence rates and asymptotic distributions of the Q-MLE estimators under mild conditions. Extensive numerical experiments demonstrate that the Q-MLE method performs better, especially when heteroscedasticity exists. Furthermore, two real examples in finance and macroeconomics reveal factor patterns across rows and columns, which coincide with financial, economic, or geographical interpretations.

  4. f

    Data from: Dependence in Financial and High-dimensional Time Series

    • uvaauas.figshare.com
    zip
    Updated Jul 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    H. Li (2022). Dependence in Financial and High-dimensional Time Series [Dataset]. http://doi.org/10.21942/uva.16615975.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 25, 2022
    Dataset provided by
    University of Amsterdam / Amsterdam University of Applied Sciences
    Authors
    H. Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PhD thesis Hao Li

  5. Credit Card Fraud Dataset

    • kaggle.com
    Updated Jan 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vishal Painjane (2025). Credit Card Fraud Dataset [Dataset]. https://www.kaggle.com/datasets/vishalpainjane/dataset101
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 28, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Vishal Painjane
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Credit risk assessment remains a critical function within financial services, influencing lending decisions, portfolio risk management, and regulatory compliance. It integrates multiple categories of financial, transactional, and behavioral data to enable advanced machine learning applications in the domain of financial risk modeling.

    Data Composition and Structure

    The dataset comprises a total of 1,212 distinct features, systematically grouped into four principal categories, alongside a binary target variable. Each feature category represents a specific dimension of credit risk assessment, reflecting both internal transactional data and externally sourced credit bureau information.

    Target Variable

    The dependent variable, denoted as bad_flag, represents a binary risk classification outcome associated with each customer account. The variable takes the following values:

    • 0: Denotes a low-risk, creditworthy customer
    • 1: Denotes a high-risk, default-prone customer

    This variable serves as the target for binary classification models aimed at predicting credit risk propensity.

    Feature Groups

    CategoryNumber of FeaturesDescription
    Transaction Attributes664Customer-level transaction behavior, repayment patterns, financial habits
    Bureau Credit Data452Credit scores, external bureau records, delinquency flags, historical credit data
    Bureau Enquiries50Credit inquiry history, frequency and type of external credit applications
    ONUS Attributes48Internal bank relationship metrics, account engagement indicators

    Each feature within a category follows a systematic sequential naming convention (e.g., transaction_attribute_1, bureau_1), facilitating programmatic identification and group-level analysis.

    Data Characteristics

    The dataset exhibits several characteristics that mirror operational credit risk data environments:

    • High Dimensionality: The feature space exceeds 1,200 variables
    • Mixed Data Types: Numerical values (continuous and discrete), binary indicators
    • High Sparsity: A substantial proportion of features contain zero values or missing entries
    • Value Range Disparity: Feature values exhibit significant variance, with magnitudes ranging from small ratios (0.001) to large transaction amounts (288,500)

    Methodological Rationale

    The dataset was constructed by simulating data generation processes typical within financial services institutions. Transactional behaviors, bureau records, and inquiry histories were aggregated and engineered into derivative features.

  6. f

    Data from: High-Dimensional Elliptical Sliced Inverse Regression in...

    • tandf.figshare.com
    txt
    Updated Feb 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Chen; Jia Zhang; Wang Zhou (2024). High-Dimensional Elliptical Sliced Inverse Regression in Non-Gaussian Distributions [Dataset]. http://doi.org/10.6084/m9.figshare.14357504.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Feb 13, 2024
    Dataset provided by
    Taylor & Francis
    Authors
    Xin Chen; Jia Zhang; Wang Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sliced inverse regression (SIR) is the most widely used sufficient dimension reduction method due to its simplicity, generality and computational efficiency. However, when the distribution of covariates deviates from multivariate normal distribution, the estimation efficiency of SIR gets rather low, and the SIR estimator may be inconsistent and misleading, especially in the high-dimensional setting. In this article, we propose a robust alternative to SIR—called elliptical sliced inverse regression (ESIR), to analysis high-dimensional, elliptically distributed data. There are wide applications of elliptically distributed data, especially in finance and economics where the distribution of the data is often heavy-tailed. To tackle the heavy-tailed elliptically distributed covariates, we novelly use the multivariate Kendall’s tau matrix in a framework of generalized eigenvalue problem in sufficient dimension reduction. Methodologically, we present a practical algorithm for our method. Theoretically, we investigate the asymptotic behavior of the ESIR estimator under the high-dimensional setting. Extensive simulation results show ESIR significantly improves the estimation efficiency in heavy-tailed scenarios, compared with other robust SIR methods. Analysis of the Istanbul stock exchange dataset also demonstrates the effectiveness of our proposed method. Moreover, ESIR can be easily extended to other sufficient dimension reduction methods and applied to nonelliptical heavy-tailed distributions.

  7. m

    Data from: Global spillovers from multi-dimensional US monetary policy

    • data.mendeley.com
    Updated Jul 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Georgios Georgiadis (2025). Global spillovers from multi-dimensional US monetary policy [Dataset]. http://doi.org/10.17632/22kwvzy2md.1
    Explore at:
    Dataset updated
    Jul 23, 2025
    Authors
    Georgios Georgiadis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the replication file for the paper "Global spillovers from multi-dimensional US monetary policy" by G. Georgiadis and M. Jarocinski.

  8. l

    Supplementary Information Files for A three-dimensional asymmetric power...

    • repository.lboro.ac.uk
    pdf
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stavroula Yfanti; Georgios Chortareas; Menelaos Karanasos; Emmanouil Noikokyris (2023). Supplementary Information Files for A three-dimensional asymmetric power HEAVY model [Dataset]. http://doi.org/10.17028/rd.lboro.15330600.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Loughborough University
    Authors
    Stavroula Yfanti; Georgios Chortareas; Menelaos Karanasos; Emmanouil Noikokyris
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplementary Information Files for A three-dimensional asymmetric power HEAVY modelThis article proposes the three‐dimensional HEAVY system of daily, intra‐daily, and range‐based volatility equations. We augment the bivariate model with a third volatility metric, the Garman–Klass estimator, and enrich the trivariate system with power transformations and asymmetries. Most importantly, we derive the theoretical properties of the multivariate asymmetric power model and explore its finite‐sample performance through a simulation experiment on the size and power properties of the diagnostic tests employed. Our empirical application shows that all three power transformed conditional variances are found to be significantly affected by the powers of squared returns, realized measure, and range‐based volatility as well. We demonstrate that the augmentation of the HEAVY framework with the range‐based volatility estimator, leverage and power effects improves remarkably its forecasting accuracy. Finally, our results reveal interesting insights for investments, market risk measurement, and policymaking.

  9. m

    Robust Estimation for Factor Models Based on Modiffed Huber Loss

    • data.mendeley.com
    Updated Jun 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xinyu Yuan (2025). Robust Estimation for Factor Models Based on Modiffed Huber Loss [Dataset]. http://doi.org/10.17632/r57s759ykz.2
    Explore at:
    Dataset updated
    Jun 26, 2025
    Authors
    Xinyu Yuan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Our research is about robust analysis for high dimensional factor model in present of heavy-tailed data. We propose novel methods by integrating the modified Huber loss function and the common Principal Component Analysis. The methods are superior or comparable to others in numerical studies and the estimated factor number is more aligned with financial practice.

    The real data in finance is from Kenneth R. French's website: http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html. We use three portfolio pools: Pool A, Pool B, and Pool C to do factor analysis. Each pool contains 100 portfolios with complete monthly average value-weighted returns data from July 2016 to June 2024. The Portfolios in each pool are influenced by two primary factors. The authors have no permission to share the data or make the data public available.

    We give the R codes for data generating, parameter setting and computational details in simulations.

  10. g

    2021-2027 Planned finances detailed (categorisation, multi funds) |...

    • gimi9.com
    Updated Jul 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). 2021-2027 Planned finances detailed (categorisation, multi funds) | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_hgyj-gyin/
    Explore at:
    Dataset updated
    Jul 5, 2025
    Description

    NOTE: The EU Budget for Hungary of EUR 21.7 billion presented in this dataset reflects the allocations to Hungary in decisions adopted on 22 December 2022. As of 1 January 2025, the total EU budget effectively allocated to Hungary is EUR 20.7 billion. Once the detailed mid term decisions are adopted during 2025 the dataset will reflect the corrected amounts. Find out more in the FAQ - question 2.10 - https://cohesiondata.ec.europa.eu/stories/s/jxcd-m4vy. 2021-2027 financial details broken down by Fund / MS / programme / Policy objective / specific objective / Categorisation dimension. This dataset provides information on planned total and EU financing for 11 different EU Funds (2021-2027) in current prices. The data is taken from multiple categorisation tables in adopted programmes and is broken down by fund, programme, priority, policy objective objective, specific objective, category of region (more developed, less developed, etc. where available) and by the dimensions in the categorisation system. The values from the differnent dimensions should not be aggregated as this would lead to doublecounting. Find out more about the cohesion policy categorisation system here: https://cohesiondata.ec.europa.eu/stories/s/2021-2027-categorisation-information-system/hhu3-atyz. It is updated daily to reflect any modifications (i.e. thematic reallocations) agreed between the Member States and the Commission. The data covers the more than 480 programmes and includes the EU and national co-financing covered by the adoption decision. Financial allocations in the adopted programmes may change over time (i.e. transfers between themes, between funds). For ERDF, ESF and CF the "priority" columns relate financing envelopes in the programmes; the policy objectives and specific objectives relate to the fixed lists of defined objectives set in the specific fund Regulations. If downloading, check the format of financial amounts and where necessary change your regional settings and default formatting in your chosen software. You can check the results of your aggregation of the financial data by comparing results either the charts on the public platform. Please refer to the user guide - https://cohesiondata.ec.europa.eu/stories/s/Cohesion-Open-Data-User-Guide/cf5w-2b26 - for information on how to download data, etc.

  11. Data from: Familial Responses to Financial Instability: The Financial...

    • icpsr.umich.edu
    ascii, delimited, sas +2
    Updated May 20, 2010
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dew, Jeffrey; Xiao, Jing Jian (2010). Familial Responses to Financial Instability: The Financial Management Behaviors Scale, 2009 [United States] [Dataset]. http://doi.org/10.3886/ICPSR26542.v1
    Explore at:
    sas, ascii, spss, stata, delimitedAvailable download formats
    Dataset updated
    May 20, 2010
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    Dew, Jeffrey; Xiao, Jing Jian
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/26542/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/26542/terms

    Time period covered
    2009
    Area covered
    United States
    Description

    This study focused on how financial difficulties may hinder or facilitate sound financial management. A survey of 1,000 adults aged 18 years and older from the general population was conducted by Knowledge Networks on behalf of the National Center for Family and Marriage Research. The survey was completed by 1,014 respondents out of 1,517 cases (66.8 percent response rate). Although financial behavior research is common in the literature, no financial behavior scale exists that is both multi-dimensional and psychometrically validated. Using data from a national sample, this study developed and examined the psychometric properties of a new scale of financial management behaviors. The Financial Behavior Scale (FBS) displayed adequate reliability (alpha = .81). Further, it was highly associated with other measures of financial behavior and discriminated between financial behaviors and time use behaviors. Finally, the scale was highly predictive of savings, consumer debt, and investments. Thus, the FBS appears to be a reliable and valid scale of financial behaviors.

  12. Xometry (XMTR): A New Dimension in Manufacturing? (Forecast)

    • kappasignal.com
    Updated Feb 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KappaSignal (2024). Xometry (XMTR): A New Dimension in Manufacturing? (Forecast) [Dataset]. https://www.kappasignal.com/2024/02/xometry-xmtr-new-dimension-in.html
    Explore at:
    Dataset updated
    Feb 27, 2024
    Dataset authored and provided by
    KappaSignal
    License

    https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html

    Description

    This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

    Xometry (XMTR): A New Dimension in Manufacturing?

    Financial data:

    • Historical daily stock prices (open, high, low, close, volume)

    • Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

    • Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

    Machine learning features:

    • Feature engineering based on financial data and technical indicators

    • Sentiment analysis data from social media and news articles

    • Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

    Potential Applications:

    • Stock price prediction

    • Portfolio optimization

    • Algorithmic trading

    • Market sentiment analysis

    • Risk management

    Use Cases:

    • Researchers investigating the effectiveness of machine learning in stock market prediction

    • Analysts developing quantitative trading Buy/Sell strategies

    • Individuals interested in building their own stock market prediction models

    • Students learning about machine learning and financial applications

    Additional Notes:

    • The dataset may include different levels of granularity (e.g., daily, hourly)

    • Data cleaning and preprocessing are essential before model training

    • Regular updates are recommended to maintain the accuracy and relevance of the data

  13. Nano Dimension (NNDM) Stock Forecast: Printing Money with 3D Electronics...

    • kappasignal.com
    Updated Jun 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KappaSignal (2024). Nano Dimension (NNDM) Stock Forecast: Printing Money with 3D Electronics (Forecast) [Dataset]. https://www.kappasignal.com/2024/06/nano-dimension-nndm-stock-forecast.html
    Explore at:
    Dataset updated
    Jun 3, 2024
    Dataset authored and provided by
    KappaSignal
    License

    https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html

    Description

    This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

    Nano Dimension (NNDM) Stock Forecast: Printing Money with 3D Electronics

    Financial data:

    • Historical daily stock prices (open, high, low, close, volume)

    • Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

    • Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

    Machine learning features:

    • Feature engineering based on financial data and technical indicators

    • Sentiment analysis data from social media and news articles

    • Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

    Potential Applications:

    • Stock price prediction

    • Portfolio optimization

    • Algorithmic trading

    • Market sentiment analysis

    • Risk management

    Use Cases:

    • Researchers investigating the effectiveness of machine learning in stock market prediction

    • Analysts developing quantitative trading Buy/Sell strategies

    • Individuals interested in building their own stock market prediction models

    • Students learning about machine learning and financial applications

    Additional Notes:

    • The dataset may include different levels of granularity (e.g., daily, hourly)

    • Data cleaning and preprocessing are essential before model training

    • Regular updates are recommended to maintain the accuracy and relevance of the data

  14. Expense Management Software Market Analysis, Size, and Forecast 2025-2029:...

    • technavio.com
    Updated Jun 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2025). Expense Management Software Market Analysis, Size, and Forecast 2025-2029: North America (US and Canada), Europe (France, Germany, Italy, Spain, and UK), APAC (China, India, and Japan), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/expense-management-software-market-analysis
    Explore at:
    Dataset updated
    Jun 27, 2025
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    Global
    Description

    Snapshot img

    Expense Management Software Market Size 2025-2029

    The expense management software market size is forecast to increase by USD 7.5 billion at a CAGR of 16.2% between 2024 and 2029.

    The market is witnessing significant growth, driven by the increasing adoption of cloud-based solutions. Companies are increasingly turning to these solutions to streamline their expense management processes and improve operational efficiency. Another key trend is the integration of artificial intelligence (AI) and machine learning (ML) technologies, which enable automated expense categorization and approval workflows. However, the market also faces challenges. Security and privacy concerns continue to be a major obstacle, as companies must ensure the protection of sensitive financial data.
    Ensuring compliance with data protection regulations, such as GDPR and HIPAA, is crucial for maintaining customer trust and avoiding potential legal issues. Additionally, the implementation of these advanced technologies requires significant investment in IT infrastructure and expertise, which may be a barrier for smaller organizations. Companies seeking to capitalize on market opportunities and navigate these challenges effectively must prioritize data security, invest in IT capabilities, and offer user-friendly, cost-effective solutions. Multi-dimensional analysis, staff communications, and workflow and data integration are essential features for modern expense management systems.
    

    What will be the Size of the Expense Management Software Market during the forecast period?

    Explore in-depth regional segment analysis with market size data - historical 2019-2023 and forecasts 2025-2029 - in the full report.
    Request Free Sample

    In the travel expense management market, a centralized system with multi-platform software and cloud-based deployments is becoming increasingly popular among buyers and users. This shift towards cloud-based expenditure enables staff communications and spending visibility, enhancing organizational effectiveness and reducing operating costs. File attachments, such as receipts, are seamlessly integrated into these systems, ensuring audit trails and streamlining the expense reporting process. Artificial intelligence (AI) and machine learning technologies are revolutionizing the market, offering multidimensional analysis capabilities that go beyond simple expense categorization. Travel technology companies are partnering with software providers to offer integrated solutions, enhancing workflow efficiency and minimizing manual data entry.

    Employee hard drives are no longer the primary storage solution for expense reports, as cloud-based systems provide secure access to information from any mobile device. Automation of expense reporting processes and real-time data analysis enable quicker decision-making and improved financial management. Integration with other business systems, such as accounting software and HR platforms, further enhances the value proposition of travel expense management solutions. The market is expected to continue growing, driven by the need for streamlined expense management and increased operational efficiency.

    How is this Expense Management Software Industry segmented?

    The expense management software industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.

    Component
    
      Solution
      Service
    
    
    Application
    
      Large enterprises
      SMEs
    
    
    Deployment
    
      Cloud-based
      On-premises
    
    
    Geography
    
      North America
    
        US
        Canada
    
    
      Europe
    
        France
        Germany
        Italy
        Spain
        UK
    
    
      APAC
    
        China
        India
        Japan
    
    
      Rest of World (ROW)
    

    By Component Insights

    The Solution segment is estimated to witness significant growth during the forecast period. The market is experiencing significant growth due to its ability to streamline business-related activities for organizations. This market encompasses solutions like expense reporting software, travel and expense management software, invoice management software, and more, collectively referred to as expense management software-as-a-service. These solutions cater to both on-premises and cloud-based deployments. While cloud-based solutions are hosted externally and accessed via the Internet, on-premises solutions are installed locally. Phishing and cybercriminals pose threats to financial data security, making predictive analytics a crucial component in expense management software. Mobile devices are increasingly used for business activities, necessitating mobile terminal support in expense management solutions.

    Client responses and employee hard drives are also targeted by malware and social engineering attacks, emphasizing the importance of robust security measures. Large enterprises

  15. n

    Measuring Financial Inclusion: A State-Wise Impact Analysis of India’s...

    • narcis.nl
    • data.mendeley.com
    Updated Sep 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Phadke, M (via Mendeley Data) (2021). Measuring Financial Inclusion: A State-Wise Impact Analysis of India’s National Mission on Financial Inclusion [Dataset]. http://doi.org/10.17632/z36hbjhkdy.1
    Explore at:
    Dataset updated
    Sep 20, 2021
    Dataset provided by
    Data Archiving and Networked Services (DANS)
    Authors
    Phadke, M (via Mendeley Data)
    Area covered
    India
    Description

    This dataset contains data for 'Measuring Financial Inclusion: A State-Wise Impact Analysis of India’s National Mission on Financial Inclusion'. This dataset contains 3 files where the individual dimensional scores are calculated, and a FIS Calculation file which combines the three dimensions into one comprehensive value.

    Purpose: This study aims to calculate the level of financial inclusion across India’s states/ UTs and Union Territories from 2011 to 2019, to quantify the impact of India’s National Mission on Financial Inclusion (Pradhan Mantri Jan Dhan Yojana) which was launched in 2014. Approach: We utilise the Index of Financial Inclusion, which has 3 dimensions- Banking Penetration, Banking Services Availability and Banking Services Usage, comprised of sub-dimensions to capture various individual indicators of financial inclusion. This allows us to factor in the various aspects of financial inclusion into one score, which can then be used for empirical studies. Findings: We find that the National Mission on Financial Inclusion (PMJDY) has been able to accelerate the growth of financial inclusion in India, and as of 2019, 24 of India’s 31 states/UTs, considered in this paper, have been able to achieve High Financial Inclusion, with the remaining 7 achieving Medium Financial Inclusion. This represents a significant improvement over the state of financial inclusion in 2011. Originality: We have enhanced the Index of Financial Inclusion used in previous studies to include additional indicators of financial inclusion based on existing literature. The application of the measurement framework to quantify the impact of India’s National Mission of Financial Inclusion is original. The financial inclusion scores have been calculated over an extended period from 2011 to 2019.

  16. f

    S1 Data -

    • figshare.com
    • plos.figshare.com
    xlsx
    Updated Apr 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hui Zhu; Tianchu Feng; Xiaoliang Li (2024). S1 Data - [Dataset]. http://doi.org/10.1371/journal.pone.0300579.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Apr 5, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Hui Zhu; Tianchu Feng; Xiaoliang Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Energy plays a crucial role in global economic development, but it also contributes significantly to CO2 emissions. China has proposed a “dual-carbon” goal, and a key aspect to achieving this objective is finding effective ways to promote the decarbonization of the energy consumption structure (DECS). Compared with traditional finance, green finance is pivotal in advancing green and low-carbon development. However, the mechanism through which green finance impacts DECS has not been thoroughly explored. This study employs an enhanced weighted multi-dimensional vector angle method, which is more systematic and scientific, to measure DECS. Then, dynamic panel data from 30 provinces in China spanning the years 2003 to 2020 are used. A double fixed-effects model is applied to investigate the impact of green finance on the DECS and identify potential pathways. Results reveal that green finance significantly enhances DECS, primarily by reinforcing green development. The critical impact pathway involves the promotion of green technology innovation and green industry development. Moreover, the enhancing effect of green finance on the DECS is considerably significant in regions with relatively low government spending on science and technology (S&T), and where the focus is not on the “Atmospheric Ten” policy. The measurement of DECS is innovative, and the conclusions derived from it can offer compelling evidence for various social stakeholders. The government has the opportunity to establish a green financial system, supporting green technological innovation and the development of green industries. This approach can accelerate the DECS and work toward achieving the “double carbon” goal at an earlier date.

  17. f

    Regression results for different dimensional benchmark models.

    • plos.figshare.com
    xls
    Updated Jan 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hongying Sun; Yipei Luo; Jia Liu; Miraj Ahmed Bhuiyan (2024). Regression results for different dimensional benchmark models. [Dataset]. http://doi.org/10.1371/journal.pone.0297264.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jan 19, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Hongying Sun; Yipei Luo; Jia Liu; Miraj Ahmed Bhuiyan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Regression results for different dimensional benchmark models.

  18. TRACE_DJIA

    • kaggle.com
    Updated Aug 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guoxuan Sun (2025). TRACE_DJIA [Dataset]. https://www.kaggle.com/datasets/williamtage/trace-djia
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 1, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Guoxuan Sun
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Context Predicting stock market movements is a classic challenge in machine learning. While raw Open, High, Low, Close, and Volume (OHLCV) data is the standard starting point, its predictive power is often limited. To build robust models, data scientists require a much richer feature set that captures different aspects of market dynamics, from technical patterns to sentiment hidden in financial news.

    This dataset was created to bridge that gap. It provides a highly-enriched, pre-processed collection of features for the Dow Jones Industrial Average (DJIA), designed to accelerate research and modeling for stock price prediction.

    Content The dataset is organized into several files, each representing a distinct category of engineered features. This modular structure allows you to easily select, combine, or test the importance of different feature types.

    • final_daily_news_graph_embeddings.npy This is a 3D NumPy tensor with the shape (Number of Days, 25, 128).

    Description: Each day's top 25 news headlines have been transformed into a sophisticated knowledge graph. These graphs, enriched with data from Wikidata, are then encoded into 128-dimensional vectors using a Graph Convolutional Network (GCN). This file captures the semantic meaning and relationships within the news, providing a powerful non-price-based feature.

    • DJIA_engineered_features_1.csv

    Description: Contains fundamental features derived directly from OHLCV data. These are crucial for capturing intraday volatility and price action.

    Example Features: intraday_range, body_size, price_change, simple_return, log_return, price_volume_interaction.

    • DJIA_technical_indicators_2.csv

    Description: A wide array of popular technical indicators calculated using the pandas-ta library. These features are staples of financial analysis and help identify trends, momentum, and volatility.

    Example Features: Simple Moving Averages (SMA_20, SMA_50, SMA_200), Exponential Moving Averages (EMA_12, EMA_26), MACD, RSI, Bollinger Bands (BBL, BBM, BBU), On-Balance Volume (OBV), and more.

    • DJIA_statistical_time_features_3.csv

    Description: This file includes features based on the statistical properties of returns over an optimized rolling window, as well as cyclical time-based features. The optimal window was determined by finding the period with the highest correlation to future returns.

    Example Features: rolling_mean, rolling_std (volatility), rolling_skew, rolling_kurt, day_of_week_sin, day_of_week_cos, is_month_end.

    • DJIA_advanced_features_4.csv

    Description: More complex and transformational features designed to capture deeper market dynamics.

    Example Features: Lagged returns and RSI, quantitative candlestick pattern features, wavelet transform coefficients (to decompose price signals into different frequencies), and the Hurst Exponent (to measure long-term memory in the time series).

    Methodology The features were systematically generated using a series of Python scripts.

    News Embeddings: Headlines were processed to extract named entities. These entities were used to build knowledge subgraphs from Wikidata. Finally, a Graph Convolutional Network (GCN) model encoded these graphs into dense vectors.

    Tabular Features: All other features were generated from the raw DJIA price and volume data. The process involved several stages, from basic price calculations to advanced transformations. For features requiring a lookback period (e.g., rolling statistics, Hurst exponent), an optimal window length was programmatically determined to maximize its correlation with the target variable.

    Acknowledgements The raw OHLCV and news data was originally sourced from: https://www.kaggle.com/datasets/aaron7sun/stocknews. We thank them for making the data available.

    Inspiration This dataset is perfect for a variety of financial machine learning tasks:

    Can you build a model to predict the next day's market direction (Up/Down)?

    Which feature set is the most powerful? The technical indicators, the news embeddings, or a combination of all?

    How do advanced features like the Hurst exponent or wavelet coefficients contribute to model performance?

    Can you use these features to build a profitable trading strategy (backtesting required)?

  19. f

    Descriptive statistics of variables.

    • plos.figshare.com
    xls
    Updated Mar 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaiwei Jia; Yunqing Du (2024). Descriptive statistics of variables. [Dataset]. http://doi.org/10.1371/journal.pone.0295575.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Mar 15, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Kaiwei Jia; Yunqing Du
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Climate change-induced pan-financial market and the contagion of systemic financial risks are becoming important issues in the financial sector. The paper measures the temperature difference in terms of the degree and direction of deviation of the actual temperature relative to the average temperature of the same historical period. Based on the high-dimensional time-series variable LASSO-VAR-DY framework, we construct a pan-financial market volatility correlation network consisting of 112 Chinese listed companies in banking, insurance, securities, real estate, traditional energy, and new energy, use eigenvector centrality to measure the systematic risk of each firm, and then empirically test the effect of temperature difference on systematic risk under pan-financial market scenario. The results of the study show that (ⅰ) There is a significant difference among the systemic risk of financial sectors such as banking, insurance, and securities in the financial market pan-financial market scenario and the systemic risk when the financial market pan-financial market is not taken into account;(ⅱ) Higher temperature significantly exacerbates systemic financial risk, while colder temperature significantly mitigates systemic risk, but both have an asymmetric effect on systemic risk, and there is sectoral heterogeneity.(ⅲ) From the dynamic evolutionary characteristics, there are significant differences in the response of systemic financial risk to positive and negative temperature shocks;(iv) The results of the systemic risk variance decomposition indicate that the temperature change contributes more to the variance of systemic risk in the banking and securities sectors in pan-financial market;(ⅴ) The contagion source of financial systemic risk shows an obvious path of leaping and changing characteristics, and the contagion source of systemic risk (source of impact) shows the evolution law of "bank → real estate → new energy → temperature difference," which means that the temperature difference has become the contagion source of systemic financial risk. This study provides a reference for preventing and resolving systemic risks under pan-financial market scenario and provides a basis for improving the current macroprudential regulatory framework.

  20. f

    Descriptive statistics results of each variables.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yufei Lei (2024). Descriptive statistics results of each variables. [Dataset]. http://doi.org/10.1371/journal.pone.0302845.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Yufei Lei
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    An increase in a currency internationalization levels can positively impact its credibility in international economic activities, and expand the effective demand and optimize the supply structure for the country’s financial service trade. In this way, a state can improve its financial service trade competitiveness in the international market. This study builds a vector autoregressive model based on time-series data of China-US financial services trade from 2010 to 2021, analyzes the impact of different quantitative indicators of RMB internationalization on this trade from the impulse response results, and validates the conclusions using various inspection methods. The results show that the increase in RMB internationalization helps to narrow the China-US financial services trade balance, but with a significant lag. And this effect is heterogeneous in different dimensions, demonstrated by the fact that the development of overseas RMB securities business is more important for the level of RMB internationalization to narrow the China-US financial services trade balance. Finally, among the specific measures to improve its financial services trade, China should focus on developing the international competitiveness of the traditional RMB deposit and loan financial sector, while the competition in the overseas market for high value-added financial businesses must also not be neglected. Furthermore, China needs to implement more targeted RMB internationalization development policies at different levels in the future to provide high-quality financial services to the rest of the world and aid in the economic recovery of the world in the "post-pandemic" era.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Chi Seng Pun (2017). Low- and High-Dimensional Asset Prices Data [Dataset]. http://doi.org/10.17632/ndxfrshm74.2

Low- and High-Dimensional Asset Prices Data

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Oct 18, 2017
Authors
Chi Seng Pun
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The data files contain seven low-dimensional financial research data (in .txt format) and four high-dimensional daily stock prices data (in .csv format). The low-dimensional data sets are provided by Lorenzo Garlappi on his website, while the high-dimensional data sets are downloaded from Yahoo!Finance by the contributor's own efforts. The description of the low-dimensional data sets can be found in DeMiguel et al. (2009, RFS).

Search
Clear search
Close search
Google apps
Main menu