Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
The "yahoo_finance_dataset(2018-2023)" dataset is a financial dataset containing daily stock market data for multiple assets such as equities, ETFs, and indexes. It spans from April 1, 2018 to March 31, 2023, and contains 1257 rows and 7 columns. The data was sourced from Yahoo Finance, and the purpose of the dataset is to provide researchers, analysts, and investors with a comprehensive dataset that they can use to analyze stock market trends, identify patterns, and develop investment strategies. The dataset can be used for various tasks, including stock price prediction, trend analysis, portfolio optimization, and risk management. The dataset is provided in XLSX format, which makes it easy to import into various data analysis tools, including Python, R, and Excel.
The dataset includes the following columns:
Date: The date on which the stock market data was recorded. Open: The opening price of the asset on the given date. High: The highest price of the asset on the given date. Low: The lowest price of the asset on the given date. Close*: The closing price of the asset on the given date. Note that this price does not take into account any after-hours trading that may have occurred after the market officially closed. Adj Close**: The adjusted closing price of the asset on the given date. This price takes into account any dividends, stock splits, or other corporate actions that may have occurred, which can affect the stock price. Volume: The total number of shares of the asset that were traded on the given date.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This repository contains a meticulously scraped dataset from various financial websites. The data extraction process ensures high-quality and accurate text, including content from both the websites and their embedded PDFs.
We applied the advanced Mixtral 7X8 model to generate the following additional fields:
The prompt used to generate the additional fields was highly effective, thanks to extensive discussions and collaboration with the Mistral AI team. This ensures that the dataset provides valuable insights and is ready for further analysis and model training.
This dataset can be used for various applications, including but not limited to:
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Dataset Description 📊🔍
The Sujet-Finance-QA-Vision-100k is a comprehensive dataset containing over 100,000 question-answer pairs derived from more than 9,800 financial document images. This dataset is designed to support research and development in the field of financial document analysis and visual question answering.
Key Features:
🖼️ 9,801 unique financial document images ❓ 107,050 question-answer pairs 🇬🇧 English language 📄 Diverse financial document types… See the full description on the dataset page: https://huggingface.co/datasets/sujet-ai/Sujet-Finance-QA-Vision-100k.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Paper |Homepage |Github
🛠️ Usage
Regarding the data, first of all, you should download the MMfin.tsv and MMfin_CN.tsv files, as well as the relevant financial images. The folder structure is shown as follows: ├─ datasets ├─ images ├─ MMfin ... ├─ MMfin_CN ... │ MMfin.tsv │ MMfin_CN.tsv
The following is the process of inference and evaluation (Qwen2-VL-2B-Instruct as an example): export LMUData="The path of the datasets" python… See the full description on the dataset page: https://huggingface.co/datasets/hithink-ai/MME-Finance.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
The Financial Risk Assessment Dataset provides detailed information on individual financial profiles. It includes demographic, financial, and behavioral data to assess financial risk. The dataset features various columns such as income, credit score, and risk rating, with intentional imbalances and missing values to simulate real-world scenarios.
Facebook
TwitterVOSA Financial system incorporating General Ledger, Accounts Payable and Accounts Receivable, Cash Management
Facebook
TwitterThis dataset provides comprehensive access to financial market data from Google Finance in real-time. Get detailed information on stocks, market quotes, trends, ETFs, international exchanges, forex, crypto, and related news. Perfect for financial applications, trading platforms, and market analysis tools. The dataset is delivered in a JSON format via REST API.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
dataset contains detailed financial and demographic data for 20,000 individuals, focusing on income, expenses, and potential savings across various categories. The data aims to provide insights into personal financial management and spending patterns.
Income: Monthly income in currency units.Age: Age of the individual.Dependents: Number of dependents supported by the individual.Occupation: Type of employment or job role.City_Tier: A categorical variable representing the living area tier (e.g., Tier 1, Tier 2).Rent, Loan_Repayment, Insurance, Groceries, Transport, Eating_Out, Entertainment, Utilities, Healthcare, Education, and Miscellaneous record various monthly expenses.Desired_Savings_Percentage and Desired_Savings: Targets for monthly savings.Disposable_Income: Income remaining after all expenses are accounted for.Groceries, Transport, Eating_Out, Entertainment, Utilities, Healthcare, Education, and Miscellaneous.
Facebook
TwitterAll financial transactions made by the Intellectual Property Office as part of the Government’s commitment to transparency in expenditure
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
A comprehensive list of the columns in your dataset, along with descriptions for each:
| Column Name | Description |
|---|---|
| Company | The name of the company (e.g., Apple, Facebook). |
| Ticker | The stock ticker symbol for the company (e.g., AAPL for Apple, META for Facebook). |
| Date | The trading date for the stock data. |
| Open | The opening price of the stock for the trading day. |
| High | The highest price of the stock during the trading day. |
| Low | The lowest price of the stock during the trading day. |
| Close | The closing price of the stock for the trading day. |
| Adj Close | The adjusted closing price, which accounts for dividends and stock splits. |
| Volume | The number of shares traded during the day. |
| Market Cap | The total market value of a company's outstanding shares. |
| PE Ratio | Price-to-earnings ratio; a measure of a company's current share price relative to its per-share earnings. |
| Beta | A measure of a stock's volatility in relation to the market. |
| EPS (Earnings Per Share) | The portion of a company's profit allocated to each outstanding share of common stock. |
| Forward PE | The price-to-earnings ratio using forecasted earnings. |
| Revenue | Total revenue reported by the company. |
| Gross Profit | The profit a company makes after deducting the costs associated with making and selling its products. |
| Operating Income | The profit realized from a business's normal operations, excluding any income derived from non-operational activities. |
| Net Income | The total profit of a company after all expenses, taxes, and costs have been deducted from total revenue. |
| Debt to Equity | A financial ratio indicating the relative proportion of shareholders' equity and debt used to finance a company's assets. |
| Return on Equity (ROE) | A measure of financial performance calculated by dividing net income by shareholders' equity. |
| Current Ratio | A liquidity ratio that measures a company's ability to pay short-term obligations or those due within one year. |
| Dividends Paid | The total dividend payments made by the company. |
| Dividend Yield | A financial ratio that shows how much a company pays out in dividends each year relative to its stock price. |
| Quarterly Revenue Growth | The year-over-year percentage growth in revenue for the most recent quarter compared to the same quarter last year. |
| Analyst Recommendation | Analysts' consensus rating for the stock (e.g., buy, sell, hold). |
| Target Price | The forecasted price for the stock as estimated by analysts. |
| Free Cash Flow | The cash generated by the company after accounting for capital expenditures. |
| Operating Margin | A measure of how much... |
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This chart provides a comprehensive view of the total app spend in the Finance sector, segmented by region. It illustrates how much stores in different regions are investing in apps and software to enhance their business operations and customer services. In United States, the total app spend is significant, with stores investing $2.35M, which accounts for 64.43% of the overall app expenditure in the Finance. United Kingdom follows, with a total spend of $235.58K, making up 6.46% of the category's total. Canada also shows considerable investment in technology, with a spend of $138.72K, representing 3.81% of the total. These figures not only reveal the financial commitment of stores in each region towards technology but also indicate regional trends and priorities in the Finance market.
Facebook
TwitterThe first table of the G.20 shows seasonally adjusted data for the flows and levels of finance company receivables outstanding. These data include simple annual percent changes of total, consumer, real estate, and business receivables. The percent change in a given period is calculated as the flow of receivables in the current period divided by the level in the previous period. Percent changes and levels are calculated from unrounded data. The second and third pages of the G.20 show data that are not seasonally adjusted. The second page contains levels of outstanding receivables by receivable type, while the third page contains flow of receivables by type.
Facebook
TwitterThis (financial and personal) data is required to be kept as part of the auditing process of the co-ordinating country. It is required to be retained for several years after the ESSnet is completed.
Facebook
TwitterThis data set contains a summary of information about candidate campaigns and political committees by election year. For candidate campaigns and single-year/election committees, a single record is provided that covers all activity of the campaign for the given election year. Information for continuing political committees is summarized by calendar/reporting year. The data set covers that prior 16 years plus the current election year. The data are compiled from the campaign reports deposit (C3), campaign summary reports (C4), campaign registrations (C1/C1pc) and candidate declarations and elections data provided to the PDC by the Washington Secretary of State. Records are updated in near real-time, typically less than 2 minutes from the time the campaign submits new data. This dataset is a best-effort by the PDC to provide a complete set of records as described herewith. The PDC provides access to the original reports for the purpose of record verification. Descriptions attached to this dataset do not constitute legal definitions; please consult RCW 42.17A and WAC Title 390 for legal definitions and additional information regarding political finance disclosure requirements. CONDITION OF RELEASE: This publication and or referenced documents constitutes a list of individuals prepared by the Washington State Public Disclosure Commission and may not be used for commercial purposes. This list is provided on the condition and with the understanding that the persons receiving it agree to this statutorily imposed limitation on its use. See RCW 42.56.070(9) and AGO 1975 No. 15.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Description
The Twitter Financial News dataset is an English-language dataset containing an annotated corpus of finance-related tweets. This dataset is used to classify finance-related tweets for their sentiment.
The dataset holds 11,932 documents annotated with 3 labels:
sentiments = { "LABEL_0": "Bearish", "LABEL_1": "Bullish", "LABEL_2": "Neutral" }
The data was collected using the Twitter API. The current dataset supports the multi-class classification… See the full description on the dataset page: https://huggingface.co/datasets/zeroshot/twitter-financial-news-sentiment.
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Finance Companies; FHLB Advances; Liability, Level (BOGZ1FL613169333Q) from Q4 1945 to Q3 2025 about FHLB, advances, finance companies, companies, finance, liabilities, financial, and USA.
Facebook
TwitterTallyFormer Finance Datasets
This repository contains the cleaned and curated datasets used to train TallyFormer-Finance-51M, a compact decoder-only transformer specialized in financial language understanding. All datasets are provided in Apache Parquet format, optimized for high-throughput training and deterministic sampling.
📊 Dataset Overview
The data is organized by training stage:
1️⃣ Pretraining Data
Used for continual pretraining and general language… See the full description on the dataset page: https://huggingface.co/datasets/haidar-ali/tallyformer-finance-dataset.
Facebook
TwitterThis dataset contains a list of all campaign finance reports (C3, C4, C5, C6, and LMC) for the last 10 years including attached schedules. It includes reports that have been superseded by an amendment. The primary purpose of this dataset is for data consumers to track report amendments and to examine the reporting history for a filer. Refer to other datasets to get actual values for any of the reports referenced herewith. For candidates, the number of years is determined by the year of the election, not necessarily the year the report was filed. For political committees, the number of years is determined by the calendar year of the reporting period. This dataset is a best-effort by the PDC to provide a complete set of records as described herewith and may contain incomplete or incorrect information. The PDC provides access to the original reports for the purpose of record verification. Descriptions attached to this dataset do not constitute legal definitions; please consult RCW 42.17A and WAC Title 390 for legal definitions and additional information regarding political finance disclosure requirements. CONDITION OF RELEASE: This publication and or referenced documents constitutes a list of individuals prepared by the Washington State Public Disclosure Commission and may not be used for commercial purposes. This list is provided on the condition and with the understanding that the persons receiving it agree to this statutorily imposed limitation on its use. See RCW 42.56.070(9) and AGO 1975 No. 15.
Facebook
Twitterhttps://www.technavio.com/content/privacy-noticehttps://www.technavio.com/content/privacy-notice
Facebook
TwitterFinance Datasets
Historical stock and cryptocurrency price data.
Contents
Stocks (5 years of daily OHLCV data)
AAPL - Apple Inc. GOOGL - Alphabet Inc. MSFT - Microsoft Corp. AMZN - Amazon.com Inc. TSLA - Tesla Inc. META - Meta Platforms NVDA - NVIDIA Corp. AMD - Advanced Micro Devices INTC - Intel Corp. NFLX - Netflix Inc.
Cryptocurrencies (full history)
BTC_USD - Bitcoin ETH_USD - Ethereum SOL_USD - Solana ADA_USD - Cardano DOT_USD - Polkadot… See the full description on the dataset page: https://huggingface.co/datasets/misterdonn/finance-datasets.
Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
The "yahoo_finance_dataset(2018-2023)" dataset is a financial dataset containing daily stock market data for multiple assets such as equities, ETFs, and indexes. It spans from April 1, 2018 to March 31, 2023, and contains 1257 rows and 7 columns. The data was sourced from Yahoo Finance, and the purpose of the dataset is to provide researchers, analysts, and investors with a comprehensive dataset that they can use to analyze stock market trends, identify patterns, and develop investment strategies. The dataset can be used for various tasks, including stock price prediction, trend analysis, portfolio optimization, and risk management. The dataset is provided in XLSX format, which makes it easy to import into various data analysis tools, including Python, R, and Excel.
The dataset includes the following columns:
Date: The date on which the stock market data was recorded. Open: The opening price of the asset on the given date. High: The highest price of the asset on the given date. Low: The lowest price of the asset on the given date. Close*: The closing price of the asset on the given date. Note that this price does not take into account any after-hours trading that may have occurred after the market officially closed. Adj Close**: The adjusted closing price of the asset on the given date. This price takes into account any dividends, stock splits, or other corporate actions that may have occurred, which can affect the stock price. Volume: The total number of shares of the asset that were traded on the given date.