Facebook
TwitterThe data sets below provide selected information extracted from exhibits to corporate financial reports filed with the Commission using eXtensible Business Reporting Language (XBRL).
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset, titled "Financial-QA-10k", contains 10,000 question-answer pairs derived from company financial reports, specifically the 10-K filings. The questions are designed to cover a wide range of topics relevant to financial analysis, company operations, and strategic insights, making it a valuable resource for researchers, data scientists, and finance professionals. Each entry includes the question, the corresponding answer, the context from which the answer is derived, the company's stock ticker, and the specific filing year. The dataset aims to facilitate the development and evaluation of natural language processing models in the financial domain.
About the Dataset Dataset Structure:
Sample Data:
Question: What area did NVIDIA initially focus on before expanding into other markets? Answer: NVIDIA initially focused on PC graphics. Context: Since our original focus on PC graphics, we have expanded into various markets. Ticker: NVDA Filing: 2023_10K
Potential Uses:
Natural Language Processing (NLP): Develop and test NLP models for question answering, context understanding, and information retrieval. Financial Analysis: Extract and analyze specific financial and operational insights from large volumes of textual data. Educational Purposes: Serve as a training and testing resource for students and researchers in finance and data science.
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
Yahoo Finance dataset provides information on top traded companies. It contains financial information on each company including stock ticker and risk scores and general company information such as company location and industry. Each record in the dataset is a unique stock, where multiple stocks can be related to the same company. Yahoo Finance dataset attributes include: company name, company ID, entity type, summary, stock ticker, currency, earnings, exchange, closing price, previous close, open, bid, ask, day range, week range, volume, and much more.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset is a combination of Stanford's Alpaca (https://github.com/tatsu-lab/stanford_alpaca) and FiQA (https://sites.google.com/view/fiqa/) with another 1.3k pairs custom generated using GPT3.5 Script for tuning through Kaggle's (https://www.kaggle.com) free resources using PEFT/LoRa: https://www.kaggle.com/code/gbhacker23/wealth-alpaca-lora GitHub repo with performance analyses, training and data generation scripts, and inference notebooks: https://github.com/gaurangbharti1/wealth-alpaca… See the full description on the dataset page: https://huggingface.co/datasets/gbharti/finance-alpaca.
Facebook
TwitterSuccess.ai offers a cutting-edge solution for businesses and organizations seeking Company Financial Data on private and public companies. Our comprehensive database is meticulously crafted to provide verified profiles, including contact details for financial decision-makers such as CFOs, financial analysts, corporate treasurers, and other key stakeholders. This robust dataset is continuously updated and validated using AI technology to ensure accuracy and relevance, empowering businesses to make informed decisions and optimize their financial strategies.
Key Features of Success.ai's Company Financial Data:
Global Coverage: Access data from over 70 million businesses worldwide, including public and private companies across all major industries and regions. Our datasets span 250+ countries, offering extensive reach for your financial analysis and market research.
Detailed Financial Profiles: Gain insights into company financials, including revenue, profit margins, funding rounds, and operational costs. Profiles are enriched with key contact details, including work emails, phone numbers, and physical addresses, ensuring direct access to decision-makers.
Industry-Specific Data: Tailored datasets for sectors such as financial services, manufacturing, technology, healthcare, and energy, among others. Each dataset is customized to meet the unique needs of industry professionals and analysts.
Real-Time Accuracy: With continuous updates powered by AI-driven validation, our financial data maintains a 99% accuracy rate, ensuring you have access to the most reliable and up-to-date information available.
Compliance and Security: All data is collected and processed in strict adherence to global compliance standards, including GDPR, ensuring ethical and lawful usage.
Why Choose Success.ai for Company Financial Data?
Best Price Guarantee: We pride ourselves on offering the most competitive pricing in the industry, ensuring you receive unparalleled value for comprehensive financial data.
AI-Validated Accuracy: Our advanced AI algorithms meticulously verify every data point to ensure precision and reliability, helping you avoid costly errors in your financial decision-making.
Customized Data Solutions: Whether you need data for a specific region, industry, or type of business, we tailor our datasets to align perfectly with your requirements.
Scalable Data Access: From small startups to global enterprises, our platform caters to businesses of all sizes, delivering scalable solutions to suit your operational needs.
Comprehensive Use Cases for Financial Data:
Leverage our detailed financial profiles to create accurate budgets, forecasts, and strategic plans. Gain insights into competitors’ financial health and market positions to make data-driven decisions.
Access key financial details and contact information to streamline your M&A processes. Identify potential acquisition targets or partners with verified profiles and financial data.
Evaluate the financial performance of public and private companies for informed investment decisions. Use our data to identify growth opportunities and assess risk factors.
Enhance your sales outreach by targeting CFOs, financial analysts, and other decision-makers with verified contact details. Utilize accurate email and phone data to increase conversion rates.
Understand market trends and financial benchmarks with our industry-specific datasets. Use the data for competitive analysis, benchmarking, and identifying market gaps.
APIs to Power Your Financial Strategies:
Enrichment API: Integrate real-time updates into your systems with our Enrichment API. Keep your financial data accurate and current to drive dynamic decision-making and maintain a competitive edge.
Lead Generation API: Supercharge your lead generation efforts with access to verified contact details for key financial decision-makers. Perfect for personalized outreach and targeted campaigns.
Tailored Solutions for Industry Professionals:
Financial Services Firms: Gain detailed insights into revenue streams, funding rounds, and operational costs for competitor analysis and client acquisition.
Corporate Finance Teams: Enhance decision-making with precise data on industry trends and benchmarks.
Consulting Firms: Deliver informed recommendations to clients with access to detailed financial datasets and key stakeholder profiles.
Investment Firms: Identify potential investment opportunities with verified data on financial performance and market positioning.
What Sets Success.ai Apart?
Extensive Database: Access detailed financial data for 70M+ companies worldwide, including small businesses, startups, and large corporations.
Ethical Practices: Our data collection and processing methods are fully comp...
Facebook
TwitterAll financial transactions made by Companies House as part of the Government’s commitment to transparency in expenditure
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Sujet Finance Dataset Overview
The Sujet Finance dataset is a comprehensive collection designed for the fine-tuning of Language Learning Models (LLMs) for specialized tasks in the financial sector. It amalgamates data from 18 distinct datasets hosted on HuggingFace, resulting in a rich repository of 177,597 entries. These entries span across seven key financial LLM tasks, making Sujet Finance a versatile tool for developing and enhancing financial applications of AI.… See the full description on the dataset page: https://huggingface.co/datasets/sujet-ai/Sujet-Finance-Instruct-177k.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains financial information for the top 500 companies in India, including their market capitalization and quarterly sales. The data is categorized based on market cap and sales quartiles, allowing for detailed analysis and comparison. This dataset can be used to identify trends, patterns, and key metrics that are crucial for understanding the competitive landscape in the Indian market.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains important financial information and accounting ratios of the top 200 US Companies. Source of data in Yfiannce
Facebook
TwitterSuccess.ai’s Company Financial Data for Banking & Capital Markets Professionals in the Middle East offers a reliable and comprehensive dataset designed to connect businesses with key stakeholders in the financial sector. Covering banking executives, capital markets professionals, and financial advisors, this dataset provides verified contact details, decision-maker profiles, and firmographic insights tailored for the Middle Eastern market.
With access to over 170 million verified professional profiles and 30 million company profiles, Success.ai ensures your outreach and strategic initiatives are powered by accurate, continuously updated, and AI-validated data. Backed by our Best Price Guarantee, this solution empowers your organization to build meaningful connections in the region’s thriving financial industry.
Why Choose Success.ai’s Company Financial Data?
Verified Contact Data for Financial Professionals
Targeted Insights for the Middle East Financial Sector
Continuously Updated Datasets
Ethical and Compliant
Data Highlights:
Key Features of the Dataset:
Decision-Maker Profiles in Banking & Capital Markets
Advanced Filters for Precision Targeting
Firmographic and Leadership Insights
AI-Driven Enrichment
Strategic Use Cases:
Sales and Lead Generation
Market Research and Competitive Analysis
Partnership Development and Vendor Evaluation
Recruitment and Talent Solutions
Why Choose Success.ai?
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides a structured and machine-readable collection of financial statements filed with the Companies Registration Office (CRO) in Ireland. It currently includes financial statements for the year 2022, with additional years to be added as they become available. The dataset aligns with the European Union’s Open Data Directive (Directive (EU) 2019/1024) and the Implementing Regulation (EU) 2023/138, which designates company and company ownership data as a high-value dataset. It is available for bulk download and API access under the Creative Commons Attribution 4.0 (CC BY 4.0) licence, allowing unrestricted reuse with appropriate attribution. By increasing transparency and enabling data-driven insights, this dataset supports public sector initiatives, financial analysis, and digital services development. The API endpoints can be accessed using these links - Query - https://opendata.cro.ie/api/3/action/datastore_search Query (via SQL) - https://opendata.cro.ie/api/3/action/datastore_search_sql
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Finance-Instruct-500k Dataset
Overview
Finance-Instruct-500k is a comprehensive and meticulously curated dataset designed to train advanced language models for financial tasks, reasoning, and multi-turn conversations. Combining data from numerous high-quality financial datasets, this corpus provides over 500,000 entries, offering unparalleled depth and versatility for finance-related instruction tuning and fine-tuning. The dataset includes content tailored for financial… See the full description on the dataset page: https://huggingface.co/datasets/Josephgflowers/Finance-Instruct-500k.
Facebook
TwitterAdapting LLMs to Domains via Continual Pre-Training (ICLR 2024)
This repo contains the evaluation datasets for our paper Adapting Large Language Models via Reading Comprehension. We explore continued pre-training on domain-specific corpora for large language models. While this approach enriches LLMs with domain knowledge, it significantly hurts their prompting ability for question answering. Inspired by human learning via reading comprehension, we propose a simple method to… See the full description on the dataset page: https://huggingface.co/datasets/AdaptLLM/finance-tasks.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset offers a comprehensive overview of over 50,000 companies, detailing essential information such as website domain, company name, and stock ticker. It also includes variables indicating companies' involvement in specific controversial activities. This rich dataset is ideal for a variety of analyses, including ESG (Environmental, Social, and Governance) assessments, allowing users to explore corporate practices concerning these significant metrics. Whether you're a financial analyst, academic researcher, or data enthusiast, this dataset provides fertile ground for in-depth analysis of the global corporate landscape.
-**domain**: The official web domain of the company.
-**name**: The name of the company.
-**ticker** : The stock ticker symbol used for the company in stock exchanges.
-**involvement**: Indicators of the company's involvement in various controversial sectors: 1. Alcoholic Beverages: Involvement in alcoholic beverages. 2. Adult Entertainment: Involvement in adult entertainment. 3. Gambling: Involvement in gambling. 4. Tobacco Products: Involvement in tobacco products. 5. Animal Testing: Involvement in animal testing. 6. Fur and Specialty Leather: Involvement in fur and specialty leather. 7. Controversial Weapons: Involvement in controversial weapons. 8. Small Arms: Involvement in small arms. 9. Catholic Values: Involvement in activities contrary to Catholic values. 10. GMO: Involvement in genetically modified organisms. 11. Military Contracting: Involvement in military contracting. 12. Pesticides: Involvement in pesticides. 13. Thermal Coal: Involvement in thermal coal. 14. Palm Oil: Involvement in palm oil.
-**employees**: The number of employees in the company.
-**industry**: The industrial sector to which the company belongs.
-**sector**: The market sector to which the company belongs.
-**altman_score**: The company's Altman Z-score, used to predict bankruptcy risk.
-**piotroski_score**: The company's Piotroski F-score, used to assess the financial health of the company.
-**Controversies**: Indicators of the company's ESG controversies: 1. Environment: Environmental controversies. 2. Social: Social controversies. 3. Customers: Customer-related controversies. 4. Human Rights & Community: Human rights and community-related controversies. 5. Labor Rights & Supply Chain: Labor rights and supply chain-related controversies. 6. Governance: Governance-related controversies.
-**Decarbonization Target**: The company's decarbonization targets: 1. Target Year: The target year for the decarbonization goal. 2. Comprehensiveness: The percentage comprehensiveness of the decarbonization target. 3. Ambition p.a.: The annual percentage ambition of the decarbonization target. 4. Decarbonization Target: Whether the company has a decarbonization target. 5. Decarbonization Target on Temperature Rise: Whether the decarbonization target addresses temperature rise. 6. Temperature Goal: The company's temperature goal in degrees Celsius.
-**sdg**: The company's alignment with the United Nations Sustainable Development Goals: 1. No Poverty: No poverty. 2. No Hunger: Zero hunger. 3. Good Health and Well-Being: Good health and well-being. 4. Quality Education: Quality education. 5. Gender Equality: Gender equality. 6. Clean Water and Sanitation: Clean water and sanitation. 7. Affordable and Clean Energy: Affordable and clean energy. 8. Decent Work and Economic Growth: Decent work and economic growth. 9. Industry, Innovation and Infrastructure: Industry, innovation and infrastructure. 10. Reduced Inequalities: Reduced inequalities. 11. Sustainable Cities and Communities: Sustainable cities and communities. 12. Responsible Consumption and Production: Responsible consumption and production. 13. Climate Action: Climate action. 14. Life under Water: Life under water. 15. Life on Land: Life on land. 16. Peace, Justice and Strong Institutions: Peace, justice and strong institutions. 17. Partnerships for the Goals: Partnerships for the goals.
-**esg**: The company's overall Environmental, Social, and Governance (ESG) score.
-**involvement_msci**: The company's involvement in controversial sectors according to MSCI: 1. Controversial Weapons: Involvement in controversial weapons. 2. Gambling: Involvement in gambling. 3. Tobacco Products: Involvement in tobacco products. 4. Alcoholic Beverages: Involvement in alcoholic beverages.
The data may be incorrect, being scraped from YahooFinance.com, Investing.com, StockAnalysis.com, MSCI
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Agent Finance Reasoning
This dataset includes traces and associated metadata from agentic interactions on a Financial Reasoning task. We built the agentic system in langgraph and ReAct agents. In each sample, the agent is given a question that must be answered with information from one or more company 10-K's. The 10-K's are accessible to the agents through tools. Our question-answer pairs have been carefully co-created with Snorkel's Expert Data-as-a-Service network of financial… See the full description on the dataset page: https://huggingface.co/datasets/snorkelai/agent-finance-reasoning.
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
Stay informed with our comprehensive Financial News Dataset, designed for investors, analysts, and businesses to track market trends, monitor financial events, and make data-driven decisions.
Dataset Features
Financial News Articles: Access structured financial news data, including headlines, summaries, full articles, publication dates, and source details. Market & Economic Indicators: Track financial reports, stock market updates, economic forecasts, and corporate earnings announcements. Sentiment & Trend Analysis: Analyze news sentiment, categorize articles by financial topics, and monitor emerging trends in global markets. Historical & Real-Time Data: Retrieve historical financial news archives or access continuously updated feeds for real-time insights.
Customizable Subsets for Specific Needs Our Financial News Dataset is fully customizable, allowing you to filter data based on publication date, region, financial topics, sentiment, or specific news sources. Whether you need broad coverage for market research or focused data for investment analysis, we tailor the dataset to your needs.
Popular Use Cases
Investment Strategy & Risk Management: Monitor financial news to assess market risks, identify investment opportunities, and optimize trading strategies. Market & Competitive Intelligence: Track industry trends, competitor financial performance, and economic developments. AI & Machine Learning Training: Use structured financial news data to train AI models for sentiment analysis, stock prediction, and automated trading. Regulatory & Compliance Monitoring: Stay updated on financial regulations, policy changes, and corporate governance news. Economic Research & Forecasting: Analyze financial news trends to predict economic shifts and market movements.
Whether you're tracking stock market trends, analyzing financial sentiment, or training AI models, our Financial News Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In dataset, 324 corporate customers were taken as reference. Dataset were created from information's between January 2022 and September 2023 time intervals. Financial ratios are calculated by taking financial information of these customers from balance sheet and income statement items.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This repository contains a meticulously scraped dataset from various financial websites. The data extraction process ensures high-quality and accurate text, including content from both the websites and their embedded PDFs.
We applied the advanced Mixtral 7X8 model to generate the following additional fields:
The prompt used to generate the additional fields was highly effective, thanks to extensive discussions and collaboration with the Mistral AI team. This ensures that the dataset provides valuable insights and is ready for further analysis and model training.
This dataset can be used for various applications, including but not limited to:
Facebook
TwitterThe first table of the G.20 shows seasonally adjusted data for the flows and levels of finance company receivables outstanding. These data include simple annual percent changes of total, consumer, real estate, and business receivables. The percent change in a given period is calculated as the flow of receivables in the current period divided by the level in the previous period. Percent changes and levels are calculated from unrounded data. The second and third pages of the G.20 show data that are not seasonally adjusted. The second page contains levels of outstanding receivables by receivable type, while the third page contains flow of receivables by type.
Facebook
TwitterThe FR 3033p is the first part of a two-stage survey series, which has been conducted at regular five-year intervals since 1955. It is a census survey designed to identify the universe of finance companies eligible for potential inclusion in the FR 3033s. It gathers limited information including total assets, areas of specialization, and information on the corporate structure of such companies. The second part of these information collections, the FR 3033s, collects balance sheet data on major categories of consumer and business credit receivables and major liabilities, along with income and expenses, and is used to gather information on the scope of a company's operations and loan and lease servicing activities. In addition, additional questions were added to collect lending information related to the COVID-19 impacts.
Facebook
TwitterThe data sets below provide selected information extracted from exhibits to corporate financial reports filed with the Commission using eXtensible Business Reporting Language (XBRL).