82 datasets found

Financial Statements of Major Companies(2009-2023)
kaggle.com
Updated Dec 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rishabh Patil (2023). Financial Statements of Major Companies(2009-2023) [Dataset]. https://www.kaggle.com/datasets/rish59/financial-statements-of-major-companies2009-2023
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 1, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Rishabh Patil
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
This is a compiled datasets comprising of data from various companies' 10-K annual reports and balance sheets. The data is a longitudinal or panel data, from year 2009-2022(/23) and also consists of a few bankrupt companies to help for investigating factors. The names of the companies are given according to their Stocks. Companies divided into specific categories.
Financial Statement Data Sets
kaggle.com
Updated Jul 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vadim Vanak (2025). Financial Statement Data Sets [Dataset]. https://www.kaggle.com/datasets/vadimvanak/company-facts-2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 4, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Vadim Vanak
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset offers a detailed collection of US-GAAP financial data extracted from the financial statements of exchange-listed U.S. companies, as submitted to the U.S. Securities and Exchange Commission (SEC) via the EDGAR database. Covering filings from January 2009 onwards, this dataset provides key financial figures reported by companies in accordance with U.S. Generally Accepted Accounting Principles (GAAP).

Dataset Features:

Data Scope: The dataset is restricted to figures reported under US-GAAP standards, with the exception of EntityCommonStockSharesOutstanding and EntityPublicFloat.

Currency and Units: The dataset exclusively includes figures reported in USD or shares, ensuring uniformity and comparability. It excludes ratios and non-financial metrics to maintain focus on financial data.

Company Selection: The dataset is limited to companies with U.S. exchange tickers, providing a concentrated analysis of publicly traded firms within the United States.

Submission Types: The dataset only incorporates data from 10-Q, 10-K, 10-Q/A, and 10-K/A filings, ensuring consistency in the type of financial reports analyzed.

Data Sources and Extraction:

This dataset primarily relies on the SEC's Financial Statement Data Sets and EDGAR APIs: - SEC Financial Statement Data Sets - EDGAR Application Programming Interfaces

In instances where specific figures were missing from these sources, data was directly extracted from the companies' financial statements to ensure completeness.

Please note that the dataset presents financial figures exactly as reported by the companies, which may occasionally include errors. A common issue involves incorrect reporting of scaling factors in the XBRL format. XBRL supports two tag attributes related to scaling: 'decimals' and 'scale.' The 'decimals' attribute indicates the number of significant decimal places but does not affect the actual value of the figure, while the 'scale' attribute adjusts the value by a specific factor.

However, there are several instances, numbering in the thousands, where companies have incorrectly used the 'decimals' attribute (e.g., 'decimals="-6"') under the mistaken assumption that it controls scaling. This is not correct, and as a result, some figures may be inaccurately scaled. This dataset does not attempt to detect or correct such errors; it aims to reflect the data precisely as reported by the companies. A future version of the dataset may be introduced to address and correct these issues.

The source code for data extraction is available here
d
Financial Statement Data Sets
catalog.data.gov
s.cnmilf.com
Updated Jul 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Economic and Risk Analysis (2025). Financial Statement Data Sets [Dataset]. https://catalog.data.gov/dataset/financial-statement-data-sets
Explore at:
Dataset updated
Jul 9, 2025
Dataset provided by
Economic and Risk Analysis
Description
The data sets below provide selected information extracted from exhibits to corporate financial reports filed with the Commission using eXtensible Business Reporting Language (XBRL).
Data from: Company Financials Dataset
kaggle.com
Updated Aug 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Atharva Arya (2023). Company Financials Dataset [Dataset]. https://www.kaggle.com/datasets/atharvaarya25/financials
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 1, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Atharva Arya
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
This is a dataset that requires a lot of preprocessing with amazing EDA insights for a company. A dataset consisting of sales and profit data sorted by market segment and country/region.

Tips for pre-processing: 1. Check for column names and find error there itself!! 2. Remove '$' sign and '-' from all columns where they are present 3. Change datatype from objects to int after the above two. 4. Challenge: Try removing " , " (comma) from all numerical numbers. 5. Try plotting sales and profit with respect to timeline
d
Financial Statements API - 50,000+ Companies Covered
datarade.ai
.json, .csv
Updated Oct 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Financial Modeling Prep (2022). Financial Statements API - 50,000+ Companies Covered [Dataset]. https://datarade.ai/data-products/financial-statements-api-50-000-companies-covered-financial-modeling-prep
Explore at:
.json, .csvAvailable download formats
Dataset updated
Oct 28, 2022
Dataset authored and provided by
Financial Modeling Prep
Area covered
Colombia, Hungary, Greece, Switzerland, Spain, Thailand, United States of America, Singapore, Norway, Germany
Description
Our Financial API provides access to a vast collection of historical financial statements for over 50,000+ companies listed on major exchanges. With this powerful tool, you can easily retrieve balance sheets, income statements, and cash flow statements for any company in our extensive database. Stay informed about the financial health of various organizations and make data-driven decisions with confidence. Our API is designed to deliver accurate and up-to-date financial information, enabling you to gain valuable insights and streamline your analysis process. Experience the convenience and reliability of our company financial API today.
a
S.Korea Financial statements datasets
aiceltech.com
Updated Jun 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KED Aicel (2024). S.Korea Financial statements datasets [Dataset]. https://www.aiceltech.com/datasets/financial-statements
Explore at:
Dataset updated
Jun 23, 2024
Dataset authored and provided by
KED Aicel
License
https://www.aiceltech.com/termshttps://www.aiceltech.com/terms
Time period covered
2016 - 2024
Area covered
South Korea
Description
Korean Companies’ Financial Data provides important information to analyze a company’s financial status and performance. This data includes financial indicators such as revenue, expenses, assets, and liabilities. Collected from corporate financial reports and stock market data, it helps investors evaluate financial health and discover investment opportunities, essential for valuing Korean companies.
Consolidated Financial Statements for Bank Holding Companies, Parent Company...
catalog.data.gov
catalog-dev.data.gov
Updated Dec 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Board of Governors of the Federal Reserve System (2024). Consolidated Financial Statements for Bank Holding Companies, Parent Company Only Financial Statements for Large Holding Companies, Parent Company Only Financial Statements for Small Holding Companies, Financial Statements Employee Stock Ownership Plan Holding Companies, Supplement to the Consolidated Financial Statements for Bank Holding Companies [Dataset]. https://catalog.data.gov/dataset/consolidated-financial-statements-for-bank-holding-companies-parent-company-only-financial
Explore at:
Dataset updated
Dec 18, 2024
Dataset provided by
Federal Reserve Systemhttp://www.federalreserve.gov/
Federal Reserve Board of Governors
Description
The Financial Statements of Holding Companies (FR Y-9 Reports) collects standardized financial statements from domestic holding companies (HCs). This is pursuant to the Bank Holding Company Act of 1956, as amended (BHC Act), and the Home Owners Loan Act (HOLA). The FR Y-9C is used to identify emerging financial risks and monitor the safety and soundness of HC operations. HCs file the FR Y-9C and FR Y-9LP quarterly, the FR Y-9SP semiannually, the FR Y-9ES annually, and the FR Y-9CS on a schedule that is determined when this supplement is used.
Financial Statement Extracts
kaggle.com
Updated Sep 13, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Securities and Exchange Commission (2017). Financial Statement Extracts [Dataset]. https://www.kaggle.com/securities-exchange-commission/financial-statement-extracts/notebooks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 13, 2017
Dataset provided by
Kaggle
Authors
Securities and Exchange Commission
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The Financial Statement Data Sets below provide numeric information from the face financials of all financial statements. This data is extracted from exhibits to corporate financial reports filed with the Commission using eXtensible Business Reporting Language (XBRL). As compared to the more extensive Financial Statement and Notes Data Sets, which provide the numeric and narrative disclosures from all financial statements and their notes, the Financial Statement Data Sets are more compact.

The information is presented without change from the "as filed" financial reports submitted by each registrant. The data is presented in a flattened format to help users analyze and compare corporate disclosure information over time and across registrants. The data sets also contain additional fields including a company's Standard Industrial Classification to facilitate the data's use.

Content

Each quarter's data is stored as a json of the original text files. This was necessary to limit the overall number of files. The num.txt file will likely be of most interest.

Acknowledgements

This dataset was kindly made available by the SEC. You can find the original dataset, which is updated quarterly, here.
Z
Annual Reports Assessment Dataset
data.niaid.nih.gov
Updated Jan 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sisodia Yogendra (2023). Annual Reports Assessment Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7536331
Explore at:
Dataset updated
Jan 14, 2023
Dataset authored and provided by
Sisodia Yogendra
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Annual reports Assessment Dataset

This dataset will help investors, merchant bankers, credit rating agencies, and the community of equity research analysts explore annual reports in a more automated way, saving them time.

Following Sub Dataset(s) are there :

a) pdf and corresponding OCR text of 100 Indian annual reports These 100 annual reports are for the 100 largest companies listed on the Bombay Stock Exchange. The total number of words in OCRed text is 12.25 million.

b) A Few Examples of Sentences with Corresponding Classes The author defined 16 widely used topics used in the investment community as classes like:

Accounting Standards

Accounting for Revenue Recognition

Corporate Social Responsbility

Credit Ratings

Diversity Equity and Inclusion

Electronic Voting

Environment and Sustainability

Hedging Strategy

Intellectual Property Infringement Risk

Litigation Risk

Order Book

Related Party Transaction

Remuneration

Research and Development

Talent Management

Whistle Blower Policy

These classes should help generate ideas and investment decisions, as well as identify red flags and early warning signs of trouble when everything appears to be proceeding smoothly.

ABOUT DATA ::

"scrips.json" is a json with name of companies "SC_CODE" is BSE Scrip Id "SC_NAME" is Listed Companies Name "NET_TURNOV" is Turnover on the day of consideration

"source_pdf" is folder containing both PDF and OCR Output from Tesseract "raw_pdf.zip" contains raw PDF and it can be used to try another OCR. "ocr.zip" contains json file (annual_report_content.json) containing OCR text for each pdf. "annual_report_content.json" is an array of 100 elements and each element is having two keys "file_name" and "content"

"classif_data_rank_freezed.json" is used for evaluation of results contains "sentence" and corresponding "class"
Financial Documents Clustering
kaggle.com
Updated May 24, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr.Anonymous (2021). Financial Documents Clustering [Dataset]. https://www.kaggle.com/drcrabkg/financial-statements-clustering/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 24, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Dr.Anonymous
Description
Every public company publishes a financial report to declare the financial activities and position of a business. This financial statement contains many tables to present the information. We classify these tables into predefined categories, such as below.

1) Income Statements 2) Balance Sheets 3) Cash Flows 4) Notes 5) Others

Datasets: Within the given dataset you will find 5 folders with the above category names. Every folder contains .html files with respective tabular data.

Expecting the grouping of documents in such a way that the files appear distinguished as per their category. The categories can only be used as a benchmark for evaluation later.

Data extracted: The data has been taken from the Publically available Hexaware Technologies financial annual reports. You can find here on link https://hexaware.com/investors/

Thank you for your Patience, Enjoy the dataset and Explore and learn more. Peace out✌️
US Company Filings Database
lseg.com
Updated Feb 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LSEG (2025). US Company Filings Database [Dataset]. https://www.lseg.com/en/data-analytics/financial-data/filings/company-filings-database
Explore at:
csv,html,json,pdf,python,text,user interface,xmlAvailable download formats
Dataset updated
Feb 3, 2025
Dataset provided by
London Stock Exchange Grouphttp://www.londonstockexchangegroup.com/
Authors
LSEG
License
https://www.lseg.com/en/policies/website-disclaimerhttps://www.lseg.com/en/policies/website-disclaimer
Area covered
United States
Description
Browse LSEG's US Company Filings Database, and find a range of filings content and history including annual reports, municipal bonds, and more.
Financial Statements of U.S. Nonbank Subsidiaries of U.S. Holding Companies
catalog.data.gov
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Board of Governors of the Federal Reserve System (2024). Financial Statements of U.S. Nonbank Subsidiaries of U.S. Holding Companies [Dataset]. https://catalog.data.gov/dataset/financial-statements-of-u-s-nonbank-subsidiaries-of-u-s-holding-companies
Explore at:
Dataset updated
Dec 18, 2024
Dataset provided by
Federal Reserve Systemhttp://www.federalreserve.gov/
Federal Reserve Board of Governors
Area covered
United States
Description
The Financial Statements of U.S. Nonbank Subsidiaries of U.S. Holding Companies (FR Y-11; FR Y-11S) reporting forms collect financial information for individual nonfunctional regulated U.S. nonbank subsidiaries of domestic holding companies, which is essential for monitoring the subsidiaries' potential impact on the condition of the holding company or its subsidiary banks. Holding companies file the FR Y-11 on a quarterly or annual basis or the FR Y-11S on an annual basis, predominantly based on whether the organization meets certain asset size thresholds. The FR Y-11 data are used with other holding company data to assess the condition of holding companies that are heavily engaged in nonbanking activities and to monitor the volume, nature, and condition of their nonbanking operations.
E
European Financial Filings Database
financialreports.eu
json
Updated 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FinancialReports UG (2024). European Financial Filings Database [Dataset]. https://financialreports.eu/
Explore at:
jsonAvailable download formats
Dataset updated
2024
Dataset authored and provided by
FinancialReports UG
Time period covered
2022 - 2024
Area covered
Europe
Description
Comprehensive database of over 100,000 financial filings from 8,000+ European companies
d
CTOS Basis Private Companies Financials Data
datarade.ai
Updated Aug 7, 1980
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CTOS Basis (1980). CTOS Basis Private Companies Financials Data [Dataset]. https://datarade.ai/data-products/ctos-basis-private-companies-financials-data-ctos-basis
Explore at:
.bin, .json, .xml, .csv, .xls, .sqlAvailable download formats
Dataset updated
Aug 7, 1980
Dataset authored and provided by
CTOS Basis
Area covered
Cuba, Netherlands, Singapore, Curaçao, United Republic of, India, Malaysia, Macao, Kuwait, Suriname
Description
Our comprehensive and advanced database is completed with all the information you need, with up to >1.5 million company financial records at your disposal. This allows you to easily perform company search on company profile and company directory, with 99% coverage in Malaysia.

Our database also contains company profiles on private limited or limited companies globally, including information such as shareholders and financial accounts can be accessed instantly.
o
Yahoo Finance Business Information Dataset
opendatabay.com
.undefined
Updated Jun 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bright Data (2025). Yahoo Finance Business Information Dataset [Dataset]. https://www.opendatabay.com/data/premium/c7c8bf69-7728-4527-a2a2-7d1506e02263
Explore at:
.undefinedAvailable download formats
Dataset updated
Jun 23, 2025
Dataset authored and provided by
Bright Data
Area covered
Finance & Banking Analytics
Description
Yahoo Finance Business Information dataset to access comprehensive details on companies, including financial data and business profiles. Popular use cases include market analysis, investment research, and competitive benchmarking.

Use our Yahoo Finance Business Information dataset to access comprehensive financial and corporate data, including company profiles, stock prices, market capitalization, revenue, and key performance metrics. This dataset is tailored for financial analysts, investors, and researchers to analyze market trends and evaluate company performance.

Popular use cases include investment research, competitor benchmarking, and trend forecasting. Leverage this dataset to make informed financial decisions, identify growth opportunities, and gain a deeper understanding of the business landscape.

Dataset Features

name: Represents the company name.

company_id: Unique identifier assigned to each company.

entity_type: Denotes the type/category of the business entity.

summary: A brief description or summary of the company.

stock_ticker: The ticker symbol used for trading on stock exchanges.

currency: The currency in which financial values are expressed.

earnings_date: The date for the reported earnings.

exchange: The stock exchange on which the company is listed.

closing_price: The final stock price at the end of the trading day.

previous_close: The stock price at the close of the previous trading day.

open: The price at which the stock opened for the trading day.

bid: The current highest price that a buyer is willing to pay for the stock.

ask: The current lowest price that a seller is willing to accept.

day_range: The range between the lowest and highest prices during the trading day.

week_range: A broader price range over the past week.

volume: Number of shares that traded in the session.

avg_volume: Average daily share volume over a specific period.

market_cap: Total market capitalization of the company.

beta: A measure of the stock's volatility in comparison to the market.

pe_ratio: Price-to-earnings ratio for valuation.

eps: Earnings per share.

dividend_yield: Dividend yield percentage.

ex_dividend_date: The date on which the stock trades without the right to the declared dividend.

target_est: The analyst's target price estimate.

url: The URL to more detailed company information.

people_also_watch: Companies frequently watched alongside this company.

similar: Other companies with similar profiles.

risk_score: A quantified risk score.

risk_score_text: A textual interpretation of the risk score.

risk_score_percentile: The risk score expressed in percentile terms.

recommendation_rating: Analyst recommendation ratings.

analyst_price_target: Analyst provided stock price target.

company_profile_address: Company address from the profile.

company_profile_website: URL for the company’s website.

company_profile_phone: Contact phone number.

company_profile_sector: The sector in which the company operates.

company_profile_industry: Industry classification of the company.

company_profile_employees: Number of employees in the company.

company_profile_description: A detailed profile description of the company.

valuation_measures: Contains key valuation ratios and metrics such as enterprise value, price-to-book, and price-to-sales ratios.

Financial_highlights: Offers summary financial statistics including EPS, profit margin, revenue, and cash flow indicators.

financials: This column appears to provide financial statement data.

financials_quarterly: Similar to the previous field but intended to capture quarterly financial figures.

earnings_estimate: Contains consensus earnings estimates including average, high, and low estimates along with the number of analysts involved.

revenue_estimate: Provides revenue estimates with details such as average estimate, high and low values, and sales growth factors.

earnings_history: This field tracks historical earnings and surprises by comparing actual EPS with estimates.

eps_trend: Contains information on how the EPS has trended over various recent time intervals.

eps_revisions: Captures recent changes in EPS forecasts.

growth_estimates: Offers projections related to growth prospects over different time horizons.

top_analysts: Intended to list the top analysts covering the company.

upgrades_and_downgrades: This field shows recent analyst upgrades or downgrades.

recent_news: Meant to contain recent news articles related to the company.

fanacials_currency: Appears to indicate the currency used for financial reporting or valuation in the dataset.

**company_profile_he
Data from: SEC Filings
kaggle.com
zip
Updated Jun 5, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Google BigQuery (2020). SEC Filings [Dataset]. https://www.kaggle.com/datasets/bigquery/sec-filings
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Jun 5, 2020
Dataset provided by
BigQueryhttps://cloud.google.com/bigquery
Authors
Google BigQuery
Description
In the U.S. public companies, certain insiders and broker-dealers are required to regularly file with the SEC. The SEC makes this data available online for anybody to view and use via their Electronic Data Gathering, Analysis, and Retrieval (EDGAR) database. The SEC updates this data every quarter going back to January, 2009. For more information please see this site.

To aid analysis a quick summary view of the data has been created that is not available in the original dataset. The quick summary view pulls together signals into a single table that otherwise would have to be joined from multiple tables and enables a more streamlined user experience.

DISCLAIMER: The Financial Statement and Notes Data Sets contain information derived from structured data filed with the Commission by individual registrants as well as Commission-generated filing identifiers. Because the data sets are derived from information provided by individual registrants, we cannot guarantee the accuracy of the data sets. In addition, it is possible inaccuracies or other errors were introduced into the data sets during the process of extracting the data and compiling the data sets. Finally, the data sets do not reflect all available information, including certain metadata associated with Commission filings. The data sets are intended to assist the public in analyzing data contained in Commission filings; however, they are not a substitute for such filings. Investors should review the full Commission filings before making any investment decision.
d
Dataset of companies’ profitability, government debt, Financial Statements'...
search.dataone.org
dataverse.harvard.edu
Updated Nov 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mgammal, Mahfoudh; Al-Matari, Ebrahim (2023). Dataset of companies’ profitability, government debt, Financial Statements' Key Indicators and earnings in an emerging market: Developing a panel and time series database of value-added tax rate increase impacts [Dataset]. http://doi.org/10.7910/DVN/HEL3YG
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/HEL3YG
Dataset updated
Nov 8, 2023
Dataset provided by
Harvard Dataverse
Authors
Mgammal, Mahfoudh; Al-Matari, Ebrahim
Description
The dataset included with this article contains three files describing and defining the sample and variables for VAT impact, and Excel file 1 consists of all raw and filtered data for the variables for the panel data sample. Excel file 2 depicts time-series and cross-sectional data for nonfinancial firms listed on the Saudi market for the second and third quarters of 2019 and the third and fourth quarters of 2020. Excel file 3 presents the raw material of variables used in measuring the company's profitability of the panel data sample
S&P Compustat Database
lseg.com
sql
Updated Nov 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LSEG (2024). S&P Compustat Database [Dataset]. https://www.lseg.com/en/data-analytics/financial-data/company-data/fundamentals-data/standardized-fundamentals/sp-compustat-database
Explore at:
sqlAvailable download formats
Dataset updated
Nov 25, 2024
Dataset provided by
London Stock Exchange Grouphttp://www.londonstockexchangegroup.com/
Authors
LSEG
License
https://www.lseg.com/en/policies/website-disclaimerhttps://www.lseg.com/en/policies/website-disclaimer
Description
Access historical and point-in-time financial statements, ratios, multiples, and press releases, with LSEG's S&P Compustat Database.
h
Financial-Fraud-Dataset
huggingface.co
Updated Mar 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amit Shushil Kedia (2024). Financial-Fraud-Dataset [Dataset]. https://huggingface.co/datasets/amitkedia/Financial-Fraud-Dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 6, 2024
Authors
Amit Shushil Kedia
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Card for Financial Fraud Labeled Dataset

Dataset Details

This dataset collects financial filings from various companies submitted to the U.S. Securities and Exchange Commission (SEC). The dataset consists of 85 companies involved in fraudulent cases and an equal number of companies not involved in fraudulent activities. The Fillings column includes information such as the company's MD&A, and financial statement over the years the company stated on the SEC… See the full description on the dataset page: https://huggingface.co/datasets/amitkedia/Financial-Fraud-Dataset.
A
‘📊 Financial market screener’ analyzed by Analyst-2
analyst-2.ai
Updated Feb 1, 2001
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2001). ‘📊 Financial market screener’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-financial-market-screener-c319/db8cf920/?iid=003-370&v=presentation
Explore at:
Dataset updated
Feb 1, 2001
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘📊 Financial market screener’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/pierrelouisdanieau/financial-market-screener on 28 January 2022.

--- Dataset description provided by original source is as follows ---

Context

In this dataset you will find several characteristics on global companies listed on the stock exchange. These characteristics are analyzed by millions of investors before they invest their money.

Analyze the stock market performance of thousands of companies ! This is the objective of this dataset !

Content

Among thse charateristics you will find :

The symbol : The stock symbol is a unique series of letters assigned to a security for trading purposes.

The shortname : The name of the company

The sector : The sector of the company (Technology, Financial services, consumer cyclical...)

The country : The location of the head office.

The market capitalisation : Market capitalization refers to the total dollar market value of a company's outstanding shares of stock. It is calculated by multiplying the total number of a company's outstanding shares by the current market price of one share.

The current ratio : The current ratio is a liquidity ratio that measures a company’s ability to pay short-term obligations. A current ratio that is in line with the industry average or slightly higher is generally considered acceptable. A current ratio that is lower than the industry average may indicate a higher risk of distress or default.

The beta : Beta is a measure of a stock's volatility in relation to the overall market. A beta greater than 1.0 suggests that the stock is more volatile than the broader market, and a beta less than 1.0 indicates a stock with lower volatility.

The dividend rate : Represents the ratio of a company's annual dividend compared to its share price. (%)

All this data is public data, obtained from the annual financial reports of these companies. They have been retrieved from the Yahoo Finance API and have been checked beforehand.

Inspiration

This dataset has been designed so that it is possible to build a recommendation engine. For example, from an existing position in a portfolio, recommend an alternative with similar characteristics (sector, market capitalization, current ratio,...) but more in line with an investor's expectations (may be with less risk or with more dividends etc...)

If you have question about this dataset you can contact me

--- Original source retains full ownership of the source dataset ---

Facebook

Twitter

Click to copy link

Link copied

Cite

Rishabh Patil (2023). Financial Statements of Major Companies(2009-2023) [Dataset]. https://www.kaggle.com/datasets/rish59/financial-statements-of-major-companies2009-2023

Financial Statements of Major Companies(2009-2023)

Data from 10-K reports and balance sheets.

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Dec 1, 2023

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Rishabh Patil

License

http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

Description

This is a compiled datasets comprising of data from various companies' 10-K annual reports and balance sheets. The data is a longitudinal or panel data, from year 2009-2022(/23) and also consists of a few bankrupt companies to help for investigating factors. The names of the companies are given according to their Stocks. Companies divided into specific categories.

Clear search

Close search

Google apps

Main menu

Financial Statements of Major Companies(2009-2023)

Financial Statement Data Sets

Dataset Features:

Data Sources and Extraction:

Financial Statement Data Sets

Data from: Company Financials Dataset

Financial Statements API - 50,000+ Companies Covered

S.Korea Financial statements datasets

Consolidated Financial Statements for Bank Holding Companies, Parent Company...

Financial Statement Extracts

Content

Acknowledgements

Annual Reports Assessment Dataset

Financial Documents Clustering

US Company Filings Database

Financial Statements of U.S. Nonbank Subsidiaries of U.S. Holding Companies

European Financial Filings Database

CTOS Basis Private Companies Financials Data

Yahoo Finance Business Information Dataset

Dataset Features

Data from: SEC Filings

Dataset of companies’ profitability, government debt, Financial Statements'...

S&P Compustat Database

Financial-Fraud-Dataset

‘📊 Financial market screener’ analyzed by Analyst-2

Context

Content

Inspiration

Financial Statements of Major Companies(2009-2023)

Data from 10-K reports and balance sheets.