Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides a structured and machine-readable collection of financial statements filed with the Companies Registration Office (CRO) in Ireland. It currently includes financial statements for the year 2022, with additional years to be added as they become available. The dataset aligns with the European Union’s Open Data Directive (Directive (EU) 2019/1024) and the Implementing Regulation (EU) 2023/138, which designates company and company ownership data as a high-value dataset. It is available for bulk download and API access under the Creative Commons Attribution 4.0 (CC BY 4.0) licence, allowing unrestricted reuse with appropriate attribution. By increasing transparency and enabling data-driven insights, this dataset supports public sector initiatives, financial analysis, and digital services development. The API endpoints can be accessed using these links - Query - https://opendata.cro.ie/api/3/action/datastore_search Query (via SQL) - https://opendata.cro.ie/api/3/action/datastore_search_sql
Facebook
TwitterFinancial Statement Analysis Dataset
This dataset provides a comprehensive collection of financial statement data from various companies, covering key financial metrics used for financial statement analysis. It includes information from income statements, balance sheets, and cash flow statements, enabling users to perform ratio analysis, trend analysis, and predictive modeling.
The dataset is collected from publicly available financial reports and regulatory filings. Users should verify data accuracy before making financial decisions. This dataset is for educational and research purposes only.
📥 Download, analyze, and gain insights into financial health! 🚀
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Explore the dynamic landscape of the Indian stock market with this extensive dataset featuring 4456 companies listed on both the National Stock Exchange (NSE) and Bombay Stock Exchange (BSE). Gain insights into each company's financial performance, quarterly and yearly profit and loss statements, balance sheets, cash flow data, and essential financial ratios. Dive deep into the intricacies of shareholding patterns, tracking the movements of promoters, foreign and domestic institutional investors, and the public.
This dataset is a rich resource for financial analysts, investors, and data enthusiasts. Perform thorough company evaluations, sector-wise comparisons, and predictive modeling. With figures presented in crore rupees, leverage the dataset for in-depth exploratory data analysis, time series forecasting, and machine learning applications. Stay tuned for updates as we enrich this dataset for a deeper understanding of the Indian stock market landscape. Unlock the potential of data-driven decision-making with this comprehensive repository of financial information.
4492 NSE & BSE Companies
Company_name folder
Company_name.csv
Quarterly_Profit_Loss.csv
Yearly_Profit_Loss.csv
Yearly_Balance_Sheet.csv
Yearly_Cash_flow.csv
Ratios.csv.csv
Quarterly_Shareholding_Pattern.csv
Yearly_Shareholding_Pattern.csv
Company_name.csv- `Company_name`: Name of the company.
- `Sector`: Industry sector of the company.
- `BSE`: Bombay Stock Exchange code.
- `NSE`: National Stock Exchange code.
- `Market Cap`: Market capitalization of the company.
- `Current Price`: Current stock price.
- `High/Low`: Highest and lowest stock prices.
- `Stock P/E`: Price to earnings ratio.
- `Book Value`: Book value per share.
- `Dividend Yield`: Dividend yield percentage.
- `ROCE`: Return on capital employed percentage.
- `ROE`: Return on equity percentage.
- `Face Value`: Face value of the stock.
- `Price to Sales`: Price to sales ratio.
- `Sales growth (1, 3, 5, 7, 10 years)`: Sales growth percentage over different time periods.
- `Profit growth (1, 3, 5, 7, 10 years)`: Profit growth percentage over different time periods.
- `EPS`: Earnings per share.
- `EPS last year`: Earnings per share in the last year.
- `Debt (1, 3, 5, 7, 10 years)`: Debt of the company over different time periods.
Quarterly_Profit_Loss.csv - `Sales`: Revenue generated by the company.
- `Expenses`: Total expenses incurred.
- `Operating Profit`: Profit from core operations.
- `OPM %`: Operating Profit Margin percentage.
- `Other Income`: Additional income sources.
- `Interest`: Interest paid.
- `Depreciation`: Depreciation of assets.
- `Profit before tax`: Profit before tax.
- `Tax %`: Tax percentage.
- `Net Profit`: Net profit after tax.
- `EPS in Rs`: Earnings per share.
Yearly_Profit_Loss.csv- Same as Quarterly_Profit_Loss.csv, but on a yearly basis.
Yearly_Balance_Sheet.csv- `Equity Capital`: Capital raised through equity.
- `Reserves`: Company's retained earnings.
- `Borrowings`: Company's borrowings.
- `Other Liabilities`: Other financial obligations.
- `Total Liabilities`: Sum of all liabilities.
- `Fixed Assets`: Company's long-term assets.
- `CWIP`: Capital Work in Progress.
- `Investments`: Company's investments.
- `Other Assets`: Other non-current assets.
- `Total Assets`: Sum of all assets.
Yearly_Cash_flow.csv- `Cash from Operating Activity`: Cash generated from core business operations.
- `Cash from Investing Activity`: Cash from investments.
- `Cash from Financing Activity`: Cash from financing (borrowing, stock issuance, etc.).
- `Net Cash Flow`: Overall net cash flow.
Ratios.csv.csv- `Debtor Days`: Number of days it takes to collect receivables.
- `Inventory Days`: Number of days inventory is held.
- `Days Payable`: Number of days a company takes to pay its bills.
- `Cash Conversion Cycle`: Time taken to convert sales into cash.
- `Wor...
Facebook
TwitterThe data sets below provide selected information extracted from exhibits to corporate financial reports filed with the Commission using eXtensible Business Reporting Language (XBRL).
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset offers a detailed collection of US-GAAP financial data extracted from the financial statements of exchange-listed U.S. companies, as submitted to the U.S. Securities and Exchange Commission (SEC) via the EDGAR database. Covering filings from January 2009 onwards, this dataset provides key financial figures reported by companies in accordance with U.S. Generally Accepted Accounting Principles (GAAP).
This dataset primarily relies on the SEC's Financial Statement Data Sets and EDGAR APIs: - SEC Financial Statement Data Sets - EDGAR Application Programming Interfaces
In instances where specific figures were missing from these sources, data was directly extracted from the companies' financial statements to ensure completeness.
Please note that the dataset presents financial figures exactly as reported by the companies, which may occasionally include errors. A common issue involves incorrect reporting of scaling factors in the XBRL format. XBRL supports two tag attributes related to scaling: 'decimals' and 'scale.' The 'decimals' attribute indicates the number of significant decimal places but does not affect the actual value of the figure, while the 'scale' attribute adjusts the value by a specific factor.
However, there are several instances, numbering in the thousands, where companies have incorrectly used the 'decimals' attribute (e.g., 'decimals="-6"') under the mistaken assumption that it controls scaling. This is not correct, and as a result, some figures may be inaccurately scaled. This dataset does not attempt to detect or correct such errors; it aims to reflect the data precisely as reported by the companies. A future version of the dataset may be introduced to address and correct these issues.
The source code for data extraction is available here
Facebook
TwitterSuccess.ai offers a cutting-edge solution for businesses and organizations seeking Company Financial Data on private and public companies. Our comprehensive database is meticulously crafted to provide verified profiles, including contact details for financial decision-makers such as CFOs, financial analysts, corporate treasurers, and other key stakeholders. This robust dataset is continuously updated and validated using AI technology to ensure accuracy and relevance, empowering businesses to make informed decisions and optimize their financial strategies.
Key Features of Success.ai's Company Financial Data:
Global Coverage: Access data from over 70 million businesses worldwide, including public and private companies across all major industries and regions. Our datasets span 250+ countries, offering extensive reach for your financial analysis and market research.
Detailed Financial Profiles: Gain insights into company financials, including revenue, profit margins, funding rounds, and operational costs. Profiles are enriched with key contact details, including work emails, phone numbers, and physical addresses, ensuring direct access to decision-makers.
Industry-Specific Data: Tailored datasets for sectors such as financial services, manufacturing, technology, healthcare, and energy, among others. Each dataset is customized to meet the unique needs of industry professionals and analysts.
Real-Time Accuracy: With continuous updates powered by AI-driven validation, our financial data maintains a 99% accuracy rate, ensuring you have access to the most reliable and up-to-date information available.
Compliance and Security: All data is collected and processed in strict adherence to global compliance standards, including GDPR, ensuring ethical and lawful usage.
Why Choose Success.ai for Company Financial Data?
Best Price Guarantee: We pride ourselves on offering the most competitive pricing in the industry, ensuring you receive unparalleled value for comprehensive financial data.
AI-Validated Accuracy: Our advanced AI algorithms meticulously verify every data point to ensure precision and reliability, helping you avoid costly errors in your financial decision-making.
Customized Data Solutions: Whether you need data for a specific region, industry, or type of business, we tailor our datasets to align perfectly with your requirements.
Scalable Data Access: From small startups to global enterprises, our platform caters to businesses of all sizes, delivering scalable solutions to suit your operational needs.
Comprehensive Use Cases for Financial Data:
Leverage our detailed financial profiles to create accurate budgets, forecasts, and strategic plans. Gain insights into competitors’ financial health and market positions to make data-driven decisions.
Access key financial details and contact information to streamline your M&A processes. Identify potential acquisition targets or partners with verified profiles and financial data.
Evaluate the financial performance of public and private companies for informed investment decisions. Use our data to identify growth opportunities and assess risk factors.
Enhance your sales outreach by targeting CFOs, financial analysts, and other decision-makers with verified contact details. Utilize accurate email and phone data to increase conversion rates.
Understand market trends and financial benchmarks with our industry-specific datasets. Use the data for competitive analysis, benchmarking, and identifying market gaps.
APIs to Power Your Financial Strategies:
Enrichment API: Integrate real-time updates into your systems with our Enrichment API. Keep your financial data accurate and current to drive dynamic decision-making and maintain a competitive edge.
Lead Generation API: Supercharge your lead generation efforts with access to verified contact details for key financial decision-makers. Perfect for personalized outreach and targeted campaigns.
Tailored Solutions for Industry Professionals:
Financial Services Firms: Gain detailed insights into revenue streams, funding rounds, and operational costs for competitor analysis and client acquisition.
Corporate Finance Teams: Enhance decision-making with precise data on industry trends and benchmarks.
Consulting Firms: Deliver informed recommendations to clients with access to detailed financial datasets and key stakeholder profiles.
Investment Firms: Identify potential investment opportunities with verified data on financial performance and market positioning.
What Sets Success.ai Apart?
Extensive Database: Access detailed financial data for 70M+ companies worldwide, including small businesses, startups, and large corporations.
Ethical Practices: Our data collection and processing methods are fully comp...
Facebook
TwitterThe Financial Statements of Holding Companies (FR Y-9 Reports) collects standardized financial statements from domestic holding companies (HCs). This is pursuant to the Bank Holding Company Act of 1956, as amended (BHC Act), and the Home Owners Loan Act (HOLA). The FR Y-9C is used to identify emerging financial risks and monitor the safety and soundness of HC operations. HCs file the FR Y-9C and FR Y-9LP quarterly, the FR Y-9SP semiannually, the FR Y-9ES annually, and the FR Y-9CS on a schedule that is determined when this supplement is used.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains financial information for the top 500 companies in India, including their market capitalization and quarterly sales. The data is categorized based on market cap and sales quartiles, allowing for detailed analysis and comparison. This dataset can be used to identify trends, patterns, and key metrics that are crucial for understanding the competitive landscape in the Indian market.
Facebook
Twitterhttps://www.aiceltech.com/termshttps://www.aiceltech.com/terms
Korean Companies’ Financial Data provides important information to analyze a company’s financial status and performance. This data includes financial indicators such as revenue, expenses, assets, and liabilities. Collected from corporate financial reports and stock market data, it helps investors evaluate financial health and discover investment opportunities, essential for valuing Korean companies.
Facebook
Twitterhttps://www.usa.gov/government-works/https://www.usa.gov/government-works/
This dataset is from the SEC's Financial Statements and Notes Data Set.
It was a personal project to see if I could make the queries efficient.
It's just been collecting dust ever since, maybe someone will make good use of it.
Data is up to about early-2024.
It doesn't differ from the source, other than it's compiled - so maybe you can try it out, then compile your own (with the link below).
Dataset was created using SEC Files and SQL Server on Docker.
For details on the SQL Server database this came from, see: "dataset-previous-life-info" folder, which will contain:
- Row Counts
- Primary/Foreign Keys
- SQL Statements to recreate database tables
- Example queries on how to join the data tables.
- A pretty picture of the table associations.
Source: https://www.sec.gov/data-research/financial-statement-notes-data-sets
Happy coding!
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Annual reports Assessment Dataset
This dataset will help investors, merchant bankers, credit rating agencies, and the community of equity research analysts explore annual reports in a more automated way, saving them time.
Following Sub Dataset(s) are there :
a) pdf and corresponding OCR text of 100 Indian annual reports These 100 annual reports are for the 100 largest companies listed on the Bombay Stock Exchange. The total number of words in OCRed text is 12.25 million.
b) A Few Examples of Sentences with Corresponding Classes The author defined 16 widely used topics used in the investment community as classes like:
Accounting Standards
Accounting for Revenue Recognition
Corporate Social Responsbility
Credit Ratings
Diversity Equity and Inclusion
Electronic Voting
Environment and Sustainability
Hedging Strategy
Intellectual Property Infringement Risk
Litigation Risk
Order Book
Related Party Transaction
Remuneration
Research and Development
Talent Management
Whistle Blower Policy
These classes should help generate ideas and investment decisions, as well as identify red flags and early warning signs of trouble when everything appears to be proceeding smoothly.
ABOUT DATA ::
"scrips.json" is a json with name of companies "SC_CODE" is BSE Scrip Id "SC_NAME" is Listed Companies Name "NET_TURNOV" is Turnover on the day of consideration
"source_pdf" is folder containing both PDF and OCR Output from Tesseract "raw_pdf.zip" contains raw PDF and it can be used to try another OCR. "ocr.zip" contains json file (annual_report_content.json) containing OCR text for each pdf. "annual_report_content.json" is an array of 100 elements and each element is having two keys "file_name" and "content"
"classif_data_rank_freezed.json" is used for evaluation of results contains "sentence" and corresponding "class"
Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
This dataset contains 604 public company financial statement annually in IDX (Bursa Efek Indonesia), largest number that I can see in kaggle :D. Company that's not included in this dataset either do not report their financial statement or contains some irrelevant publishing date.
Please leave a message on suggestions!
| Type | Description | Translate (in Indonesia) |
|---|---|---|
| BS | Balance Sheet/Statement of FInancial Position | Laporan Posisi Neraca / Laporan Posisi Keuangan |
| IS | (Consolidated) Income Statement | Laporan Laba/Rugi (Konsolidasian) |
| CF | Statement of Cash Flow | Laporan Arus Kas |
| Account | Type | Translate (in Indonesia) |
|---|---|---|
| Accounts Payable | BS | Utang Usaha |
| Accounts Receivable | BS | Piutang Usaha |
| Accumulated Depreciation | BS | Akumulasi Penyusutan |
| Additional Paid In Capital (PIC) / Share Premium | BS | Saham premium |
| Allowance For Doubtful Accounts Receivable (AFDA) | BS | Cadangan Piutang Usaha |
| Buildings And Improvements | BS | Bangunan dan Pengembangan |
| Capital Stock | BS | Saham |
| Cash And Cash Equivalents | BS | Kas dan Setara Kas |
| Cash Cash Equivalents And Short Term Investments | BS | Kas, Setara Kas, dan Investasi Jangka Pendek |
| Cash Equivalents | BS | Setara Kas |
| Cash Financial | BS | Kas yang berhubungan dengan aktiviatas keuangan |
| Common Stock | BS | Saham Biasa |
| Common Stock Equity | BS | Ekuitas Saham Biasa |
| Construction In Progress | BS | Konstruksi yang Sedang Berlangsung |
| Current Assets | BS | Aset Lancar |
| Current Debt | BS | Utang Lancar |
| Current Debt And Capital Lease Obligation | BS | Utang Lancar dan Kewajiban Sewa Kapital |
| Current Liabilities | BS | Liabilitas Lancar |
| Finished Goods | BS | Barang Jadi |
| Goodwill | BS | Nilai Tambah (Goodwill) |
| Goodwill And Other Intangible Assets | BS | Nilai Tambah (Goodwill) dan Aset Tidak Berwujud Lainnya |
| Gross Accounts Receivable | BS | Piutang Usaha Bruto |
| Gross PPE | BS | Aktiva Tetap Bruto (Properti, Pabrik, dan Peralatan) |
| Inventory | BS | Persediaan |
| Invested Capital | BS | Kapital yang Diinvestasikan |
| Investmentsin Joint Venturesat Cost | BS | Investasi dalam Usaha Patungan dengan Harga Perolehan |
| Land And Improvements | BS | Tanah dan Pengembangan |
| Long Term Debt | BS | Utang Jangka Panjang |
| Long Term Debt And Capital Lease Obligation | BS | Utang Jangka Panjang dan Kewajiban Sewa Kapital |
| Long Term Equity Investment | BS | Investasi Ekuitas Jangka Panjang |
| Machinery Furniture Equipment | BS | Mesin, Perabotan dan Perlengkapan |
| Minority Interest | BS | Kepentingan Minoritas |
| Net Debt | BS | Utang Bersih |
| Net PPE | BS | Aktiva Tetap Bersih (Properti, Pabrik, dan Peralatan) |
| Net Tangible Assets | BS | Aset Berwujud Bersih |
| Non Current Deferred Taxes Assets | BS | Aset Pajak Tangguhan Non Lancar |
| Non Current Deferred Taxes Liabilities | BS | Liabilitas Pajak Tangguhan Non Lancar |
| Non Current Pension And Other Postretirement Benefit Plans | BS | Rencana Pensiun Non Lancar dan Manfaat Pasca Pensiun Lainnya |
| Ordinary Shares Number | BS | Jumlah Saham Biasa |
| Other Current Liabilities | BS | Liabilitas Lancar Lainnya |
| Other Equity Interest | BS | Kepentingan Ekuitas Lainnya |
| Other Inventories | BS | Persediaan Lainnya |
| Other Non Current Assets | BS | Aset Non Lancar Lainnya |
| Other Non Current Liabilities | BS | Liabilitas Non Lancar Lainnya |
| Other Payable | BS | Hutang Lainnya |
| Other Properties | BS | Properti Lainnya |
| Other Receivables | BS | Piutang Lainnya |
| Payables | BS | Utang |
| Pensionand Other Post Retirement Benefit Plans Current | BS | Rencana Pensiun dan Manfaat Pasca Pensiun Lainnya Saat Ini |
| Prepaid Assets | BS | Aset Dibayar Dimuka |
| Properties | BS | Properti |
| Raw Materials | BS | Bahan Baku |
| Retained Earnings | BS | Laba Ditahan |
| Share Issued | BS | Saham yang Diterbitkan |
| Stockholders Equity | BS | Ekuitas Pemegang Saham |
| Tangible Book Value | BS | Nilai Buku Berwujud |
| Total Assets | BS | Total Aset |
| Total Capitalization | BS | Total Kapitalisasi |
| Total Debt | BS | Total Utang |
| Total Equity Gross Minority Interest | BS | Total Ekuitas Bruto dengan Kepentingan Minoritas |
| Total Liabilities Net Minority Interest | BS | Total Liabilitas Bersih dengan Kepentingan Minoritas |
| Total Non Current Assets | BS | Total Aset Non Lancar |
| Total Non Current Liabilities Net Minority Interest | BS | Total Liabilitas Non Lancar Bersih dengan Kepentingan Minoritas |
| Total Tax Payable | BS | Total Utang Pajak |
| Treasury Shares Number | BS | Jumlah Saham Treasuri |
| Work In Process | BS | Pekerjaan dalam Proses |
| Working Capital | BS | Modal Kerja / Kapital Jangka Pendek |
| Beginning Cash Position | CF | Posisi Kas Awal |
| Capital Expenditure | CF | Pengeluaran - Kapital |
| Capital Expenditure Reported | CF | Pengeluaran - Kapital yang Dilaporkan |
| Cash Dividends Paid | CF | Dividen Tunai yang Dibayarkan |
| Cash Flowsfromusedin Operating Activities Direct | CF | Arus Kas yang Digunakan dalam Aktivitas Operasional Langsung |
| Changes In Cash... |
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
The Russian Financial Statements Database (RFSD) is an open, harmonized collection of annual unconsolidated financial statements of the universe of Russian firms:
🔓 First open data set with information on every active firm in Russia.
🗂️ First open financial statements data set that includes non-filing firms.
🏛️ Sourced from two official data providers: the Rosstat and the Federal Tax Service.
📅 Covers 2011-2023 initially, will be continuously updated.
🏗️ Restores as much data as possible through non-invasive data imputation, statement articulation, and harmonization.
The RFSD is hosted on 🤗 Hugging Face and Zenodo and is stored in a structured, column-oriented, compressed binary format Apache Parquet with yearly partitioning scheme, enabling end-users to query only variables of interest at scale.
The accompanying paper provides internal and external validation of the data: http://arxiv.org/abs/2501.05841.
Here we present the instructions for importing the data in R or Python environment. Please consult with the project repository for more information: http://github.com/irlcode/RFSD.
Importing The Data
You have two options to ingest the data: download the .parquet files manually from Hugging Face or Zenodo or rely on 🤗 Hugging Face Datasets library.
Python
🤗 Hugging Face Datasets
It is as easy as:
from datasets import load_dataset import polars as pl
RFSD = load_dataset('irlspbru/RFSD')
RFSD_2023 = pl.read_parquet('hf://datasets/irlspbru/RFSD/RFSD/year=2023/*.parquet')
Please note that the data is not shuffled within year, meaning that streaming first n rows will not yield a random sample.
Local File Import
Importing in Python requires pyarrow package installed.
import pyarrow.dataset as ds import polars as pl
RFSD = ds.dataset("local/path/to/RFSD")
print(RFSD.schema)
RFSD_full = pl.from_arrow(RFSD.to_table())
RFSD_2019 = pl.from_arrow(RFSD.to_table(filter=ds.field('year') == 2019))
RFSD_2019_revenue = pl.from_arrow( RFSD.to_table( filter=ds.field('year') == 2019, columns=['inn', 'line_2110'] ) )
renaming_df = pl.read_csv('local/path/to/descriptive_names_dict.csv') RFSD_full = RFSD_full.rename({item[0]: item[1] for item in zip(renaming_df['original'], renaming_df['descriptive'])})
R
Local File Import
Importing in R requires arrow package installed.
library(arrow) library(data.table)
RFSD <- open_dataset("local/path/to/RFSD")
schema(RFSD)
scanner <- Scanner$create(RFSD) RFSD_full <- as.data.table(scanner$ToTable())
scan_builder <- RFSD$NewScan() scan_builder$Filter(Expression$field_ref("year") == 2019) scanner <- scan_builder$Finish() RFSD_2019 <- as.data.table(scanner$ToTable())
scan_builder <- RFSD$NewScan() scan_builder$Filter(Expression$field_ref("year") == 2019) scan_builder$Project(cols = c("inn", "line_2110")) scanner <- scan_builder$Finish() RFSD_2019_revenue <- as.data.table(scanner$ToTable())
renaming_dt <- fread("local/path/to/descriptive_names_dict.csv") setnames(RFSD_full, old = renaming_dt$original, new = renaming_dt$descriptive)
Use Cases
🌍 For macroeconomists: Replication of a Bank of Russia study of the cost channel of monetary policy in Russia by Mogiliat et al. (2024) — interest_payments.md
🏭 For IO: Replication of the total factor productivity estimation by Kaukin and Zhemkova (2023) — tfp.md
🗺️ For economic geographers: A novel model-less house-level GDP spatialization that capitalizes on geocoding of firm addresses — spatialization.md
FAQ
Why should I use this data instead of Interfax's SPARK, Moody's Ruslana, or Kontur's Focus?hat is the data period?
To the best of our knowledge, the RFSD is the only open data set with up-to-date financial statements of Russian companies published under a permissive licence. Apart from being free-to-use, the RFSD benefits from data harmonization and error detection procedures unavailable in commercial sources. Finally, the data can be easily ingested in any statistical package with minimal effort.
What is the data period?
We provide financials for Russian firms in 2011-2023. We will add the data for 2024 by July, 2025 (see Version and Update Policy below).
Why are there no data for firm X in year Y?
Although the RFSD strives to be an all-encompassing database of financial statements, end users will encounter data gaps:
We do not include financials for firms that we considered ineligible to submit financial statements to the Rosstat/Federal Tax Service by law: financial, religious, or state organizations (state-owned commercial firms are still in the data).
Eligible firms may enjoy the right not to disclose under certain conditions. For instance, Gazprom did not file in 2022 and we had to impute its 2022 data from 2023 filings. Sibur filed only in 2023, Novatek — in 2020 and 2021. Commercial data providers such as Interfax's SPARK enjoy dedicated access to the Federal Tax Service data and therefore are able source this information elsewhere.
Firm may have submitted its annual statement but, according to the Uniform State Register of Legal Entities (EGRUL), it was not active in this year. We remove those filings.
Why is the geolocation of firm X incorrect?
We use Nominatim to geocode structured addresses of incorporation of legal entities from the EGRUL. There may be errors in the original addresses that prevent us from geocoding firms to a particular house. Gazprom, for instance, is geocoded up to a house level in 2014 and 2021-2023, but only at street level for 2015-2020 due to improper handling of the house number by Nominatim. In that case we have fallen back to street-level geocoding. Additionally, streets in different districts of one city may share identical names. We have ignored those problems in our geocoding and invite your submissions. Finally, address of incorporation may not correspond with plant locations. For instance, Rosneft has 62 field offices in addition to the central office in Moscow. We ignore the location of such offices in our geocoding, but subsidiaries set up as separate legal entities are still geocoded.
Why is the data for firm X different from https://bo.nalog.ru/?
Many firms submit correcting statements after the initial filing. While we have downloaded the data way past the April, 2024 deadline for 2023 filings, firms may have kept submitting the correcting statements. We will capture them in the future releases.
Why is the data for firm X unrealistic?
We provide the source data as is, with minimal changes. Consider a relatively unknown LLC Banknota. It reported 3.7 trillion rubles in revenue in 2023, or 2% of Russia's GDP. This is obviously an outlier firm with unrealistic financials. We manually reviewed the data and flagged such firms for user consideration (variable outlier), keeping the source data intact.
Why is the data for groups of companies different from their IFRS statements?
We should stress that we provide unconsolidated financial statements filed according to the Russian accounting standards, meaning that it would be wrong to infer financials for corporate groups with this data. Gazprom, for instance, had over 800 affiliated entities and to study this corporate group in its entirety it is not enough to consider financials of the parent company.
Why is the data not in CSV?
The data is provided in Apache Parquet format. This is a structured, column-oriented, compressed binary format allowing for conditional subsetting of columns and rows. In other words, you can easily query financials of companies of interest, keeping only variables of interest in memory, greatly reducing data footprint.
Version and Update Policy
Version (SemVer): 1.0.0.
We intend to update the RFSD annualy as the data becomes available, in other words when most of the firms have their statements filed with the Federal Tax Service. The official deadline for filing of previous year statements is April, 1. However, every year a portion of firms either fails to meet the deadline or submits corrections afterwards. Filing continues up to the very end of the year but after the end of April this stream quickly thins out. Nevertheless, there is obviously a trade-off between minimization of data completeness and version availability. We find it a reasonable compromise to query new data in early June, since on average by the end of May 96.7% statements are already filed, including 86.4% of all the correcting filings. We plan to make a new version of RFSD available by July.
Licence
Creative Commons License Attribution 4.0 International (CC BY 4.0).
Copyright © the respective contributors.
Citation
Please cite as:
@unpublished{bondarkov2025rfsd, title={{R}ussian {F}inancial {S}tatements {D}atabase}, author={Bondarkov, Sergey and Ledenev, Victor and Skougarevskiy, Dmitriy}, note={arXiv preprint arXiv:2501.05841}, doi={https://doi.org/10.48550/arXiv.2501.05841}, year={2025}}
Acknowledgments and Contacts
Data collection and processing: Sergey Bondarkov, sbondarkov@eu.spb.ru, Viktor Ledenev, vledenev@eu.spb.ru
Project conception, data validation, and use cases: Dmitriy Skougarevskiy, Ph.D.,
Facebook
Twitterhttps://www.lseg.com/en/policies/website-disclaimerhttps://www.lseg.com/en/policies/website-disclaimer
Company fundamentals data provides the user with a company's current financial health and when combined historically, the financial 'life-story' of the company.
Facebook
TwitterOur comprehensive and advanced database is completed with all the information you need, with up to >1.5 million company financial records at your disposal. This allows you to easily perform company search on company profile and company directory, with 99% coverage in Malaysia.
Our database also contains company profiles on private limited or limited companies globally, including information such as shareholders and financial accounts can be accessed instantly.
Facebook
TwitterOur comprehensive and advanced database is completed with all the information you need, with up to >1million company financial records at your disposal. This allows you to easily perform company search on company profile and company directory, with maximised coverage in Thailand.
Our database also contains company profiles on private limited or limited companies globally, including information such as shareholders and financial accounts can be accessed instantly.
Facebook
TwitterIn the U.S. public companies, certain insiders and broker-dealers are required to regularly file with the SEC. The SEC makes this data available online for anybody to view and use via their Electronic Data Gathering, Analysis, and Retrieval (EDGAR) database. The SEC updates this data every quarter going back to January, 2009. For more information please see this site.
To aid analysis a quick summary view of the data has been created that is not available in the original dataset. The quick summary view pulls together signals into a single table that otherwise would have to be joined from multiple tables and enables a more streamlined user experience.
DISCLAIMER: The Financial Statement and Notes Data Sets contain information derived from structured data filed with the Commission by individual registrants as well as Commission-generated filing identifiers. Because the data sets are derived from information provided by individual registrants, we cannot guarantee the accuracy of the data sets. In addition, it is possible inaccuracies or other errors were introduced into the data sets during the process of extracting the data and compiling the data sets. Finally, the data sets do not reflect all available information, including certain metadata associated with Commission filings. The data sets are intended to assist the public in analyzing data contained in Commission filings; however, they are not a substitute for such filings. Investors should review the full Commission filings before making any investment decision.
Facebook
TwitterSee Readme.txt. Visit https://dataone.org/datasets/sha256%3A780638d72909580abd52dd18a16fa53928d07fc992dbc160bc66271c61cc1d9d for complete metadata about this dataset.
Facebook
Twitterhttps://www.lseg.com/en/policies/website-disclaimerhttps://www.lseg.com/en/policies/website-disclaimer
Browse LSEG's US Company Filings Database, and find a range of filings content and history including annual reports, municipal bonds, and more.
Facebook
Twitterhttps://www.ontario.ca/page/terms-usehttps://www.ontario.ca/page/terms-use
(Formerly Public Accounts: Volume 2)
The Public Accounts of Ontario is a major accountability document which presents the financial statements of the province.
This dataset contains audited financial statements of consolidated organizations and Trusts under Administration.
Starting in 2018-19, Volume 2 is no longer part of the Public Accounts. Find the individual financial statements of government organizations (including hospitals, colleges and school boards), trusts under administration (such as the Workplace Safety and Insurance Board), businesses and other organizations on their websites. "https://www.ontario.ca/page/financial-statements-government-organizations-and-business-enterprises-2019-20">Access listing of organizations (2019-20).
The Financial Administration Act requires the preparation of the Public Accounts for each fiscal year.
The Public Accounts of Ontario are licenced under the Ontario.ca "https://www.ontario.ca/page/terms-use">terms of use (they are not subject to or licenced under the Open Government Licence).
See previous versions of the Public Accounts of Ontario. (Please note that the Public Accounts were only available in PDF format before 2015-16).
*[PDF]: Portable Document Format
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides a structured and machine-readable collection of financial statements filed with the Companies Registration Office (CRO) in Ireland. It currently includes financial statements for the year 2022, with additional years to be added as they become available. The dataset aligns with the European Union’s Open Data Directive (Directive (EU) 2019/1024) and the Implementing Regulation (EU) 2023/138, which designates company and company ownership data as a high-value dataset. It is available for bulk download and API access under the Creative Commons Attribution 4.0 (CC BY 4.0) licence, allowing unrestricted reuse with appropriate attribution. By increasing transparency and enabling data-driven insights, this dataset supports public sector initiatives, financial analysis, and digital services development. The API endpoints can be accessed using these links - Query - https://opendata.cro.ie/api/3/action/datastore_search Query (via SQL) - https://opendata.cro.ie/api/3/action/datastore_search_sql