Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The dataset contains the annual report of US public firms filing with the SEC EDGAR system. Each annual report (10K filing) is broken into 20 sections. Each section is split into individual sentences. Sentiment labels are provided on a per filing basis from the market reaction around the filing data. Additional metadata for each filing is included in the dataset.
The data sets below provide selected information extracted from exhibits to corporate financial reports filed with the Commission using eXtensible Business Reporting Language (XBRL).
In the U.S. public companies, certain insiders and broker-dealers are required to regularly file with the SEC. The SEC makes this data available online for anybody to view and use via their Electronic Data Gathering, Analysis, and Retrieval (EDGAR) database. The SEC updates this data every quarter going back to January, 2009. For more information please see this site.
To aid analysis a quick summary view of the data has been created that is not available in the original dataset. The quick summary view pulls together signals into a single table that otherwise would have to be joined from multiple tables and enables a more streamlined user experience.
DISCLAIMER: The Financial Statement and Notes Data Sets contain information derived from structured data filed with the Commission by individual registrants as well as Commission-generated filing identifiers. Because the data sets are derived from information provided by individual registrants, we cannot guarantee the accuracy of the data sets. In addition, it is possible inaccuracies or other errors were introduced into the data sets during the process of extracting the data and compiling the data sets. Finally, the data sets do not reflect all available information, including certain metadata associated with Commission filings. The data sets are intended to assist the public in analyzing data contained in Commission filings; however, they are not a substitute for such filings. Investors should review the full Commission filings before making any investment decision.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains information found in the 10-K annual reports filed by companies in the US. It comes from the SEC official website found here. I scraped the data in a jupyter notebook and kept only a few of the important financial line items (there are 300+ for some 10k reports). No 10-K/A amendments were taken into account, so some information could be incorrect. In other words, don't bet the farm on a trading model built with this data. The price data was collected from the yfinance python API.
The Corporate Financial Fraud project is a study of company and top-executive characteristics of firms that ultimately violated Securities and Exchange Commission (SEC) financial accounting and securities fraud provisions compared to a sample of public companies that did not. The fraud firm sample was identified through systematic review of SEC accounting enforcement releases from 2005-2010, which included administrative and civil actions, and referrals for criminal prosecution that were identified through mentions in enforcement release, indictments, and news searches. The non-fraud firms were randomly selected from among nearly 10,000 US public companies censused and active during at least one year between 2005-2010 in Standard and Poor's Compustat data. The Company and Top-Executive (CEO) databases combine information from numerous publicly available sources, many in raw form that were hand-coded (e.g., for fraud firms: Accounting and Auditing Enforcement Releases (AAER) enforcement releases, investigation summaries, SEC-filed complaints, litigation proceedings and case outcomes). Financial and structural information on companies for the year leading up to the financial fraud (or around year 2000 for non-fraud firms) was collected from Compustat financial statement data on Form 10-Ks, and supplemented by hand-collected data from original company 10-Ks, proxy statements, or other financial reports accessed via Electronic Data Gathering, Analysis, and Retrieval (EDGAR), SEC's data-gathering search tool. For CEOs, data on personal background characteristics were collected from Execucomp and BoardEx databases, supplemented by hand-collection from proxy-statement biographies.
This dataset is a mirror of the Financial Statement and Notes Data Set (https://www.sec.gov/dera/data/financial-statement-and-notes-data-set.html) hosted by the SEC and is updated monthly.
From this page:
%3E The Financial Statement and Notes Data Sets provide the text and detailed numeric information from all financial statements and their notes. This data is extracted from exhibits to corporate financial reports filed with the Commission using eXtensible Business Reporting Language (XBRL). As compared to the more compact Financial Statement Data Sets which provide only the numeric information from face financials, the Financial Statement and Notes Data Sets provide significantly more disclosure data. The information is presented without change from the "as filed" financial reports submitted by each registrant. The data is presented in a flattened format to help users analyze and compare corporate disclosure information over time and across registrants. The data sets also contain additional fields such as a company's Standard Industrial Classification to facilitate the data's use.
%3E DISCLAIMER: The Financial Statement and Notes Data Sets contain information derived from structured data filed with the Commission by individual registrants as well as Commission-generated filing identifiers. Because the data sets are derived from information provided by individual registrants, we cannot guarantee the accuracy of the data sets. In addition, it is possible inaccuracies or other errors were introduced into the data sets during the process of extracting the data and compiling the data sets. Finally, the data sets do not reflect all available information, including certain metadata associated with Commission filings. The data sets are intended to assist the public in analyzing data contained in Commission filings; however, they are not a substitute for such filings. Investors should review the full Commission filings before making any investment decision.
Once a month, the second-to-latest dump of data (ex: August 2022 dump is downloaded in October 2022) is downloaded from the page and then the tables are extracted and appended to the existing ones in this Redivis dataset.
Please refer to this documentation file created by the SEC, which provides documentation of scope, organization, file formats and table definitions.
We deliver via API access to Companies Financial statements, Insider transaction, Stock Ownership and all information relative to Stock Fundamental
Here is the extensive list of all the information that you can access via our API:
STOCK FUNDAMENTALS
Financial Statements Annual/Quarter Financial Statements As Reported International Filings Annual/Quarter Quarterly Earnings Reports Shares Float SEC RSS Feeds Real-time SEC Filings Rss feed 8K (Important Events)
STOCK FUNDAMENTALS ANALYSIS
Financial Ratios Annual/Quarter Enterprise Value Annual/Quarter Financial Statements Growth Annual Key Metrics Annual/Quarter Financial Growth Annual/Quarter Rating Daily DCF Real-time
STOCK CALENDARS
Earnings Calendar Popular IPO Calendar Stock Split Calendar Dividend Calendar Economic Calendar
COMPANY INFORMATION
Profile Minute Key Executives Market Capitalization Daily Company Outlook New Stock Peers
The Savings Association Holding Company Report (FR LL-(b)11) collects from certain savings and loan holding companies (SLHCs) information about their Securities and Exchange Commission (SEC) filings, reports, financial statements, and other exhibits that the Board requires. The Board uses this data to analyze the financial condition of respondent SLHCs, and assess regulatory compliance. The FR LL-(b)11 is filed quarterly based on the institution’s fiscal year, and also when there has been a material change in any of the information reported. The fourth quarter report also includes audited financial statements.
https://www.lseg.com/en/policies/website-disclaimerhttps://www.lseg.com/en/policies/website-disclaimer
Browse LSEG's US Company Filings Database, and find a range of filings content and history including annual reports, municipal bonds, and more.
We provide the financial as reported data for bulk download. The dataset is parsed from 10Q and 10K filings from 2010 to the end of Q1, 2022. We hope financial researchers will find this dataset useful. About us: AlphaResearch makes searching across SEC filings for all symbols super easy and efficient. Using state-of-the-art ML and NLP techniques, AlphaResearch provides keyword highlighting, blacklining, note-taking to make your search experience as smooth as possible. The documents that AlphaResearch supports searching are SEC filings, 10 k, 10q, form 8k, 13f filings, sec form 4.
The SEC Form 10-K is an annual report required by the U.S. Securities and Exchange Commission (SEC), that gives a comprehensive summary of a company's financial performance.
The full contents of the SEC 10-K are available through the SEC's EDGAR database. PUDL integrates only some of the 10-K metadata and data extracted from the unstructured Exhibit 21 attachement, which describes the ownershp relationships between the parent company and its subsidiaries. This data is used to create a linkage between EIA utilities and SEC reporting companies, to better understand the relationships between utlities and their affiliates, and the resulting economic and political impacts.
This data was originally downloaded from the SEC and processed using a machine learning pipeline found here: https://github.com/catalyst-cooperative/mozilla-sec-eia Archived from https://www.sec.gov/search-filings/edgar-application-programming-interfaces
This archive contains raw input data for the Public Utility Data Liberation (PUDL) software developed by Catalyst Cooperative. It is organized into Frictionless Data Packages. For additional information about this data and PUDL, see the following resources:
The PUDL Repository on GitHub
PUDL Documentation
Other Catalyst Cooperative data archives
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States Financial Institutions: CR: LL: RT: CO: Sec by Nonfarm data was reported at 61.304 USD bn in Jun 2019. This records an increase from the previous number of 59.202 USD bn for May 2019. United States Financial Institutions: CR: LL: RT: CO: Sec by Nonfarm data is updated monthly, averaging 57.514 USD bn from Jan 2015 (Median) to Jun 2019, with 54 observations. The data reached an all-time high of 62.315 USD bn in Dec 2018 and a record low of 30.756 USD bn in Jan 2015. United States Financial Institutions: CR: LL: RT: CO: Sec by Nonfarm data remains active status in CEIC and is reported by Federal Reserve Board. The data is categorized under Global Database’s United States – Table US.KB042: Balance Sheet: Foreign Related Institutions: Monthly.
This dataset captures insider trading activity at publicly traded companies. The Securities and Exchange Commission has made these insider trading reports available on its web site in a structured format since mid-2003. However, most academic papers use proprietary commercial databases instead of regulatory filings directly, which makes replication challenging because the data manipulation and aggregation steps in commercial databases are opaque and historical records could be altered by the data provider over time. To overcome these limitations, the presented dataset is created from the original regulatory filings; it is updated daily and includes all information reported by insiders without alteration. Daily updates: https://dx.doi.org/10.34740/kaggle/ds/2973477
The data set includes information extracted from the prospectus regarding the corporate bond issue including amount, coupon rate, maturity, YTM, OID, credit rating (issuer and issue, if available), CUSIP ISIN and booker-runner / managers. The key benefit of this data set is that it provides recent market information on corporate bond issues by US SEC Registrants in all non-financial industries for 1Q2021 for users that need this information to assist the company’s finance & treasury department in negotiating pricing of its bond issues with investment banks.
With the sole mission to democratize financial data, Finnhub is excited to release the new Financials as Reported dataset for bulk download. The data is cleaned and sourced directly from SEC filings from 2010-2020.
If you don't need bulk download, you can query this data for free on our website: https://finnhub.io/docs/api#financials-reported. We also provide various type of financial data such as global fundamentals, deep historical tick data, estimates and alternative data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States Financial Institutions: CR: LL: RT: CO: Sec by Multifamily Prop data was reported at 8.675 USD bn in Jun 2019. This records an increase from the previous number of 8.510 USD bn for May 2019. United States Financial Institutions: CR: LL: RT: CO: Sec by Multifamily Prop data is updated monthly, averaging 4.121 USD bn from Jan 2015 (Median) to Jun 2019, with 54 observations. The data reached an all-time high of 8.675 USD bn in Jun 2019 and a record low of 786.000 USD mn in Feb 2015. United States Financial Institutions: CR: LL: RT: CO: Sec by Multifamily Prop data remains active status in CEIC and is reported by Federal Reserve Board. The data is categorized under Global Database’s United States – Table US.KB042: Balance Sheet: Foreign Related Institutions: Monthly.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States Financial Institutions: sa: CR: LL: RT: CO: Sec by Multifamily Prop data was reported at 8.463 USD bn in Jun 2019. This records a decrease from the previous number of 8.485 USD bn for May 2019. United States Financial Institutions: sa: CR: LL: RT: CO: Sec by Multifamily Prop data is updated monthly, averaging 4.208 USD bn from Jan 2015 (Median) to Jun 2019, with 54 observations. The data reached an all-time high of 8.485 USD bn in May 2019 and a record low of 556.800 USD mn in Feb 2015. United States Financial Institutions: sa: CR: LL: RT: CO: Sec by Multifamily Prop data remains active status in CEIC and is reported by Federal Reserve Board. The data is categorized under Global Database’s United States – Table US.KB042: Balance Sheet: Foreign Related Institutions: Monthly.
https://www.lseg.com/en/policies/website-disclaimerhttps://www.lseg.com/en/policies/website-disclaimer
LSEG global Filings offers extensive coverage of developed and emerging markets, updated in real time. Discover the data.
Dataset containing over 5000 data metrics (including raw data and BQ calculated scores & metrics) for over 4000 public companies (~95% of the Russell 3000). Includes financials (from SEC filings) as well as data that is not reported to the SEC, including monthly headcount, detailed employee benefits data, credit events related to contributions to benefits plans. Also includes BQ scores, industry and macro statistics that provide a comprehensive view of the sector & industry.
BQ's Public Companies dataset is applicable to both quantitative investment managers as well as fundamentals public equity investors, who wish to use alternative (non-financial) data to enhance their investment analysis and investment decisions.
10-K offers a detailed picture of a company's business, the risks itfaces, and the operating and financial results for the fiscal year. Archived from https://www.sec.gov/files/form10-k.pdf
This archive contains raw input data for the Public Utility Data Liberation (PUDL) software developed by Catalyst Cooperative. It is organized into "https://specs.frictionlessdata.io/data-package/">Frictionless Data Packages. For additional information about this data and PUDL, see the following resources:
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The dataset contains the annual report of US public firms filing with the SEC EDGAR system. Each annual report (10K filing) is broken into 20 sections. Each section is split into individual sentences. Sentiment labels are provided on a per filing basis from the market reaction around the filing data. Additional metadata for each filing is included in the dataset.