More and more websites have started to embed structured data describing products, people, organizations, places, events into their HTML pages using markup standards such as RDFa, Microdata and Microformats. The Web Data Commons project extracts this data from several billion web pages. The project provides the extracted data for download and publishes statistics about the deployment of the different formats.
The Crowdfunding Offerings Data Sets below provide the structured data from crowdfunding offering statements, updates, annual reports, and terminations filed with the Commission. The data is extracted from the eXtensible Markup Language (XML) based fillable portion of Form C. The data is presented without change from the “as-filed” submissions. The data is presented in a flattened format to provide the public with readily available data about offerings that rely on the Regulation Crowdfunding exemption.
The data sets will be updated quarterly. Data contained in documents filed after 5:30PM Eastern on the last business day of a quarter will be included in the subsequent quarterly posting.
The Crowdfunding Offerings Data Sets (PDF, 218 kb) provides documentation of scope, organization, file formats and table definitions.
Find out more about Crowdfunding.
DISCLAIMER: The Crowdfunding Offerings Data Sets contain information derived from structured data filed with the Commission by individual registrants as well as Commission-generated filing identifiers. Because the data sets are derived from information provided by individual registrants, we cannot guarantee the accuracy of the data sets. In addition, it is possible inaccuracies or other errors were introduced into the data sets during the process of extracting the data and compiling the data sets. Finally, the data sets do not reflect all available information, including certain metadata associated with Commission filings. The data sets are intended to assist the public in analyzing data contained in Commission filings; however, they are not a substitute for such filings. Investors should review the full Commission filings before making any investment decision.
RDFa Core is a specification for attributes to express structured data in any markup language. The embedded data already available in the markup language (e.g., HTML) can often be reused by the RDFa markup, so that publishers don't need to repeat significant data in the document content. (from https://www.w3.org/TR/rdfa-core/)
Not seeing a result you expected?
Learn how you can add new datasets to our index.
More and more websites have started to embed structured data describing products, people, organizations, places, events into their HTML pages using markup standards such as RDFa, Microdata and Microformats. The Web Data Commons project extracts this data from several billion web pages. The project provides the extracted data for download and publishes statistics about the deployment of the different formats.