Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset provides a comprehensive view of UK companies, including their registration details, financial information, ownership, management, and recent filings for up to the 31st December 2023. The data has been meticulously processed using dbt (Data Build Tool) scripts to ensure accuracy and relevance.
Play with this dataset at the BI app. (Free registration is required)
https://www.youtube.com/watch?v=iybNM8UtQRA" alt="Dataset overview">
The dataset comprises the following tables:
Below is a detailed description of each table.
Description: Contains detailed information about companies registered in the UK up to January 1, 2024.
Columns:
company_number: Unique identifier for each company.company_type: Type of company (e.g., private limited, public limited).office_address: Registered office address.incorporation_date: Date of company incorporation.jurisdiction: Legal jurisdiction of the company.company_status: Current status (e.g., active, dissolved).account_type: Type of accounts filed.company_name: Official name of the company.sic_codes: Standard Industrial Classification codes.date_of_cessation: Date when the company ceased operations (if applicable).next_accounts_overdue: Indicator if the next accounts are overdue.confirmation_statement_overdue: Indicator if the confirmation statement is overdue.owners: Number of registered owners (persons with significant control).officers: Number of officers (directors, secretaries) associated with the company.average_number_employees_during_period: Average number of employees during the last accounting period.current_assets: Current assets as per the last accounts.last_accounts_period_end: End date of the last accounting period.company_url: Where you can check the up-to-date company information. Free registration is required.Data Generation Process:
ch_psc and ch_officers tables.ch_accounts table to include financial information.Description: Provides detailed SIC (Standard Industrial Classification) codes for each company.
Columns:
company_number: Company identifier.sic_code: SIC code assigned to the company.sic_description: Description of the SIC code.sic_section: Section of the SIC code.sic_division: Division of the SIC code.company_url: Where you can check the up-to-date company information. Free registration is required.Data Generation Process:
ch_companies_sic_codes with the sic_codes table to enrich SIC code information.Description: Lists up to the five most recent filings for each company as of January 1, 2024.
Columns:
transaction_id.company_number: Company identifier.date: Date of the filing.Data Generation Process:
ch_filings table.Description: Details up to five most recent officers and owners for each company, including their roles and personal information.
Columns:
company_number: Company identifier.name: Full name of the officer or owner.kind: Type of person (individual or corporate entity).officer_role: Role within the company.occupation: Occupation of the individual.date: Date of appointment or notification.is_owner: Boolean indicating if the person is an owner.country_of_residence: Country where the individual resides.nationality: Nationality of the individual.company_country: Country of the company (for corporate persons).person_id: Unique identifier for the person.person_url: Where you can check the up-to-date company information. Free registration is required.Data Generation Process:
ch_officers and ch_psc tables.Segments Included:
Data Generation Process:
Facebook
TwitterThe Free Company Data Product is a downloadable data snapshot containing basic company data of live companies on the register. This snapshot is provided as ZIP files containing data in CSV format and is split into multiple files for ease of downloading.
This snapshot is provided free of charge and will not be supported.
The latest snapshot will be updated within 5 working days of the previous month end.
The contents of the snapshot have been compiled up to the end of the previous month.
A list of the data fields contained in the snapshot can be found here PDF.
Up-to-date company information can be obtained by following the URI links in the data. More details on URIs
If files are viewed with Microsoft Excel, it is recommended that you use version 2007 or later.
Facebook
TwitterUK Business Database | 15M Business Contacts, 2M Companies, and Bi-Weekly Data Refreshes
This dataset provides access to a structured UK B2B database built for teams that need reliable business contact and company data across the United Kingdom. It is designed for sales prospecting, outreach, market research, account targeting, and business development.
The database includes 15,000,000 business contacts linked to 2M UK companies. It can be used as a UK company database, UK business database, UK business contact database, or UK B2B leads database, depending on the type of records and workflows buyers need.
The dataset combines business contact data with company-level information, making it easier to identify target companies, understand their business profile, and reach the right people inside those organizations. This helps buyers work with more than a basic contact list by connecting contact records to structured company context.
Dataset Overview
• 15M business contacts • 2M UK companies • 95% reported accuracy • Email and phone data coverage • Bi-weekly refresh cycle • Structured contact and company records
This UK B2B database is useful for teams that need large-scale business contact coverage across the UK market in a format that is easy to filter, segment, and use.
Business Contact Data
Each record in the dataset is designed to support prospecting, outreach, and business contact discovery.
Typical contact fields may include:
• Full Name • Business Email Address • Phone Number • Job Title • Department or Role • Company Name • Industry • Company Domain • City or Region • and More
Field availability can vary by segment and source coverage. Buyers can request a sample to review the schema and confirm fit before purchase.
This makes the dataset useful for building UK B2B company lists, outreach files, and targeted business contact segments.
UK Company Coverage
The database includes contacts linked to 2M UK companies, making it useful for teams targeting businesses across England, Scotland, Wales, and Northern Ireland.
This supports use cases such as:
• UK-only sales prospecting • domestic B2B outreach • account list building for regional or national campaigns • company segmentation across the UK market • industry-specific targeting within the UK
Because the contact data is linked to company records, buyers can create more focused segments based on both business and contact criteria.
Decision-Maker and Account Targeting
The dataset is useful for teams that need structured UK decision makers databases and company-linked contact records for account targeting.
This can support teams that want to:
• identify target companies by industry or size • build contact lists for sales and outreach • organize account-based prospecting workflows • create filtered company and contact segments for the UK market
Because the contact data is tied to company information, buyers can build more relevant prospecting lists and reduce the need to work from broad, unqualified records.
Data Quality and Refreshes
The dataset is listed with 95% reported accuracy, which makes it suitable for teams that need more reliable business contact data for sales and market research.
The data is also refreshed on a bi-weekly basis, which helps keep contact and company records more current. Regular refreshes are important for business data because company information, contact details, and roles can change frequently over time.
Buyers are encouraged to request a sample and validate coverage against their target segment before purchase.
Filtering and Segmentation
The dataset can be filtered using several company and contact-level attributes so buyers can create more targeted prospecting and outreach lists.
Common filters may include:
• industry • company name • city or region • company type • contact availability • department or job role • company domain
These filters help teams build more focused UK B2B leads databases and reduce the need to work from broad exports.
Common Use Cases Sales Prospecting
Use the UK B2B database to build targeted account and contact lists for outbound sales and business development.
Account-Based Marketing
Identify target companies and decision-makers in the UK market based on industry, size, or region.
Market Research
Analyze UK company presence, business segments, and contact coverage using structured records.
Business Outreach
Use the dataset for email and phone-based contact discovery and campaign preparation across the UK market.
Data Enrichment
Use the dataset to add missing contact or company fields to internal records where appropriate.
Data Structure and Delivery
The dataset is organized into a structured format so buyers can work with it more easily in spreadsheets, CRM systems, sales tools, and internal workflows.
Data can typically be delivered in formats such as:
• CSV • Excel • Custom filtered extracts
A sample dataset and schema preview are available on reques...
Facebook
TwitterHere's the description without icons:
UK Companies House - Basic Company Data This dataset contains 850,000 registered companies from the official UK Companies House registry - the government agency responsible for incorporating and dissolving limited companies in the United Kingdom. What's Included ColumnDescriptionCompanyNameRegistered name of the companyCompanyNumberUnique 8-character identifierRegAddress.*Registered office address (street, city, postcode, country)CompanyCategoryType: Private Limited, PLC, LLP, etc.CompanyStatusCurrent status: Active, Dissolved, Liquidation, etc.IncorporationDateDate company was registeredSICCode.SicText_1/2Industry classification codesAccounts.*Financial filing information and due datesURIDirect link to company page on Companies House Dataset Stats
Rows: 849,999 companies Columns: 24 (cleaned from original 55) File Format: Parquet (57 MB, compressed from 398 MB CSV) Date: January 2026 snapshot
Source Official data from UK Companies House (https://www.gov.uk/government/organisations/companies-house), released under the Open Government License (OGL). Download original: https://download.companieshouse.gov.uk/en_output.html Use Cases
Market Research - Analyze industries, company formation trends Lead Generation - Find businesses by location, sector, size Economic Analysis - Track UK business growth over time Compliance - Verify company registration details Machine Learning - Classification, clustering, NLP on company names
Acknowledgements Data provided by UK Companies House under the Open Government License v3.0.
Facebook
TwitterAll statutory company information captured by Companies House. Includes basic company data (company name, registered office address, company status, incorporation date, country of origin, company type, nature of business, accounting reference date, date of last accounts/annual return filed, date of next accounts/annual return due, previous names, share allocation) and all statutory filings by companies contained in the Register.
Facebook
TwitterThese tables cover:
These figures are not official statistics.
<p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute"><abbr title="OpenDocument Spreadsheet" class="gem-c-attachment_abbr">ODS</abbr></span>, <span class="gem-c-attachment_attribute">77.3 KB</span></p>
<p class="gem-c-attachment_metadata">
This file is in an <a href="https://www.gov.uk/guidance/using-open-document-formats-odf-in-your-organisation" target="_self" class="govuk-link">OpenDocument</a> format
<p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">MS Excel Spreadsheet</span>, <span class="gem-c-attachment_attribute">104 KB</span></p>
<p class="gem-c-attachment_metadata">This file may not be suitable for users of assistive technology.</p>
<details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">
Request an accessible format.
If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:enquiries@compani
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Comprehensive list containing 899 verified Database management company businesses in United Kingdom with latest contact information, ratings, reviews, and location data.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Free Public Data Product Note The dates shown below relate to when the link has been added to data.gov.uk. The link references the latest data set as detailed below What is it? The Free Public Data Product is a downloadable data snapshot containing basic company data of live companies on the register. This snapshot is provided as ZIP files containing data in CSV format and is split into multiple files for ease of downloading. This snapshot is provided free of charge and will not be supported. When will it be updated? The latest snapshot will be updated within 5 working days of the previous month end. Additional Information The contents of the snapshot have been compiled up to the end of the previous month. Up-to-date company information can be obtained by following the URI links in the data. More details on URIs If files are viewed with Microsoft Excel, it is recommended that you use version 2007 or later.
Facebook
TwitterLocal businesses form the foundation of many economies. From independent service providers to growing regional companies, small and local businesses represent a large share of potential customers, partners, and opportunities.
The Local Business Database from Lead For Business provides structured information on local companies along with professional contact details connected to those businesses. The dataset helps organizations identify local companies, understand what they do, and connect with the professionals responsible for running them.
Rather than offering a simple list of businesses, the dataset combines business contact data with company information and location details. This allows teams to identify companies operating within specific areas and focus outreach on businesses that match their target market.
Organizations commonly use this dataset as a local business email list, a small business database, or a local business leads dataset when researching markets or building prospect lists.
Geographic Coverage
The dataset includes businesses operating across multiple cities, regions, and local markets.
Companies are associated with location information that allows users to identify businesses within specific geographic areas.
Typical geographic attributes included in the dataset may cover:
cities states or provinces regions postal areas
This location-based structure allows organizations to focus on companies operating within specific cities or local markets.
For example, a company expanding its services to a new city may use the dataset to identify businesses operating within that region and begin outreach to potential customers or partners.
Industry Representation
Local businesses operate across a wide range of industries, and the dataset reflects this diversity.
Common sectors represented include:
Professional services Retail businesses Healthcare and wellness Construction and home services Hospitality and tourism Education and training Marketing and advertising agencies Technology and IT services Automotive services Local service providers
Because business contacts are linked with company attributes, users can easily focus on specific industries or analyze businesses across sectors.
For example, a marketing agency may focus on retail and ecommerce businesses, while a technology provider may target professional services firms.
Data Fields Included
Each record combines business contact information with company and location details.
Business Contact Information
First Name Last Name Job Title Business Email Address LinkedIn Profile URL Company Name Company Domain Country City
Company Information
Company Name Website Domain Industry Employee Count Revenue Range Headquarters Location Company Description Founded Year
Location and Business Attributes
Company Size Business Category Geographic Region
These attributes help organizations understand both the company and the professionals working within it.
How Organizations Use This Dataset
Local business datasets are used by many types of organizations.
Sales Prospecting
Sales teams often target local businesses when introducing products or services. Access to business contacts allows them to reach companies within specific regions.
Local Marketing Campaigns
Marketing teams running regional campaigns may use the dataset to identify businesses operating within specific cities or markets.
Partnership Development
Companies seeking partnerships with local service providers or regional businesses can use the dataset to identify potential partners.
Market Research
Researchers may analyze business listings to understand how companies are distributed across local markets and industries.
Recruitment Research
Recruitment teams sometimes use company databases to identify small businesses operating in specific regions.
Because of these uses, the dataset is often searched as a business listing database, a local company database, or a small business contact list.
Data Sources and Organization
The dataset is compiled using information gathered from publicly available professional and business sources including:
company websites business directories professional profiles public business records
Collected information is structured and standardized so it can be easily used in CRM systems, analytics platforms, or internal prospecting tools.
This structured format helps ensure that businesses and contacts are represented consistently across the dataset.
Dataset Updates
Local business environments evolve as companies open, close, or change their operations.
To maintain relevance, the dataset is reviewed and refreshed periodically through update cycles that may include:
monthly updates quarterly dataset refreshes periodic record revisions
These updates help ensure that business listings and contact information remain useful over time.
Compliance
The dataset is maintained with attention to widely recognized data protection and marke...
Facebook
TwitterThe Business Structure Database (BSD) contains a small number of variables for almost all business organisations in the UK. The BSD is derived primarily from the Inter-Departmental Business Register (IDBR), which is a live register of data collected by HM Revenue and Customs via VAT and Pay As You Earn (PAYE) records. The IDBR data are complimented with data from ONS business surveys. If a business is liable for VAT (turnover exceeds the VAT threshold) and/or has at least one member of staff registered for the PAYE tax collection system, then the business will appear on the IDBR (and hence in the BSD). In 2004 it was estimated that the businesses listed on the IDBR accounted for almost 99 per cent of economic activity in the UK. Only very small businesses, such as the self-employed were not found on the IDBR.
The IDBR is frequently updated, and contains confidential information that cannot be accessed by non-civil servants without special permission. However, the ONS Virtual Micro-data Laboratory (VML) created and developed the BSD, which is a 'snapshot' in time of the IDBR, in order to provide a version of the IDBR for research use, taking full account of changes in ownership and restructuring of businesses. The 'snapshot' is taken around April, and the captured point-in-time data are supplied to the VML by the following September. The reporting period is generally the financial year. For example, the 2000 BSD file is produced in September 2000, using data captured from the IDBR in April 2000. The data will reflect the financial year of April 1999 to March 2000. However, the ONS may, during this time, update the IDBR with data on companies from its own business surveys, such as the Annual Business Survey (SN 7451).
The data are divided into 'enterprises' and 'local units'. An enterprise is the overall business organisation. A local unit is a 'plant', such as a factory, shop, branch, etc. In some cases, an enterprise will only have one local unit, and in other cases (such as a bank or supermarket), an enterprise will own many local units.
For each company, data are available on employment, turnover, foreign ownership, and industrial activity based on Standard Industrial Classification (SIC)92, SIC 2003 or SIC 2007. Year of 'birth' (company start-up date) and 'death' (termination date) are also included, as well as postcodes for both enterprises and their local units. Previously only pseudo-anonymised postcodes were available but now all postcodes are real.
The ONS is continually developing the BSD, and so researchers are strongly recommended to read all documentation pertaining to this dataset before using the data.
Linking to Other Business Studies
These data contain IDBR reference numbers. These are anonymous but unique reference numbers assigned to business organisations. Their inclusion allows researchers to combine different business survey sources together. Researchers may consider applying for other business data to assist their research.
Latest Edition Information
For the sixteenth edition (March 2024), data files and a variable catalogue document for 2023 have been added.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Comprehensive list containing 5,898 verified Media company businesses in United Kingdom with latest contact information, ratings, reviews, and location data.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Comprehensive list containing 49 verified Title company businesses in United Kingdom with latest contact information, ratings, reviews, and location data.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Numbers of enterprises and local units produced from a snapshot of the Inter-Departmental Business Register (IDBR) taken on 14 March 2025.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I work with UK company information on a daily basis, and I thought it would be useful to publish a list of all active companies, in a way that could be used for machine learning.
There are 3,838,469 rows in the dataset, one for each active company. Each row, has the company name, date of incorporation and the Standard Industrial Classification Code.
The company list is from the publicly available 1st November 2017 Companies House snapshot.
The SIC code descriptions are from the gov.uk website.
In the file AllCompanies.csv each row is formatted as follows:
Inspiration
Possible uses for this data is to use ML to suggest a new unique but suitable name for a company based on what other companies of the same SIC are called.
Perhaps analyse how company names have evolved over time.
Using ML, perhaps determine what a typical company name looks like, maybe analyse if company names have got longer or more complicated over time.
I am sure there are many more possible uses for this data in ways, that I cannot imagine.
This is my second go (the first was published a few hours ago) at publishing a dataset on any medium, so any useful tips and hints would be extremely welcome.
Links to the raw data sources are here:
Facebook
TwitterFrom our comprehensive UK Data Lake, we proudly present 5M+ high-quality UK decision-makers and influencers.
Take your ABM strategy to the next level, build a strong pipeline and close deals by laser targeting key decision-makers and influencers based on their department, job functions, job responsibilities, interest areas and expertise, then utilise essential prospect information, including verified work email addresses and business phone and social links.
Our data is sourced directly from executives, businesses, official sources and registries, standardised, de-duped, and verified, and then processed through vigorous compliance procedures for GDPR/PECR on a legitimate interest basis and RTBI etc. This results in a highly accurate single source of quality and compliant B2B data.
It is with our B2B Live Data Lake that we can enrich your CRM data, supply new prospect data, verify leads, and provide you with a custom dataset tailored to your target audience specifications. We also cater for big data licensing to software providers and agencies that intend to supply our data to their customers and use it in their software solutions.
and much more
Why Choose 1 Stop Data?
Products and Services:
The oscar4.io web platform for self-service data on demand Bulk data feeds Data hygiene, standardisation, cleansing and enrichment Know Your Business (KYB)
Keywords:
B2B,Prospect Data,Validated Work Emails,Personal Emails,Email Enrichment,Company Data,Lead Enrichment,Data Enhancement,Account Based Marketing (ABM),Customer Data,Phone Enrichment,LinkedIn URL,Market Intelligence,Business Intelligence,Data Append,Contact Data,Lead Generation,360-Degree Customer View,Data Cleansing,Lead Data,Email and Phone Validation,Data Augmentation,Segmentation,Data Enrichment,Email Marketing,Data Intelligence,Direct Marketing,Customer Insights,Audience Targeting,Audience Generation,Mobile Phone,B2B Data Enrichment,Social Advertising,Due Diligence,B2B Advertising,Audience Insights,B2B Lead Retargeting,Contact Information,Demographic Data,Consumer Data Enrichment,People-Based Marketing,Contact Data Enrichment,Customer Data Insights,Prospecting,Sales Intelligence,Predictive Analytics,Email Address Validation,Company Data Enrichment,Audience Intelligence,Cold Outreach,Analytics,Marketing Data Enrichment,Customer Acquisition,Data Cleansing,B2C Data,People Data,Professional Information,Recruiting and HR,KYC,B2B List Validation,Lead Information,Sales Prospecting,B2B Sales,B2B Data,Lead Lists,Contact Validation,Competitive Intelligence,Customer Data Enrichment,Identity Resolution,Identity Validation,Data Science,B2C Data Enrichment,B2C,Lead Data Enrichment,Social Media Data.
Facebook
TwitterThe UK Business Data Survey is a telephone-based quantitative and qualitative study of UK businesses. It seeks to understand the role and importance of personal and non-personal data in UK businesses, domestic and international transfers of data, and the awareness of, and attitudes toward, data protection legislation and policy.
This is the first time this survey has been carried out. The quantitative survey took place from November 2020 to January 2021 and the qualitative interviews were undertaken in February 2021. The research was delayed from spring 2020 to minimise the impact of the COVID-19 lockdown on the quality of responses and the robustness of the results.
Facebook
TwitterThe 2014 London Business Survey (LBS) is an innovative survey designed by the Office for National Statistics, on behalf of the London Enterprise Panel and the GLA. The survey collected information from a representative sample of private sector businesses in London in May-July 2014. This dataset contains information on the profile of London businesses corresponding with Section 1 of the London Business Survey 2014: Main Findings report. Information is provided on: The country or region of business ownership of London businesses UK versus foreign ownership of London businesses What London businesses provide: goods, services and intellectual property The types of customers of London businesses The age of London businesses, including the numbers of start-ups As with any survey, the 2014 LBS is based on a sample and as such is subject to variability in the results. Care should therefore be taken in interpreting the survey findings. For all estimates, lower and upper limits of 95% confidence intervals are provided in the data files to assist with interpretation. The LBS results represent the population of business units in London. A business unit is defined as a site/workplace, which may also be a head office if the head office is in London. It will be the whole business in the case of businesses which only have one site, or part of the business in the case of multi-site firms. The results are presented by enterprise size band and industry sector.
Facebook
TwitterThe company information data is sourced from SEC filings and the official company websites. The data on essential company information includes: company profile overview including mission, vision and values, executive leadership, contact information, number of employees and account of the sector and the industry in which the business organizations operate.
If you are interested to learn more, check out the company website:
https://tradefeeds.com/company-information-api/
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Techsalerator’s Business Funding Data for the United Kingdom
Techsalerator’s Business Funding Data for the United Kingdom offers a comprehensive and insightful collection of information crucial for businesses, investors, and financial analysts. This dataset provides an in-depth examination of the funding activities of companies across various sectors in the UK, detailing data related to their funding rounds, investment sources, and significant financial milestones.
If you need the full dataset, reach out to us at info@techsalerator.com or https://www.techsalerator.com/contact-us.
Techsalerator’s Business Funding Data for the United Kingdom
Techsalerator’s Business Funding Data for the United Kingdom provides a thorough and insightful overview of essential information for businesses, investors, and financial analysts. This dataset offers an in-depth analysis of funding activities across various sectors in the UK, capturing data related to funding rounds, investment sources, and key financial milestones.
Top 5 Key Data Fields
Top 5 Funding Trends in the United Kingdom
Top 5 Companies with Notable Funding Data in the United Kingdom
Accessing Techsalerator’s Business Funding Data
To obtain Techsalerator’s Business Funding Data for the United Kingdom, contact info@techsalerator.com with your specific needs. Techsalerator will provide a customized quote based on the required data fields and records, with delivery available within 24 hours. Ongoing access options can also be discussed.
Included Data Fields
For detailed insights into funding activities and financial trends in the United Kingdom, Techsalerator’s...
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I work with UK company information on a daily basis, and I thought it would be useful to publish a list of all active companies, in a way that could be used for machine learning.
There are 3,801,733 rows in the dataset, one for each active company. The postcode which is included in the dataset has been geolocated, and the resultant latitude and longitudes have been included, along with the Standard Industrial Classification Code, and date of incorporation.
The company list is from the publicly available 1st November 2017 Companies House snapshot.
The postcode geolocations and SIC Codes are from the gov.uk website.
In the file AllCompanies.csv each row is formatted as follows:
Inspiration Possible uses for this data is to see where certain types of companies are located in the UK, and how over time they multiply and spread throughout the UK.
Training ML algorithms to predict where there are a high (or low) density of certain types of companies, and where would be a good area for a company to be located, if it wanted minimal competition, or the inverse, where there are clusters of high densities, where it might be easier to recruit specialised staff.
A useful addition would be to overlay population density, which I am currently working on as an option for this dataset.
I am sure there are many more possible uses for this data in ways, that I cannot imagine.
This is my first go at publishing a dataset on any medium, so any useful tips and hints would be extremely welcome.
Links to the raw data sources are here:
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset provides a comprehensive view of UK companies, including their registration details, financial information, ownership, management, and recent filings for up to the 31st December 2023. The data has been meticulously processed using dbt (Data Build Tool) scripts to ensure accuracy and relevance.
Play with this dataset at the BI app. (Free registration is required)
https://www.youtube.com/watch?v=iybNM8UtQRA" alt="Dataset overview">
The dataset comprises the following tables:
Below is a detailed description of each table.
Description: Contains detailed information about companies registered in the UK up to January 1, 2024.
Columns:
company_number: Unique identifier for each company.company_type: Type of company (e.g., private limited, public limited).office_address: Registered office address.incorporation_date: Date of company incorporation.jurisdiction: Legal jurisdiction of the company.company_status: Current status (e.g., active, dissolved).account_type: Type of accounts filed.company_name: Official name of the company.sic_codes: Standard Industrial Classification codes.date_of_cessation: Date when the company ceased operations (if applicable).next_accounts_overdue: Indicator if the next accounts are overdue.confirmation_statement_overdue: Indicator if the confirmation statement is overdue.owners: Number of registered owners (persons with significant control).officers: Number of officers (directors, secretaries) associated with the company.average_number_employees_during_period: Average number of employees during the last accounting period.current_assets: Current assets as per the last accounts.last_accounts_period_end: End date of the last accounting period.company_url: Where you can check the up-to-date company information. Free registration is required.Data Generation Process:
ch_psc and ch_officers tables.ch_accounts table to include financial information.Description: Provides detailed SIC (Standard Industrial Classification) codes for each company.
Columns:
company_number: Company identifier.sic_code: SIC code assigned to the company.sic_description: Description of the SIC code.sic_section: Section of the SIC code.sic_division: Division of the SIC code.company_url: Where you can check the up-to-date company information. Free registration is required.Data Generation Process:
ch_companies_sic_codes with the sic_codes table to enrich SIC code information.Description: Lists up to the five most recent filings for each company as of January 1, 2024.
Columns:
transaction_id.company_number: Company identifier.date: Date of the filing.Data Generation Process:
ch_filings table.Description: Details up to five most recent officers and owners for each company, including their roles and personal information.
Columns:
company_number: Company identifier.name: Full name of the officer or owner.kind: Type of person (individual or corporate entity).officer_role: Role within the company.occupation: Occupation of the individual.date: Date of appointment or notification.is_owner: Boolean indicating if the person is an owner.country_of_residence: Country where the individual resides.nationality: Nationality of the individual.company_country: Country of the company (for corporate persons).person_id: Unique identifier for the person.person_url: Where you can check the up-to-date company information. Free registration is required.Data Generation Process:
ch_officers and ch_psc tables.Segments Included:
Data Generation Process: