100+ datasets found
  1. US Industry Data by State, by Industry

    • kaggle.com
    zip
    Updated Jan 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). US Industry Data by State, by Industry [Dataset]. https://www.kaggle.com/datasets/thedevastator/2012-us-industry-data-by-state-by-industry
    Explore at:
    zip(53066 bytes)Available download formats
    Dataset updated
    Jan 15, 2023
    Authors
    The Devastator
    Area covered
    United States
    Description

    US Industry Data by State, by Industry

    Number of Establishments, Sales, Payroll, and Employees

    By Gary Hoover [source]

    About this dataset

    This data set provides a detailed look into the US economy. It includes information on establishments and nonemployer businesses, as well as sales revenue, payrolls, and the number of employees. Gleaned from the Economic Census done every five years, this data is a valuable resource to anyone curious about where the nation was economically at the time. With columns including geographic area name, North American Industry Classification System (NAICS) codes for industries, descriptions of those codes meaning of operation or tax status, and annual payroll, this information-rich dataset contains all you need to track economic trends over time. Whether you’re a researcher studying industry patterns or an entrepreneur looking for market insight — this dataset has what you’re looking for!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset provides detailed US industry data by state, including the number of establishments, value of sales, payroll, and number of employees. All the data is based on the North American Industry Classification System (NAICS) code for each specific industry. This will allow you to easily analyze and compare industries across different states or regions.

    Research Ideas

    • Analyzing the economic impact of a new business or industry trends in different states: Comparing the change in the number of establishments, payroll, and employees over time can give insight into how a state is affected by a new industry trend or introduction of a new service or product.
    • Estimating customer sales potential for businesses: This dataset can be used to estimate the potential customer base for businesses in different geographic areas. By analyzing total business done by non-employers in an area along with its estimated population can help estimate how much overall sales potential exists for a given region.
    • Tracking competitor performance: By looking at shipments, receipts, and value of business done across industries in different regions or even cities, companies can track their competitors’ performance and compare it to their own to better assess their strategies going forward

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

    Columns

    File: 2012 Industry Data by Industry and State.csv | Column name | Description | |:----------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------| | Geographic area name | The name of the geographic area the data is for. (String) | | NAICS code | The North American Industry Classification System (NAICS) code for the industry. (String) | | Meaning of NAICS code | The description of the NAICS code. (String) | | Meaning of Type of operation or tax status code | The description of the type of operation or tax status code. (String) ...

  2. c

    Employment by Industry - Datasets - CTData.org

    • data.ctdata.org
    Updated Mar 14, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Employment by Industry - Datasets - CTData.org [Dataset]. http://data.ctdata.org/dataset/employment-by-industry
    Explore at:
    Dataset updated
    Mar 14, 2016
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Employment by Industry reports several labor statistics related to employment and wage. Domain Frequency Annual Full Description Employment by Industry reports the total Number of Employers, the Annual Average Employment, and the Annual Average Wage by industry at the town, county, and state level. Industries included in this dataset vary from location to location. In as many locations as possible, five specific industry segments are consistently present (Construction, Manufacturing, Retail Trade, All Industries, Total Government) as well as the largest 3 out of the remaining segments for that location, ranked by Annual Average Employment. Not every location has data for every segment, and some may not have data for the five consistently reported segments. This data is from the Connecticut Department of Labor Quarterly Census of Employment and Wages (QCEW). The program produces a comprehensive tabulation of employment and wage information for workers covered by Connecticut Unemployment Insurance (UI) laws and Federal workers covered by the Unemployment Compensation for Federal Employees (UCFE) program.

  3. Industry Market Cap Dataset

    • kaggle.com
    zip
    Updated Jul 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zibran Zarif Amio (2024). Industry Market Cap Dataset [Dataset]. https://www.kaggle.com/datasets/zibranzarif/industry-market-cap-analysis-dataset
    Explore at:
    zip(154299 bytes)Available download formats
    Dataset updated
    Jul 25, 2024
    Authors
    Zibran Zarif Amio
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    About Dataset

    Context

    This dataset contains financial information of 1500 companies across 8 different industries scraped from companiesmarketcap.com on May 2024. It contains information about the company's name, industry, country, employees, marketcap, revenue, earnings, etc.

    Content

    The dataset contains 2 files with the same column names. scraped_company_data.csv file is further transformed and cleaned to produce the finaltransformed_company_data.csvfile.

    1. Company: Full name of the company
    2. Company Path: Website URL of the company
    3. Industry: Associated industry of the company
    4. Country: Location of the company
    5. Employees: Total number of employees of the company
    6. Market Cap: Market capital of the company (as of Mar 2024)
    7. Revenue: Company's current revenue (31 Mar 2023 - 31 Mar 2024)
    8. Earnings: Company's current earnings (31 Mar 2023 - 31 Mar 2024)
    9. Operating Margin: Company's current operating margin (at the end of 2023)
    10. Total Assets: Company's total assets (as of Mar 2024)
    11. Total Liabilities: Company's total liabilities
    12. Total Debt: Company's total debt
    13. Net Assets: Company’s net assets
    14. PE Ratio: Company's current price-to-earnings ratio (31 Mar 2023 - 31 Mar 2024)
    15. PS Ratio: Company's current price-to-sales ratio (31 Mar 2023 - 31 Mar 2024)

    Acknowledgements

    The website companiesmarketcap.com was used to scrape this dataset. Please include citations for this dataset if you use it in your own research.

    Inspiration

    The dataset can be used to find industries with the highest average market value, most profitable industries, most growth-oriented sectors, etc. More interesting insights can be found in this README file.

  4. N

    Industry, Maine Annual Population and Growth Analysis Dataset: A...

    • neilsberg.com
    csv, json
    Updated Jul 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2024). Industry, Maine Annual Population and Growth Analysis Dataset: A Comprehensive Overview of Population Changes and Yearly Growth Rates in Industry town from 2000 to 2023 // 2024 Edition [Dataset]. https://www.neilsberg.com/insights/industry-me-population-by-year/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Jul 30, 2024
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Maine, Industry
    Variables measured
    Annual Population Growth Rate, Population Between 2000 and 2023, Annual Population Growth Rate Percent
    Measurement technique
    The data presented in this dataset is derived from the 20 years data of U.S. Census Bureau Population Estimates Program (PEP) 2000 - 2023. To measure the variables, namely (a) population and (b) population change in ( absolute and as a percentage ), we initially analyzed and tabulated the data for each of the years between 2000 and 2023. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the Industry town population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of Industry town across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.

    Key observations

    In 2023, the population of Industry town was 801, a 0.50% increase year-by-year from 2022. Previously, in 2022, Industry town population was 797, an increase of 0.63% compared to a population of 792 in 2021. Over the last 20 plus years, between 2000 and 2023, population of Industry town increased by 16. In this period, the peak population was 928 in the year 2019. The numbers suggest that the population has already reached its peak and is showing a trend of decline. Source: U.S. Census Bureau Population Estimates Program (PEP).

    Content

    When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).

    Data Coverage:

    • From 2000 to 2023

    Variables / Data Columns

    • Year: This column displays the data year (Measured annually and for years 2000 to 2023)
    • Population: The population for the specific year for the Industry town is shown in this column.
    • Year on Year Change: This column displays the change in Industry town population for each year compared to the previous year.
    • Change in Percent: This column displays the year on year change as a percentage. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Industry town Population by Year. You can refer the same here

  5. a

    York Region 2022 Business Directory

    • hub.arcgis.com
    • data-markham.opendata.arcgis.com
    • +1more
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Regional Municipality of York (2019). York Region 2022 Business Directory [Dataset]. https://hub.arcgis.com/maps/york::york-region-2022-business-directory
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset authored and provided by
    The Regional Municipality of York
    Area covered
    Description

    Displays a representation of where all the surveyed businesses across York Region are located. This data is collected through the Region’s annual comprehensive employment survey and each record contains employment and business contact information about each business with the exception of home and farm-based businesses. Home-based businesses are not included as they are distributed throughout residential communities within the Region and are difficult to survey. Employment data for farm-based businesses are collected through the Census of Agriculture conducted by Statistics Canada, and are not included in the York Region Employment Survey dataset.Update Frequency: Not PlannedDate Created: 17/03/2023Date Modified: 17/03/2023Metadata Date: 17/03/2023Citation Contacts: York Region, Long Range PlanningAttribute DefinitionsBUSINESSID: Unique key to identify a business.NAME: The common business name used in everyday transactions. FULL_ADDRESS: Full street address of the physical address. (This field concatenates the following fields: Street Number, Street Name, Street Type, Street Direction)STREET_NUM: Street number of the physical addressSTREET_NAME: Street name of the physical addressSTREET_TYPE: Street type of the physical addressSTREET_DIR: Street direction of the physical addressUNIT_NUM: Unit number of the physical addressCOMMUNITY: Community name where the business is physically locatedMUNICIPALITY: Municipality where the business is physically locatedPOST_CODE: Postal code corresponding to the physical street addressEMPLOYEE_RANGE: The numerical range of employees working in a given firm. PRIM_NAICS, PRIM_NAICS_DESC: The Primary 5-digit NAIC code defines the main business activity that occurs at that particular physical business location.SEC_NAICS, SEC_NAICS_DESC: If there is more than one business activity occurring at a particular business location (that is substantially different from the primary business activity), then a secondary NAIC is assigned.PRIM_BUS_CLUSTER, SEC_BUS_CLUSTER: A business cluster is defined as a geographic concentration of interconnected businesses and institutions in a common industry that both compete and cooperate. As defined by York Region, this field indicates the primary business cluster that this business belongs to.BUS_ACTIVITY_DESC: This is a comment box with a detailed text description of the business activity.TRAFFIC_ZONE: Specifies the traffic zone in which the business is located. MANUFACTURER: Indicates whether or not the business manufactures at the physical business location. CAN_HEADOFFICE: The business at this location is considered the Canadian head office.HEADOFFICEPROVSTATE: Indicates which state or province the head office is located if the head office is located in Canada (outside of Ontario) or in the United StatesHEADOFFICECOUNTRY: Indicates which country the head office is locatedYR_CURRENTLOC: Indicates the year that the business moved into its current address.MAIL_FULL_ADDRESS: The mailing address is the address through which a business receives postal service. This may or may not be the same as the physical street address.MAIL_STREET_NUM, MAIL_STREET_NAME, MAIL_STREET_TYPE, MAIL_STREET_DIR, MAIL_UNIT_NUM, MAIL_COMMUNITY, MAIL_MUNICIPALITY, MAIL_PROVINCE, MAIL_COUNTRY, MAIL_POST_CODE, MAIL_POBOX: Mailing address fields are similar to street address fields and in most cases will be the same as the Street Address. Some examples where the two addresses might not be the same include, multiple location businesses, home-based businesses, or when a business receives mail through a P.O. Box.WEBSITE: The General/Main business website.GEN_BUS_EMAIL: The general/main business e-mail address for that location.PHONE_NO: The general/main phone number for the business location.PHONE_EXT: The extension (if any) for the general/main business phone number.LAST_SURVEYED: The date the record was last surveyedLAST_UPDATED: The date the record was last updatedUPDATEMETHOD: Displays how the business was last updated, based on a predetermined list.X_COORD, Y_COORD: The x,y coordinates of the surveyed business locationFrequently Asked Questions How many businesses are included in the 2022 York Region Business Directory? The 2022 York Region Business Directory contains just over 34,000 business listings. In the past, businesses were annually surveyed, either in person or by telephone to improve the accuracy of the directory. Due to the COVID-19 Pandemic, a survey was not complete in 2020 and 2021. The Region may return to annual surveying in future years, however the next employment survey will be in 2024. This listing also includes home-based businesses that participated in the 2022 employment survey. What is a NAIC code? The North American Industrial Classification (NAIC) coding system is a hierarchical classification system developed in Canada, Mexico and the United States. It was developed to allow for the comparison of business and employment information across a variety of industry categories. The NAICS has a hierarchical structure, designed as follows: Two-digits = sector (e.g., 31-33 contain the Manufacturing sectors) Three-digits = subsector (e.g., 336 = Transportation Equipment Manufacturing) Four-digits = industry group (e.g., 3361 = Motor Vehicle Manufacturing) Five-digits = industry (e.g., 33611 = Automobile and Light Duty Motor Vehicle Manufacturing) For more information on the NAIC coding system click here How do I add or update my business information in the York Region Business Directory? To add or update your business information, please select one of the following methods: • Email: Please email businessdirectory@york.ca to request to be added to the Business Directory. • Online: Go to www.york.ca/employmentsurvey and participate in the employment survey - note, this will only be active in 2024 when the Region performs its next employment survey There is no charge for obtaining a basic listing of your business in the York Region Business Directory. How up-to-date is the information? This directory is based on the 2022 York Region Employment Survey, a survey of businesses which attempts to gather information from all businesses across York Region. In instances where we were unable to gather information, the most recent data was used. Farm-based businesses have not been included in the survey and home-based businesses that participated in the 2022 survey are included in the dataset. The date that the business listing was last updated is located in the LastUpdate column in the attached spreadsheet. Are different versions of the York Region Business Directory available? Yes, the directory is available in two online formats: • An interactive, map-based directory searchable by company name, street address, municipality and industry sector. • The entire dataset in downloadable Microsoft Excel format via York Region's Open Data Portal. This version of the York Region Business Directory 2022 is offered free of charge. The Directory allows for the detailed analysis of business and employment trends, as well as the construction of targeted contact lists. To view the map-based directory and dataset, go to: 2022 Business Directory - Map Is there any analysis of business and employment trends in York Region? Yes. The "2022 Employment and Industry Report" contains information on employment trends in York Region and is based on results from the employment survey. please visit www.york.ca/york-region/plans-reports-and-strategies/employment-and-industry-report to view the report. What other resources are available for York Region businesses? York Region offers an export advisory service and a number of other business development programs and seminars for interested individuals. For details, consult the York Region Economic Strategy Branch. Who do I contact to obtain more information about the Directory? For more information on the York Region Business Directory, contact the Planning and Economic Development Branch at: businessdirectory@york.ca.

  6. m

    Business establishments and jobs data by business size and industry

    • data.melbourne.vic.gov.au
    • researchdata.edu.au
    csv, excel, json
    Updated Oct 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Business establishments and jobs data by business size and industry [Dataset]. https://data.melbourne.vic.gov.au/explore/dataset/business-establishments-and-jobs-data-by-business-size-and-industry/
    Explore at:
    excel, json, csvAvailable download formats
    Dataset updated
    Oct 2, 2023
    Description

    Data collected as part of the City of Melbourne's Census of Land Use and Employment (CLUE). The data covers the period 2002-2023. It shows number of jobs and number of business establishments by business size, classified by their CLUE industry, ANZSIC1 and CLUE small area allocation.Business size is determined by the total number of jobs at ech business establishment and is categorised as follows:Non employing, no jobs allocated to the establishment.Small business, 1 to 19 jobs employed at a business establishment.Medium business, 20 to 199 jobs employed at a business establishment.Larger business, 200 or more jobs employed at a business establishment.This dataset has been confidentialised to protect the commercially sensitive information of individual businesses. Data in cells which pertain to two or fewer businesses have been suppressed and are shown as a blank cell. The 'City of Melbourne' row totals refer to the true total, including those suppressed cells.Non-confidentialised data may be made available subject to a data supply agreement. For more information contact cityfacts@melbourne.vic.gov.auFor CLUE small area spatial files see https://data.melbourne.vic.gov.au/explore/dataset/small-areas-for-census-of-land-use-and-employment-clue/mapFor more information about CLUE see http://www.melbourne.vic.gov.au/clueFor more information about the ANZSIC industry classification system see http://www.abs.gov.au/ausstats/abs@.nsf/mf/1292.0

  7. d

    Satellite US Construction Materials Dataset Package (Cemex, Vulcan, Martin...

    • datarade.ai
    .csv
    Updated Jan 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Space Know (2023). Satellite US Construction Materials Dataset Package (Cemex, Vulcan, Martin Marietta) [Dataset]. https://datarade.ai/data-products/satellite-us-construction-materials-dataset-package-cemex-v-space-know
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Jan 18, 2023
    Dataset authored and provided by
    Space Know
    Area covered
    United States of America
    Description

    This dataset package is focused on U.S construction materials and three construction companies: Cemex, Martin Marietta & Vulcan.

    In this package, SpaceKnow tracks manufacturing and processing facilities for construction material products all over the US. By tracking these facilities, we are able to give you near-real-time data on spending on these materials, which helps to predict residential and commercial real estate construction and spending in the US.

    The dataset includes 40 indices focused on asphalt, cement, concrete, and building materials in general. You can look forward to receiving country-level and regional data (activity in the North, East, West, and South of the country) and the aforementioned company data.

    SpaceKnow uses satellite (SAR) data to capture activity and building material manufacturing and processing facilities in the US.

    Data is updated daily, has an average lag of 4-6 days, and history back to 2017.

    The insights provide you with level and change data for refineries, storage, manufacturing, logistics, and employee parking-based locations.

    SpaceKnow offers 3 delivery options: CSV, API, and Insights Dashboard

    Available Indices Companies: Cemex (CX): Construction Materials (covers all manufacturing facilities of the company in the US), Concrete, Cement (refinery and storage) indices, and aggregates Martin Marietta (MLM): Construction Materials (covers all manufacturing facilities of the company in the US), Concrete, Cement (refinery and storage) indices, and aggregates Vulcan (VMC): Construction Materials (covers all manufacturing facilities of the company in the US), Concrete, Cement (refinery and storage) indices, and aggregates

    USA Indices:

    Aggregates USA Asphalt USA Cement USA Cement Refinery USA Cement Storage USA Concrete USA Construction Materials USA Construction Mining USA Construction Parking Lots USA Construction Materials Transfer Hub US Cement - Midwest, Northeast, South, West Cement Refinery - Midwest, Northeast, South, West Cement Storage - Midwest, Northeast, South, West

    Why get SpaceKnow's U.S Construction Materials Package?

    Monitor Construction Market Trends: Near-real-time insights into the construction industry allow clients to understand and anticipate market trends better.

    Track Companies Performance: Monitor the operational activities, such as the volume of sales

    Assess Risk: Use satellite activity data to assess the risks associated with investing in the construction industry.

    Index Methodology Summary Continuous Feed Index (CFI) is a daily aggregation of the area of metallic objects in square meters. There are two types of CFI indices; CFI-R index gives the data in levels. It shows how many square meters are covered by metallic objects (for example employee cars at a facility). CFI-S index gives the change in data. It shows how many square meters have changed within the locations between two consecutive satellite images.

    How to interpret the data SpaceKnow indices can be compared with the related economic indicators or KPIs. If the economic indicator is in monthly terms, perform a 30-day rolling sum and pick the last day of the month to compare with the economic indicator. Each data point will reflect approximately the sum of the month. If the economic indicator is in quarterly terms, perform a 90-day rolling sum and pick the last day of the 90-day to compare with the economic indicator. Each data point will reflect approximately the sum of the quarter.

    Where the data comes from SpaceKnow brings you the data edge by applying machine learning and AI algorithms to synthetic aperture radar and optical satellite imagery. The company’s infrastructure searches and downloads new imagery every day, and the computations of the data take place within less than 24 hours.

    In contrast to traditional economic data, which are released in monthly and quarterly terms, SpaceKnow data is high-frequency and available daily. It is possible to observe the latest movements in the construction industry with just a 4-6 day lag, on average.

    The construction materials data help you to estimate the performance of the construction sector and the business activity of the selected companies.

    The foundation of delivering high-quality data is based on the success of defining each location to observe and extract the data. All locations are thoroughly researched and validated by an in-house team of annotators and data analysts.

    See below how our Construction Materials index performs against the US Non-residential construction spending benchmark

    Each individual location is precisely defined to avoid noise in the data, which may arise from traffic or changing vegetation due to seasonal reasons.

    SpaceKnow uses radar imagery and its own unique algorithms, so the indices do not lose their significance in bad weather conditions such as rain or heavy clouds.

    → Reach out to get free trial

    ...

  8. Z

    CompanyKG Dataset V2.0: A Large-Scale Heterogeneous Graph for Company...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lele Cao; Vilhelm von Ehrenheim; Mark Granroth-Wilding; Richard Anselmo Stahl; Drew McCornack; Armin Catovic; Dhiana Deva Cavacanti Rocha (2024). CompanyKG Dataset V2.0: A Large-Scale Heterogeneous Graph for Company Similarity Quantification [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7957401
    Explore at:
    Dataset updated
    Jun 4, 2024
    Dataset provided by
    EQT
    Authors
    Lele Cao; Vilhelm von Ehrenheim; Mark Granroth-Wilding; Richard Anselmo Stahl; Drew McCornack; Armin Catovic; Dhiana Deva Cavacanti Rocha
    Description

    CompanyKG is a heterogeneous graph consisting of 1,169,931 nodes and 50,815,503 undirected edges, with each node representing a real-world company and each edge signifying a relationship between the connected pair of companies.

    Edges: We model 15 different inter-company relations as undirected edges, each of which corresponds to a unique edge type. These edge types capture various forms of similarity between connected company pairs. Associated with each edge of a certain type, we calculate a real-numbered weight as an approximation of the similarity level of that type. It is important to note that the constructed edges do not represent an exhaustive list of all possible edges due to incomplete information. Consequently, this leads to a sparse and occasionally skewed distribution of edges for individual relation/edge types. Such characteristics pose additional challenges for downstream learning tasks. Please refer to our paper for a detailed definition of edge types and weight calculations.

    Nodes: The graph includes all companies connected by edges defined previously. Each node represents a company and is associated with a descriptive text, such as "Klarna is a fintech company that provides support for direct and post-purchase payments ...". To comply with privacy and confidentiality requirements, we encoded the text into numerical embeddings using four different pre-trained text embedding models: mSBERT (multilingual Sentence BERT), ADA2, SimCSE (fine-tuned on the raw company descriptions) and PAUSE.

    Evaluation Tasks. The primary goal of CompanyKG is to develop algorithms and models for quantifying the similarity between pairs of companies. In order to evaluate the effectiveness of these methods, we have carefully curated three evaluation tasks:

    Similarity Prediction (SP). To assess the accuracy of pairwise company similarity, we constructed the SP evaluation set comprising 3,219 pairs of companies that are labeled either as positive (similar, denoted by "1") or negative (dissimilar, denoted by "0"). Of these pairs, 1,522 are positive and 1,697 are negative.

    Competitor Retrieval (CR). Each sample contains one target company and one of its direct competitors. It contains 76 distinct target companies, each of which has 5.3 competitors annotated in average. For a given target company A with N direct competitors in this CR evaluation set, we expect a competent method to retrieve all N competitors when searching for similar companies to A.

    Similarity Ranking (SR) is designed to assess the ability of any method to rank candidate companies (numbered 0 and 1) based on their similarity to a query company. Paid human annotators, with backgrounds in engineering, science, and investment, were tasked with determining which candidate company is more similar to the query company. It resulted in an evaluation set comprising 1,856 rigorously labeled ranking questions. We retained 20% (368 samples) of this set as a validation set for model development.

    Edge Prediction (EP) evaluates a model's ability to predict future or missing relationships between companies, providing forward-looking insights for investment professionals. The EP dataset, derived (and sampled) from new edges collected between April 6, 2023, and May 25, 2024, includes 40,000 samples, with edges not present in the pre-existing CompanyKG (a snapshot up until April 5, 2023).

    Background and Motivation

    In the investment industry, it is often essential to identify similar companies for a variety of purposes, such as market/competitor mapping and Mergers & Acquisitions (M&A). Identifying comparable companies is a critical task, as it can inform investment decisions, help identify potential synergies, and reveal areas for growth and improvement. The accurate quantification of inter-company similarity, also referred to as company similarity quantification, is the cornerstone to successfully executing such tasks. However, company similarity quantification is often a challenging and time-consuming process, given the vast amount of data available on each company, and the complex and diversified relationships among them.

    While there is no universally agreed definition of company similarity, researchers and practitioners in PE industry have adopted various criteria to measure similarity, typically reflecting the companies' operations and relationships. These criteria can embody one or more dimensions such as industry sectors, employee profiles, keywords/tags, customers' review, financial performance, co-appearance in news, and so on. Investment professionals usually begin with a limited number of companies of interest (a.k.a. seed companies) and require an algorithmic approach to expand their search to a larger list of companies for potential investment.

    In recent years, transformer-based Language Models (LMs) have become the preferred method for encoding textual company descriptions into vector-space embeddings. Then companies that are similar to the seed companies can be searched in the embedding space using distance metrics like cosine similarity. The rapid advancements in Large LMs (LLMs), such as GPT-3/4 and LLaMA, have significantly enhanced the performance of general-purpose conversational models. These models, such as ChatGPT, can be employed to answer questions related to similar company discovery and quantification in a Q&A format.

    However, graph is still the most natural choice for representing and learning diverse company relations due to its ability to model complex relationships between a large number of entities. By representing companies as nodes and their relationships as edges, we can form a Knowledge Graph (KG). Utilizing this KG allows us to efficiently capture and analyze the network structure of the business landscape. Moreover, KG-based approaches allow us to leverage powerful tools from network science, graph theory, and graph-based machine learning, such as Graph Neural Networks (GNNs), to extract insights and patterns to facilitate similar company analysis. While there are various company datasets (mostly commercial/proprietary and non-relational) and graph datasets available (mostly for single link/node/graph-level predictions), there is a scarcity of datasets and benchmarks that combine both to create a large-scale KG dataset expressing rich pairwise company relations.

    Source Code and Tutorial:https://github.com/llcresearch/CompanyKG2

    Paper: to be published

  9. T

    United States Industrial Production

    • tradingeconomics.com
    • zh.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Sep 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). United States Industrial Production [Dataset]. https://tradingeconomics.com/united-states/industrial-production
    Explore at:
    xml, excel, json, csvAvailable download formats
    Dataset updated
    Sep 16, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 31, 1920 - Aug 31, 2025
    Area covered
    United States
    Description

    Industrial Production in the United States increased 0.90 percent in August of 2025 over the same month in the previous year. This dataset provides the latest reported value for - United States Industrial Production - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.

  10. j

    CA Jobs Dataset: Comprehensive Job Count Information by Company

    • jopilot.net
    zip
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tarta.ai, CA Jobs Dataset: Comprehensive Job Count Information by Company [Dataset]. https://jopilot.net/open-data/datasets/number-of-jobs-by-company-in-CA-0524
    Explore at:
    zip(3711706 bytes)Available download formats
    Dataset provided by
    Tarta.ai
    License

    https://jopilot.net/dataset-licencehttps://jopilot.net/dataset-licence

    Time period covered
    May 1, 2024 - May 31, 2024
    Area covered
    California
    Dataset funded by
    Tarta.ai
    Description

    The dataset provided by JoPilot, created in May 2024, contains information on the number of jobs by company and city in California. The data provides a comprehensive view of the job market, highlighting the companies and cities that have the highest number of job opportunities.

    The dataset includes a list of companies and the number of jobs they offer in different cities.

    The dataset provides valuable insights for job seekers, employers, and policymakers. It can help job seekers to identify companies and cities with the highest job opportunities in their preferred industry and location. Employers can use the data to understand the competitive landscape and adjust their recruitment strategies accordingly. Policymakers can leverage the information to develop policies that promote job growth and economic development in different regions.

    Overall, the JoPilot dataset is a valuable resource for anyone interested in the job market and provides a comprehensive view of the employment landscape across different industries and regions.

    Dataset Columns:
    1. Company name
    2. City
    3. State
    4. Jobs in total
  11. N

    Industry, IL Population Dataset: Yearly Figures, Population Change, and...

    • neilsberg.com
    csv, json
    Updated Sep 18, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2023). Industry, IL Population Dataset: Yearly Figures, Population Change, and Percent Change Analysis [Dataset]. https://www.neilsberg.com/research/datasets/6ea88d6e-3d85-11ee-9abe-0aa64bf2eeb2/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Sep 18, 2023
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Industry, Illinois
    Variables measured
    Annual Population Growth Rate, Population Between 2000 and 2022, Annual Population Growth Rate Percent
    Measurement technique
    The data presented in this dataset is derived from the 20 years data of U.S. Census Bureau Population Estimates Program (PEP) 2000 - 2022. To measure the variables, namely (a) population and (b) population change in ( absolute and as a percentage ), we initially analyzed and tabulated the data for each of the years between 2000 and 2022. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the Industry population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of Industry across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.

    Key observations

    In 2022, the population of Industry was 393, a 0.00% decrease year-by-year from 2021. Previously, in 2021, Industry population was 393, a decline of 1.26% compared to a population of 398 in 2020. Over the last 20 plus years, between 2000 and 2022, population of Industry decreased by 172. In this period, the peak population was 565 in the year 2000. The numbers suggest that the population has already reached its peak and is showing a trend of decline. Source: U.S. Census Bureau Population Estimates Program (PEP).

    Content

    When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).

    Data Coverage:

    • From 2000 to 2022

    Variables / Data Columns

    • Year: This column displays the data year (Measured annually and for years 2000 to 2022)
    • Population: The population for the specific year for the Industry is shown in this column.
    • Year on Year Change: This column displays the change in Industry population for each year compared to the previous year.
    • Change in Percent: This column displays the year on year change as a percentage. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Industry Population by Year. You can refer the same here

  12. c

    The global AI Training Dataset Market size will be USD 2962.4 million in...

    • cognitivemarketresearch.com
    pdf,excel,csv,ppt
    Updated Aug 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cognitive Market Research (2025). The global AI Training Dataset Market size will be USD 2962.4 million in 2025. [Dataset]. https://www.cognitivemarketresearch.com/ai-training-dataset-market-report
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset updated
    Aug 15, 2025
    Dataset authored and provided by
    Cognitive Market Research
    License

    https://www.cognitivemarketresearch.com/privacy-policyhttps://www.cognitivemarketresearch.com/privacy-policy

    Time period covered
    2021 - 2033
    Area covered
    Global
    Description

    According to Cognitive Market Research, the global AI Training Dataset Market size will be USD 2962.4 million in 2025. It will expand at a compound annual growth rate (CAGR) of 28.60% from 2025 to 2033.

    North America held the major market share for more than 37% of the global revenue with a market size of USD 1096.09 million in 2025 and will grow at a compound annual growth rate (CAGR) of 26.4% from 2025 to 2033.
    Europe accounted for a market share of over 29% of the global revenue, with a market size of USD 859.10 million.
    APAC held a market share of around 24% of the global revenue with a market size of USD 710.98 million in 2025 and will grow at a compound annual growth rate (CAGR) of 30.6% from 2025 to 2033.
    South America has a market share of more than 3.8% of the global revenue, with a market size of USD 112.57 million in 2025 and will grow at a compound annual growth rate (CAGR) of 27.6% from 2025 to 2033.
    Middle East had a market share of around 4% of the global revenue and was estimated at a market size of USD 118.50 million in 2025 and will grow at a compound annual growth rate (CAGR) of 27.9% from 2025 to 2033.
    Africa had a market share of around 2.20% of the global revenue and was estimated at a market size of USD 65.17 million in 2025 and will grow at a compound annual growth rate (CAGR) of 28.3% from 2025 to 2033.
    Data Annotation category is the fastest growing segment of the AI Training Dataset Market
    

    Market Dynamics of AI Training Dataset Market

    Key Drivers for AI Training Dataset Market

    Government-Led Open Data Initiatives Fueling AI Training Dataset Market Growth

    In recent years, Government-initiated open data efforts have strongly driven the development of the AI Training Dataset Market through offering affordable, high-quality datasets that are vital in training sound AI models. For instance, the U.S. government's drive for openness and innovation can be seen through portals such as Data.gov, which provides an enormous collection of datasets from many industries, ranging from healthcare, finance, and transportation. Such datasets are basic building blocks in constructing AI applications and training models using real-world data. In the same way, the platform data.gov.uk, run by the U.K. government, offers ample datasets to aid AI research and development, creating an environment that is supportive of technological growth. By releasing such information into the public domain, governments not only enhance transparency but also encourage innovation in the AI industry, resulting in greater demand for training datasets and helping to drive the market's growth.

    India's IndiaAI Datasets Platform Accelerates AI Training Dataset Market Growth

    India's upcoming launch of the IndiaAI Datasets Platform in January 2025 is likely to greatly increase the AI Training Dataset Market. The project, which is part of the government's ?10,000 crore IndiaAI Mission, will establish an open-source repository similar to platforms such as HuggingFace to enable developers to create, train, and deploy AI models. The platform will collect datasets from central and state governments and private sector organizations to provide a wide and rich data pool. Through improved access to high-quality, non-personal data, the platform is filling an important requirement for high-quality datasets for training AI models, thus driving innovation and development in the AI industry. This public initiative reflects India's determination to become a global AI hub, offering the infrastructure required to facilitate startups, researchers, and businesses in creating cutting-edge AI solutions. The initiative not only simplifies data access but also creates a model for public-private partnerships in AI development.

    Restraint Factor for the AI Training Dataset Market

    Data Privacy Regulations Impeding AI Training Dataset Market Growth

    Strict data privacy laws are coming up as a major constraint in the AI Training Dataset Market since governments across the globe are establishing legislation to safeguard personal data. In the European Union, explicit consent for using personal data is required under the General Data Protection Regulation (GDPR), reducing the availability of datasets for training AI. Likewise, the data protection regulator in Brazil ordered Meta and others to stop the use of Brazilian personal data in training AI models due to dangers to individuals' funda...

  13. H

    Census Manufacturing Statistics, 1958-1976 (M280)

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Oct 29, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Research Program in Competition and Business Policy (2015). Census Manufacturing Statistics, 1958-1976 (M280) [Dataset]. http://doi.org/10.7910/DVN/LRROMK
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 29, 2015
    Dataset provided by
    Harvard Dataverse
    Authors
    Research Program in Competition and Business Policy
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    United States
    Description

    There are three sections of this data. The first covers data from 1958-1971, the second contains data for 1972-1976, and the third is a combination of 1958-1976. 1958-1971: contains establishment statistics and industry structure data for 421 4-digit census manufacturing industries. All are SIC industries in the 2000 to 4000 range, with the exception of 6 "ordnance accessories" industries (SIC 1900's). Also, SIC 9999 is included as the last SIC. Industry definitions are based on the 1967 SIC. There are 15 records per industry for a total of 6315 records. 1972-1976: This file contains establishment statistics and industry structure data for 450 4-digit census manufacturing industries. Establishment data is from Industry Profiles. Industry structure data is from the Census of Manufactures: 1972 Special Report, Concentration Ratios in Manufacturing. Industry definitions are based on the 1972 SIC definitions. There is one record per industry. 1958-1976: A combination of the two datasets described above, organized to provide a time series for as many industries (343) as it was possible to maintain a complete and accurate data over time. There is one record for each industry,

  14. Company Datasets for Business Profiling

    • datarade.ai
    Updated Feb 23, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oxylabs (2017). Company Datasets for Business Profiling [Dataset]. https://datarade.ai/data-products/company-datasets-for-business-profiling-oxylabs
    Explore at:
    .json, .xml, .csv, .xlsAvailable download formats
    Dataset updated
    Feb 23, 2017
    Dataset provided by
    Oxy Labs
    Authors
    Oxylabs
    Area covered
    Tunisia, Nepal, Bangladesh, British Indian Ocean Territory, Taiwan, Andorra, Moldova (Republic of), Canada, Isle of Man, Northern Mariana Islands
    Description

    Company Datasets for valuable business insights!

    Discover new business prospects, identify investment opportunities, track competitor performance, and streamline your sales efforts with comprehensive Company Datasets.

    These datasets are sourced from top industry providers, ensuring you have access to high-quality information:

    • Owler: Gain valuable business insights and competitive intelligence. -AngelList: Receive fresh startup data transformed into actionable insights. -CrunchBase: Access clean, parsed, and ready-to-use business data from private and public companies. -Craft.co: Make data-informed business decisions with Craft.co's company datasets. -Product Hunt: Harness the Product Hunt dataset, a leader in curating the best new products.

    We provide fresh and ready-to-use company data, eliminating the need for complex scraping and parsing. Our data includes crucial details such as:

    • Company name;
    • Size;
    • Founding date;
    • Location;
    • Industry;
    • Revenue;
    • Employee count;
    • Competitors.

    You can choose your preferred data delivery method, including various storage options, delivery frequency, and input/output formats.

    Receive datasets in CSV, JSON, and other formats, with storage options like AWS S3 and Google Cloud Storage. Opt for one-time, monthly, quarterly, or bi-annual data delivery.

    With Oxylabs Datasets, you can count on:

    • Fresh and accurate data collected and parsed by our expert web scraping team.
    • Time and resource savings, allowing you to focus on data analysis and achieving your business goals.
    • A customized approach tailored to your specific business needs.
    • Legal compliance in line with GDPR and CCPA standards, thanks to our membership in the Ethical Web Data Collection Initiative.

    Pricing Options:

    Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.

    Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.

    Experience a seamless journey with Oxylabs:

    • Understanding your data needs: We work closely to understand your business nature and daily operations, defining your unique data requirements.
    • Developing a customized solution: Our experts create a custom framework to extract public data using our in-house web scraping infrastructure.
    • Delivering data sample: We provide a sample for your feedback on data quality and the entire delivery process.
    • Continuous data delivery: We continuously collect public data and deliver custom datasets per the agreed frequency.

    Unlock the power of data with Oxylabs' Company Datasets and supercharge your business insights today!

  15. r

    Data from: Dataset with condition monitoring vibration data annotated with...

    • researchdata.se
    Updated Jun 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karl Löwenmark; Fredrik Sandin; Marcus Liwicki; Stephan Schnabel (2025). Dataset with condition monitoring vibration data annotated with technical language, from paper machine industries in northern Sweden [Dataset]. http://doi.org/10.5878/hxc0-bd07
    Explore at:
    (200308), (124)Available download formats
    Dataset updated
    Jun 17, 2025
    Dataset provided by
    Luleå University of Technology
    Authors
    Karl Löwenmark; Fredrik Sandin; Marcus Liwicki; Stephan Schnabel
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Area covered
    Sweden
    Description

    Labelled industry datasets are one of the most valuable assets in prognostics and health management (PHM) research. However, creating labelled industry datasets is both difficult and expensive, making publicly available industry datasets rare at best, in particular labelled datasets. Recent studies have showcased that industry annotations can be used to train artificial intelligence models directly on industry data ( https://doi.org/10.36001/ijphm.2022.v13i2.3137 , https://doi.org/10.36001/phmconf.2023.v15i1.3507 ), but while many industry datasets also contain text descriptions or logbooks in the form of annotations and maintenance work orders, few, if any, are publicly available. Therefore, we release a dataset consisting with annotated signal data from two large (80mx10mx10m) paper machines, from a Kraftliner production company in northern Sweden. The data consists of 21 090 pairs of signals and annotations from one year of production. The annotations are written in Swedish, by on-site Swedish experts, and the signals consist primarily of accelerometer vibration measurements from the two machines. The dataset is structured as a Pandas dataframe and serialized as a pickle (.pkl) file and a JSON (.json) file. The first column (‘id’) is the ID of the samples; the second column (‘Spectra’) are the fast Fourier transform and envelope-transformed vibration signals; the third column (‘Notes’) are the associated annotations, mapped so that each annotation is associated with all signals from ten days before the annotation date, up to the annotation date; and finally the fourth column (‘Embeddings’) are pre-computed embeddings using Swedish SentenceBERT. Each row corresponds to a vibration measurement sample, though there is no distinction in this data between which sensor or machine part each measurement is from.

  16. d

    Small Business Contact Data | North American Small Business Owners |...

    • datarade.ai
    Updated Oct 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2021). Small Business Contact Data | North American Small Business Owners | Verified Contact Details from 170M Profiles | Best Price Guaranteed [Dataset]. https://datarade.ai/data-products/small-business-contact-data-north-american-small-business-o-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Oct 27, 2021
    Dataset provided by
    Success.ai
    Area covered
    Bermuda, Greenland, Saint Pierre and Miquelon, United States of America, Guatemala, Mexico, Panama, Honduras, Costa Rica, Belize
    Description

    Access B2B Contact Data for North American Small Business Owners with Success.ai—your go-to provider for verified, high-quality business datasets. This dataset is tailored for businesses, agencies, and professionals seeking direct access to decision-makers within the small business ecosystem across North America. With over 170 million professional profiles, it’s an unparalleled resource for powering your marketing, sales, and lead generation efforts.

    Key Features of the Dataset:

    Verified Contact Details

    Includes accurate and up-to-date email addresses and phone numbers to ensure you reach your targets reliably.

    AI-validated for 99% accuracy, eliminating errors and reducing wasted efforts.

    Detailed Professional Insights

    Comprehensive data points include job titles, skills, work experience, and education to enable precise segmentation and targeting.

    Enriched with insights into decision-making roles, helping you connect directly with small business owners, CEOs, and other key stakeholders.

    Business-Specific Information

    Covers essential details such as industry, company size, location, and more, enabling you to tailor your campaigns effectively. Ideal for profiling and understanding the unique needs of small businesses.

    Continuously Updated Data

    Our dataset is maintained and updated regularly to ensure relevance and accuracy in fast-changing market conditions. New business contacts are added frequently, helping you stay ahead of the competition.

    Why Choose Success.ai?

    At Success.ai, we understand the critical importance of high-quality data for your business success. Here’s why our dataset stands out:

    Tailored for Small Business Engagement Focused specifically on North American small business owners, this dataset is an invaluable resource for building relationships with SMEs (Small and Medium Enterprises). Whether you’re targeting startups, local businesses, or established small enterprises, our dataset has you covered.

    Comprehensive Coverage Across North America Spanning the United States, Canada, and Mexico, our dataset ensures wide-reaching access to verified small business contacts in the region.

    Categories Tailored to Your Needs Includes highly relevant categories such as Small Business Contact Data, CEO Contact Data, B2B Contact Data, and Email Address Data to match your marketing and sales strategies.

    Customizable and Flexible Choose from a wide range of filtering options to create datasets that meet your exact specifications, including filtering by industry, company size, geographic location, and more.

    Best Price Guaranteed We pride ourselves on offering the most competitive rates without compromising on quality. When you partner with Success.ai, you receive superior data at the best value.

    Seamless Integration Delivered in formats that integrate effortlessly with your CRM, marketing automation, or sales platforms, so you can start acting on the data immediately.

    Use Cases: This dataset empowers you to:

    Drive Sales Growth: Build and refine your sales pipeline by connecting directly with decision-makers in small businesses. Optimize Marketing Campaigns: Launch highly targeted email and phone outreach campaigns with verified contact data. Expand Your Network: Leverage the dataset to build relationships with small business owners and other key figures within the B2B landscape. Improve Data Accuracy: Enhance your existing databases with verified, enriched contact information, reducing bounce rates and increasing ROI. Industries Served: Whether you're in B2B SaaS, digital marketing, consulting, or any field requiring accurate and targeted contact data, this dataset serves industries of all kinds. It is especially useful for professionals focused on:

    Lead Generation Business Development Market Research Sales Outreach Customer Acquisition What’s Included in the Dataset: Each profile provides:

    Full Name Verified Email Address Phone Number (where available) Job Title Company Name Industry Company Size Location Skills and Professional Experience Education Background With over 170 million profiles, you can tap into a wealth of opportunities to expand your reach and grow your business.

    Why High-Quality Contact Data Matters: Accurate, verified contact data is the foundation of any successful B2B strategy. Reaching small business owners and decision-makers directly ensures your message lands where it matters most, reducing costs and improving the effectiveness of your campaigns. By choosing Success.ai, you ensure that every contact in your pipeline is a genuine opportunity.

    Partner with Success.ai for Better Data, Better Results: Success.ai is committed to delivering premium-quality B2B data solutions at scale. With our small business owner dataset, you can unlock the potential of North America's dynamic small business market.

    Get Started Today Request a sample or customize your dataset to fit your unique...

  17. T

    United States Industrial Production MoM

    • tradingeconomics.com
    • tr.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Sep 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). United States Industrial Production MoM [Dataset]. https://tradingeconomics.com/united-states/industrial-production-mom
    Explore at:
    xml, json, csv, excelAvailable download formats
    Dataset updated
    Sep 16, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Feb 28, 1919 - Aug 31, 2025
    Area covered
    United States
    Description

    Industrial Production in the United States increased 0.10 percent in August of 2025 over the previous month. This dataset provides the latest reported value for - United States Industrial Production MoM - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.

  18. Mental Health in the Tech Industry

    • kaggle.com
    zip
    Updated Jan 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Mental Health in the Tech Industry [Dataset]. https://www.kaggle.com/datasets/thedevastator/osmi-mental-health-survey
    Explore at:
    zip(3427606 bytes)Available download formats
    Dataset updated
    Jan 21, 2023
    Authors
    The Devastator
    Description

    Mental Health in the Tech Industry

    Exploring Mental Health Conditions in the Tech Industry

    By Olga Tsubiks [source]

    About this dataset

    This dataset contains survey responses from the tech industry about mental health, offering an insightful snapshot into the diagnoses, treatments, and attitudes of those in the field towards mental health. These data points allow people to understand more about how their peers in tech view mental health and can provide greater insight into how to better support those who work in this industry. This dataset includes questions on whether or not respondents have had a mental health disorder or sought treatment for a mental health issue in the past, if they currently have been diagnosed with a condition and what it is, their age group, location of work and residence as well as information on whether they are self-employed or working at a tech company with other questions. Additionally, this dataset also provides insight into respondents' attitudes towards speaking openly about their mental wellbeing versus physical wellbeing. To gain even more understanding of individual's experiences within their place of business overall employee count is included as well what role they fill within that organisation is related to technology/IT. This valuable data set may be used for medical research furthering our knowledge about workplace stressors effecting people seen within this particular field but also across multiple industries to help create support systems that reflect upon individual need rather than one-size fits all models previously employed by employers through out many parts globally

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    Research Ideas

    • Analyze the correlation between employment industry and mental health status, including self-identified diagnosis, use of mental health services and any history of mental illness in the family.
    • Determine if there are differences in how people experience and speak out about their own mental health based on geographic location.
    • Compare attitudes towards open conversations on physical vs mental health within different age groups both in the U.S. and abroad

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

    Columns

    File: OSMI_Survey_Data.csv | Column name | Description | |:-----------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------| | Are you selfemployed | Indicates whether the respondent is self-employed or not. (Boolean) | | How many employees does your company or organization have | Indicates the number of employees in the respondent's company or organization. (Numeric) | | Is your employer primarily a tech companyorganization | Indicates whether the respondent's employer is primarily a tech company or organization. (Boolean) | | Is your primary role within your company related to techIT | Indicates whether the respondent's primary role within their company is related to tech or IT. (Boolean) | | Do you have previous employers | Indicates whether the respondent has had previous employers. (Boolean) ...

  19. m

    Miscellaneous Manufacturing Industries - Price Series

    • macro-rankings.com
    csv, excel
    Updated Jul 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). Miscellaneous Manufacturing Industries - Price Series [Dataset]. https://www.macro-rankings.com/industries/mg-miscellaneous-manufacturing-industries
    Explore at:
    csv, excelAvailable download formats
    Dataset updated
    Jul 2, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    united states
    Description

    Industry Time Series for Miscellaneous Manufacturing Industries. The frequency of the observation is daily. Moving average series are also typically included. This major group includes establishments primarily engaged in manufacturing products not classified in any other manufacturing major group.

  20. d

    Employment in The ICT Industry - Dataset - MAMPU

    • archive.data.gov.my
    Updated Nov 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Employment in The ICT Industry - Dataset - MAMPU [Dataset]. https://archive.data.gov.my/data/dataset/employment-in-the-ict-industry
    Explore at:
    Dataset updated
    Nov 14, 2022
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset shows the Employment in The ICT Industry, 2005 - 2021 value below are estimate Base year Year 2005 2012 2010 2016 value below are preliminary Base year Year 2005 2013 2010 2017 No. of Views : 67

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
The Devastator (2023). US Industry Data by State, by Industry [Dataset]. https://www.kaggle.com/datasets/thedevastator/2012-us-industry-data-by-state-by-industry
Organization logo

US Industry Data by State, by Industry

Number of Establishments, Sales, Payroll, and Employees

Explore at:
zip(53066 bytes)Available download formats
Dataset updated
Jan 15, 2023
Authors
The Devastator
Area covered
United States
Description

US Industry Data by State, by Industry

Number of Establishments, Sales, Payroll, and Employees

By Gary Hoover [source]

About this dataset

This data set provides a detailed look into the US economy. It includes information on establishments and nonemployer businesses, as well as sales revenue, payrolls, and the number of employees. Gleaned from the Economic Census done every five years, this data is a valuable resource to anyone curious about where the nation was economically at the time. With columns including geographic area name, North American Industry Classification System (NAICS) codes for industries, descriptions of those codes meaning of operation or tax status, and annual payroll, this information-rich dataset contains all you need to track economic trends over time. Whether you’re a researcher studying industry patterns or an entrepreneur looking for market insight — this dataset has what you’re looking for!

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset provides detailed US industry data by state, including the number of establishments, value of sales, payroll, and number of employees. All the data is based on the North American Industry Classification System (NAICS) code for each specific industry. This will allow you to easily analyze and compare industries across different states or regions.

Research Ideas

  • Analyzing the economic impact of a new business or industry trends in different states: Comparing the change in the number of establishments, payroll, and employees over time can give insight into how a state is affected by a new industry trend or introduction of a new service or product.
  • Estimating customer sales potential for businesses: This dataset can be used to estimate the potential customer base for businesses in different geographic areas. By analyzing total business done by non-employers in an area along with its estimated population can help estimate how much overall sales potential exists for a given region.
  • Tracking competitor performance: By looking at shipments, receipts, and value of business done across industries in different regions or even cities, companies can track their competitors’ performance and compare it to their own to better assess their strategies going forward

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

Columns

File: 2012 Industry Data by Industry and State.csv | Column name | Description | |:----------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------| | Geographic area name | The name of the geographic area the data is for. (String) | | NAICS code | The North American Industry Classification System (NAICS) code for the industry. (String) | | Meaning of NAICS code | The description of the NAICS code. (String) | | Meaning of Type of operation or tax status code | The description of the type of operation or tax status code. (String) ...

Search
Clear search
Close search
Google apps
Main menu