Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset provides anonymized sample employee records commonly found in HR information systems. It includes details such as employee ID, name, job title, department, business unit, gender, ethnicity, age, hire date, and annual salary. It is ideal for educational projects, algorithm demonstrations (such as B-tree implementation), HR analytics exploration, salary-related analysis examples, and more.
Facebook
TwitterThe 10,000 Worlds Employee Dataset is a comprehensive dataset designed for analyzing workforce trends, employee performance, and organizational dynamics within a large-scale company setting. This dataset contains information on 10,000 employees, spanning various departments, roles, and experience levels. It is ideal for research in human resource analytics, machine learning applications in employee retention, performance prediction, and diversity analysis.
Key Features of the Dataset: Employee Demographics:
Age, gender, ethnicity Education level, degree specialization Years of experience Employment Details:
Department (e.g., HR, Engineering, Marketing) Job title and seniority level Employment type (full-time, part-time, contract) Performance & Productivity Metrics:
Annual performance ratings Work hours, overtime details Training programs attended Compensation & Benefits:
Salary, bonuses, stock options Benefits (healthcare, pension plans, remote work options) Employee Engagement & Retention:
Job satisfaction scores Attrition and turnover rates Promotion history and career growth Workplace Environment Factors:
Team collaboration metrics Employee feedback and survey results Work-life balance indicators Use Cases: HR Analytics: Identifying patterns in employee satisfaction, retention, and performance. Predictive Modeling: Forecasting attrition risks and promotion likelihoods. Diversity & Inclusion Analysis: Understanding representation across departments. Compensation Benchmarking: Comparing salaries and benefits within and across industries. This dataset is highly valuable for data scientists, HR professionals, and business analysts looking to gain insights into workforce dynamics and improve organizational strategies.
Would you like any additional details or a sample schema for the dataset?
Facebook
Twitter➡️ You can choose from multiple data formats, delivery frequency options, and delivery methods;
➡️ You can select raw or clean and AI-enriched datasets;
➡️ Multiple APIs designed for effortless search and enrichment (accessible using a user-friendly self-service tool);
➡️ Fresh data: daily updates, easy change tracking with dedicated data fields, and a constant flow of new data;
➡️ You get all necessary resources for evaluating our data: a free consultation, a data sample, or free credits for testing our APIs.
Coresignal's employee data enables you to create and improve innovative data-driven solutions and extract actionable business insights. These datasets are popular among companies from different industries, including HR and sales technology and investment.
Employee Data use cases:
✅ Source best-fit talent for your recruitment needs
Coresignal's Employee Data can help source the best-fit talent for your recruitment needs by providing the most up-to-date information on qualified candidates globally.
✅ Fuel your lead generation pipeline
Enhance lead generation with 712M+ up-to-date employee records from the largest professional network. Our Employee Data can help you develop a qualified list of potential clients and enrich your own database.
✅ Analyze talent for investment opportunities
Employee Data can help you generate actionable signals and identify new investment opportunities earlier than competitors or perform deeper analysis of companies you're interested in.
➡️ Why 400+ data-powered businesses choose Coresignal:
Facebook
TwitterCreating a robust employee dataset for data analysis and visualization involves several key fields that capture different aspects of an employee's information. Here's a list of fields you might consider including: Employee ID: A unique identifier for each employee. Name: First name and last name of the employee. Gender: Male, female, non-binary, etc. Date of Birth: Birthdate of the employee. Email Address: Contact email of the employee. Phone Number: Contact number of the employee. Address: Home or work address of the employee. Department: The department the employee belongs to (e.g., HR, Marketing, Engineering, etc.). Job Title: The specific job title of the employee. Manager ID: ID of the employee's manager. Hire Date: Date when the employee was hired. Salary: Employee's salary or compensation. Employment Status: Full-time, part-time, contractor, etc. Employee Type: Regular, temporary, contract, etc. Education Level: Highest level of education attained by the employee. Certifications: Any relevant certifications the employee holds. Skills: Specific skills or expertise possessed by the employee. Performance Ratings: Ratings or evaluations of employee performance. Work Experience: Previous work experience of the employee. Benefits Enrollment: Information on benefits chosen by the employee (e.g., healthcare plan, retirement plan, etc.). Work Location: Physical location where the employee works. Work Hours: Regular working hours or shifts of the employee. Employee Status: Active, on leave, terminated, etc. Emergency Contact: Contact information of the employee's emergency contact person. Employee Satisfaction Survey Responses: Data from employee satisfaction surveys, if applicable.
Code Url: https://github.com/intellisenseCodez/faker-data-generator
Facebook
TwitterThe documentation covers Enterprise Survey panel datasets that were collected in Slovenia in 2009, 2013 and 2019.
The Slovenia ES 2009 was conducted between 2008 and 2009. The Slovenia ES 2013 was conducted between March 2013 and September 2013. Finally, the Slovenia ES 2019 was conducted between December 2018 and November 2019. The objective of the Enterprise Survey is to gain an understanding of what firms experience in the private sector.
As part of its strategic goal of building a climate for investment, job creation, and sustainable growth, the World Bank has promoted improving the business environment as a key strategy for development, which has led to a systematic effort in collecting enterprise data across countries. The Enterprise Surveys (ES) are an ongoing World Bank project in collecting both objective data based on firms' experiences and enterprises' perception of the environment in which they operate.
National
The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must take its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.
As it is standard for the ES, the Slovenia ES was based on the following size stratification: small (5 to 19 employees), medium (20 to 99 employees), and large (100 or more employees).
Sample survey data [ssd]
The sample for Slovenia ES 2009, 2013, 2019 were selected using stratified random sampling, following the methodology explained in the Sampling Manual for Slovenia 2009 ES and for Slovenia 2013 ES, and in the Sampling Note for 2019 Slovenia ES.
Three levels of stratification were used in this country: industry, establishment size, and oblast (region). The original sample designs with specific information of the industries and regions chosen are included in the attached Excel file (Sampling Report.xls.) for Slovenia 2009 ES. For Slovenia 2013 and 2019 ES, specific information of the industries and regions chosen is described in the "The Slovenia 2013 Enterprise Surveys Data Set" and "The Slovenia 2019 Enterprise Surveys Data Set" reports respectively, Appendix E.
For the Slovenia 2009 ES, industry stratification was designed in the way that follows: the universe was stratified into manufacturing industries, services industries, and one residual (core) sector as defined in the sampling manual. Each industry had a target of 90 interviews. For the manufacturing industries sample sizes were inflated by about 17% to account for potential non-response cases when requesting sensitive financial data and also because of likely attrition in future surveys that would affect the construction of a panel. For the other industries (residuals) sample sizes were inflated by about 12% to account for under sampling in firms in service industries.
For Slovenia 2013 ES, industry stratification was designed in the way that follows: the universe was stratified into one manufacturing industry, and two service industries (retail, and other services).
Finally, for Slovenia 2019 ES, three levels of stratification were used in this country: industry, establishment size, and region. The original sample design with specific information of the industries and regions chosen is described in "The Slovenia 2019 Enterprise Surveys Data Set" report, Appendix C. Industry stratification was done as follows: Manufacturing – combining all the relevant activities (ISIC Rev. 4.0 codes 10-33), Retail (ISIC 47), and Other Services (ISIC 41-43, 45, 46, 49-53, 55, 56, 58, 61, 62, 79, 95).
For Slovenia 2009 and 2013 ES, size stratification was defined following the standardized definition for the rollout: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not a common practice, except in the sectors of construction and agriculture.
For Slovenia 2009 ES, regional stratification was defined in 2 regions. These regions are Vzhodna Slovenija and Zahodna Slovenija. The Slovenia sample contains panel data. The wave 1 panel “Investment Climate Private Enterprise Survey implemented in Slovenia” consisted of 223 establishments interviewed in 2005. A total of 57 establishments have been re-interviewed in the 2008 Business Environment and Enterprise Performance Survey.
For Slovenia 2013 ES, regional stratification was defined in 2 regions (city and the surrounding business area) throughout Slovenia.
Finally, for Slovenia 2019 ES, regional stratification was done across two regions: Eastern Slovenia (NUTS code SI03) and Western Slovenia (SI04).
Computer Assisted Personal Interview [capi]
Questionnaires have common questions (core module) and respectfully additional manufacturing- and services-specific questions. The eligible manufacturing industries have been surveyed using the Manufacturing questionnaire (includes the core module, plus manufacturing specific questions). Retail firms have been interviewed using the Services questionnaire (includes the core module plus retail specific questions) and the residual eligible services have been covered using the Services questionnaire (includes the core module). Each variation of the questionnaire is identified by the index variable, a0.
Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether whereas the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.
Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect the refusal to respond as (-8). b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary. However, there were clear cases of low response.
For 2009 and 2013 Slovenia ES, the survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Up to 4 attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals. Further research is needed on survey non-response in the Enterprise Surveys regarding potential introduction of bias.
For 2009, the number of contacted establishments per realized interview was 6.18. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The relatively low ratio of contacted establishments per realized interview (6.18) suggests that the main source of error in estimates in the Slovenia may be selection bias and not frame inaccuracy.
For 2013, the number of realized interviews per contacted establishment was 25%. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 44%.
Finally, for 2019, the number of interviews per contacted establishments was 9.7%. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The share of rejections per contact was 75.2%.
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
Enhance your workforce insights with comprehensive Employee Dataset, designed to help businesses improve recruitment strategies, track employment trends, and optimize workforce planning. This dataset provides structured and reliable employee data for HR professionals, recruiters, and analysts.
Dataset Features
Employee Profiles: Access detailed public employee data, including names, job titles, industries, locations, experience, and skills. Ideal for talent acquisition, workforce analytics, and competitive hiring strategies. Company Employment Data: Gain insights into company workforce distribution, employee tenure, hiring trends, and organizational structures. Useful for market research, HR benchmarking, and business intelligence. Job Listings & Open Positions: Track job postings, employment trends, and hiring patterns across industries. This data includes job titles, company names, locations, salary ranges, and job descriptions.
Customizable Subsets for Specific Needs Our Employee Dataset is fully customizable, allowing you to filter data based on industry, location, job role, or company size. Whether you need a broad dataset for market analysis or a focused subset for recruitment purposes, we tailor the dataset to your specific needs.
Popular Use Cases
Recruitment & Talent Sourcing: Identify top talent, analyze hiring trends, and enhance recruitment strategies with up-to-date employee data. HR Analytics & Workforce Planning: Optimize workforce management by tracking employee movement, industry hiring patterns, and job market trends. Competitive Intelligence: Monitor hiring activity, employee retention rates, and workforce distribution to gain insights into competitors’ strategies. Market Research & Business Expansion: Analyze employment trends to identify growth opportunities, emerging job markets, and industry shifts. AI & Predictive Analytics: Leverage structured employee data for AI-driven workforce predictions, job market forecasting, and HR automation.
Whether you're looking to improve recruitment, analyze workforce trends, or gain competitive insights, our Employee Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.
Facebook
TwitterDownload Employee Travel Excel SheetThis dataset contains information about the employee travel expenses for the year 2020. Details are provided on the employee (name, title, department), the travel (dates, location, purpose) and the cost (expenses, recoveries). Expenses are broken down in separate tabs by Quarter (Q1, Q2, Q3 and Q4). Updated quarterly when expenses are prepared. Expenses for other years are available in separate datasets.
Facebook
TwitterThe HR dataset is a collection of employee data that includes information on various factors that may impact employee performance. To explore the employee performance factors using Python, we begin by importing the necessary libraries such as Pandas, NumPy, and Matplotlib, then load the HR dataset into a Pandas DataFrame and perform basic data cleaning and preprocessing steps such as handling missing values and checking for duplicates.
The dataset also use various data visualization to explore the relationships between different variables and employee performance. For example, scatterplots to examine the relationship between job satisfaction and performance ratings, or bar charts to compare the average performance ratings across different gender or positions.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The Employee Satisfaction Survey dataset is a comprehensive collection of information regarding employees within a company. It includes essential details such as employee identification numbers, self-reported satisfaction levels, performance evaluations, project involvement, work hours, tenure with the company, work accidents, promotions received in the last 5 years, departmental affiliations, and salary levels. This dataset offers valuable insights into the factors influencing employee satisfaction and can be used to analyze and understand various aspects of the workplace environment.
Facebook
TwitterNational Compensation Survey - Benefits produces comprehensive data on the incidence (the percentage of workers with access to and participation in employer provided benefit plans) and provisions of selected employee benefit plans. The Employee Benefits Survey (EBS) is an annual survey of the incidence and provisions of selected benefits provided by employers. The data are presented as a percentage of employees who participate in a certain benefit, or as an average benefit provision (for example, the average number of paid holidays provided to employees per year). The survey covers paid leave benefits such as holidays and vacations, and person, funeral, jury duty, military, parental, and sick leave; sickness and accident, long-term disability, and life insurance; medical, dental, and vision care plans; defined benefit pension and defined contribution plans; flexible benefits plans; reimbursement accounts; and unpaid parental leave. Also, data are tabulated on the incidence of several other benefits, such as severance pay, child-care assistance, wellness programs, and employee assistance programs. For more information and data visit: https://www.bls.gov/ebs/
Facebook
TwitterList of Arlington County employees in permanent positions at the start of the current Fiscal Year. Data includes demographic information as well as the employee’s Department and Job Title.
Facebook
TwitterThe OpenWeb Ninja Glassdoor Data API provides real-time access to extensive company data and employer reviews data from Glassdoor.
Key company data points included in the dataset: Name, Rating, Website, Salary and Job counts, Company size, Revenue, Stock, Competitors, Awards won, and 30+ more data points.
Key employer review data points included in the dataset: Review summary, Pros / Cons, Employee status, Location, Work-Life balance, CEO rating, and 20+ more data points.
OpenWeb Ninja's Glassdoor Data API Stats & Capabilities: - 2M+ Companies/Employers - 80M+ Employee Reviews - 30+ company data points - 20+ review data points - Company search capability
OpenWeb Ninja's Glassdoor Data API common use cases: - Investors and Market Analysts - Market and Industry Trends - Competitive Analysis - Company Insights
Facebook
TwitterIntroducing Job Posting Datasets: Uncover labor market insights!
Elevate your recruitment strategies, forecast future labor industry trends, and unearth investment opportunities with Job Posting Datasets.
Job Posting Datasets Source:
Indeed: Access datasets from Indeed, a leading employment website known for its comprehensive job listings.
Glassdoor: Receive ready-to-use employee reviews, salary ranges, and job openings from Glassdoor.
StackShare: Access StackShare datasets to make data-driven technology decisions.
Job Posting Datasets provide meticulously acquired and parsed data, freeing you to focus on analysis. You'll receive clean, structured, ready-to-use job posting data, including job titles, company names, seniority levels, industries, locations, salaries, and employment types.
Choose your preferred dataset delivery options for convenience:
Receive datasets in various formats, including CSV, JSON, and more. Opt for storage solutions such as AWS S3, Google Cloud Storage, and more. Customize data delivery frequencies, whether one-time or per your agreed schedule.
Why Choose Oxylabs Job Posting Datasets:
Fresh and accurate data: Access clean and structured job posting datasets collected by our seasoned web scraping professionals, enabling you to dive into analysis.
Time and resource savings: Focus on data analysis and your core business objectives while we efficiently handle the data extraction process cost-effectively.
Customized solutions: Tailor our approach to your business needs, ensuring your goals are met.
Legal compliance: Partner with a trusted leader in ethical data collection. Oxylabs is a founding member of the Ethical Web Data Collection Initiative, aligning with GDPR and CCPA best practices.
Pricing Options:
Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.
Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.
Experience a seamless journey with Oxylabs:
Effortlessly access fresh job posting data with Oxylabs Job Posting Datasets.
Facebook
TwitterPredictLeads Job Openings Data provides real-time hiring insights sourced directly from company websites, ensuring the highest level of accuracy and freshness. Unlike job boards that rely on aggregated listings, our dataset delivers unmatched granularity on job postings, salary trends, and workforce demand - making it a powerful tool for HR, talent acquisition, and market analysis.
Use Cases: ✅ Job Boards Enhancement – Improve job listings with, high-quality postings. ✅ HR Consulting – Analyze hiring trends to guide workforce planning strategies. ✅ Employment Analytics – Track job market shifts, salary benchmarks, and demand for skills. ✅ HR Operations – Optimize recruitment pipelines with direct employer-sourced data. ✅ Competitive Intelligence – Monitor hiring activities of competitors for strategic insights.
Key API Attributes:
PredictLeads Docs: https://docs.predictleads.com/v3/guide/job_openings_dataset
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains salary and bonus details of employees across various roles and industries. It includes key attributes such as base salary, annual bonus, job position, experience level, and department. The data can be used for data science projects related to salary prediction, employee compensation analysis, and financial insights. By exploring trends, correlations, and patterns, users can gain valuable insights into salary distributions and bonus structures. The dataset is suitable for machine learning applications such as regression modeling and classification. It is ideal for students, researchers, and professionals looking to analyze financial compensation trends in the workforce.
Facebook
TwitterU.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Capture/store/manage end-user work activities at designated times as needed for DOWS sampling. The end-user, e.g., CR, receives a desktop alert when a DOWS sample is requested. User selects appropriate DOWS category from drop-down menu and submits the information. Manager reviews end-user submitted data, adds non-reporter data, and submits report.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
🧑💼 Employee Performance and Salary Dataset
This synthetic dataset simulates employee information in a medium-sized organization, designed specifically for data preprocessing and exploratory data analysis (EDA) tasks in Data Mining and Machine Learning labs.
It includes over 1,000 employee records with realistic variations in age, gender, department, experience, performance score, and salary — along with missing values, duplicates, and outliers to mimic real-world data quality issues.
| Column Name | Description |
|---|---|
| Employee_ID | Unique employee identifier (E0001, E0002, …) |
| Age | Employee age (22–60 years) |
| Gender | Gender of the employee (Male/Female) |
| Department | Department where the employee works (HR, Finance, IT, Marketing, Sales, Operations) |
| Experience_Years | Total years of work experience (contains missing values) |
| Performance_Score | Employee performance score (0–100, contains missing values) |
| Salary | Annual salary in USD (contains outliers) |
Salary → Predict salary based on experience, performance, department, and age. Performance_Score → Predict employee performance based on age, experience, and department.
Predict the employee's salary based on their experience, performance score, and department.
X = ['Age', 'Experience_Years', 'Performance_Score', 'Department', 'Gender'] y = ['Salary']
You can apply:
R², MAE, MSE, RMSE, and residual plots.
Facebook
TwitterFor talent sourcing and HR intelligence
Coresignal’s global Employee Data enables you to make intelligent talent acquisition possible by building or improving AI-based talent-sourcing solutions.
Our Web Data contains information such as employees' interests and activities, recommendations, education, job experience, and more.
Use cases
For lead enrichment
Access to a list of 712M+ employees helps you build the best prospect list. With work experience, location, industry, and connections, among other data points, lead enrichment becomes much more accessible.
Use cases
Combine Employee Data with our Job Postings Data (260M+ records) for improved competitive intelligence and talent analytics.
Facebook
TwitterU.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Employee payroll data for all Cook County employees excluding Forest Preserves, indicating amount of base salary paid to an employee during the County fiscal quarter. Salaries are paid to employees on a bi-weekly basis.
Any pay period that extended between quarters will be reported to the quarter of the Pay Period End Date. (e.g. If a Pay Period runs 02/21-03/05, that pay period would be reported in the Q2 period, as the end of the pay period falls in March - Q2)
The county fiscal quarters are:
Q1: December - February
Q2: March - May
Q3: June - August
Q4: September - November
The Employee Unique Identifier field is a unique number assigned to each employee for the purpose of this data set, that is not their internal employee ID number, and allows an employee to be identified in the data set over time, in case of a name change or other change. This number will be consistent within the data set, but we reserve the right to regenerate this number over time across the data set.
ISSUE RESOLVED: As of 4/19/2018 there was an issue regarding employee FY2016 and FY2017 payroll in which records were duplicated in the quarterly aggregation, resulting in inflated base pay amounts. Please disregard any data extracted from this dataset prior to the correction date and use this version moving forward.
KNOWN ISSUE: Several records are missing Bureau and Office information. We are working on correcting this and will update the dataset when this issue has been resolved.
For data prior to Fiscal Year 2016, see datasets at https://datacatalog.cookcountyil.gov/browse?tags=payroll
Splitgraph serves as an HTTP API that lets you run SQL queries directly on this data to power Web applications. For example:
See the Splitgraph documentation for more information.
Facebook
TwitterODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Each year, the City of Boston publishes payroll data for employees. This dataset contains employee names, job details, and earnings information including base salary, overtime, and total compensation for employees of the City.
See the "Payroll Categories" document below for an explanation of what types of earnings are included in each category.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset provides anonymized sample employee records commonly found in HR information systems. It includes details such as employee ID, name, job title, department, business unit, gender, ethnicity, age, hire date, and annual salary. It is ideal for educational projects, algorithm demonstrations (such as B-tree implementation), HR analytics exploration, salary-related analysis examples, and more.