Facebook
TwitterThis dataset was created by SHAMANTH
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by Santhosh Kumar
Released under Apache 2.0
Facebook
TwitterThis dataset was created by Rohan Kayan
Released under Other (specified in description)
Facebook
TwitterThis Dataset indicates average salary by position title and grade for full-time regular employees. Data excludes elected, appointed, non-merit and temporary employees. Underfilled positions are also excluded from the dataset. Update Frequency : Annually
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Annual salary information including gross pay and overtime pay for all active, permanent employees of Montgomery County, MD paid in calendar year 2023. This dataset is a prime candidate for conducting analyses on salary disparities, the relationship between department/division and salary, and the distribution of salaries across gender and grade levels.
Statistical models can be applied to predict base salaries based on factors such as department, grade, and length of service. Machine learning techniques could also be employed to identify patterns and anomalies in the salary data, such as outliers or instances of significant inequity.
Some analysis to be performed with this dataset can include:
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The ATO (Australian Tax Office) made a dataset openly available (see links) showing all the Australian Salary and Wages (2002, 2006, 2010, 2014) by detailed occupation (around 1,000) and over 100 SA4 regions. Sole Trader sales and earnings are also provided. This open data (csv) is now packaged into a database (*.sql) with 45 sample SQL queries (backupSQL[date]_public.txt).See more description at related Figshare #datavis record. Versions:V5: Following #datascience course, I have made main data (individual salary and wages) available as csv and Jupyter Notebook. Checksum matches #dataTotals. In 209,xxx rows.Also provided Jobs, and SA4(Locations) description files as csv. More details at: Where are jobs growing/shrinking? Figshare DOI: 4056282 (linked below). Noted 1% discrepancy ($6B) in 2010 wages total - to follow up.#dataTotals - Salary and WagesYearWorkers (M)Earnings ($B) 20028.528520069.4372201010.2481201410.3584#dataTotal - Sole TradersYearWorkers (M)Sales ($B)Earnings ($B)20020.9611320061.0881920101.11122620141.19630#links See ATO request for data at ideascale link below.See original csv open data set (CC-BY) at data.gov.au link below.This database was used to create maps of change in regional employment - see Figshare link below (m9.figshare.4056282).#packageThis file package contains a database (analysing the open data) in SQL package and sample SQL text, interrogating the DB. DB name: test. There are 20 queries relating to Salary and Wages.#analysisThe database was analysed and outputs provided on Nectar(.org.au) resources at: http://118.138.240.130.(offline)This is only resourced for max 1 year, from July 2016, so will expire in June 2017. Hence the filing here. The sample home page is provided here (and pdf), but not all the supporting files, which may be packaged and added later. Until then all files are available at the Nectar URL. Nectar URL now offline - server files attached as package (html_backup[date].zip), including php scripts, html, csv, jpegs.#installIMPORT: DB SQL dump e.g. test_2016-12-20.sql (14.8Mb)1.Started MAMP on OSX.1.1 Go to PhpMyAdmin2. New Database: 3. Import: Choose file: test_2016-12-20.sql -> Go (about 15-20 seconds on MacBookPro 16Gb, 2.3 Ghz i5)4. four tables appeared: jobTitles 3,208 rows | salaryWages 209,697 rows | soleTrader 97,209 rows | stateNames 9 rowsplus views e.g. deltahair, Industrycodes, states5. Run test query under **#; Sum of Salary by SA4 e.g. 101 $4.7B, 102 $6.9B#sampleSQLselect sa4,(select sum(count) from salaryWageswhere year = '2014' and sa4 = sw.sa4) as thisYr14,(select sum(count) from salaryWageswhere year = '2010' and sa4 = sw.sa4) as thisYr10,(select sum(count) from salaryWageswhere year = '2006' and sa4 = sw.sa4) as thisYr06,(select sum(count) from salaryWageswhere year = '2002' and sa4 = sw.sa4) as thisYr02from salaryWages swgroup by sa4order by sa4
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
In the rapidly evolving field of data science, understanding the trends and patterns in salaries is crucial for professionals and organizations alike. This dataset aims to shed light on the landscape of Data Science Salaries from 2020 to 2024. By analyzing salary data over this period, data enthusiasts, researchers, and industry professionals can gain valuable insights into salary trends, regional variations, and potential factors influencing compensation within the data science community.
The dataset encompasses a comprehensive collection of data science salary information, covering a span of five years from 2020 to 2024. The data includes various aspects related to salaries, providing a multifaceted view of compensation in the field.
This dataset (data_science_salaries) covering from 2020 up to 2024 includes the following columns:
| Column Name | Description |
|---|---|
job_title | The job title or role associated with the reported salary. |
experience_level | The level of experience of the individual. |
employment_type | Indicates whether the employment is full-time, part-time, etc. |
work_models | Describes different working models (remote, on-site, hybrid). |
work_year | The specific year in which the salary information was recorded. |
employee_residence | The residence location of the employee. |
salary | The reported salary in the original currency. |
salary_currency | The currency in which the salary is denominated. |
salary_in_usd | The converted salary in US dollars. |
company_location | The geographic location of the employing organization. |
company_size | The size of the company, categorized by the number of employees. |
The primary dataset was retrieved from the ai-jobs.net. I sincerely thank the team for providing the core data used in this dataset.
© Image credit: Freepik
Facebook
TwitterProblem Description: Develop a salary prediction system based on the given dataset.
Data supplied: You are given two data files in CSV: • train_features.csv: Each row represents the metadata for an individual job posting. The “jobId” column represents a unique identifier for the job posting. The remaining columns describe the features of the job posting. • train_salaries.csv: Each row associates a “jobId” with a “salary”. The first row of each file contains headers for the columns. Keep in that the metadata and salary data were crawled from the internet. As such, it’s possible that the data is dirty (it may contain errors).
Questions 1. What steps did you take to prepare the data for the project? Was any cleaning necessary? 2. What algorithmic method did you apply? Why? What other methods did you consider? 3. Describe how the algorithmic method that you chose works? 4. What features did you use? Why? 5. How did you train your model? During training, what issues concerned you? 6. How did you assess the accuracy of your predictions? Why did you choose that method? Would you consider any alternative approaches for assessing accuracy? 7. Which features had the most significant impact on salary? How did you identify these to be most significant? Which features had the least impact on salary? How did you identify these?
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This data set represents the data analyzed and discussed in a research article for the Journal of Altmetrics (JOA) of the same title/name. However, the original survey project was commissioned by Virginia Tech Faculty Senate to assess faculty perceptions of research assessment and salary considerations at Virginia Tech. The project was overseen by the Faculty Senate Research Assessment Committee and the resulting report was submitted to Faculty Senate and presented to the Virginia Tech Board of Visitors at their June 2019 meeting. That report can be found at https://bov.vt.edu/assets/Attachment II_Constituent Reports_June 2019.pdf, pages 12-118 (includes survey questions). The data that are available in this data set represent data that were analyzed and included in the JOA research article. Certain data, including responses to questions about salaries, are not included in this data set, because they were not a part of the analysis for the JOA publication. This data set includes four files: the survey instrument (all questions, PDF format), select quantitative responses (.csv), select qualitative responses (.csv), and the codebook for the questions in the response files (.csv). All response data have been anonymized; please see each file for more details. Racial data and departmental data are eliminated to ensure anonymity. The survey was submitted to the Institutional Review Board of Virginia Tech and was determined to not be research involving human subjects as defined by HHS and FDA regulations, resulting in approval to broadly distribute the survey to Virginia Tech faculty with expectations that responses are kept anonymous and results do not claim to be generalizable knowledge (IRB reference number 19-234). The authors would like to thank Dr. Ivica Ico Bukvic, who provided us with a survey instrument used within the School of Performing Arts (SOPA) at Virginia Tech to determine the types of research and creative works SOPA faculty produce and which indicators they prefer for scholarly evaluation; this project's survey instrument was based in-part on the SOPA survey instrument.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Graduate nominal salaries for those of working age and the young population by gender and industry in 2023.
Facebook
TwitterThis dataset is a listing of all active City of Chicago employees, complete with full names, departments, positions, employment status (part-time or full-time), frequency of hourly employee –where applicable—and annual salaries or hourly rate. Please note that "active" has a specific meaning for Human Resources purposes and will sometimes exclude employees on certain types of temporary leave. For hourly employees, the City is providing the hourly rate and frequency of hourly employees (40, 35, 20 and 10) to allow dataset users to estimate annual wages for hourly employees. Please note that annual wages will vary by employee, depending on number of hours worked and seasonal status. For information on the positions and related salaries detailed in the annual budgets, see https://www.cityofchicago.org/city/en/depts/obm.html
Data Disclosure Exemptions: Information disclosed in this dataset is subject to FOIA Exemption Act, 5 ILCS 140/7 (Link:https://www.ilga.gov/legislation/ilcs/documents/000501400K7.htm)
Facebook
TwitterAs part of its commitment to greater transparency, the Government will publish salary information for the Senior Civil Service and organograms for all government departments.
The information provides a snapshot of the department as at 30 September 2012. The Cabinet Office is presenting the information in a standardised format and so our organogram does not always reflect our new departmental structure. *
What we are publishing:
Datasets
*The exercise is concerned with providing a snapshot of the department as at 30 September 2012 and as such; this information does not necessarily reflect our current make-up. The information will also be published on the Cabinet Office website. (DCMS will publish the information on its website and data.gov.uk, after the initial publication has taken place).
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This file contains detailed information about data professionals, including their salaries, designations, departments, and more. The data can be used for salary prediction, trend analysis, and HR analytics.
FIRST NAME: First name of the data professional (String)
LAST NAME: Last name of the data professional (String)
SEX: Gender of the data professional (String: 'F' for Female, 'M' for Male)
DOJ (Date of Joining): The date when the data professional joined the company (Date in MM/DD/YYYY format)
CURRENT DATE: The current date or the snapshot date of the data (Date in MM/DD/YYYY format)
DESIGNATION: The job role or designation of the data professional (String: e.g., Analyst, Senior Analyst, Manager)
AGE: Age of the data professional (Integer)
SALARY: Annual salary of the data professional (Float)
UNIT: Business unit or department the data professional works in (String: e.g., IT, Finance, Marketing)
LEAVES USED: Number of leaves used by the data professional (Integer)
LEAVES REMAINING: Number of leaves remaining for the data professional (Integer)
RATINGS: Performance ratings of the data professional (Float)
PAST EXP: Past work experience in years before joining the current company (Float)
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is accessed from https://www.kaggle.com/jessemostipak/college-tuition-diversity-and-pay and was downloaded on August 4, 2021.
The following excerpt is from Kaggle regarding the sources of this dataset:
The data this week comes from many different sources but originally came from the US Department of Education.
Tuition and fees by college/university for 2018-2019, along with school type, degree length, state, in-state vs out-of-state from the Chronicle of Higher Education. Diversity by college/university for 2014, along with school type, degree length, state, in-state vs out-of-state from the Chronicle of Higher Education. Example diversity graphics from Priceonomics. Average net cost by income bracket from TuitionTracker.org. Example price trend and graduation rates from TuitionTracker.org Salary potential data comes from payscale.com.
This dataset included the following files:
diversity_school.csv
historical_tuition.csv
salary_potential.csv
tuition_cost.csv
tuition_income.csv
After data cleaning, the data in diversity_school.csv and tuition_cost.csv were merged and the data in salary_potential.csv and tuition_income.csv were merged. The combined datasets were then split based on the US Census Regions into West, Midwest, Northeast and South (https://www2.census.gov/geo/pdfs/maps-data/maps/reference/us_regdiv.pdf).
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Median nominal and real salaries by different demographics time series 2007 - 2022(By gender, age group, and graduate type)
Facebook
TwitterIntroducing Job Posting Datasets: Uncover labor market insights!
Elevate your recruitment strategies, forecast future labor industry trends, and unearth investment opportunities with Job Posting Datasets.
Job Posting Datasets Source:
Indeed: Access datasets from Indeed, a leading employment website known for its comprehensive job listings.
Glassdoor: Receive ready-to-use employee reviews, salary ranges, and job openings from Glassdoor.
StackShare: Access StackShare datasets to make data-driven technology decisions.
Job Posting Datasets provide meticulously acquired and parsed data, freeing you to focus on analysis. You'll receive clean, structured, ready-to-use job posting data, including job titles, company names, seniority levels, industries, locations, salaries, and employment types.
Choose your preferred dataset delivery options for convenience:
Receive datasets in various formats, including CSV, JSON, and more. Opt for storage solutions such as AWS S3, Google Cloud Storage, and more. Customize data delivery frequencies, whether one-time or per your agreed schedule.
Why Choose Oxylabs Job Posting Datasets:
Fresh and accurate data: Access clean and structured job posting datasets collected by our seasoned web scraping professionals, enabling you to dive into analysis.
Time and resource savings: Focus on data analysis and your core business objectives while we efficiently handle the data extraction process cost-effectively.
Customized solutions: Tailor our approach to your business needs, ensuring your goals are met.
Legal compliance: Partner with a trusted leader in ethical data collection. Oxylabs is a founding member of the Ethical Web Data Collection Initiative, aligning with GDPR and CCPA best practices.
Pricing Options:
Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.
Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.
Experience a seamless journey with Oxylabs:
Effortlessly access fresh job posting data with Oxylabs Job Posting Datasets.
Facebook
TwitterInformation on posts occupied by:
Also included are details of the salary and responsibility attached to each senior post (where available) and the estimated cost of the teams supporting the delivery of these responsibilities. The estimated team cost uses the mid-point of the salary range and the mid-point of the three locational pay ranges the department has for its junior posts in the team.
This information can also be viewed as an http://reference.data.gov.uk/gov-structure/organogram/?dept=dh">organogram on the data.gov.uk website.
Note: the organogram can be viewed in the most up-to-date browsers (Internet Explorer 8 or above), but will not display clearly in older browsers. CSV files can be read by most spreadsheets, word processors and text editors.
Facebook
TwitterDfE salary data and organograms showing the costs associated with each of our directorates. We update and republish the data quarterly.
The latest files include:
DfE’s organisation and costs are also available as a series of organograms on the https://www.data.gov.uk/dataset/5a1f3831-86d6-4979-9164-99e982361ca4/organogram-department-for-education">data.gov.uk site.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Is the Gini Coefficient Enough? A Microeconomic Data Decomposition StudyIvan Skliarov, Lukasz Goczek (2023).List of data files:1. theil_raw.csv - data obtained from LISSY using the lis_theil.R script.*2. scv_raw.csv - data obtained from LISSY using the scv_theil.R script.*3. hdi.csv - Human Development Index and its components.4. gini.csv - Gini coefficient from SWIID 9.4.5. wdi.csv - World Development Indicators from the World Bank.6. wgi.csv - World Governance Indicators from the World Bank.7. govcon.csv - government consumption (% of GDP) from UNCTAD.8. theil_fin.csv - final dataset (1, 3-7 combined), which is used in lis_analysis.do.9. scv_fin.csv - final dataset (2-7 combined), which is used in lis_analysis.do.10. indexes.csv - only within and between-cohort components of the Theil index and SCV with imputed values (see lis_analysis.do) for Georgia and Lithuania, which is used in lis_plot.R. * LISSY is the remote-execution system allowing access to the Luxembourg Income Study database: https://www.lisdatacenter.org/data-access/lissy/.For questions about this research please contact:Ivan Skliarov, MA: Faculty of Economic Sciences, University of Warsaw, Poland, Długa 44/50, Warsaw 00-241, Poland, i.skliarov@student.uw.edu.pl.Lukasz Goczek, PhD: Faculty of Economic Sciences, University of Warsaw, Poland, Długa 44/50, Warsaw 00-241, Poland, lgoczek@wne.uw.edu.pl.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Organogram (organisation chart) showing all staff roles. Names and salaries are also listed for senior civil servants. Organogram data is released by all central government departments and their agencies since 2010. Snapshots for 31 March and 30 September are published by 6 June and 6 December each year. The published data is validated and released in CSV format and under Open Government Licence (OGL) for reuse. Not all staff are listed within a Senior Civil Service (SCS1) deputy director led team. These staff are listed on the final row of the transparency return. Staff may not have an allocated SCS1 team lead due to a number of reasons, which could include the deputy director post being vacant, the member of staff being new, their record being processed, and so on. Additionally, some staff have a nil salary due to being in unpaid positions or being 'free loans' from other government departments.
Facebook
TwitterThis dataset was created by SHAMANTH