4 datasets found
  1. d

    Quantifying industry spending on promotional events using Open Payments...

    • search.dataone.org
    • borealisdata.ca
    Updated Jul 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Held, Fabian (2024). Quantifying industry spending on promotional events using Open Payments data: Event classification script [Dataset]. http://doi.org/10.5683/SP3/0KR09P
    Explore at:
    Dataset updated
    Jul 3, 2024
    Dataset provided by
    Borealis
    Authors
    Held, Fabian
    Description

    We conducted a cross-sectional study of the publicly available 2022 Open Payments data to characterize and quantify sponsored events (available for download at: https://www.cms.gov/priorities/key-initiatives/open-payments/data/dataset-downloads). Data sources We downloaded the 2022 dataset ZIP files from the Open Payments website on June 30th, 2023. We included all records for nurse practitioners, clinical nurse specialists, certified registered nurse anesthetists, and certified nurse-midwives (hereafter advanced practiced registered nurses (APRNs)); and allopathic and osteopathic physicians (hereafter, ‘physicians’). To ensure consistency in provider classification, we linked Payments data to the National Plan and Provider Enumeration System data (June 2023) by National Provider Identifier (NPI) and the National Uniform Claim Committee (NUCC) and excluded individuals with an ambiguous provider type. Event-centric analysis of Open Payments records: Creating an event typology We included only payments classified as “food and beverage” to reliably identify distinct sponsored events. We reasoned that food and beverage would be consumed on the same day in the same place, thus assumed that records for food and beverage associated with the same event would share the date of payment and location. We also assumed that the reported value of a food and beverage payment is the total cost of the hospitality divided by the number of attendees, thus grouped payment records with the same amount, rounded to the nearest dollar. Inferring which Open Payment records relate to the same sponsored event requires analytic decisions regarding the selection and representation of variables that define an event. To understand the impact of these choices, we undertook a sensitivity analysis to explore alternative ways to group Open Payments records for food and beverage, to determine how combination of variables, including date (specific date or within the same calendar week), amount (rounded to nearest dollar), and recipient’s state, affected the identification of sponsored events in the Open Payments data set. We chose to define a sponsored event as a cluster of three or more individual payment records for food and beverage (nature of payment) with the following matching Open Payments record variables: • Submitting applicable manufacturer (name) • Product category or therapeutic area • Name of drug or biological or device or medical supply • Recipient state • Total amount of payment (USD, rounded to nearest dollar) • Date of payment (exact) After examining the distribution of the data, we classified events in terms of size (≥20 attendees as “large” and 3-<20 as “small”) and amount per person. We categorized events <$10 as “coffee”, $10-<$30 as “lunch”, $30-<$150 as “dinner”, and ≥$150 as “banquet”.

  2. Employees' Payroll in Los Angeles

    • kaggle.com
    Updated Jun 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DSFelix (2022). Employees' Payroll in Los Angeles [Dataset]. https://www.kaggle.com/datasets/dsfelix/employees-payroll-in-los-angeles/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 23, 2022
    Dataset provided by
    Kaggle
    Authors
    DSFelix
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Los Angeles
    Description

    🧑‍💼 Employees' Payroll in Los Angeles 🧑‍💼


    Payroll of 685,000 Employees in Los Angeles (01-2013 to 06-2022)

    🔐 Dataset File 🔐

    payroll.csv


    ❓ Description ❓

    This dataset contains information about 685,000 Los Angeles's employees .

    The data was extracted between January 1th 2013 and June 317th in 2022.


    📝 Variables 📝

    The file payroll.csv contains a huge table of Los Angeles's employees' information in the following columns:


    • Record_NBR: unique number to identify an employee;
    • Pay_Year: tax year employee was paid. This is not Fiscal Year;
    • Department_NO: department Number in City Payroll System;
    • Department_Title: title of City Department;
    • Job_Class_PGrade: Job Class and Pay Grade;
    • Job_Title: Job Title;
    • Employment_Type: employment type - Full Time, Part Time, or Per Event;
    • Job_Status: employee's job status at the time the data was uploaded;
    • MOU: Memorandum of Understanding;
    • MOU_Title: title of Memorandum of Understanding;
    • Regular_Pay: regular work hours payment;
    • Overtime_Pay: payments attributable to hours worked beyond regular work schedule;
    • All_Other_Pay: any payments other than Regular and Overtime. This includes bonuses, adjustments, and lump sum payouts. Examples of bonuses include Permanent, Longevity, and Temporary Bonuses. Lump Sum Pay includes significant one-time payouts due to retirement, lawsuit settlements, or other adjustments;
    • Total_Pay: sum of regular, overtime and all other payments;
    • City_Retirement_Contributions: estimated payments made by the City towards employee's retirement;
    • Benefit_Pay: city contribution for the employee's health care, dental care, vision care, and life insurance;
    • Gender: gender as self-reported by employee;
    • Ethnicity: ethnicity as self-reported by employee.



    🔐 Dataset File 🔐

    processed-payroll.csv


    ❓ Description ❓

    This dataset consists in the payroll.csv without rows with Null values (all columns) and negative values (REGULAR_PAY, OVERTIME_PAY, ALL_OTHER_PAY, TOTAL_PAY, CITY_RETIREMENT_CONTRIBUTIONS and BENEFIT_PAY columns).


    📝 Variables 📝

    The file processed-payroll.csv contains the same variables of the payroll.csv file.



    🔐 Dataset File 🔐

    train-test-payroll.csv


    ❓ Description ❓

    This dataset consists in the processed-payroll.csv containing just Active and Full-Time Employees, Labelled Categorical Variables and with a few less columns.


    📝 Variables 📝

    • Regular_Pay: regular work hours payment;
    • Pay_Year: tax year employee was paid. This is not Fiscal Year;
    • Department_NO: department Number in City Payroll System;
    • Job_Class_PGrade: Job Class and Pay Grade;
    • Job_Title: Job Title;
    • Overtime_Pay: payments attributable to hours worked beyond regular work schedule;
    • All_Other_Pay: any payments other than Regular and Overtime. This includes bonuses, adjustments, and lump sum payouts. Examples of bonuses include Permanent, Longevity, and Temporary Bonuses. Lump Sum Pay includes significant one-time payouts due to retirement, lawsuit settlements, or other adjustments;
    • City_Retirement_Contributions: estimated payments made by the City towards employee's retirement;
    • Benefit_Pay: city contribution for the employee's health care, dental care, vision care, and life insurance;
    • Gender: gender as self-reported by employee;
    • Ethnicity: ethnicity as self-reported by employee.



    🎉 Acknowledgements 🎉

    Thanks to:

    1. Control Paneel L.A. for providing the data.
  3. CMS Open Payments Dataset 2013

    • kaggle.com
    Updated Nov 17, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Medicare & Medicaid Services (2019). CMS Open Payments Dataset 2013 [Dataset]. https://www.kaggle.com/datasets/cms/cms-open-payments-dataset-2013/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 17, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Centers for Medicare & Medicaid Services
    Description

    Context

    Open Payments is a national disclosure program created by the Affordable Care Act (ACA) and managed by Centers for Medicare & Medicaid Services (CMS). The purpose of the program is to promote transparency into the financial relationships between pharmaceutical and medical device industries, and physicians and teaching hospitals. The financial relationships may include consulting fees, research grants, travel reimbursements, and payments from industry to medical practitioners.

    Content

    There are 3 datasets that represent 3 different payment types:

    • General Payments: Payments not made in connection with a research agreement. This dataset contains 65 variables.

    • Research Payments: Payments made in connection with a research agreement. This dataset contains 166 variables.

    • Physician Ownership or Investment Interest: Information about physicians who hold ownership or investment interest in the manufacturer/GPO or who have an immediate family member holding such interest. This dataset contains 29 variables.

    • Deleted/Removed Records: Contains any deleted/removed records.

    A comprehensive methodology overview and data dictionary for each dataset can be found here.

    Acknowledgements

    The original datasets can be found here.

    Inspiration

    • Using the General Payments dataset, can you determine any trends in the total amount of payment to hospitals and physicians across the medical specialties or by the form/nature of the payments?

    • According to the Research Payments dataset, which area(s) of research or the type of drug/medical device receive the most amount of payment?

  4. Data for study "Factors influencing intention to use BLIK mobile payments by...

    • zenodo.org
    Updated Jul 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oliwia Mizielska; Oliwia Mizielska (2025). Data for study "Factors influencing intention to use BLIK mobile payments by Polish bank customers" [Dataset]. http://doi.org/10.5281/zenodo.16437186
    Explore at:
    Dataset updated
    Jul 26, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Oliwia Mizielska; Oliwia Mizielska
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This file contains anonymous responses from 206 participants to a survey on the use of BLIK mobile payments in Poland. The data were collected using an online questionnaire designed to analyze factors influencing the acceptance and usage of mobile payments via the BLIK system. Each row represents a single respondent, and each column corresponds to a specific survey question or demographic variable.

    Data structure:

    • Number of records (rows): 206

    • Number of variables (columns): 27

    • File format: Microsoft Excel (.xlsx)

    • Survey language: Polish

    Data scope:

    The dataset includes responses to questions regarding attitudes toward mobile payments, skills, ease of use, social norms, satisfaction, and behavioral intentions related to BLIK usage. It also contains basic demographic information such as gender, age, education, employment status, place of residence, the way and frequency of using BLIK, and the bank where mobile payments are conducted.

    Example columns:

    • Do you use mobile payments via BLIK?

    • I believe that BLIK mobile payment is useful in everyday life.

    • Using BLIK mobile payment helps me perform tasks more quickly.

    • I have the necessary skills to use BLIK mobile payment.

    • Gender, Age, Education, Employment status, Current place of residence

    • How often do you use BLIK?

    • In which of the following banks do you conduct BLIK mobile payments?

    Response format:

    Most closed-ended questions are measured on a Likert scale (e.g., 1–7). Some questions are categorical or descriptive.

    Potential uses:

    The dataset can be used for research in the fields of electronic banking, fintech, consumer behavior, technology acceptance modeling, and demographic analyses of mobile payment users in Poland.

    Notes:

    The data are fully anonymized and do not contain any information that could identify respondents. If you use these data in publications, please cite the source appropriately.

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Held, Fabian (2024). Quantifying industry spending on promotional events using Open Payments data: Event classification script [Dataset]. http://doi.org/10.5683/SP3/0KR09P

Quantifying industry spending on promotional events using Open Payments data: Event classification script

Explore at:
Dataset updated
Jul 3, 2024
Dataset provided by
Borealis
Authors
Held, Fabian
Description

We conducted a cross-sectional study of the publicly available 2022 Open Payments data to characterize and quantify sponsored events (available for download at: https://www.cms.gov/priorities/key-initiatives/open-payments/data/dataset-downloads). Data sources We downloaded the 2022 dataset ZIP files from the Open Payments website on June 30th, 2023. We included all records for nurse practitioners, clinical nurse specialists, certified registered nurse anesthetists, and certified nurse-midwives (hereafter advanced practiced registered nurses (APRNs)); and allopathic and osteopathic physicians (hereafter, ‘physicians’). To ensure consistency in provider classification, we linked Payments data to the National Plan and Provider Enumeration System data (June 2023) by National Provider Identifier (NPI) and the National Uniform Claim Committee (NUCC) and excluded individuals with an ambiguous provider type. Event-centric analysis of Open Payments records: Creating an event typology We included only payments classified as “food and beverage” to reliably identify distinct sponsored events. We reasoned that food and beverage would be consumed on the same day in the same place, thus assumed that records for food and beverage associated with the same event would share the date of payment and location. We also assumed that the reported value of a food and beverage payment is the total cost of the hospitality divided by the number of attendees, thus grouped payment records with the same amount, rounded to the nearest dollar. Inferring which Open Payment records relate to the same sponsored event requires analytic decisions regarding the selection and representation of variables that define an event. To understand the impact of these choices, we undertook a sensitivity analysis to explore alternative ways to group Open Payments records for food and beverage, to determine how combination of variables, including date (specific date or within the same calendar week), amount (rounded to nearest dollar), and recipient’s state, affected the identification of sponsored events in the Open Payments data set. We chose to define a sponsored event as a cluster of three or more individual payment records for food and beverage (nature of payment) with the following matching Open Payments record variables: • Submitting applicable manufacturer (name) • Product category or therapeutic area • Name of drug or biological or device or medical supply • Recipient state • Total amount of payment (USD, rounded to nearest dollar) • Date of payment (exact) After examining the distribution of the data, we classified events in terms of size (≥20 attendees as “large” and 3-<20 as “small”) and amount per person. We categorized events <$10 as “coffee”, $10-<$30 as “lunch”, $30-<$150 as “dinner”, and ≥$150 as “banquet”.

Search
Clear search
Close search
Google apps
Main menu