3 datasets found
  1. o

    Exploratory Data Analysis and Dataset Introduction Of Arkansas' Registered...

    • explore.openaire.eu
    Updated Dec 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robert McGough; Joshua Edelmann; Benjamin Feder (2023). Exploratory Data Analysis and Dataset Introduction Of Arkansas' Registered Apprenticeship Partners Information Management Data System [Dataset]. http://doi.org/10.5281/zenodo.10407358
    Explore at:
    Dataset updated
    Dec 19, 2023
    Authors
    Robert McGough; Joshua Edelmann; Benjamin Feder
    Description

    This is a Jupyter Notebook demonstrating the Exploratory Data Analysis (EDA) process on the primary dataset available for this training program: Arkansas’ Registered Apprenticeship Partners Information Management Data System (RAPIDS) Data. EDA is a vital first step as it provides numerical and visual summaries of the data. This notebook was developed for the Summer 2022 Applied Data Analytics training facilitated by the State of Arkansas and Coleridge Initiative.

  2. McKinsey Solve Assessment Data (2018–2025)

    • kaggle.com
    Updated May 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oluwademilade Adeniyi (2025). McKinsey Solve Assessment Data (2018–2025) [Dataset]. http://doi.org/10.34740/kaggle/dsv/11720554
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 7, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Oluwademilade Adeniyi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    McKinsey Solve Global Assessment Dataset (2018–2025)

    🧠 Context

    McKinsey's Solve is a gamified problem-solving assessment used globally in the consulting firm’s recruitment process. This dataset simulates assessment results across geographies, education levels, and roles over a 7-year period. It aims to provide deep insights into performance trends, candidate readiness, resume quality, and cognitive task outcomes.

    📌 Inspiration & Purpose

    Inspired by McKinsey’s real-world assessment framework, this dataset was designed to enable: - Exploratory Data Analysis (EDA) - Recruitment trend analysis - Gamified performance modelling - Dashboard development in Excel / Power BI - Resume and education impact evaluation - Regional performance benchmarking - Data storytelling for portfolio projects

    Whether you're building dashboards or training models, this dataset offers practical and relatable data for HR analytics and consulting use cases.

    🔍 Dataset Source

    • Data generated by Oluwademilade Adeniyi (Demibolt) with the assistance of ChatGPT by OpenAI Structure and logic inspired by McKinsey’s public-facing Solve information, including role categories, game types (Ecosystem, Redrock, Seawolf), education tiers, and global office locations The entire dataset is synthetic and designed for analytical learning, ethical use, and professional development

    🧾 Dataset Structure

    This dataset includes 4,000 rows and the following columns: - Testtaker ID: Unique identifier - Country / Region: Geographic segmentation - Gender / Age: Demographics - Year: Assessment year (2018–2025) - Highest Level of Education: From high school to PhD / MBA - School or University Attended: Mapped to country and education level - First-generation University Student: Yes/No - Employment Status: Student, Employed, Unemployed - Role Applied For and Department / Interest: Business/tech disciplines - Past Test Taker: Indicates repeat attempts - Prepared with Online Materials: Indicates test prep involvement - Desired Office Location: Mapped to McKinsey's international offices - Ecosystem / Redrock / Seawolf (%): Game performance scores - Time Spent on Each Game (mins) - Total Product Score: Average of the 3 game scores - Process Score: A secondary assessment component - Resume Score: Scored based on education prestige, role fit, and clarity - Total Assessment Score (%): Final decision metric - Status (Pass/Fail): Based on total score ≥ 75%

    ✅ Why Use This Dataset

    • Benchmark educational and regional trends in global assessments
    • Build KPI cards, donut charts, histograms, or speedometer visuals
    • Train pass/fail classifiers or regression models
    • Segment job applicants by role, location, or game behaviour
    • Showcase portfolio skills across Excel, SQL, Power BI, Python, or R
    • Test dashboards or predictive logic in a business-relevant scenario

    💡 Credit & Collaboration

    • Data Creator: Oluwademilade Adeniyi (Me) (LinkedIn, Twitter, GitHub, Medium)
    • Collaborator: ChatGPT by OpenAI
    • Inspired by: McKinsey & Company’s Solve Assessment
  3. O

    Analytic_Provenance

    • opendatalab.com
    zip
    Updated Jan 17, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Texas A&M University (2018). Analytic_Provenance [Dataset]. https://opendatalab.com/OpenDataLab/Analytic_Provenance
    Explore at:
    zip(321803532 bytes)Available download formats
    Dataset updated
    Jan 17, 2018
    Dataset provided by
    Texas A&M University
    Description

    Analytic provenance is a data repository that can be used to study human analysis activity, thought processes, and software interaction with visual analysis tools during exploratory data analysis. It was collected during a series of user studies involving exploratory data analysis scenario with textual and cyber security data. Interactions logs, think-alouds, videos and all coded data in this study are available online for research purposes. Analysis sessions are segmented in multiple sub-task steps based on user think-alouds, video and audios captured during the studies. These analytic provenance datasets can be used for research involving tools and techniques for analyzing interaction logs and analysis history.

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Robert McGough; Joshua Edelmann; Benjamin Feder (2023). Exploratory Data Analysis and Dataset Introduction Of Arkansas' Registered Apprenticeship Partners Information Management Data System [Dataset]. http://doi.org/10.5281/zenodo.10407358

Exploratory Data Analysis and Dataset Introduction Of Arkansas' Registered Apprenticeship Partners Information Management Data System

Explore at:
22 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Dec 19, 2023
Authors
Robert McGough; Joshua Edelmann; Benjamin Feder
Description

This is a Jupyter Notebook demonstrating the Exploratory Data Analysis (EDA) process on the primary dataset available for this training program: Arkansas’ Registered Apprenticeship Partners Information Management Data System (RAPIDS) Data. EDA is a vital first step as it provides numerical and visual summaries of the data. This notebook was developed for the Summer 2022 Applied Data Analytics training facilitated by the State of Arkansas and Coleridge Initiative.

Search
Clear search
Close search
Google apps
Main menu