2 datasets found
  1. Resume Dataset

    • kaggle.com
    Updated Aug 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Snehaan Bhawal (2021). Resume Dataset [Dataset]. https://www.kaggle.com/datasets/snehaanbhawal/resume-dataset/suggestions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 8, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Snehaan Bhawal
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    A collection of Resume Examples taken from livecareer.com for categorizing a given resume into any of the labels defined in the dataset.

    Content

    Contains 2400+ Resumes in string as well as PDF format. PDF stored in the data folder differentiated into their respective labels as folders with each resume residing inside the folder in pdf form with filename as the id defined in the csv.

    Inside the CSV: - ID: Unique identifier and file name for the respective pdf. - Resume_str : Contains the resume text only in string format. - Resume_html : Contains the resume data in html format as present while web scrapping. - Category : Category of the job the resume was used to apply.

    Present categories are HR, Designer, Information-Technology, Teacher, Advocate, Business-Development, Healthcare, Fitness, Agriculture, BPO, Sales, Consultant, Digital-Media, Automobile, Chef, Finance, Apparel, Engineering, Accountant, Construction, Public-Relations, Banking, Arts, Aviation

    Acknowledgements

    Data was obtained by scrapping individual resume examples from www.livecareer.com website. Web Scrapping code present in my Github Repo.

  2. h

    resumes

    • huggingface.co
    Updated May 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Phillipe Pouti (2024). resumes [Dataset]. https://huggingface.co/datasets/opensporks/resumes
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 14, 2024
    Authors
    Phillipe Pouti
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Dataset Card for Resume Dataset

      Dataset Summary
    
    
    
    
    
      Context
    

    A collection of Resume Examples taken from livecareer.com for categorizing a given resume into any of the labels defined in the dataset.

      Content
    

    Contains 2400+ Resumes in string as well as PDF format. PDF stored in the data folder differentiated into their respective labels as folders with each resume residing inside the folder in pdf form with filename as the id defined in the csv. Inside the… See the full description on the dataset page: https://huggingface.co/datasets/opensporks/resumes.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Snehaan Bhawal (2021). Resume Dataset [Dataset]. https://www.kaggle.com/datasets/snehaanbhawal/resume-dataset/suggestions
Organization logo

Resume Dataset

A collection of Resumes in PDF as well as String format for data extraction.

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 8, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Snehaan Bhawal
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

A collection of Resume Examples taken from livecareer.com for categorizing a given resume into any of the labels defined in the dataset.

Content

Contains 2400+ Resumes in string as well as PDF format. PDF stored in the data folder differentiated into their respective labels as folders with each resume residing inside the folder in pdf form with filename as the id defined in the csv.

Inside the CSV: - ID: Unique identifier and file name for the respective pdf. - Resume_str : Contains the resume text only in string format. - Resume_html : Contains the resume data in html format as present while web scrapping. - Category : Category of the job the resume was used to apply.

Present categories are HR, Designer, Information-Technology, Teacher, Advocate, Business-Development, Healthcare, Fitness, Agriculture, BPO, Sales, Consultant, Digital-Media, Automobile, Chef, Finance, Apparel, Engineering, Accountant, Construction, Public-Relations, Banking, Arts, Aviation

Acknowledgements

Data was obtained by scrapping individual resume examples from www.livecareer.com website. Web Scrapping code present in my Github Repo.

Search
Clear search
Close search
Google apps
Main menu