2 datasets found

Resume Dataset
kaggle.com
Updated Aug 8, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Snehaan Bhawal (2021). Resume Dataset [Dataset]. https://www.kaggle.com/datasets/snehaanbhawal/resume-dataset/suggestions
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 8, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Snehaan Bhawal
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

A collection of Resume Examples taken from livecareer.com for categorizing a given resume into any of the labels defined in the dataset.

Content

Contains 2400+ Resumes in string as well as PDF format. PDF stored in the data folder differentiated into their respective labels as folders with each resume residing inside the folder in pdf form with filename as the id defined in the csv.

Inside the CSV: - ID: Unique identifier and file name for the respective pdf. - Resume_str : Contains the resume text only in string format. - Resume_html : Contains the resume data in html format as present while web scrapping. - Category : Category of the job the resume was used to apply.

Present categories are HR, Designer, Information-Technology, Teacher, Advocate, Business-Development, Healthcare, Fitness, Agriculture, BPO, Sales, Consultant, Digital-Media, Automobile, Chef, Finance, Apparel, Engineering, Accountant, Construction, Public-Relations, Banking, Arts, Aviation

Acknowledgements

Data was obtained by scrapping individual resume examples from www.livecareer.com website. Web Scrapping code present in my Github Repo.
h
resumes
huggingface.co
Updated May 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Phillipe Pouti (2024). resumes [Dataset]. https://huggingface.co/datasets/opensporks/resumes
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 14, 2024
Authors
Phillipe Pouti
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Dataset Card for Resume Dataset

Dataset Summary Context

A collection of Resume Examples taken from livecareer.com for categorizing a given resume into any of the labels defined in the dataset.

Content

Contains 2400+ Resumes in string as well as PDF format. PDF stored in the data folder differentiated into their respective labels as folders with each resume residing inside the folder in pdf form with filename as the id defined in the csv. Inside the… See the full description on the dataset page: https://huggingface.co/datasets/opensporks/resumes.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Snehaan Bhawal (2021). Resume Dataset [Dataset]. https://www.kaggle.com/datasets/snehaanbhawal/resume-dataset/suggestions

Resume Dataset

A collection of Resumes in PDF as well as String format for data extraction.

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Aug 8, 2021

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Snehaan Bhawal

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

A collection of Resume Examples taken from livecareer.com for categorizing a given resume into any of the labels defined in the dataset.

Content

Contains 2400+ Resumes in string as well as PDF format. PDF stored in the data folder differentiated into their respective labels as folders with each resume residing inside the folder in pdf form with filename as the id defined in the csv.

Inside the CSV: - ID: Unique identifier and file name for the respective pdf. - Resume_str : Contains the resume text only in string format. - Resume_html : Contains the resume data in html format as present while web scrapping. - Category : Category of the job the resume was used to apply.

Present categories are HR, Designer, Information-Technology, Teacher, Advocate, Business-Development, Healthcare, Fitness, Agriculture, BPO, Sales, Consultant, Digital-Media, Automobile, Chef, Finance, Apparel, Engineering, Accountant, Construction, Public-Relations, Banking, Arts, Aviation

Acknowledgements

Data was obtained by scrapping individual resume examples from www.livecareer.com website. Web Scrapping code present in my Github Repo.

Clear search

Close search

Google apps

Main menu

Resume Dataset

Context

Content

Acknowledgements

resumes

Resume Dataset

A collection of Resumes in PDF as well as String format for data extraction.

Context

Content

Acknowledgements