6 datasets found
  1. s

    Physician Clinical Notes - De-identified Dictation Notes

    • id.shaip.com
    • gd.shaip.com
    • +73more
    json
    Updated Oct 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shaip (2024). Physician Clinical Notes - De-identified Dictation Notes [Dataset]. https://id.shaip.com/resources/sample-datasets/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Oct 6, 2024
    Dataset authored and provided by
    Shaip
    License

    https://www.shaip.comhttps://www.shaip.com

    Description

    A set of formatted clinical documents as dictated by the physicians to train medical AI models.

  2. h

    Bitext-retail-banking-llm-chatbot-training-dataset

    • huggingface.co
    Updated Sep 14, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bitext (2024). Bitext-retail-banking-llm-chatbot-training-dataset [Dataset]. https://huggingface.co/datasets/bitext/Bitext-retail-banking-llm-chatbot-training-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 14, 2024
    Dataset authored and provided by
    Bitext
    License

    https://choosealicense.com/licenses/cdla-sharing-1.0/https://choosealicense.com/licenses/cdla-sharing-1.0/

    Description

    Bitext - Retail Banking Tagged Training Dataset for LLM-based Virtual Assistants

      Overview
    

    This hybrid synthetic dataset is designed to be used to fine-tune Large Language Models such as GPT, Mistral and OpenELM, and has been generated using our NLP/NLG technology and our automated Data Labeling (DAL) tools. The goal is to demonstrate how Verticalization/Domain Adaptation for the [Retail Banking] sector can be easily achieved using our two-step approach to LLM… See the full description on the dataset page: https://huggingface.co/datasets/bitext/Bitext-retail-banking-llm-chatbot-training-dataset.

  3. P

    Customer Service Chatbots Dataset

    • paperswithcode.com
    Updated Mar 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Customer Service Chatbots Dataset [Dataset]. https://paperswithcode.com/dataset/customer-service-chatbots
    Explore at:
    Dataset updated
    Mar 7, 2025
    Description

    Problem Statement

    👉 Download the case studies here

    A leading e-commerce platform faced challenges in providing timely and efficient customer support. The volume of inquiries often overwhelmed support teams, resulting in delayed responses, increased operational costs, and dissatisfied customers. The company needed a solution to automate responses to common customer queries while maintaining personalized service

    Challenge

    Implementing an automated customer support solution came with specific hurdles:

    Handling diverse inquiries, including product details, order status, and returns, with high accuracy.

    Integrating the chatbot with existing systems like CRM and order management.

    Ensuring a seamless customer experience while maintaining conversational quality and natural interactions.

    Solution Provided

    An AI-driven demand forecasting system was developed, utilizing time series forecasting models and advanced analytics platforms to predict product demand accurately. The solution was designed to:

    Address common customer inquiries with pre-trained conversational models.

    Redirect complex queries to human agents seamlessly.

    Operate 24/7 across multiple communication channels, including the website, mobile app, and social media.

    Development Steps

    Data Collection

    Compiled historical customer inquiries and support responses to build a robust dataset for training the chatbot.

    Preprocessing

    Cleaned and categorized data to create intent libraries and FAQs for training NLP models.

    Model Training

    Trained the chatbot using NLP algorithms to recognize intents and entities, ensuring accurate responses. Enhanced the model with machine learning to adapt to customer-specific language and trends.

    Integration

    Integrated the chatbot with CRM, order management, and support ticketing systems to provide real-time information on orders and account details.

    Deployment

    Rolled out the chatbot on the e-commerce platform’s website, app, and social media channels, enabling round-the-clock support.

    Continuous Improvement

    Established a feedback loop to monitor chatbot performance, user satisfaction, and areas for improvement, refining the system continuously.

    Results

    24/7 Customer Support Availability

    The chatbot provided uninterrupted customer support, addressing inquiries outside regular business hours.

    Reduced Response Times

    Automated responses decreased average response times by 50%, ensuring prompt assistance for customers.

    Lowered Operational Costs

    The chatbot reduced dependency on human agents for routine inquiries, cutting support costs significantly.

    Improved Customer Satisfaction

    Timely and accurate responses enhanced the customer experience, leading to positive feedback and increased brand loyalty.

    Scalable Solution

    The chatbot system scaled effortlessly to handle growing customer volumes during peak periods, such as holiday sales.

  4. h

    lmsys-chat-1m

    • huggingface.co
    • opendatalab.com
    Updated Sep 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Large Model Systems Organization (2023). lmsys-chat-1m [Dataset]. https://huggingface.co/datasets/lmsys/lmsys-chat-1m
    Explore at:
    Dataset updated
    Sep 17, 2023
    Dataset authored and provided by
    Large Model Systems Organization
    Description

    LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

    This dataset contains one million real-world conversations with 25 state-of-the-art LLMs. It is collected from 210K unique IP addresses in the wild on the Vicuna demo and Chatbot Arena website from April to August 2023. Each sample includes a conversation ID, model name, conversation text in OpenAI API JSON format, detected language tag, and OpenAI moderation API tag. User consent is obtained through the "Terms of… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/lmsys-chat-1m.

  5. h

    alpaca

    • huggingface.co
    • opendatalab.com
    Updated Mar 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tatsu Lab (2023). alpaca [Dataset]. https://huggingface.co/datasets/tatsu-lab/alpaca
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 14, 2023
    Dataset authored and provided by
    Tatsu Lab
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Dataset Card for Alpaca

      Dataset Summary
    

    Alpaca is a dataset of 52,000 instructions and demonstrations generated by OpenAI's text-davinci-003 engine. This instruction data can be used to conduct instruction-tuning for language models and make the language model follow instruction better. The authors built on the data generation pipeline from Self-Instruct framework and made the following modifications:

    The text-davinci-003 engine to generate the instruction data instead… See the full description on the dataset page: https://huggingface.co/datasets/tatsu-lab/alpaca.

  6. r

    ExpBot - A dataset of 79 dialogs with an experimental customer service...

    • radar-service.eu
    • radar.kit.edu
    tar
    Updated Jun 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Mädche; Jasper Feine; Stefan Morana; Ulrich Gnewuch (2023). ExpBot - A dataset of 79 dialogs with an experimental customer service chatbot [Dataset]. http://doi.org/10.35097/1210
    Explore at:
    tar(251904 bytes)Available download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    Karlsruhe Institute of Technology
    Mädche, Alexander
    Gnewuch, Ulrich
    Feine, Jasper
    Morana, Stefan
    Authors
    Alexander Mädche; Jasper Feine; Stefan Morana; Ulrich Gnewuch
    Description

    This dataset consists of 79 dialogs between a human user and a chatbot in English language. This data was collected during an online experiment conducted by the research group "Information Systems & Service Design" at the Karlsruhe Institute of Technology (KIT). Experimental task: Participants were asked to interact with a chatbot to find out whether they could save money by switching to a better mobile phone plan. Additionally, there were shown a fictitious copy of last month's mobile phone bill. During the conversation, the chatbot asked about the participant's usage patterns (e.g., how much data was used) and recommended a randomly generated plan that better met the participant’s requirements. For more information, see Gnewuch et al. (2018). If you have any questions, please contact us via email (info@chatbotresearch.com) or visit https://chatbotresearch.com. WARNING! Some dialogs contain profanity and/or offensive language. Profanity was not removed because it is important for calculating sentiment scores. PUBLICATIONS / REFERENCES Gnewuch, U., Morana, S., Adam, M. T. P., and Maedche, A. 2018. “Faster Is Not Always Better: Understanding the Effect of Dynamic Response Delays in Human-Chatbot Interaction,” in Proceedings of the 26th European Conference on Information Systems (ECIS 2018), Portsmouth, United Kingdom. Feine, J., Morana, S., and Gnewuch, U. 2019. “Measuring Service Encounter Satisfaction with Customer Service Chatbots using Sentiment Analysis,” in Proceedings of the 14th International Conference on Wirtschaftsinformatik (WI 2019), Siegen, Germany, February 24–27.

  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Shaip (2024). Physician Clinical Notes - De-identified Dictation Notes [Dataset]. https://id.shaip.com/resources/sample-datasets/

Physician Clinical Notes - De-identified Dictation Notes

Explore at:
jsonAvailable download formats
Dataset updated
Oct 6, 2024
Dataset authored and provided by
Shaip
License

https://www.shaip.comhttps://www.shaip.com

Description

A set of formatted clinical documents as dictated by the physicians to train medical AI models.

Search
Clear search
Close search
Google apps
Main menu