7 datasets found
  1. h

    Bitext-restaurants-llm-chatbot-training-dataset

    • huggingface.co
    Updated Aug 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bitext (2024). Bitext-restaurants-llm-chatbot-training-dataset [Dataset]. https://huggingface.co/datasets/bitext/Bitext-restaurants-llm-chatbot-training-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 16, 2024
    Dataset authored and provided by
    Bitext
    License

    https://choosealicense.com/licenses/cdla-sharing-1.0/https://choosealicense.com/licenses/cdla-sharing-1.0/

    Description

    Bitext - Restaurants Tagged Training Dataset for LLM-based Virtual Assistants

      Overview
    

    This hybrid synthetic dataset is designed to be used to fine-tune Large Language Models such as GPT, Mistral and OpenELM, and has been generated using our NLP/NLG technology and our automated Data Labeling (DAL) tools. The goal is to demonstrate how Verticalization/Domain Adaptation for the [restaurants] sector can be easily achieved using our two-step approach to LLM Fine-Tuning. An… See the full description on the dataset page: https://huggingface.co/datasets/bitext/Bitext-restaurants-llm-chatbot-training-dataset.

  2. h

    whatscooking.restaurants

    • huggingface.co
    Updated Feb 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MongoDB (2024). whatscooking.restaurants [Dataset]. https://huggingface.co/datasets/MongoDB/whatscooking.restaurants
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 14, 2024
    Dataset authored and provided by
    MongoDB
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Whatscooking.restaurants

      Overview
    

    This dataset provides detailed information about various restaurants, including their location, cuisine, ratings, and other attributes. It is particularly useful for applications in food and beverage industry analysis, recommendation systems, and geographical studies.

      Dataset Structure
    

    Each record in the dataset represents a single restaurant and contains the following fields:

    _id: A unique identifier for the restaurant… See the full description on the dataset page: https://huggingface.co/datasets/MongoDB/whatscooking.restaurants.

  3. Forecast: Full-Service Restaurants Industry Gross Output in the US 2024 -...

    • reportlinker.com
    Updated Apr 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ReportLinker (2024). Forecast: Full-Service Restaurants Industry Gross Output in the US 2024 - 2028 [Dataset]. https://www.reportlinker.com/dataset/3d0462fed43427405ad1c80519eab420211e531b
    Explore at:
    Dataset updated
    Apr 11, 2024
    Dataset authored and provided by
    ReportLinker
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    Forecast: Full-Service Restaurants Industry Gross Output in the US 2024 - 2028 Discover more data with ReportLinker!

  4. h

    philly_restaurants

    • huggingface.co
    Updated Jan 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Pappalardo (2022). philly_restaurants [Dataset]. https://huggingface.co/datasets/danielpappa/philly_restaurants
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 14, 2022
    Authors
    Daniel Pappalardo
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Area covered
    Philadelphia
    Description

    Dataset Card for Dataset Name

    Dataset from yelp containing restaurant reviews and location in Philadelphia

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    More than 600 restaurants across Philadelphia with five reviews each, including both text and stars (1-5).

  5. S

    Niagara County

    • health.data.ny.gov
    application/rdfxml +5
    Updated Jun 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York State Department of Health (2025). Niagara County [Dataset]. https://health.data.ny.gov/widgets/7kyb-jvd4
    Explore at:
    application/rssxml, tsv, json, xml, csv, application/rdfxmlAvailable download formats
    Dataset updated
    Jun 29, 2025
    Authors
    New York State Department of Health
    Area covered
    Niagara County
    Description

    This data includes the name and location of active food service establishments and the violations that were found at the time of the inspection. Active food service establishments include only establishments that are currently operating. This dataset excludes inspections conducted in New York City (https://data.cityofnewyork.us/Health/Restaurant-Inspection-Results/4vkw-7nck), Suffolk County (http://apps.suffolkcountyny.gov/health/Restaurant/intro.html) and Erie County (http://www.healthspace.com/erieny). Inspections are a “snapshot” in time and are not always reflective of the day-to-day operations and overall condition of an establishment. Occasionally, remediation may not appear until the following month due to the timing of the updates. Update frequencies and availability of historical inspection data may vary from county to county. Some counties provide this information on their own websites and information found there may be updated more frequently. This dataset is refreshed on a monthly basis. The inspection data contained in this dataset was not collected in a manner intended for use as a restaurant grading system, and should not be construed or interpreted as such. Any use of this data to develop a restaurant grading system is not supported or endorsed by the New York State Department of Health. For more information, visit http://www.health.ny.gov/regulations/nycrr/title_10/part_14/subpart_14-1.htm or go to the “About” tab.

  6. S

    ClintonCounty

    • health.data.ny.gov
    application/rdfxml +5
    Updated Jul 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York State Department of Health (2025). ClintonCounty [Dataset]. https://health.data.ny.gov/Health/ClintonCounty/4hcx-sgvu
    Explore at:
    application/rdfxml, tsv, csv, xml, json, application/rssxmlAvailable download formats
    Dataset updated
    Jul 12, 2025
    Authors
    New York State Department of Health
    Description

    This data includes the name and location of active food service establishments and the violations that were found at the time of the inspection. Active food service establishments include only establishments that are currently operating. This dataset excludes inspections conducted in New York City (https://data.cityofnewyork.us/Health/Restaurant-Inspection-Results/4vkw-7nck), Suffolk County (http://apps.suffolkcountyny.gov/health/Restaurant/intro.html) and Erie County (http://www.healthspace.com/erieny). Inspections are a “snapshot” in time and are not always reflective of the day-to-day operations and overall condition of an establishment. Occasionally, remediation may not appear until the following month due to the timing of the updates. Update frequencies and availability of historical inspection data may vary from county to county. Some counties provide this information on their own websites and information found there may be updated more frequently. This dataset is refreshed on a monthly basis. The inspection data contained in this dataset was not collected in a manner intended for use as a restaurant grading system, and should not be construed or interpreted as such. Any use of this data to develop a restaurant grading system is not supported or endorsed by the New York State Department of Health. Historical inspection data through 2005 is also available. Inactive (closed) establishments can be found at: https://health.data.ny.gov/Health/Food-Service-Establishment-Inspections-Beginning-2/aaxz-j6pj. For more information, visit http://www.health.ny.gov/regulations/nycrr/title_10/part_14/subpart_14-1.htm or go to the “About” tab.

  7. S

    Cayuga

    • cayugacounty.us
    • health.data.ny.gov
    Updated May 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York State Department of Health (2025). Cayuga [Dataset]. https://www.cayugacounty.us/442/Food-Service-Inspections
    Explore at:
    application/rssxml, xml, application/geo+json, application/rdfxml, csv, kmz, kml, tsvAvailable download formats
    Dataset updated
    May 5, 2025
    Authors
    New York State Department of Health
    Description

    This data includes the name and location of food service establishments and the violations that were found at the time of their last inspection. This dataset excludes inspections conducted in New York City (see: https://nycopendata.socrata.com/), Suffolk County (http://apps.suffolkcountyny.gov/health/Restaurant/intro.html) and Erie County (http://www.healthspace.com/erieny). Inspections are a “snapshot” in time and are not always reflective of the day-to-day operations and overall condition of an establishment. Occasionally, remediation may not appear until the following month due to the timing of the updates. Some counties provide this information on their own websites and information found there may be updated more frequently. This dataset is refreshed on a monthly basis.

    Last inspection data is the most recently submitted and available data. Historical inspection data through 2005 is also available. Active establishments can be found at: https://health.data.ny.gov/Health/Food-Service-Establishment-Inspections-Beginning-2/2hcc-shji. Inactive (closed) establishments can be found at: https://health.data.ny.gov/Health/Food-Service-Establishment-Inspections-Beginning-2/aaxz-j6pj

    For more information, check out http://www.health.ny.gov/regulations/nycrr/title_10/part_14/subpart_14-1.htm, or go to the "About" tab.

  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bitext (2024). Bitext-restaurants-llm-chatbot-training-dataset [Dataset]. https://huggingface.co/datasets/bitext/Bitext-restaurants-llm-chatbot-training-dataset

Bitext-restaurants-llm-chatbot-training-dataset

bitext/Bitext-restaurants-llm-chatbot-training-dataset

Bitext - Restaurants Tagged Training Dataset for LLM-based Virtual Assistants

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 16, 2024
Dataset authored and provided by
Bitext
License

https://choosealicense.com/licenses/cdla-sharing-1.0/https://choosealicense.com/licenses/cdla-sharing-1.0/

Description

Bitext - Restaurants Tagged Training Dataset for LLM-based Virtual Assistants

  Overview

This hybrid synthetic dataset is designed to be used to fine-tune Large Language Models such as GPT, Mistral and OpenELM, and has been generated using our NLP/NLG technology and our automated Data Labeling (DAL) tools. The goal is to demonstrate how Verticalization/Domain Adaptation for the [restaurants] sector can be easily achieved using our two-step approach to LLM Fine-Tuning. An… See the full description on the dataset page: https://huggingface.co/datasets/bitext/Bitext-restaurants-llm-chatbot-training-dataset.

Search
Clear search
Close search
Google apps
Main menu