6 datasets found
  1. The Great American Coffee Taste Test Dataset

    • kaggle.com
    Updated May 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Umer Haddii (2024). The Great American Coffee Taste Test Dataset [Dataset]. https://www.kaggle.com/datasets/umerhaddii/the-great-american-coffee-taste-test-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 20, 2024
    Dataset provided by
    Kaggle
    Authors
    Umer Haddii
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Context

    World champion barista James Hoffmann and Cometeer partnered to conduct a first-of-its-kind coffee taste test. Cometeer shipped 5000 coffee kits across America. Kits contained four different coffees - pre-extracted and flash frozen. Tasters melted and diluted the coffee capsules for a largely identical tasting experience. Tasting and ratings were conducted blind [1]. After survey responses were collected (provided data), some attributes of the coffee were revealed.

    In October 2023, World champion barista James Hoffmann and coffee company Cometeer held the "Great American Coffee Taste Test" on YouTube, during which viewers were asked to fill out a survey about 4 coffees they ordered from Cometeer for the tasting. Data blogger Robert McKeon Aloe analyzed the data the following month.

    Content

    Geography: US

    Time-period: 2023

    Unit of Analysis: The Great American Coffee Taste Test

    Variables

    • submission_id = Submission ID
    • age = What is your age?
    • cups = How many cups of coffee do you typically drink per day?
    • where_drink = Where do you typically drink coffee?
    • brew = How do you brew coffee at home?
    • brew_other = How else do you brew coffee at home?
    • purchase = On the go, where do you typically purchase coffee?
    • purchase_other = Where else do you purchase coffee?
    • favorite = What is your favorite coffee drink?
    • favorite_specify = Please specify what your favorite coffee drink is
    • additions = Do you usually add anything to your coffee?
    • additions_other = What else do you add to your coffee?
    • dairy = What kind of dairy do you add?
    • sweetener = What kind of sugar or sweetener do you add?
    • style = Before today's tasting, which of the following best described what kind of coffee you like?
      -**strength** = How strong do you like your coffee?
    • roast_level = What roast level of coffee do you prefer?
    • caffeine = How much caffeine do you like in your coffee?
    • expertise = Lastly, how would you rate your own coffee expertise?
    • coffee_a_bitterness = Coffee A - Bitterness
    • coffee_a_acidity = Coffee A - Acidity
    • coffee_a_personal_preference = Coffee A - Personal Preference
    • coffee_a_notes = Coffee A - Notes
    • coffee_b_bitterness = Coffee B - Bitterness
    • coffee_b_acidity = Coffee B - Acidity
    • coffee_b_personal_preference = Coffee B - Personal Preference
    • coffee_b_notes = Coffee B - Notes
    • coffee_c_bitterness = Coffee C - Bitterness
    • coffee_c_acidity = Coffee C - Acidity
    • coffee_c_personal_preference = Coffee C - Personal Preference
    • coffee_c_notes = Coffee C - Notes
    • coffee_d_bitterness = Coffee D - Bitterness
    • coffee_d_acidity = Coffee D - Acidity
    • coffee_d_personal_preference = Coffee D - Personal Preference
    • coffee_d_notes = Coffee D - Notes
    • prefer_abc = Between Coffee A, Coffee B, and Coffee C which did you prefer?
    • prefer_ad = Between Coffee A and Coffee D, which did you prefer?
    • prefer_overall = Lastly, what was your favorite overall coffee?
    • wfh = Do you work from home or in person?
    • total_spend = In total, how much money do you typically spend on coffee in a month?
    • why_drink = Why do you drink coffee?
    • why_drink_other = Other reason for drinking coffee
    • taste = Do you like the taste of coffee?
    • know_source = Do you know where your coffee comes from?
    • most_paid = What is the most you've ever paid for a cup of coffee?
    • most_willing = What is the most you'd ever be willing to pay for a cup of coffee?
    • value_cafe = Do you feel like you’re getting good value for your money when you buy coffee at a cafe?
    • spent_equipment = Approximately how much have you spent on coffee equipment in the past 5 years?
    • value_equipment = Do you feel like you’re getting good value for your money when you buy coffee at a cafe?
    • gender = Gender
    • gender_specify = Gender (please specify)
    • education_level = Education Level
    • ethnicity_race = Ethnicity/Race
    • ethnicity_race_specify = Ethnicity/Race (please specify)
    • employment_status = Employment Status
    • number_children = Number of Children
    • political_affiliation = Political Affiliation

    Acknowledgement

    Datasource: The data is collected thorugh a survey called The Great American Coffee Taste Test held by James Haffmann

    Inspiration: [Great American Coffee...

  2. Global coffee consumption 2012/13-2023/4

    • statista.com
    Updated Nov 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Global coffee consumption 2012/13-2023/4 [Dataset]. https://www.statista.com/statistics/292595/global-coffee-consumption/
    Explore at:
    Dataset updated
    Nov 19, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    Coffee is one of the most widely consumed beverages in the world. In 2023/24, approximately *** million 60 kilogram bags of coffee were consumed worldwide, a slight increase from about *** million bags in the previous year. Coffee Brewing Innovations The coffee industry is regularly coming up with innovative new methods for brewing coffee and serving it. Single cup brewers are a relatively new innovation which offer a quick and mess-free coffee brewing method for when one only wants a single cup of coffee without the hassle of brewing an entire pot. Cold Brew Coffee Cold brew coffee is made by leaving coffee to brew in cold or room temperature water for 12 to 24 hours and results in a mellower, less acidic tasting coffee. It has become a popular menu item at coffee shops and cafes, but it also easy to make at home. The market for cold brew coffee was valued at *** million U.S. dollars in 2017, and is expected to reach a staggering **** billion U.S. dollars in market value by 2023.

  3. Coffee consumption in the U.S. 2013/14-2024/2025

    • statista.com
    Updated Nov 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Coffee consumption in the U.S. 2013/14-2024/2025 [Dataset]. https://www.statista.com/statistics/804271/domestic-coffee-consumption-in-the-us/
    Explore at:
    Dataset updated
    Nov 29, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States, North America
    Description

    Coffee consumption in the United States amounted to over ** million 60-kilogram bags in the 2024/2025 fiscal year. This is a slight increase from the total U.S. coffee consumption in the previous fiscal year. Coffee production The coffee plant has its origins in Ethiopia and is now grown all over the world. Most of the world’s coffee is cultivated in South America, followed by Asia and Oceania. In 2024, over ***million 60-kilogram bags of coffee were produced in South America. The majority of South America’s coffee production is attributed to Brazil. In the 2023/2024 fiscal year, global coffee production reached *** million 60-kilogram bags. Coffee brewing in the United States Americans love their coffee and have dozens of different methods and gadgets for brewing and preparing coffee. A 2025 survey of U.S. consumers found that the most commonly used coffee preparation methods were drip coffee makers and single-cup brewers. However, drip coffee makers have become less popular over time. In 2012, ***percent of coffee drinkers used drip coffee makers, while in 2025 this share had dropped to ***percent.

  4. Coffee Shop Daily Revenue Prediction Dataset

    • kaggle.com
    zip
    Updated Feb 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Himel Sarder (2025). Coffee Shop Daily Revenue Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/himelsarder/coffee-shop-daily-revenue-prediction-dataset
    Explore at:
    zip(30259 bytes)Available download formats
    Dataset updated
    Feb 7, 2025
    Authors
    Himel Sarder
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset Overview

    This dataset contains 2,000 rows of data from coffee shops, offering detailed insights into factors that influence daily revenue. It includes key operational and environmental variables that provide a comprehensive view of how business activities and external conditions affect sales performance. Designed for use in predictive analytics and business optimization, this dataset is a valuable resource for anyone looking to understand the relationship between customer behavior, operational decisions, and revenue generation in the food and beverage industry.

    Columns & Variables

    The dataset features a variety of columns that capture the operational details of coffee shops, including customer activity, store operations, and external factors such as marketing spend and location foot traffic.

    1. Number of Customers Per Day

      • The total number of customers visiting the coffee shop on any given day.
      • Range: 50 - 500 customers.
    2. Average Order Value ($)

      • The average dollar amount spent by each customer during their visit.
      • Range: $2.50 - $10.00.
    3. Operating Hours Per Day

      • The total number of hours the coffee shop is open for business each day.
      • Range: 6 - 18 hours.
    4. Number of Employees

      • The number of employees working on a given day. This can influence service speed, customer satisfaction, and ultimately, sales.
      • Range: 2 - 15 employees.
    5. Marketing Spend Per Day ($)

      • The amount of money spent on marketing campaigns or promotions on any given day.
      • Range: $10 - $500 per day.
    6. Location Foot Traffic (people/hour)

      • The number of people passing by the coffee shop per hour, a variable indicative of the shop's location and its potential to attract customers.
      • Range: 50 - 1000 people per hour.

    Target Variable

    • Daily Revenue ($)
      • This is the dependent variable representing the total revenue generated by the coffee shop each day.
      • It is calculated as a combination of customer visits, average spending, and other operational factors like marketing spend and staff availability.
      • Range: $200 - $10,000 per day.

    Data Distribution & Insights

    The dataset spans a wide variety of operational scenarios, from small neighborhood coffee shops with limited traffic to larger, high-traffic locations with extensive marketing budgets. This variety allows for exploring different predictive modeling strategies. Key insights that can be derived from the data include:

    • The effect of marketing spend on daily revenue.
    • The correlation between customer count and daily sales.
    • The relationship between staffing levels and revenue generation.
    • The influence of foot traffic and operating hours on customer behavior.

    Use Cases & Applications

    The dataset offers a wide range of applications, especially in predictive analytics, business optimization, and forecasting:

    • Predictive Modeling: Use machine learning models such as regression, decision trees, or neural networks to predict daily revenue based on operational data.
    • Business Strategy Development: Analyze how changes in marketing spend, staff numbers, or operating hours can optimize revenue and improve efficiency.
    • Customer Insights: Identify patterns in customer behavior related to shop operations and external factors like foot traffic and marketing campaigns.
    • Resource Allocation: Determine optimal staffing levels and marketing budgets based on predicted sales, improving overall profitability.

    Real-World Applications in the Food & Beverage Industry

    For coffee shop owners, managers, and analysts in the food and beverage industry, this dataset provides an essential tool for refining daily operations and boosting profitability. Insights gained from this data can help:

    • Optimize Marketing Campaigns: Evaluate the effectiveness of daily or seasonal marketing campaigns on revenue.
    • Staff Scheduling: Predict busy days and ensure that the right number of employees are scheduled to maximize efficiency.
    • Revenue Forecasting: Provide accurate revenue projections that can assist with financial planning and decision-making.
    • Operational Efficiency: Discover the most profitable operating hours and adjust business hours accordingly.

    This dataset is also ideal for aspiring data scientists and machine learning practitioners looking to apply their skills to real-world business problems in the food and beverage sector.

    Conclusion

    The Coffee Shop Revenue Prediction Dataset is a versatile and comprehensive resource for understanding the dynamics of daily sales performance in coffee shops. With a focus on key operational factors, it is perfect for building predictive models, ...

  5. Coffee Tastings [Survey Analysis]

    • kaggle.com
    zip
    Updated Nov 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sujay Kapadnis (2023). Coffee Tastings [Survey Analysis] [Dataset]. https://www.kaggle.com/datasets/sujaykapadnis/lets-do-some-coffee-tasting
    Explore at:
    zip(444226 bytes)Available download formats
    Dataset updated
    Nov 20, 2023
    Authors
    Sujay Kapadnis
    Description

    Last month, British YouTuber (and former World Barista Champion) James Hoffman virtually hosted the Great American Coffee Taste Test, during which thousands of people simultaneously blind-tasted the same four coffees. Hoffman has published a video summarizing the results, as well as a spreadsheet of anonymized survey responses from 4,000+ participants. It includes tasters’ demographics, general coffee drinking habits and preferences, assessments of the four coffees, and more CR: https://bit.ly/gacttCSV+

  6. The ORBIT (Object Recognition for Blind Image Training)-India Dataset

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gesu India; Gesu India; Martin Grayson; Martin Grayson; Daniela Massiceti; Daniela Massiceti; Cecily Morrison; Cecily Morrison; Simon Robinson; Simon Robinson; Jennifer Pearson; Jennifer Pearson; Matt Jones; Matt Jones (2025). The ORBIT (Object Recognition for Blind Image Training)-India Dataset [Dataset]. http://doi.org/10.5281/zenodo.12608444
    Explore at:
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Gesu India; Gesu India; Martin Grayson; Martin Grayson; Daniela Massiceti; Daniela Massiceti; Cecily Morrison; Cecily Morrison; Simon Robinson; Simon Robinson; Jennifer Pearson; Jennifer Pearson; Matt Jones; Matt Jones
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    India
    Description

    The ORBIT (Object Recognition for Blind Image Training) -India Dataset is a collection of 105,243 images of 76 commonly used objects, collected by 12 individuals in India who are blind or have low vision. This dataset is an "Indian subset" of the original ORBIT dataset [1, 2], which was collected in the UK and Canada. In contrast to the ORBIT dataset, which was created in a Global North, Western, and English-speaking context, the ORBIT-India dataset features images taken in a low-resource, non-English-speaking, Global South context, a home to 90% of the world’s population of people with blindness. Since it is easier for blind or low-vision individuals to gather high-quality data by recording videos, this dataset, like the ORBIT dataset, contains images (each sized 224x224) derived from 587 videos. These videos were taken by our data collectors from various parts of India using the Find My Things [3] Android app. Each data collector was asked to record eight videos of at least 10 objects of their choice.

    Collected between July and November 2023, this dataset represents a set of objects commonly used by people who are blind or have low vision in India, including earphones, talking watches, toothbrushes, and typical Indian household items like a belan (rolling pin), and a steel glass. These videos were taken in various settings of the data collectors' homes and workspaces using the Find My Things Android app.

    The image dataset is stored in the ‘Dataset’ folder, organized by folders assigned to each data collector (P1, P2, ...P12) who collected them. Each collector's folder includes sub-folders named with the object labels as provided by our data collectors. Within each object folder, there are two subfolders: ‘clean’ for images taken on clean surfaces and ‘clutter’ for images taken in cluttered environments where the objects are typically found. The annotations are saved inside a ‘Annotations’ folder containing a JSON file per video (e.g., P1--coffee mug--clean--231220_084852_coffee mug_224.json) that contains keys corresponding to all frames/images in that video (e.g., "P1--coffee mug--clean--231220_084852_coffee mug_224--000001.jpeg": {"object_not_present_issue": false, "pii_present_issue": false}, "P1--coffee mug--clean--231220_084852_coffee mug_224--000002.jpeg": {"object_not_present_issue": false, "pii_present_issue": false}, ...). The ‘object_not_present_issue’ key is True if the object is not present in the image, and the ‘pii_present_issue’ key is True, if there is a personally identifiable information (PII) present in the image. Note, all PII present in the images has been blurred to protect the identity and privacy of our data collectors. This dataset version was created by cropping images originally sized at 1080 × 1920; therefore, an unscaled version of the dataset will follow soon.

    This project was funded by the Engineering and Physical Sciences Research Council (EPSRC) Industrial ICASE Award with Microsoft Research UK Ltd. as the Industrial Project Partner. We would like to acknowledge and express our gratitude to our data collectors for their efforts and time invested in carefully collecting videos to build this dataset for their community. The dataset is designed for developing few-shot learning algorithms, aiming to support researchers and developers in advancing object-recognition systems. We are excited to share this dataset and would love to hear from you if and how you use this dataset. Please feel free to reach out if you have any questions, comments or suggestions.

    REFERENCES:

    1. Daniela Massiceti, Lida Theodorou, Luisa Zintgraf, Matthew Tobias Harris, Simone Stumpf, Cecily Morrison, Edward Cutrell, and Katja Hofmann. 2021. ORBIT: A real-world few-shot dataset for teachable object recognition collected from people who are blind or low vision. DOI: https://doi.org/10.25383/city.14294597

    2. microsoft/ORBIT-Dataset. https://github.com/microsoft/ORBIT-Dataset

    3. Linda Yilin Wen, Cecily Morrison, Martin Grayson, Rita Faia Marques, Daniela Massiceti, Camilla Longden, and Edward Cutrell. 2024. Find My Things: Personalized Accessibility through Teachable AI for People who are Blind or Low Vision. In Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems (CHI EA '24). Association for Computing Machinery, New York, NY, USA, Article 403, 1–6. https://doi.org/10.1145/3613905.3648641

  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Umer Haddii (2024). The Great American Coffee Taste Test Dataset [Dataset]. https://www.kaggle.com/datasets/umerhaddii/the-great-american-coffee-taste-test-dataset
Organization logo

The Great American Coffee Taste Test Dataset

James Hoffmann and Cometeer survey America's coffee taste preferences

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 20, 2024
Dataset provided by
Kaggle
Authors
Umer Haddii
License

http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

Description

Context

World champion barista James Hoffmann and Cometeer partnered to conduct a first-of-its-kind coffee taste test. Cometeer shipped 5000 coffee kits across America. Kits contained four different coffees - pre-extracted and flash frozen. Tasters melted and diluted the coffee capsules for a largely identical tasting experience. Tasting and ratings were conducted blind [1]. After survey responses were collected (provided data), some attributes of the coffee were revealed.

In October 2023, World champion barista James Hoffmann and coffee company Cometeer held the "Great American Coffee Taste Test" on YouTube, during which viewers were asked to fill out a survey about 4 coffees they ordered from Cometeer for the tasting. Data blogger Robert McKeon Aloe analyzed the data the following month.

Content

Geography: US

Time-period: 2023

Unit of Analysis: The Great American Coffee Taste Test

Variables

  • submission_id = Submission ID
  • age = What is your age?
  • cups = How many cups of coffee do you typically drink per day?
  • where_drink = Where do you typically drink coffee?
  • brew = How do you brew coffee at home?
  • brew_other = How else do you brew coffee at home?
  • purchase = On the go, where do you typically purchase coffee?
  • purchase_other = Where else do you purchase coffee?
  • favorite = What is your favorite coffee drink?
  • favorite_specify = Please specify what your favorite coffee drink is
  • additions = Do you usually add anything to your coffee?
  • additions_other = What else do you add to your coffee?
  • dairy = What kind of dairy do you add?
  • sweetener = What kind of sugar or sweetener do you add?
  • style = Before today's tasting, which of the following best described what kind of coffee you like?
    -**strength** = How strong do you like your coffee?
  • roast_level = What roast level of coffee do you prefer?
  • caffeine = How much caffeine do you like in your coffee?
  • expertise = Lastly, how would you rate your own coffee expertise?
  • coffee_a_bitterness = Coffee A - Bitterness
  • coffee_a_acidity = Coffee A - Acidity
  • coffee_a_personal_preference = Coffee A - Personal Preference
  • coffee_a_notes = Coffee A - Notes
  • coffee_b_bitterness = Coffee B - Bitterness
  • coffee_b_acidity = Coffee B - Acidity
  • coffee_b_personal_preference = Coffee B - Personal Preference
  • coffee_b_notes = Coffee B - Notes
  • coffee_c_bitterness = Coffee C - Bitterness
  • coffee_c_acidity = Coffee C - Acidity
  • coffee_c_personal_preference = Coffee C - Personal Preference
  • coffee_c_notes = Coffee C - Notes
  • coffee_d_bitterness = Coffee D - Bitterness
  • coffee_d_acidity = Coffee D - Acidity
  • coffee_d_personal_preference = Coffee D - Personal Preference
  • coffee_d_notes = Coffee D - Notes
  • prefer_abc = Between Coffee A, Coffee B, and Coffee C which did you prefer?
  • prefer_ad = Between Coffee A and Coffee D, which did you prefer?
  • prefer_overall = Lastly, what was your favorite overall coffee?
  • wfh = Do you work from home or in person?
  • total_spend = In total, how much money do you typically spend on coffee in a month?
  • why_drink = Why do you drink coffee?
  • why_drink_other = Other reason for drinking coffee
  • taste = Do you like the taste of coffee?
  • know_source = Do you know where your coffee comes from?
  • most_paid = What is the most you've ever paid for a cup of coffee?
  • most_willing = What is the most you'd ever be willing to pay for a cup of coffee?
  • value_cafe = Do you feel like you’re getting good value for your money when you buy coffee at a cafe?
  • spent_equipment = Approximately how much have you spent on coffee equipment in the past 5 years?
  • value_equipment = Do you feel like you’re getting good value for your money when you buy coffee at a cafe?
  • gender = Gender
  • gender_specify = Gender (please specify)
  • education_level = Education Level
  • ethnicity_race = Ethnicity/Race
  • ethnicity_race_specify = Ethnicity/Race (please specify)
  • employment_status = Employment Status
  • number_children = Number of Children
  • political_affiliation = Political Affiliation

Acknowledgement

Datasource: The data is collected thorugh a survey called The Great American Coffee Taste Test held by James Haffmann

Inspiration: [Great American Coffee...

Search
Clear search
Close search
Google apps
Main menu