100+ datasets found
  1. c

    Wine Dataset

    • cubig.ai
    Updated May 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Wine Dataset [Dataset]. https://cubig.ai/store/products/210/wine-dataset
    Explore at:
    Dataset updated
    May 2, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Wine Dataset is derived from a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The dataset includes 13 attributes such as alcohol, malic acid, ash, and color intensity, providing a comprehensive overview for understanding wine characteristics and aiding in classification tasks.

    2) Data Utilization (1) Wine data has characteristics that: • It includes detailed measurements of wine attributes, allowing for analysis of chemical composition, comparison between different wine types, and identification of patterns in wine quality and flavor profiles. (2) Wine data can be used to: • Wine Industry: Assists winemakers and analysts in understanding the chemical properties that influence wine quality, helping to improve production processes and quality control. • Research: Supports academic studies and the development of classification models for wine quality prediction and analysis.

  2. P

    Wine Dataset

    • paperswithcode.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan N. van Rijn; Jonathan K. Vis, Wine Dataset [Dataset]. https://paperswithcode.com/dataset/wine
    Explore at:
    Authors
    Jan N. van Rijn; Jonathan K. Vis
    Description

    These data are the results of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of 13 constituents found in each of the three types of wines.

  3. Data from: Red wine DataSet

    • kaggle.com
    Updated Aug 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suraj_kumar_Gupta (2023). Red wine DataSet [Dataset]. https://www.kaggle.com/datasets/soorajgupta7/red-wine-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 21, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Suraj_kumar_Gupta
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Description

    Datasets Description:

    The datasets under discussion pertain to the red and white variants of Portuguese "Vinho Verde" wine. Detailed information is available in the reference by Cortez et al. (2009). These datasets encompass physicochemical variables as inputs and sensory variables as outputs. Notably, specifics regarding grape types, wine brand, and selling prices are absent due to privacy and logistical concerns.

    Classification and Regression Tasks: One can interpret these datasets as being suitable for both classification and regression analyses. The classes are ordered, albeit imbalanced. For instance, the dataset contains a more significant number of normal wines compared to excellent or poor ones.

    Dataset Contents: For a comprehensive understanding, readers are encouraged to review the work by Cortez et al. (2009). The input variables, derived from physicochemical tests, include: 1. Fixed acidity 2. Volatile acidity 3. Citric acid 4. Residual sugar 5. Chlorides 6. Free sulfur dioxide 7. Total sulfur dioxide 8. Density 9. pH 10. Sulphates 11. Alcohol

    The output variable, based on sensory data, is denoted by: 12. Quality (score ranging from 0 to 10)

    Usage Tips: A practical suggestion involves setting a threshold for the dependent variable, defining wines with a quality score of 7 or higher as 'good/1' and the rest as 'not good/0.' This facilitates meaningful experimentation with hyperparameter tuning using decision tree algorithms and analyzing ROC curves and AUC values.

    Operational Workflow: To efficiently utilize the dataset, the following steps are recommended: 1. Utilize a File Reader (for csv) to a linear correlation node and an interactive histogram for basic Exploratory Data Analysis (EDA). 2. Employ a File Reader to a Rule Engine Node for transforming the 10-point scale to a dichotomous variable indicating 'good wine' and 'rest.' 3. Implement a Rule Engine Node output to an input of Column Filter node to filter out the original 10-point feature, thus preventing data leakage. 4. Apply a Column Filter Node output to the input of Partitioning Node to execute a standard train/test split (e.g., 75%/25%, choosing 'random' or 'stratified'). 5. Feed the Partitioning Node train data split output into the input of Decision Tree Learner node. 6. Connect the Partitioning Node test data split output to the input of Decision Tree predictor Node. 7. Link the Decision Tree Learner Node output to the input of Decision Tree Node. 8. Finally, connect the Decision Tree output to the input of ROC Node for model evaluation based on the AUC value.

    Tools and Acknowledgments: For an efficient analysis, consider using KNIME, a valuable graphical user interface (GUI) tool. Additionally, the dataset is available on the UCI machine learning repository, and proper acknowledgment and citation of the dataset source by Cortez et al. (2009) are essential for use.

  4. Spanish Wine Quality Dataset

    • kaggle.com
    Updated Apr 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    fedesoriano (2022). Spanish Wine Quality Dataset [Dataset]. https://www.kaggle.com/datasets/fedesoriano/spanish-wine-quality-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 26, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    fedesoriano
    Description

    Similar Datasets

    • CERN Proton Collision Dataset: LINK
    • Airfoil Self-Noise Dataset: LINK
    • CERN Electron Collision Data: LINK
    • Wind Speed Prediction Dataset: LINK
    • Stellar Classification Dataset - SDSS17: LINK

    Context

    This dataset is related to red variants of spanish wines. The dataset describes several popularity and description metrics their effect on it's quality. The datasets can be used for classification or regression tasks. The classes are ordered and not balanced (i.e. the quality goes from almost 5 to 4 points). The task is to predict either the quality of wine or the prices using the given data.

    Content

    The dataset contains 7500 different types of red wines from Spain with 11 features that describe their price, rating, and even some flavor description. The was collected by me using web scraping from different sources (from wine specialized pages to supermarkets). Please acknowledge the hard work to obtain and create this dataset, you can upvote it if you find it useful to use on your projects :)

    If the dataset becomes popular I will probably try to create a bigger version with wines from other countries and a wider spectrum of ratings.

    Attribute Information

    1. winery: Winery name
    2. wine: Name of the wine
    3. year: Year in which the grapes were harvested
    4. rating: Average rating given to the wine by the users [from 1-5]
    5. num_reviews: Number of users that reviewed the wine
    6. country: Country of origin [Spain]
    7. region: Region of the wine
    8. price: Price in euros [€]
    9. type: Wine variety
    10. body: Body score, defined as the richness and weight of the wine in your mouth [from 1-5]
    11. acidity: Acidity score, defined as wine's “pucker” or tartness; it's what makes a wine refreshing and your tongue salivate and want another sip [from 1-5]

    Citation Request

    If you want to cite this data:

    fedesoriano. (April 2022). Spanish Wine Quality Dataset. Retrieved [Date Retrieved] from https://www.kaggle.com/datasets/fedesoriano/spanish-wine-quality-dataset

  5. Wine Quality Full

    • figshare.com
    txt
    Updated Jul 4, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepchecks Data (2022). Wine Quality Full [Dataset]. http://doi.org/10.6084/m9.figshare.20223303.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 4, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Deepchecks Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
  6. T

    wine_quality

    • tensorflow.org
    • beta.dataverse.org
    • +1more
    Updated Nov 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). wine_quality [Dataset]. https://www.tensorflow.org/datasets/catalog/wine_quality
    Explore at:
    Dataset updated
    Nov 23, 2022
    Description

    Two datasets were created, using red and white wine samples. The inputs include objective tests (e.g. PH values) and the output is based on sensory data (median of at least 3 evaluations made by wine experts). Each expert graded the wine quality between 0 (very bad) and 10 (very excellent). Several data mining methods were applied to model these datasets under a regression approach. The support vector machine model achieved the best results. Several metrics were computed: MAD, confusion matrix for a fixed error tolerance (T), etc. Also, we plot the relative importances of the input variables (as measured by a sensitivity analysis procedure).

    The two datasets are related to red and white variants of the Portuguese "Vinho Verde" wine. For more details, consult: http://www.vinhoverde.pt/en/ or the reference [Cortez et al., 2009]. Due to privacy and logistic issues, only physicochemical (inputs) and sensory (the output) variables are available (e.g. there is no data about grape types, wine brand, wine selling price, etc.).

    Number of Instances: red wine - 1599; white wine - 4898

    Input variables (based on physicochemical tests):

    1. fixed acidity
    2. volatile acidity
    3. citric acid
    4. residual sugar
    5. chlorides
    6. free sulfur dioxide
    7. total sulfur dioxide
    8. density
    9. pH
    10. sulphates
    11. alcohol

    Output variable (based on sensory data):

    1. quality (score between 0 and 10)

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('wine_quality', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  7. Data from: Wine Quality

    • kaggle.com
    Updated Jul 9, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raj Parmar (2018). Wine Quality [Dataset]. https://www.kaggle.com/rajyellow46/wine-quality/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 9, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Raj Parmar
    Description

    Data Set Information:

    The dataset was downloaded from the UCI Machine Learning Repository.

    The two datasets are related to red and white variants of the Portuguese "Vinho Verde" wine. The reference [Cortez et al., 2009]. Due to privacy and logistic issues, only physicochemical (inputs) and sensory (the output) variables are available (e.g. there is no data about grape types, wine brand, wine selling price, etc.).

    These datasets can be viewed as classification or regression tasks. The classes are ordered and not balanced (e.g. there are munch more normal wines than excellent or poor ones). Outlier detection algorithms could be used to detect the few excellent or poor wines. Also, we are not sure if all input variables are relevant. So it could be interesting to test feature selection methods.

    Two datasets were combined and few values were randomly removed.

    Attribute Information:

    For more information, read [Cortez et al., 2009]. Input variables (based on physicochemical tests): 1 - fixed acidity 2 - volatile acidity 3 - citric acid 4 - residual sugar 5 - chlorides 6 - free sulfur dioxide 7 - total sulfur dioxide 8 - density 9 - pH 10 - sulphates 11 - alcohol Output variable (based on sensory data): 12 - quality (score between 0 and 10)

    Acknowledgements:

    P. Cortez, A. Cerdeira, F. Almeida, T. Matos and J. Reis. Modeling wine preferences by data mining from physicochemical properties. In Decision Support Systems, Elsevier, 47(4):547-553, 2009.

  8. Wine_Test Prediction | 1600 data | yashaswi

    • kaggle.com
    Updated May 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ayushman Yashaswi (2025). Wine_Test Prediction | 1600 data | yashaswi [Dataset]. https://www.kaggle.com/datasets/ayushmanyashaswi/wine-test-prediction-1600-data-yashaswi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 19, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ayushman Yashaswi
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Sure! Here's the updated Kaggle dataset description with your data visualization work included:

    📊 Wine Quality - Red Wine Dataset

    This dataset contains physicochemical attributes of red variants of Portuguese "Vinho Verde" wine, along with their quality score (rated between 0 to 10). The goal is to predict wine quality using various classification models based on the chemical properties of the wine.

    🧪 Features Overview (12 columns):

    • fixed acidity: most acids involved with wine are fixed/nonvolatile
    • volatile acidity: amount of acetic acid (can affect taste)
    • citric acid: adds freshness and flavor
    • residual sugar: sugar left after fermentation
    • chlorides: salt content
    • free sulfur dioxide: protects wine from microbes
    • total sulfur dioxide: total SO₂ content
    • density: wine density
    • pH: acidity level
    • sulphates: preservative and antimicrobial
    • alcohol: alcohol percentage
    • quality (target): wine quality score (0–10)

    🤖 Model Performance Summary:

    Multiple machine learning models were trained to predict wine quality. The following accuracy scores were observed:

    ModelTraining AccuracyTesting Accuracy
    Logistic Regression87.91%87.0%
    Random Forest100%94.0%
    Decision Tree100%88.5%
    Support Vector Machine (SVM)86.41%86.5%

    📈 Data Visualization:

    A comparison plot of model performance was created to visually represent the accuracy of each algorithm. This helps in understanding which models generalized well and which ones may have overfit to the training data.

    📁 File Info:

    • Filename: winequality-red.csv
    • Size: ~100 KB
    • Rows: 1,599
    • Columns: 12

    📌 Ideal For:

    • Classification model evaluation
    • Feature correlation analysis
    • EDA and visualization
    • ML model tuning and comparison
  9. A

    ‘Wine Dataset for Clustering’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Sep 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Wine Dataset for Clustering’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-wine-dataset-for-clustering-14e0/9640816e/
    Explore at:
    Dataset updated
    Sep 30, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Wine Dataset for Clustering’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/harrywang/wine-dataset-for-clustering on 30 September 2021.

    --- Dataset description provided by original source is as follows ---

    This dataset is adapted from the Wine Data Set from https://archive.ics.uci.edu/ml/datasets/wine by removing the information about the types of wine for unsupervised learning.

    The following descriptions are adapted from the UCI webpage:

    These data are the results of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of 13 constituents found in each of the three types of wines.

    The attributes are:

    • Alcohol
    • Malic acid
    • Ash
    • Alcalinity of ash
    • Magnesium
    • Total phenols
    • Flavanoids
    • Nonflavanoid phenols
    • Proanthocyanins
    • Color intensity
    • Hue
    • OD280/OD315 of diluted wines
    • Proline

    --- Original source retains full ownership of the source dataset ---

  10. UCI Wine Dataset (edit)

    • kaggle.com
    Updated Sep 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aaron Tanjaya (2020). UCI Wine Dataset (edit) [Dataset]. https://www.kaggle.com/aarontanjaya/uci-wine-dataset-edit/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 8, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aaron Tanjaya
    Description

    This data is taken from UCI's Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/Wine For a slightly edited csv version (added column names): https://www.kaggle.com/aarontanjaya/uci-wine-dataset-edit

    the data is donated by:

    Original Owners:

    Forina, M. et al, PARVUS - An Extendible Package for Data Exploration, Classification and Correlation. Institute of Pharmaceutical and Food Analysis and Technologies, Via Brigata Salerno, 16147 Genoa, Italy.

    Donor:

    Stefan Aeberhard, email: stefan '@' coral.cs.jcu.edu.au

    Data Set Information:

    These data are the results of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of 13 constituents found in each of the three types of wines.

    I think that the initial data set had around 30 variables, but for some reason I only have the 13 dimensional version. I had a list of what the 30 or so variables were, but a.) I lost it, and b.), I would not know which 13 variables are included in the set.

    The attributes are (dontated by Riccardo Leardi, riclea '@' anchem.unige.it ) 1) Alcohol 2) Malic acid 3) Ash 4) Alcalinity of ash 5) Magnesium 6) Total phenols 7) Flavanoids 8) Nonflavanoid phenols 9) Proanthocyanins 10)Color intensity 11)Hue 12)OD280/OD315 of diluted wines 13)Proline

    In a classification context, this is a well posed problem with "well behaved" class structures. A good data set for first testing of a new classifier, but not very challenging.

  11. h

    wine-labels

    • huggingface.co
    Updated Mar 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zuppichini (2023). wine-labels [Dataset]. https://huggingface.co/datasets/Francesco/wine-labels
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 30, 2023
    Authors
    Zuppichini
    License

    https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/

    Description

    Dataset Card for wine-labels

    ** The original COCO dataset is stored at dataset.tar.gz**

      Dataset Summary
    

    wine-labels

      Supported Tasks and Leaderboards
    

    object-detection: The dataset can be used to train a model for Object Detection.

      Languages
    

    English

      Dataset Structure
    
    
    
    
    
      Data Instances
    

    A data point comprises an image and its object annotations. { 'image_id': 15, 'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB… See the full description on the dataset page: https://huggingface.co/datasets/Francesco/wine-labels.

  12. Average wine bottle price in the U.S. in 2023, by origin state

    • statista.com
    • ai-chatbox.pro
    Updated Nov 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2024). Average wine bottle price in the U.S. in 2023, by origin state [Dataset]. https://www.statista.com/topics/1541/wine-market/
    Explore at:
    Dataset updated
    Nov 29, 2024
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    United States
    Description

    Wine originating from the state of Oregon had the highest average price for a 750ml bottle in the United States in 2023 at 17.37 U.S. dollars. In comparison, wine from California averaged 8.48 dollars per bottle.

  13. F

    France Wines: Approved for Circulation: AOC & VDQS: Dept: Sevres (Deux)

    • ceicdata.com
    Updated Apr 24, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2018). France Wines: Approved for Circulation: AOC & VDQS: Dept: Sevres (Deux) [Dataset]. https://www.ceicdata.com/en/france/wine-statistics
    Explore at:
    Dataset updated
    Apr 24, 2018
    Dataset provided by
    CEICdata.com
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 1, 2017 - Feb 1, 2018
    Area covered
    France
    Variables measured
    Agricultural, Fishery and Forestry Inventory
    Description

    Wines: Approved for Circulation: AOC & VDQS: Dept: Sevres (Deux) data was reported at 465.000 hl in Apr 2018. This records a decrease from the previous number of 511.000 hl for Mar 2018. Wines: Approved for Circulation: AOC & VDQS: Dept: Sevres (Deux) data is updated monthly, averaging 18,398.000 hl from Aug 2002 (Median) to Apr 2018, with 188 observations. The data reached an all-time high of 120,078.000 hl in Feb 2013 and a record low of 154.000 hl in Jul 2013. Wines: Approved for Circulation: AOC & VDQS: Dept: Sevres (Deux) data remains active status in CEIC and is reported by General Directorate of Customs and Excise. The data is categorized under Global Database’s France – Table FR.B013: Wine Statistics.

  14. d

    Compendium of Grape and Wine Data for Australia's Wine Regions, 1999-2008

    • data.sa.gov.au
    • data.wu.ac.at
    Updated Jun 12, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2014). Compendium of Grape and Wine Data for Australia's Wine Regions, 1999-2008 [Dataset]. https://data.sa.gov.au/data/dataset/compendium-of-grape-and-wine-data-for-australia-s-wine-regions-1999-2008
    Explore at:
    Dataset updated
    Jun 12, 2014
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Australia
    Description

    A compendium of data on wine and grape production in winegrape bearing regions in Australia. There are four sheets that record data by region: 1) wine variables; 2) yield for 2006 and 2008; 3) time series data from 1999-2008; and 4) data on water usage by state. The data include, for example, statistics on grape and wine employment and value of grape and wine output. Dataset to be attributed to The University of Adelaide.

  15. n

    Data from: Wine lactone

    • webbook.nist.gov
    Updated May 10, 2009
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2009). Wine lactone [Dataset]. https://webbook.nist.gov/cgi/cbook.cgi?InChI=1/C10H14O2/c1-6-3-4-8-7(2)10(11)12-9(8)5-6/h5,7-9H,3-4H2,1-2H3/t7-,8-,9-/m1/s1
    Explore at:
    Dataset updated
    May 10, 2009
    Dataset provided by
    National Institute of Standards and Technology
    License

    https://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRDhttps://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRD

    Description

    This page, "Wine lactone", is part of the NIST Chemistry WebBook. This site and its contents are part of the NIST Standard Reference Data Program.

  16. Z

    Data from: Dataset on the characterization of the flavor of two red wine...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guérin, Laurence (2020). Dataset on the characterization of the flavor of two red wine varieties using sensory descriptive analysis, volatile organic compounds quantitative analysis by GC-MS and odorant composition by GC-MS-O [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_1213609
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Prost, Carole
    Thomas-Danguin, Thierry
    Roche, Alice
    Courcoux, Philippe
    Villière, Angélique
    Symoneaux, Ronan
    Guérin, Laurence
    Perrot, Nathalie
    Eslami, Aïda
    Le Fur, Yves
    Vigneau, Evelyne
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains data that were collected on 2 sets of 8 French red wines from two grape varieties, Pinot Noir (PN) and Cabernet Franc (CF). It provides, for the 16 wines, (i) sensory descriptive data obtained with a trained panel, (ii) volatile organic compounds (VOC) quantification data obtained by Gas Chromatography–Mass Spectrometry (GC-MS) and (iii) odorant composition obtained by Gas Chromatography–Mass Spectrometry–Olfactometry (GC-MS-O).

    The dataset is a Microsoft Excel Worksheet containing 8 sheets.

    • Sheet 1: Information

    Gives information about the sheets contained in this .xlsx file

    • Sheet 2: Experimental_factors

    Each row represents a wine

    Each column corresponds to an experimental factors of the wines (Grape variety, Vintage and Protected Designation of Origin)

    • Sheet 3: List_sensory_descriptors

    Lists the 33 sensory descriptors used for the sensory descriptive analysis of the wines

    • Sheet 4: Sensory_descriptive_analysis

    Each row represents a wine

    Each column corresponds to a condition (2640 columns)

    Senso_(ortho or retro)_(Panelist1 to Panelist 16)_(1 to 33 Sensory descriptors)_(1 to 3 repetitions for ortho and 1 to 2 repetitions for retro)

    For the ortho (orthonasal) measurements, there is 16 panelists, 33 sensory descriptors and 3 repetitions = 1584 columns

    For the retro (retronasal) measurements, there is 16 panelists, 33 sensory descriptors and 2 repetitions = 1056 columns

    Each cell contains a sensory measurement for the corresponding condition in the corresponding wine

    • Sheet 5: List_VOC

    Lists the 45 VOC quantified in the wines with their corresponding CAS number

    VOC: Volatil Organic Compounds

    • Sheet 6: VOC_quantification

    Each row represents a wine

    Each column corresponds to a VOC (45 columns)

    Each cell contains the quantification of the corresponding VOC in the corresponding wine

    • Sheet 7: List_GC-MS-O

    Lists the 49 odor-active compounds identified with their corresponding CAS number and the 34 compounds identified by their apex indice

    • Sheet 8: GC-MS-O

    Each row represents a wine

    Each column corresponds to an odor-active compound identified by its CAS number or by its Apex indice if the compound was not identify (81 odor-active compounds) + the number of judges who smelled the compound and its description (by 8 judges) = 9 columns per odor-active compound for a total of 729 columns

  17. m

    Data from: Electronic nose dataset for detection of wine spoilage thresholds...

    • data.mendeley.com
    • search.datacite.org
    Updated Apr 2, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JUAN CARLOS RODRIGUEZ GAMBOA (2019). Electronic nose dataset for detection of wine spoilage thresholds [Dataset]. http://doi.org/10.17632/vpc887d53s.3
    Explore at:
    Dataset updated
    Apr 2, 2019
    Authors
    JUAN CARLOS RODRIGUEZ GAMBOA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is related to the research paper "Wine quality rapid detection using a compact electronic nose system: application focused on spoilage thresholds by acetic acid" published in LWT journal (available online from April 1, 2019, https://doi.org/10.1016/j.lwt.2019.03.074), and the data paper "Electronic nose dataset for detection of wine spoilage thresholds" submitted to Data in Brief journal. For more details read the mentioned articles and cite our work whether found useful.

    The recorded time series was acquired at the sampling frequency of 18.5Hz during 180 seconds, resulting in 3330 data points per sensor.

    Each file in the dataset has eight columns: relative humidity (%), temperature (°C), and the resistance readings in kΩ of the six gas sensors: MQ-3, MQ-4, MQ-6, MQ-3, MQ-4, MQ-6.

    We organized the database in three folders for the wines: AQ_Wines, HQ_Wines, LQ_Wines; and one folder for the ethanol: Ethanol. Each folder contains text files that correspond to different measurements.

    The filename identify the wine measurement as follows: the first 2 characters of the filename are an identifier of the spoilage wine threshold (AQ: average-quality, HQ: high-quality, LQ: low-quality); characters 4-9 indicate the wine brand; characters 11-13 indicate the bottle, and the last 3 characters indicate the repetition (another sample of the same bottle). For example, file LQ_Wine01-B01_R01 contains the time series recorded when low-quality wine of the brand 01, bottle 01, sample 01 was measured.

    The filenames into the Ethanol folder identify the measurements at different concentrations: the first 2 characters of the filename are an identifier of Ethanol (Ea); characters 4-5 indicate the concentration in v/v (C1: 1%, C2: 2.5%, C3: 5%, C4: 10%, C5: 15%, C6: 20%); and the last 3 characters indicate the repetition. For example, file Ea-C1_R01 contains time series acquired when Ethanol at 1% v/v of concentration, sample 01 was measured.

  18. French wine dataset to mapping the expected harvest value by county

    • zenodo.org
    bin, csv
    Updated Jun 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martial Phélippé-Guinvarc'h; Martial Phélippé-Guinvarc'h (2024). French wine dataset to mapping the expected harvest value by county [Dataset]. http://doi.org/10.5281/zenodo.12548961
    Explore at:
    bin, csvAvailable download formats
    Dataset updated
    Jun 28, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Martial Phélippé-Guinvarc'h; Martial Phélippé-Guinvarc'h
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jun 26, 2024
    Area covered
    French, France
    Description

    This database is built from open data as described in the paper entitled ‘French wine: Combination of multiple open data sources to mapping the expected harvest value’ (2024).

    CODE_CULTU

    Crop code of the graphic land registry database

    CodeCdC

    Crop code in Multi Perils Crop Insurance specification

    Harvest Value B

    Harvest value (€/ha organic wine)

    Harvest Value C

    Harvest value (€/ha no-organic wine)

    IDA

    ID of geographical areas of INAO

    Insee_Com

    County code (INSEE)

    Label_CdC

    Crop label in Multi Perils Crop Insurance specification

    Label_Dpt

    Department

    Label_Insee_com

    County

    Label_RA

    Agricultural Region (AGRESTE)

    Label_appellation

    Appellation (INAO)

    Label_code3

    Crop (FADN)

    Label_cvi

    Wine name (vineyard register of customs services)

    Label_idGeo

    Geographical ID of Quality Sign (INAO)

    PxBaremAOP

    Price listed in Multi Perils Crop Insurance specification (€/hl no-organic)

    PxBaremAOPBio

    Price listed in Multi Perils Crop Insurance specification (€/hl organic)

    RdtMOAOP

    Harvest wine yield (hl/ha)

    SurfaceModel

    Surface of wine as fitted by model

    code3

    Crop code (FADN)

    code_dept

    Department code

    code_regag

    Code of Agricultural Region (AGRESTE)

    cvi

    Wine code (vineyard register of customs services)

    id_appellation

    Appellation code (INAO)

    id_denomination_geo

    Geographical ID of Quality Sign (INAO)

    Find here the relative research paper :

    https://univ-lemans.hal.science/hal-04627672

    Please find below the list of the sites where used data could be found (lasted view the June 26, 2024).

    https://agreste.agriculture.gouv.fr/agreste-web/methodon/Z.1/!searchurl/listeTypeMethodon/

    https://www.casd.eu/source/reseau-dinformation-comptable-agricole/?tab=16

    https://www.douane.gouv.fr/la-douane/opendata?f%5B0%5D=categorie_opendata_facet%3A467

    www.inao.gouv.fr

    https://www.data.gouv.fr/fr/datasets/?q=inao

    https://maisons-champagne.com/fr/appellation/aire-geographique/

    https://info.agriculture.gouv.fr/boagri/document_administratif-4b9ef75e-29a7-449d-9e40-7e5253bfd642/telechargement

    https://agreste.agriculture.gouv.fr/agreste-web/download/publication/publie/Dos2203/2Pages%20de%20Dossier2022-3_CCAN_ChapitreII.pdf

  19. o

    Data from: BrainGut_WineUp daily lifelike images [Dataset]

    • explore.openaire.eu
    Updated Jan 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Begoña Bartolomé; M. Victoria Moreno-Arribas; Lara Lloret Iglesias; Fernando Aguilar; Miriam Cobo Cano; Daniel García Díaz; Ignacio Heredia; Silvia Yuste; Patricia Pérez-Matute; María-José Motilva (2022). BrainGut_WineUp daily lifelike images [Dataset] [Dataset]. http://doi.org/10.20350/digitalcsic/14816
    Explore at:
    Dataset updated
    Jan 1, 2022
    Authors
    Begoña Bartolomé; M. Victoria Moreno-Arribas; Lara Lloret Iglesias; Fernando Aguilar; Miriam Cobo Cano; Daniel García Díaz; Ignacio Heredia; Silvia Yuste; Patricia Pérez-Matute; María-José Motilva
    Description

    The DATASET compiles 1.945 files corresponding to individual images of glasses containing red wine. Each file name is unique and contains information of the parameters under which the photograph was taken (see DATA-SPECIFIC INFOR-MATION for details). For example: The file Rea_Rio_C_Bor_175_nd_nd_fr10_nd_nd_ar2 corresponds to an almost real image (Rea), taken in La Rioja (Rio), of a “crianza wine”(C), in a Bourgogne wine glass (Bor), with a volume of 175 mL (175), taken at none defined time (nd) and undefined lighting (nd), with a real back-ground (fr10) and without reference (nd) nor distance (nd) considerations, and upper angle (ar2). The DATASET compiles 1.945 files corresponding to individual images of glasses containing red wine. The photographs of glasses containing wine were acquired by researchers with different smartphones equipped with high-resolution cameras (12 or 48 MP). Photographs were previously designed considering usual photographic parameters. Each file name is unique and contains information of the parameters under which the photograph was taken. The photographs of glasses containing wine were acquired by researchers with different smartphones equipped with high-resolution cameras (12 or 48 MP). Photographs were previously designed considering usual photographic parameters (see DATA-SPECIFIC INFORMATION for details).-- The data have not been processed. This study was supported by MCIN (Ministerio de Ciencia e Innovación)/AEI (Agencia Estatal de Investigación)/10 .13039 /501100011033through the projects PID2019-108851RB-C21 & PID2019-108851RB-C22. The authors would like to thank CSIC Interdisciplinary Thematic Platform (PTI+) Digital Science and Innovation. Peer reviewed

  20. S

    Data from: WINERIES

    • data.ny.gov
    application/rdfxml +5
    Updated Oct 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York State Liquor Authority (2023). WINERIES [Dataset]. https://data.ny.gov/w/t8fn-3ifp/caer-yrtv?cur=lK5TmBdz-Wj
    Explore at:
    application/rssxml, tsv, csv, xml, application/rdfxml, jsonAvailable download formats
    Dataset updated
    Oct 6, 2023
    Authors
    New York State Liquor Authority
    Description

    Liquor Authority quarterly list of all active licensees in NYS filtered by Winery and Brewery specific License Types.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
CUBIG (2025). Wine Dataset [Dataset]. https://cubig.ai/store/products/210/wine-dataset

Wine Dataset

Explore at:
Dataset updated
May 2, 2025
Dataset authored and provided by
CUBIG
License

https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

Measurement technique
Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
Description

1) Data Introduction • The Wine Dataset is derived from a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The dataset includes 13 attributes such as alcohol, malic acid, ash, and color intensity, providing a comprehensive overview for understanding wine characteristics and aiding in classification tasks.

2) Data Utilization (1) Wine data has characteristics that: • It includes detailed measurements of wine attributes, allowing for analysis of chemical composition, comparison between different wine types, and identification of patterns in wine quality and flavor profiles. (2) Wine data can be used to: • Wine Industry: Assists winemakers and analysts in understanding the chemical properties that influence wine quality, helping to improve production processes and quality control. • Research: Supports academic studies and the development of classification models for wine quality prediction and analysis.

Search
Clear search
Close search
Google apps
Main menu