46 datasets found

h
amazon-product-data-filter
huggingface.co
Updated Nov 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Iftach Arbel (2023). amazon-product-data-filter [Dataset]. https://huggingface.co/datasets/iarbel/amazon-product-data-filter
Explore at:
Dataset updated
Nov 14, 2023
Authors
Iftach Arbel
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Dataset Card for "amazon-product-data-filter"

Dataset Summary

The Amazon Product Dataset contains product listing data from the Amazon US website. It can be used for various NLP and classification tasks, such as text generation, product type classification, attribute extraction, image recognition and more.

Languages

The text in the dataset is in English.

Dataset Structure Data Instances

Each data point provides product information, such… See the full description on the dataset page: https://huggingface.co/datasets/iarbel/amazon-product-data-filter.
h
amazon-appliances-data-subset
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dhruv Srivastava, amazon-appliances-data-subset [Dataset]. https://huggingface.co/datasets/grudgie/amazon-appliances-data-subset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Dhruv Srivastava
Description
grudgie/amazon-appliances-data-subset dataset hosted on Hugging Face and contributed by the HF Datasets community
M
K-12 Education Digital Signage Market By Key Players (Amazon AWS, UCView,...
marketresearchstore.com
pdf
Updated Jul 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Store (2025). K-12 Education Digital Signage Market By Key Players (Amazon AWS, UCView, Samsung Electronics, AVI Systems); Global Report by Size, Share, Industry Analysis, Growth Trends, Regional Outlook, and Forecast 2024-2032 [Dataset]. https://www.marketresearchstore.com/market-insights/k-12-education-digital-signage-market-818677
Explore at:
pdfAvailable download formats
Dataset updated
Jul 2, 2025
Dataset authored and provided by
Market Research Store
License
https://www.marketresearchstore.com/privacy-statementhttps://www.marketresearchstore.com/privacy-statement
Time period covered
2022 - 2030
Area covered
Global
Description
[Keywords] Market include TouchIT Technologies, UCView, Mvix, AVI Systems, Samsung Electronics
h
Data from: Amazon-beauty
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MultifacetedNLPDatasets, Amazon-beauty [Dataset]. https://huggingface.co/datasets/recmeapp/Amazon-beauty
Explore at:
Authors
MultifacetedNLPDatasets
Description
recmeapp/Amazon-beauty dataset hosted on Hugging Face and contributed by the HF Datasets community
Data from: The structure of the Mini-K and K-SF-42: a psychological network...
zenodo.org
datadryad.org
csv, txt
Updated Jun 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joseph Manson; Joseph Manson; Kristine Chua; Aaron Lukaszewski; Kristine Chua; Aaron Lukaszewski (2022). Data from: The structure of the Mini-K and K-SF-42: a psychological network approach [Dataset]. http://doi.org/10.5068/d1q378
Explore at:
txt, csvAvailable download formats
Unique identifier
https://doi.org/10.5068/d1q378
Dataset updated
Jun 2, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Joseph Manson; Joseph Manson; Kristine Chua; Aaron Lukaszewski; Kristine Chua; Aaron Lukaszewski
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Study-1-data and Study-2-data comprise responses to the Mini-K (Figueredo et al. 2006). The Study 1 participants were Amazon's Mechanincal Turk workers. The Study 2 participants were undergraduates at Oklahoma State University. Study-3-data comprises reponses to the K-SF-42 (Figueredo et al. 2017). Participants were Amazon's Mechanincal Turk workers. See the paper for additional information.

R-code contains the code used to run the network analyses described in the paper. "Datafile" represents the file name of the data set being analyzed.
Automatic weather station data from AWS9 collected during 2020 at the...
doi.pangaea.de
html, tsv
Updated Nov 2, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paul C J P Smeets; Michiel R van den Broeke; Wim Boot; Giorgio Cover; Mark Eijkelboom; Wouter Greuell; Carleen H Tijm-Reijmer; Henk Snellen; Roderik S W van de Wal (2022). Automatic weather station data from AWS9 collected during 2020 at the Greenland ice sheet along the K-transect, West-Greenland [Dataset]. http://doi.org/10.1594/PANGAEA.950110
Explore at:
html, tsvAvailable download formats
Unique identifier
https://doi.org/10.1594/PANGAEA.950110
Dataset updated
Nov 2, 2022
Dataset provided by
PANGAEA
Authors
Paul C J P Smeets; Michiel R van den Broeke; Wim Boot; Giorgio Cover; Mark Eijkelboom; Wouter Greuell; Carleen H Tijm-Reijmer; Henk Snellen; Roderik S W van de Wal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 2020 - Dec 31, 2020
Area covered

Variables measured
DATE/TIME, Wind speed, Identification, Logger voltage, Wind direction, Height, relative, Temperature, air, Humidity, relative, Pressure, atmospheric, Temperature, technical, and 5 more
Description
The timeseries constitutes two different types of AWS. The first is a standard type modular AWS described extensively in https://doi.org/10.1080/15230430.2017.1420954.
The second type (operational from Aug2016 at AWS5, Aug2015 at AWS6, Aug2014 at AWS9, Aug2014 at AWS10) is a very compact IMAU design AWS consisting of one integrated module containing the datalogger, energy system and multiple sensors. In addition to the datalogger unit there are also 2 independent sensors dedicated to wind speed/direction and radiation (a Young prop/vane, CNR4 radiation sensor). All three units are mounted 3 to 4m above the surface at one mast boom. […]
t
Solid phase sediment and pore water data (Al, Fe, K, nitrate, organic...
service.tib.eu
doi.pangaea.de
Updated Nov 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Solid phase sediment and pore water data (Al, Fe, K, nitrate, organic carbon) from the Amazon continental shelf (cruise M147) [Dataset]. https://service.tib.eu/ldmservice/dataset/png-doi-10-1594-pangaea-950248
Explore at:
Dataset updated
Nov 29, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Sediment and pore water samples were collected during the M147 cruise of Research Vessel Meteor in April and May 2018. Additional sediment samples (GeoB 4417-5 and GeoB 4409-2) were collected during the M38-2 cruise in March 1997. Total element concentrations (Fe, Al, K) of the solid phase were measured after acid digestion (HF, HNO3 and HClO4) by inductively coupled plasma optical emission spectrometry (Varian ICP 720-ES). Solid phase iron speciation data were measured following single step sodium dithionite extraction (FeD) or sequential Fe extraction (FeAc, FeDith, FeOxal) by inductively coupled plasma optical emission spectrometry (Varian ICP 720-ES). Solid phase pyrite concentrations (FePy) were calculated stoichiometrically from photometrically measured S2- released via chromium(II) chloride reduction. Total organic carbon (TOC) of the sediment samples was measured in an Elemental Analyzer (Euro EA). Prior to analysis carbon bound to carbonate minerals was removed by leaching the sediment with 0.25 N HCl. Pore water nitrate concentrations were measured on board with a SEAL QuAAtro continuous flow auto analyzer. Pore water samples for dissolved element analysis were acidified with HCl to pH < 2 after sampling. Depending on the concentration range, pore water K and Fe was measured by inductively coupled plasma optical emission spectrometry (Varian 720 ES) or inductively coupled plasma mass spectrometry (Agilent 7500).
P
Amazon Review Dataset
paperswithcode.com
Updated Apr 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Amazon Review Dataset [Dataset]. https://paperswithcode.com/dataset/amazon-review
Explore at:
Dataset updated
Apr 9, 2023
Description
Amazon Review is a dataset to tackle the task of identifying whether the sentiment of a product review is positive or negative. This dataset includes reviews from four different merchandise categories: Books (B) (2834 samples), DVDs (D) (1199 samples), Electronics (E) (1883 samples), and Kitchen and housewares (K) (1755 samples).
h
amazon-products
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CK, amazon-products [Dataset]. https://huggingface.co/datasets/ckandemir/amazon-products
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
CK
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Creation and Processing Overview

This dataset underwent a comprehensive process of loading, cleaning, processing, and preparing, incorporating a range of data manipulation and NLP techniques to optimize its utility for machine learning models, particularly in natural language processing.

Data Loading and Initial Cleaning

Source: Loaded from the Hugging Face dataset repository bprateek/amazon_product_description. Conversion to Pandas DataFrame: For ease of data… See the full description on the dataset page: https://huggingface.co/datasets/ckandemir/amazon-products.
h
amazon-review-description
huggingface.co
Updated Oct 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TPP-LLM (2024). amazon-review-description [Dataset]. https://huggingface.co/datasets/tppllm/amazon-review-description
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 21, 2024
Dataset authored and provided by
TPP-LLM
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
Amazon Review Description Dataset

This dataset contains Amazon reviews from January 1, 2018, to June 30, 2018. It includes 2,245 sequences with 127,054 events across 18 category types. The original data is available at Amazon Review Data with citation information provided on the page. The detailed data preprocessing steps used to create this dataset can be found in the TPP-LLM paper and TPP-LLM-Embedding paper. If you find this dataset useful, we kindly invite you to cite the… See the full description on the dataset page: https://huggingface.co/datasets/tppllm/amazon-review-description.
E
Fine litterfall production and nutrient composition data from a fertilized...
catalogue.ceh.ac.uk
hosted-metadata.bgs.ac.uk
+2more
zip
Updated Feb 13, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
A.C.M. Moraes; C.A. Quesada; K. Andersen; I.P. Hartley; M.P. Martins (2020). Fine litterfall production and nutrient composition data from a fertilized site in Central Amazon, Brazil [Dataset]. http://doi.org/10.5285/c0294ec9-45d6-464c-b543-ce9ece9fd968
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5285/c0294ec9-45d6-464c-b543-ce9ece9fd968
Dataset updated
Feb 13, 2020
Dataset provided by
NERC EDS Environmental Information Data Centre
Authors
A.C.M. Moraes; C.A. Quesada; K. Andersen; I.P. Hartley; M.P. Martins
Time period covered
Jul 1, 2017 - Jul 31, 2019
Area covered

Description
The data consists of litterfall production in a fertilised old growth forest in Central Amazon. Data was collected in a full factorial nutrient addition experiment (nitrogen, phosphorus and cation treatments). Within each plot we have installed five litter traps of 50 cm x 50 cm, 1 m above ground, occupying an area of 1.25 m2 per plot, and ensuring litter reaching the trap was produced within the experimental plot area. The study was funded by NERC, BDFFP (logistical support) and the Brazilian government (students scholarship).
o
E-commerce Headphone Sentiment Dataset
opendatabay.com
.undefined
Updated Jul 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Datasimple (2025). E-commerce Headphone Sentiment Dataset [Dataset]. https://www.opendatabay.com/data/ai-ml/eed974c6-d221-4eb3-85f6-51e99839a040
Explore at:
.undefinedAvailable download formats
Dataset updated
Jul 5, 2025
Dataset authored and provided by
Datasimple
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Reviews & Ratings
Description
This dataset contains a collection of Amazon headphone reviews, processed for sentiment analysis. It is a small subset intended to assist in understanding customer opinions and evaluating product perceptions. The data supports analysis of review usefulness, factors influencing helpfulness, and the detection of atypical or potentially misleading reviews.

Columns

Customer_Name: The name of the customer who provided the review.

REVIEW_TITLE: A short summary or title of the customer's review.

Color: The colour of the headphone product being reviewed.

REVIEW_DATE: The specific date when the customer submitted their review.

COMMENTS: Detailed comments from the customer expressing their feelings or observations about the product.

RATINGS: The customer's rating for the product, given on a scale of 1 to 5 stars.

Distribution

This dataset is typically provided in a CSV file format. It comprises approximately 1,500 individual reviews. The structure includes 6 distinct columns, making it readily available for analytical tasks.

Usage

This dataset is ideally suited for: * Conducting sentiment analysis on product reviews. * Exploring factors that influence the perceived helpfulness of a review. * Identifying unusual review patterns or potential outliers. * Applications in Natural Language Processing (NLP), text mining, and exploratory data analysis.

Coverage

The data spans a time range from 28 May 2021 to 13 June 2022. It covers various customer names, including "Amazon Customer" and "Rahul", alongside a large proportion of "Other" customers. Product colours predominantly include "White" and "Black". The ratings are distributed across several ranges, from 1.00-1.40 up to 4.60-5.00. The geographical scope of the data is global.

License

CCO

Who Can Use It

This dataset is beneficial for data scientists, machine learning engineers, business analysts, and researchers interested in: * Developing sentiment analysis models. * Understanding consumer feedback and product performance. * Performing text-based data analysis. * Exploring e-commerce review patterns.

Dataset Name Suggestions

Amazon Headphone Reviews for Sentiment Analysis

Headphone Customer Review Data

E-commerce Headphone Sentiment Dataset

Product Review Analysis Data (Headphones)

Attributes

Original Data Source: HEADPHONE DATASET REVIEW ANALYSIS
Amazon Alexa skills available in selected countries as of January 2021
statista.com
Updated Jan 15, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2021). Amazon Alexa skills available in selected countries as of January 2021 [Dataset]. https://www.statista.com/statistics/917900/selected-countries-amazon-alexa-skill-count/
Explore at:
Dataset updated
Jan 15, 2021
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Jan 2021
Area covered
Canada, France, Australia, United Kingdom, Germany, United States, India, Italy, Japan
Description
The total number of Amazon Alexa skills continues to grow at a steady pace in selected countries. As of ************, the skill count for Amazon Alexa has grown to ****** in the United States. The most noticeable jump in the number of skills was noticed in Spain at ****** with the last year just at *****.
h
amazon-qa
huggingface.co
Updated Apr 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sentence Transformers (2025). amazon-qa [Dataset]. https://huggingface.co/datasets/sentence-transformers/amazon-qa
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 7, 2025
Dataset authored and provided by
Sentence Transformers
Description
Dataset Card for Amazon QA

This dataset is a collection of question-answer pairs collected from Amazon QA. See Amazon QA for additional information. This dataset can be used directly with Sentence Transformers to train embedding models.

Dataset Subsets pair subset

Columns: "query", "answer" Column types: str, str Examples:{ 'query': 'What size are the tiles and how thick and what material?', 'answer': 'Tiles are 12" x 12", about 1/2 inch thick and made of… See the full description on the dataset page: https://huggingface.co/datasets/sentence-transformers/amazon-qa.
Data from: LBA-ECO CD-06 CO2 Exchange in River Systems Across the Amazon...
s.cnmilf.com
search.dataone.org
+5more
Updated Jul 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ORNL_DAAC (2025). LBA-ECO CD-06 CO2 Exchange in River Systems Across the Amazon Basin: 2004-2007 [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/lba-eco-cd-06-co2-exchange-in-river-systems-across-the-amazon-basin-2004-2007-4e63b
Explore at:
Dataset updated
Jul 3, 2025
Dataset provided by
Oak Ridge National Laboratory Distributed Active Archive Center
Description
This data set provides measurements of carbon dioxide flux rates (FCO2), gas transfer velocity (k), and partial pressures (pCO2) at 75 sites on rivers and streams of the Amazon River system in South America for the period beginning July 1, 2004, and ending January 23, 2007. Several fieldwork campaigns occurred between June 2004 and January 2007 in the Amazon River basin, with discharge conditions ranging from low to high flow. The sampled areas span the spectrum of chemical characteristics observed across the entire basin, including, for example, both low and high pH values and suspended sediment loads. There is one comma-delimited data file in this data set.
d
Data from: Cold waves in the Amazon rainforest and their ecological impact
search.dataone.org
data.niaid.nih.gov
+1more
Updated Dec 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kim L. Holzmann; Pedro Alonso-Alonso; Yenny Correa-Carmona; Andrea Pinos-Leon; Felipe Yon; Gunnar Brehm; Alexander Keller; Ingolf Steffan-Dewenter; Marcell K. Peters (2024). Cold waves in the Amazon rainforest and their ecological impact [Dataset]. http://doi.org/10.5061/dryad.ns1rn8q31
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.ns1rn8q31
Dataset updated
Dec 25, 2024
Dataset provided by
Dryad Digital Repository
Authors
Kim L. Holzmann; Pedro Alonso-Alonso; Yenny Correa-Carmona; Andrea Pinos-Leon; Felipe Yon; Gunnar Brehm; Alexander Keller; Ingolf Steffan-Dewenter; Marcell K. Peters
Area covered
Amazon Rainforest
Description
Cold waves crossing the Amazon rainforest are a rare phenomenon predicted to increase in intensity under climate change. We here describe an extensive cold wave occurring in June 2023 in Amazonian-Andean forests, compared environmental temperatures to experimentally tested thermal tolerances and its impact on lowland animal communities (insects and wild mammals). While we found strong reductions in abundance of all animal groups under the cold wave, tropical lowland animals showed thermal tolerance limits below the lowest environmental temperatures measured during the cold wave, and abundances of most studied taxa recovered over the next season; nevertheless, small thermal safety margins suggest that an increased intensity of cold waves in the future could imperil animal communities in the Amazon., Temperature data Air temperature was measured at each plot along the elevation gradient with iButton sensors (Analog Devices, Inc, Wilmington, USA) at 1.5 m height every four hours, hanging from a horizontal branch. The sensors were shielded with white plastic dishes (diameter ca. 18 cm) to protect them from rain and direct sunlight [13]. In addition, each plot was equipped with a TOMST4-temperature and soil humidity logger (TOMST s.r.o., Prague, Czech Republic), continuously measuring temperature and soil humidity at 6 cm depth, as well as temperature at the surface and in 15 cm height. Insect data (1) Malaise: At each plot in each field season, one malaise trap was operated for seven days. Malaise traps were based on the Townes Malaise trap model, albeit with a black roof and a slightly smaller size (dimensions of the capture area: height front: 0.90â€‰m; height rear: 0.60â€‰m; length: 1.60â€‰m); Ethanol (96 %) was used as the capture fluid to ensure the preservation of specimens. For each ..., , # Cold waves in the Amazon rainforest and their ecological impact

https://doi.org/10.5061/dryad.ns1rn8q31

Description of the data and file structure

Climate and biodiversity were monitored at three study locations ("plots") in the Peruvian rainforest. We used thermal sensors, pitfall traps, malaise traps, manual netting and camera traps. Thermal tolerance experiments were conducted with a programmable thermoblock.

Files and variables

File: Malaise.xlsx

Description:Â Malaise traps for community biomass of mainly flying insects

Variables

Plot: ID of study site

Field_season: number of sampling roundÂ

dateon: Date of setting up the trap

dateout: Date of removing the trap

Note: special observations

Mass: wet insect biomass in gram

File: dailyTemp_iB.xlsx

Description:Â temperature measured by iButton loggers

Variables

Tmean: daily mean temperature

Tmin: daily minimum temperature

Tmax:...
f
The distribution of tribe by tooth type in the data set.
plos.figshare.com
xls
Updated Jun 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gregory J. Matthews; George K. Thiruvathukal; Maxwell P. Luetkemeier; Juliet K. Brophy (2023). The distribution of tribe by tooth type in the data set. [Dataset]. http://doi.org/10.1371/journal.pone.0179757.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0179757.t002
Dataset updated
Jun 19, 2023
Dataset provided by
PLOS ONE
Authors
Gregory J. Matthews; George K. Thiruvathukal; Maxwell P. Luetkemeier; Juliet K. Brophy
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The distribution of tribe by tooth type in the data set.
A collection of fully-annotated soundscape recordings from the Southwestern...
zenodo.org
data.niaid.nih.gov
csv, pdf, txt, zip
Updated Jul 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
W. Alexander Hopping; W. Alexander Hopping; Stefan Kahl; Stefan Kahl; Holger Klinck; Holger Klinck (2024). A collection of fully-annotated soundscape recordings from the Southwestern Amazon Basin [Dataset]. http://doi.org/10.5281/zenodo.7079124
Explore at:
csv, txt, pdf, zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7079124
Dataset updated
Jul 16, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
W. Alexander Hopping; W. Alexander Hopping; Stefan Kahl; Stefan Kahl; Holger Klinck; Holger Klinck
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This collection contains 21 hour-long soundscape recordings, which have been annotated with 14,798 bounding box labels for 132 different bird species from the Southwestern Amazon Basin. The data were recorded in 2019 in the Inkaterra Reserva Amazonica, Madre de Dios, Peru. This collection has partially been featured as test data in the 2020 BirdCLEF competition and can primarily be used for training and evaluation of machine learning algorithms.

Data collection

This acoustic data was collected at the Inkaterra Reserva Amazonica (ITRA) between January 14th and February 2nd, 2019, during the rainy season. ITRA is a 2 km² lowland rainforest reserve on the banks of the Madre de Dios river, approximately 20 km east of the frontier town of Puerto Maldonado. The region's extraordinary biodiversity is threatened by accelerating rates of deforestation, degradation, and fragmentation, which are driven primarily by expanding road networks, mining, agriculture, and an increasing population. The acoustic data from this site were collected as part of a study designed to assess spatio-temporal variation in avian species richness and vocal activity levels across intact, degraded, and edge forest, and between different days at the same point locations.

Ten SWIFT recording units, provided by the K. Lisa Yang Center for Conservation Bioacoustics at the Cornell Lab of Ornithology, were placed at separate sites spanning edge habitat, degraded forest, and intact forest within the reserve. These omnidirectional recorders were set to record uncompressed WAVE files continuously for the duration of their deployment, with a sampling rate of 48 kHz. The sensitivity of the used microphones was -44 (+/-3) dB re 1 V/Pa. The microphone's frequency response was not measured but is assumed to be flat (+/- 3 dB) in the frequency range 100 Hz to 7.5 kHz. The analog signal was amplified by 35 dB and digitized (16-bit resolution) using an analog-to-digital converter (ADC) with a clipping level of -/+ 0.9 V. For this collection, recordings were resampled at 32 kHz and converted to FLAC. Recorders were placed at a consistent height of approximately 1.5 m above the ground. To minimize background noise, all sites used for data analysis were located at a minimum distance of 450 m from the river.

Sampling and annotation protocol

A total of 21 dawn-hours, from 05:00-06:00 PET (10:00-11:00 UTC), representing 7 of the 10 sites on three randomly-selected dates, were manually annotated. Many neotropical bird species sing almost exclusively during the dawn hour, so this time window was selected to maximize the number of species present in the recordings. A single annotator boxed every bird call he could identify and ignored those that were too faint. Raven Pro software was used to annotate the data. Provided labels contain full bird calls that are boxed in time and frequency. The annotator was allowed to combine multiple consecutive calls of one species into one bounding box label if pauses between calls were shorter than five seconds. In this collection, we use eBird species codes as labels, following the 2021 eBird taxonomy (Clements list). Parts of this dataset have previously been featured in the 2020 BirdCLEF competition.

Files in this collection

Audio recordings can be accessed by downloading and extracting the “soundscape_data.zip” file. Soundscape recording filenames contain a sequential file ID, recording site, date, and timestamp in UTC. As an example, the file “PER_001_S01_20190116_100007Z.flac” has sequential ID 001 and was recorded at site S01 on Jan 16th, 2019 at 10:00:07 UTC. Ground truth annotations are listed in “annotations.csv” where each line specifies the corresponding filename, start and end time in seconds, low and high frequency in Hertz, and an eBird species code. These species codes can be assigned to scientific and common name of a species with the “species.csv” file. Unidentifiable calls have been marked with “????” and are included in the ground truth annotations. The approximate recording location and a short habitat description for all sites can be found in the “recording_location.txt” file.

Acknowledgements

We would like to thank the Inkaterra Association (ITA) staff for providing logistical support and excellent field station facilities, particularly Noe Huaraca, Dennis Osorio, and Kevin Jiménez Gonzales, who helped set up recorders. Noe Huaraca, John Fitzpatrick, Fernando Angulo, Will Sweet, Ken Rosenburg, and Alex Wiebe helped identify unknown vocalizations. Funding for equipment was provided by the K. Lisa Yang Center for Conservation Bioacoustics at the Cornell Lab of Ornithology, with support from Innóvate Perú, CORBIDI, and the Inkaterra Association. Travel expenses were funded by the Cornell Lab of Ornithology.
o
E-commerce Customer Feedback Dataset
opendatabay.com
.undefined
Updated Jul 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Datasimple (2025). E-commerce Customer Feedback Dataset [Dataset]. https://www.opendatabay.com/data/dataset/6051da05-ace4-44ca-baca-85efdd809836
Explore at:
.undefinedAvailable download formats
Dataset updated
Jul 2, 2025
Dataset authored and provided by
Datasimple
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Reviews & Ratings
Description
This dataset provides a collection of randomly selected customer reviews and ratings for various Amazon products. It comprises nearly 1.6 thousand individual reviews, making it a valuable resource for understanding consumer feedback. The primary aim of using this dataset is to identify the main topics within these reviews, enabling better classification for improved search functionality. It is particularly suited for developing algorithms that can differentiate topics based on a body of review text.

Columns

The dataset includes the following fields: * id: A unique identifier for each entry. * asins: The product identification number. * brand: The manufacturer or brand of the product. * categories: The categorisation of the product. * colors: The colour of the product. * dateAdded: The date when the product was first listed or added to the dataset. * dateUpdated: The date when the product's information was last updated. * dimension: The physical dimensions of the product. * ean: The European Article Number (EAN) for the product. * keys: A special assigned key associated with the product.

Distribution

The dataset contains approximately 1.6 thousand reviews. The data is structured in a tabular format, suitable for analysis. Key distributions observed within the dataset include: * Brands: A significant majority of products (99%) are from Amazon, with a smaller portion (1%) from Moshi. * Categories: A notable 34% of products fall under categories such as Amazon Devices, Smart Home, and Voice Assistants, with another 12% simply categorised as Amazon Devices. Other categories account for 54% of the data. * Colours: About 52% of entries have null values for colour, while 42% are recorded as Black. Other colours make up the remaining 6%. * Dates: The date range for products added or updated spans from 17 January 2015 to 13 August 2017, with varying counts of entries across different periods. * Dimensions: 65% of the entries have null dimensions, while 34% specify a dimension of 4.8 inches by 6.6 inches by 3.2 inches.

Usage

This dataset is ideal for a range of applications, including: * Developing and evaluating Topic Modelling Algorithms to categorise customer reviews. * Performing Natural Language Processing (NLP) tasks such as sentiment analysis or keyword extraction from product reviews. * Gaining insights into consumer behaviour and product feedback in the e-commerce sector. * Supporting data clean-up and exploratory data analysis for textual datasets.

Coverage

The dataset's coverage is global, encompassing reviews from various customers. The time range of the data spans from 17 January 2015 to 13 August 2017. No specific demographic details about the customers are provided.

License

CCO

Who Can Use It

This dataset is suitable for: * Data Scientists and Machine Learning Engineers focused on NLP and topic modelling. * Researchers in fields such as e-commerce, consumer studies, and computational linguistics. * Students and beginners in data science looking for a practical dataset for learning and experimentation. * Businesses aiming to understand customer feedback and improve product categorisation.

Dataset Name Suggestions

Amazon Product Reviews Corpus

E-commerce Customer Feedback Dataset

Amazon Ratings and Reviews Data

Product Review Topic Analysis Dataset

Customer Review Dataset for E-commerce

Attributes

Original Data Source: Amazon Product Reviews Dataset
f
Data from: SPATIAL VARIABILITY IN LEAF ANALYSIS AND PRODUCTIVITY OF...
scielo.figshare.com
jpeg
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Felipe O. Ribeiro; Antonio R. Fernandes; Gilson S. B. de Matos; Marcelo M. Lindolfo; Rafael S. Guedes; Graziele R. Rodrigues (2023). SPATIAL VARIABILITY IN LEAF ANALYSIS AND PRODUCTIVITY OF FERTIRRIGATED AÇAÍ [Dataset]. http://doi.org/10.6084/m9.figshare.14279769.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.14279769.v1
Dataset updated
Jun 1, 2023
Dataset provided by
SciELO journals
Authors
Felipe O. Ribeiro; Antonio R. Fernandes; Gilson S. B. de Matos; Marcelo M. Lindolfo; Rafael S. Guedes; Graziele R. Rodrigues
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ABSTRACT This study aimed to define management zones (MZs) for fertirrigated açaí cultivation, based on spatial variability of the foliar nutrients and productivity data. The work was carried out in an area of 5.75 ha of a 7-year crop, with 80 georeferenced sample points. Fresh fruit productivity and nutrient (N, P, K, Ca, Mg, S, B, Cu, Fe, Mn, and Zn) contents were determined. The average contents of macronutrients were considered adequate for adult açaí plants, and their spatial dependence associated with fruit productivity allowed the representation of their distributions through maps of variability. Through multivariate analysis, three main components were highlighted. These components explained 51.5 % of the total variability of the data, where PC1 showed a higher correlation with Ca, Mg, K, and P. In addition, three MZs were obtained, out of which one with the highest productivity showed the best Ca, Mg, S, B, and Fe leaf contents. Principal component analysis and determination of MZs emphasized Ca and Mg nutrition as being more related to spatial variability and açaí fruit productivity.

Facebook

Twitter

Click to copy link

Link copied

Cite

Iftach Arbel (2023). amazon-product-data-filter [Dataset]. https://huggingface.co/datasets/iarbel/amazon-product-data-filter

amazon-product-data-filter

iarbel/amazon-product-data-filter

Explore at:

Dataset updated

Nov 14, 2023

Authors

Iftach Arbel

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Dataset Card for "amazon-product-data-filter"

  Dataset Summary

The Amazon Product Dataset contains product listing data from the Amazon US website. It can be used for various NLP and classification tasks, such as text generation, product type classification, attribute extraction, image recognition and more.

  Languages

The text in the dataset is in English.

  Dataset Structure





  Data Instances

Each data point provides product information, such… See the full description on the dataset page: https://huggingface.co/datasets/iarbel/amazon-product-data-filter.

Clear search

Close search

Google apps

Main menu

amazon-product-data-filter

amazon-appliances-data-subset

K-12 Education Digital Signage Market By Key Players (Amazon AWS, UCView,...

Data from: Amazon-beauty

Data from: The structure of the Mini-K and K-SF-42: a psychological network...

Automatic weather station data from AWS9 collected during 2020 at the...

Solid phase sediment and pore water data (Al, Fe, K, nitrate, organic...

Amazon Review Dataset

amazon-products

amazon-review-description

Fine litterfall production and nutrient composition data from a fertilized...

E-commerce Headphone Sentiment Dataset

Columns

Distribution

Usage

Coverage

License

Who Can Use It

Dataset Name Suggestions

Attributes

Amazon Alexa skills available in selected countries as of January 2021

amazon-qa

Data from: LBA-ECO CD-06 CO2 Exchange in River Systems Across the Amazon...

Data from: Cold waves in the Amazon rainforest and their ecological impact

Description of the data and file structure

Files and variables

File: Malaise.xlsx

Variables

File: dailyTemp_iB.xlsx

Variables

The distribution of tribe by tooth type in the data set.

A collection of fully-annotated soundscape recordings from the Southwestern...

E-commerce Customer Feedback Dataset

Columns

Distribution

Usage

Coverage

License

Who Can Use It

Dataset Name Suggestions

Attributes

Data from: SPATIAL VARIABILITY IN LEAF ANALYSIS AND PRODUCTIVITY OF...

amazon-product-data-filterSee More Versions

iarbel/amazon-product-data-filter

amazon-product-data-filter