100+ datasets found

Amazon Customer Database
kaggle.com
Updated Jul 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DEBJYOTI SAHA (2021). Amazon Customer Database [Dataset]. https://www.kaggle.com/datasets/debjyotisaha/amazon-customer-database
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 28, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
DEBJYOTI SAHA
Description
Dataset

This dataset was created by DEBJYOTI SAHA

Contents
Retail Transactions Dataset
kaggle.com
Updated May 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prasad Patil (2024). Retail Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/prasad22/retail-transactions-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 18, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Prasad Patil
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset was created to simulate a market basket dataset, providing insights into customer purchasing behavior and store operations. The dataset facilitates market basket analysis, customer segmentation, and other retail analytics tasks. Here's more information about the context and inspiration behind this dataset:

Context:

Retail businesses, from supermarkets to convenience stores, are constantly seeking ways to better understand their customers and improve their operations. Market basket analysis, a technique used in retail analytics, explores customer purchase patterns to uncover associations between products, identify trends, and optimize pricing and promotions. Customer segmentation allows businesses to tailor their offerings to specific groups, enhancing the customer experience.

Inspiration:

The inspiration for this dataset comes from the need for accessible and customizable market basket datasets. While real-world retail data is sensitive and often restricted, synthetic datasets offer a safe and versatile alternative. Researchers, data scientists, and analysts can use this dataset to develop and test algorithms, models, and analytical tools.

Dataset Information:

The columns provide information about the transactions, customers, products, and purchasing behavior, making the dataset suitable for various analyses, including market basket analysis and customer segmentation. Here's a brief explanation of each column in the Dataset:

Transaction_ID: A unique identifier for each transaction, represented as a 10-digit number. This column is used to uniquely identify each purchase.

Date: The date and time when the transaction occurred. It records the timestamp of each purchase.

Customer_Name: The name of the customer who made the purchase. It provides information about the customer's identity.

Product: A list of products purchased in the transaction. It includes the names of the products bought.

Total_Items: The total number of items purchased in the transaction. It represents the quantity of products bought.

Total_Cost: The total cost of the purchase, in currency. It represents the financial value of the transaction.

Payment_Method: The method used for payment in the transaction, such as credit card, debit card, cash, or mobile payment.

City: The city where the purchase took place. It indicates the location of the transaction.

Store_Type: The type of store where the purchase was made, such as a supermarket, convenience store, department store, etc.

Discount_Applied: A binary indicator (True/False) representing whether a discount was applied to the transaction.

Customer_Category: A category representing the customer's background or age group.

Season: The season in which the purchase occurred, such as spring, summer, fall, or winter.

Promotion: The type of promotion applied to the transaction, such as "None," "BOGO (Buy One Get One)," or "Discount on Selected Items."

Use Cases:

Market Basket Analysis: Discover associations between products and uncover buying patterns.

Customer Segmentation: Group customers based on purchasing behavior.

Pricing Optimization: Optimize pricing strategies and identify opportunities for discounts and promotions.

Retail Analytics: Analyze store performance and customer trends.

Note: This dataset is entirely synthetic and was generated using the Python Faker library, which means it doesn't contain real customer data. It's designed for educational and research purposes.
d
Replication Data for: \"A Topic-based Segmentation Model for Identifying...
search.dataone.org
dataverse.harvard.edu
Updated Sep 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kim, Sunghoon; Lee, Sanghak; McCulloch, Robert (2024). Replication Data for: \"A Topic-based Segmentation Model for Identifying Segment-Level Drivers of Star Ratings from Unstructured Text Reviews\" [Dataset]. http://doi.org/10.7910/DVN/EE3DE2
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/EE3DE2
Dataset updated
Sep 25, 2024
Dataset provided by
Harvard Dataverse
Authors
Kim, Sunghoon; Lee, Sanghak; McCulloch, Robert
Description
We provide instructions, codes and datasets for replicating the article by Kim, Lee and McCulloch (2024), "A Topic-based Segmentation Model for Identifying Segment-Level Drivers of Star Ratings from Unstructured Text Reviews." This repository provides a user-friendly R package for any researchers or practitioners to apply A Topic-based Segmentation Model with Unstructured Texts (latent class regression with group variable selection) to their datasets. First, we provide a R code to replicate the illustrative simulation study: see file 1. Second, we provide the user-friendly R package with a very simple example code to help apply the model to real-world datasets: see file 2, Package_MixtureRegression_GroupVariableSelection.R and Dendrogram.R. Third, we provide a set of codes and instructions to replicate the empirical studies of customer-level segmentation and restaurant-level segmentation with Yelp reviews data: see files 3-a, 3-b, 4-a, 4-b. Note, due to the dataset terms of use by Yelp and the restriction of data size, we provide the link to download the same Yelp datasets (https://www.kaggle.com/datasets/yelp-dataset/yelp-dataset/versions/6). Fourth, we provided a set of codes and datasets to replicate the empirical study with professor ratings reviews data: see file 5. Please see more details in the description text and comments of each file. [A guide on how to use the code to reproduce each study in the paper] 1. Full codes for replicating Illustrative simulation study.txt -- [see Table 2 and Figure 2 in main text]: This is R source code to replicate the illustrative simulation study. Please run from the beginning to the end in R. In addition to estimated coefficients (posterior means of coefficients), indicators of variable selections, and segment memberships, you will get dendrograms of selected groups of variables in Figure 2. Computing time is approximately 20 to 30 minutes 3-a. Preprocessing raw Yelp Reviews for Customer-level Segmentation.txt: Code for preprocessing the downloaded unstructured Yelp review data and preparing DV and IVs matrix for customer-level segmentation study. 3-b. Instruction for replicating Customer-level Segmentation analysis.txt -- [see Table 10 in main text; Tables F-1, F-2, and F-3 and Figure F-1 in Web Appendix]: Code for replicating customer-level segmentation study with Yelp data. You will get estimated coefficients (posterior means of coefficients), indicators of variable selections, and segment memberships. Computing time is approximately 3 to 4 hours. 4-a. Preprocessing raw Yelp reviews_Restaruant Segmentation (1).txt: R code for preprocessing the downloaded unstructured Yelp data and preparing DV and IVs matrix for restaurant-level segmentation study. 4-b. Instructions for replicating restaurant-level segmentation analysis.txt -- [see Tables 5, 6 and 7 in main text; Tables E-4 and E-5 and Figure H-1 in Web Appendix]: Code for replicating restaurant-level segmentation study with Yelp. you will get estimated coefficients (posterior means of coefficients), indicators of variable selections, and segment memberships. Computing time is approximately 10 to 12 hours. [Guidelines for running Benchmark models in Table 6] Unsupervised Topic model: 'topicmodels' package in R -- after determining the number of topics(e.g., with 'ldatuning' R package), run 'LDA' function in the 'topicmodels'package. Then, compute topic probabilities per restaurant (with 'posterior' function in the package) which can be used as predictors. Then, conduct prediction with regression Hierarchical topic model (HDP): 'gensimr' R package -- 'model_hdp' function for identifying topics in the package (see https://radimrehurek.com/gensim/models/hdpmodel.html or https://gensimr.news-r.org/). Supervised topic model: 'lda' R package -- 'slda.em' function for training and 'slda.predict' for prediction. Aggregate regression: 'lm' default function in R. Latent class regression without variable selection: 'flexmix' function in 'flexmix' R package. Run flexmix with a certain number of segments (e.g., 3 segments in this study). Then, with estimated coefficients and memberships, conduct prediction of dependent variable per each segment. Latent class regression with variable selection: 'Unconstraind_Bayes_Mixture' function in Kim, Fong and DeSarbo(2012)'s package. Run the Kim et al's model (2012) with a certain number of segments (e.g., 3 segments in this study). Then, with estimated coefficients and memberships, we can do prediction of dependent variables per each segment. The same R package ('KimFongDeSarbo2012.zip') can be downloaded at: https://sites.google.com/scarletmail.rutgers.edu/r-code-packages/home 5. Instructions for replicating Professor ratings review study.txt -- [see Tables G-1, G-2, G-4 and G-5, and Figures G-1 and H-2 in Web Appendix]: Code to replicate the Professor ratings reviews study. Computing time is approximately 10 hours. [A list of the versions of R, packages, and computer...
Real World Customer Churn Dataset
kaggle.com
Updated Oct 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lasal Jayawardena (2023). Real World Customer Churn Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/6787676
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/6787676
Dataset updated
Oct 24, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Lasal Jayawardena
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
World
Description
60,000+ Real Anonymized Customer Usage Data for Churn Prediction!

Dataset Information

Dataset Name: Real World Customer Churn Dataset in Telco Domain

Snapshot Period: January 1, 2023, to March 31, 2023

Source: One of the Largest Telco Companies in Sri Lanka

Data Anonymization: The Dataset is Anonymized to Protect Customer Privacy.

Overview

The "Real World Customer Churn Dataset in Telco Domain" is a comprehensive collection of anonymized data that provides insights into customer behavior and churn prediction within the telecommunications industry.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F6361330%2F860271e0362e6c10503889f289201402%2FCustomer-churn.jpg?generation=1698182677600097&alt=media" alt="Dataset Image">

Usage Categories

The dataset contains data on over 60,000 customers across more than 10+ distinct usage categories. Some of the key usage categories include:

usage_app_youtube_daily: YouTube Traffic in MBs.

usage_app_facebook_daily: Facebook Traffic in MBs.

usage_app_tiktok_daily: TikTok Traffic in MBs.

usage_app_whatsapp_daily: WhatsApp Traffic in MBs.

usage_app_helakuru_daily: Helakuru App traffic in MBs.

usage_voice_o2o_outgoing: Outgoing call volume in minutes between the same operator.

usage_voice_o2op_outgoing: Outgoing call volume in minutes between operator and other operators.

usage_voice_o2o_incoming: Incoming call volume in minutes between the same operator.

usage_voice_op2o_incoming: Incoming call volume in minutes between other operator to operator.

usage_pack_data: Spend in LKR for data package purchasing.

usage_pack_vas: Spend in LKR for value-added service rentals or usage.

Dataset Files

The dataset consists of the following key files:

main.csv: An aggregated dataset that compiles usage data from all usage categories, providing a holistic view of customer behavior.

raw_dump folder: The raw data export, preserving the original source data for detailed exploration.

test and train folders: These folders contain customer IDs and corresponding Churn Labels, facilitating model training and testing.

usage_profiles folder: It comprises broken-down data frames for each customer under specific usage categories, allowing in-depth analysis of individual customer behavior within various usage categories.

Potential Use Cases

The "Real World Customer Churn Dataset in Telco Domain" offers a range of potential use cases, including:

Customer Churn Prediction: Leveraging customer usage patterns to predict and reduce churn.

Targeted Marketing: Designing customized marketing campaigns based on customer preferences.

Service Quality Enhancement: Identifying areas for service improvement, such as network quality.

Revenue Optimization: Maximizing revenue through the analysis of data package spending and value-added service usage.

Dataset Importance

This dataset's real-world aspect is of significant importance. It reflects actual customer interactions with a major telecommunications company in Sri Lanka, offering insights that can be directly applied to real-world scenarios. The dataset is sourced from one of the largest telco companies in the country, adding credibility and relevance to the insights it provides.

Understanding customer churn and usage behavior is pivotal for the telecommunications industry, and this dataset empowers researchers, data scientists, and businesses to gain deeper insights into these aspects.

Disclaimer

The dataset is anonymized to protect customer privacy, and all data used is in compliance with privacy regulations and agreements. Users are encouraged to explore and contribute to the "Real World Customer Churn Dataset in Telco Domain."

Thank you for your valuable contributions to this dataset.
d
815 Million Global Contact Data - B2B / Email / Mobile Phone / LinkedIn URL...
datarade.ai
.json, .csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RampedUp Global Data Solutions, 815 Million Global Contact Data - B2B / Email / Mobile Phone / LinkedIn URL - RampedUp [Dataset]. https://datarade.ai/data-products/global-contact-data-personal-and-professional-840-million-rampedup-global-data-solutions
Explore at:
.json, .csvAvailable download formats
Dataset authored and provided by
RampedUp Global Data Solutions
Area covered
Ireland, Pakistan, Chad, Greece, Haiti, Uganda, Grenada, United States Minor Outlying Islands, Sint Eustatius and Saba, Bolivia (Plurinational State of)
Description
Sign Up for a free trial: https://rampedup.io/sign-up-%2F-log-in - 7 Days and 50 Credits to test our quality and accuracy.

These are the fields available within the RampedUp Global dataset.

CONTACT DATA: Personal Email Address - We manage over 115 million personal email addresses Professional Email - We manage over 200 million professional email addresses Home Address - We manage over 20 million home addresses Mobile Phones - 65 million direct lines to decision makers Social Profiles - Individual Facebook, Twitter, and LinkedIn Local Address - We manage 65M locations for local office mailers, event-based marketing or face-to-face sales calls.

JOB DATA: Job Title - Standardized titles for ease of use and selection Company Name - The Contact's current employer Job Function - The Company Department associated with the job role Title Level - The Level in the Company associated with the job role Job Start Date - Identify people new to their role as a potential buyer

EMPLOYER DATA: Websites - Company Website, Root Domain, or Full Domain Addresses - Standardized Address, City, Region, Postal Code, and Country Phone - E164 phone with country code Social Profiles - LinkedIn, CrunchBase, Facebook, and Twitter

FIRMOGRAPHIC DATA: Industry - 420 classifications for categorizing the company’s main field of business Sector - 20 classifications for categorizing company industries 4 Digit SIC Code - 239 classifications and their definitions 6 Digit NAICS - 452 classifications and their definitions Revenue - Estimated revenue and bands from 1M to over 1B Employee Size - Exact employee count and bands Email Open Scores - Aggregated data at the domain level showing relationships between email opens and corporate domains. IP Address -Company level IP Addresses associated to Domains from a DNS lookup

CONSUMER DATA: Education - Alma Mater, Degree, Graduation Date Skills - Accumulated Skills associated with work experience
Interests - Known interests of contact Connections - Number of social connections. Followers - Number of social followers

Download our data dictionary: https://rampedup.io/our-data
E-commerce, customer relation management (CRM) and secure transactions by...
data.europa.eu
Updated Nov 30, 2009
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat (2009). E-commerce, customer relation management (CRM) and secure transactions by size class of enterprise [Dataset]. https://data.europa.eu/data/datasets/i9yvadmdw9xeyctv8zeswg?locale=en
Explore at:
Dataset updated
Nov 30, 2009
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
Description
The dataset "isoc_bde15dec" has been discontinued since 08/02/2024.
p
Appliances Customer Services in New Jersey, United States - 22 Verified...
poidata.io
csv, excel, json
Updated Aug 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Appliances Customer Services in New Jersey, United States - 22 Verified Listings Database [Dataset]. https://www.poidata.io/report/appliances-customer-service/united-states/new-jersey
Explore at:
csv, json, excelAvailable download formats
Dataset updated
Aug 10, 2025
Dataset provided by
Poidata.io
Area covered
New Jersey, United States
Description
Comprehensive dataset of 22 Appliances customer services in New Jersey, United States as of August, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
p
Appliances Customer Services in United States - 898 Verified Listings...
poidata.io
csv, excel, json
Updated Aug 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Appliances Customer Services in United States - 898 Verified Listings Database [Dataset]. https://www.poidata.io/report/appliances-customer-service/united-states
Explore at:
csv, excel, jsonAvailable download formats
Dataset updated
Aug 13, 2025
Dataset provided by
Poidata.io
Area covered
United States
Description
Comprehensive dataset of 898 Appliances customer services in United States as of August, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Customer Segmentation Data
kaggle.com
Updated Mar 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Raval Smit (2024). Customer Segmentation Data [Dataset]. https://www.kaggle.com/datasets/ravalsmit/customer-segmentation-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 11, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Raval Smit
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset provides comprehensive customer data suitable for segmentation analysis. It includes anonymized demographic, transactional, and behavioral attributes, allowing for detailed exploration of customer segments. Leveraging this dataset, marketers, data scientists, and business analysts can uncover valuable insights to optimize targeted marketing strategies and enhance customer engagement. Whether you're looking to understand customer behavior or improve campaign effectiveness, this dataset offers a rich resource for actionable insights and informed decision-making.

Key Features:

Anonymized demographic, transactional, and behavioral data. Suitable for customer segmentation analysis. Opportunities to optimize targeted marketing strategies. Valuable insights for improving campaign effectiveness. Ideal for marketers, data scientists, and business analysts.

Usage Examples:

Segmenting customers based on demographic attributes. Analyzing purchase behavior to identify high-value customer segments. Optimizing marketing campaigns for targeted engagement. Understanding customer preferences and tailoring product offerings accordingly. Evaluating the effectiveness of marketing strategies and iterating for improvement. Explore this dataset to unlock actionable insights and drive success in your marketing initiatives!
p
Appliances Customer Services in Germany - 1,619 Verified Listings Database
poidata.io
csv, excel, json
Updated Jul 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Appliances Customer Services in Germany - 1,619 Verified Listings Database [Dataset]. https://www.poidata.io/report/appliances-customer-service/germany
Explore at:
csv, excel, jsonAvailable download formats
Dataset updated
Jul 28, 2025
Dataset provided by
Poidata.io
Area covered
Germany
Description
Comprehensive dataset of 1,619 Appliances customer services in Germany as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
d
National Consumer Complaint Database (NCCDB) - National Consumer Complaint...
catalog.data.gov
cloud.csiss.gmu.edu
+6more
Updated Jun 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federal Motor Carrier Safety Administration (2024). National Consumer Complaint Database (NCCDB) - National Consumer Complaint Database [Dataset]. https://catalog.data.gov/dataset/national-consumer-complaint-database-nccdb-national-consumer-complaint-database
Explore at:
Dataset updated
Jun 26, 2024
Dataset provided by
Federal Motor Carrier Safety Administration
Description
NCCDB is a web-based information system for recording and reporting on household goods, safety violation, hazardous material, cargo tank and passenger complaints. NCCDB allows the public and FMCSA staff to submit complaints using an online form. The database contains, among other information, reports on inspection and test of cargo tanks and inventory of tanks. These reports are used in the development and amendment to regulations of cargo security which is the protection of cargo from theft.
g
[Obsolete] Essential public order data – enriched data
gimi9.com
Updated May 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). [Obsolete] Essential public order data – enriched data [Dataset]. https://gimi9.com/dataset/eu_https-data-economie-gouv-fr-explore-dataset-decp_augmente-/
Explore at:
Dataset updated
May 27, 2024
Description
This dataset will no longer be maintained as of 16 November 2023. Please refer to the documentation by following this link The Order of 22 March 2019 on essential data provides for the obligation for all French public buyers (territorial authorities, ministries, public hospitals, public institutions, etc.) to publish the essential data of their public procurement and concession contracts on their buyer profile for a period of five years. It is also possible to publish them on the national open data portal (data.gouv) In order to facilitate the consumption of this data on a national scale, we have developed scripts bringing together, removing and enriching this data from data.gouv.fr, in a single file bringing together markets and concession contracts.This unique file is made available on data.gouv.fr at the [next] link(https://www.data.gouv.fr/fr/datasets/donnees-essentielles-de-la-commande-publique-fichiers-consolides/). The data sources used are as follows: * data from the PES Market of DGFiP * data collected by AIFE’s DUME API * data from buyer profile Achatpublic.com put to * layout via AIFE’s DUME API * data from the buyer profile Dematis facilitating the * download customer data (e-marchespublics.com) * data published on the Open Data portal of Greater Lyon * data published on AWS buyer profile (Marches-publics.info), extracted and published manually by Colin Maudry on data.gouv.fr If you are aware of any data sources that may be aggregated to this dataset, please contact us at demat.daj@finances.gouv.fr.Please refer to the documentation by following this link The Order of 22 March 2019 on essential data provides for the obligation for all French public buyers (territorial authorities, ministries, public hospitals, public institutions, etc.) to publish the essential data of their public procurement and concession contracts on their buyer profile for a period of five years. It is also possible to publish them on the national open data portal (data.gouv) In order to facilitate the consumption of this data on a national scale, we have developed scripts bringing together, removing and enriching this data from data.gouv.fr, in a single file bringing together markets and concession contracts. This unique file is made available on data.gouv.fr at the [next] link(https://www.data.gouv.fr/fr/datasets/donnees-essentielles-de-la-commande-publique-fichiers-consolides/). The data sources used are as follows: * data from the PES Market of DGFiP * data collected by AIFE’s DUME API * data from buyer profile Achatpublic.com put to * layout via AIFE’s DUME API * data from the buyer profile Dematis facilitating the * download customer data (e-marchespublics.com) * data published on the Open Data portal of Greater Lyon * data published on AWS buyer profile (Marches-publics.info), extracted and published manually by Colin Maudry on data.gouv.fr If you are aware of any data sources that may be aggregated to this dataset, please contact us at demat.daj@finances.gouv.fr.
O*NET Database
onetcenter.org
excel, mysql, oracle +2
Updated May 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Center for O*NET Development (2025). O*NET Database [Dataset]. https://www.onetcenter.org/database.html
Explore at:
oracle, sql server, text, mysql, excelAvailable download formats
Dataset updated
May 22, 2025
Dataset provided by
Occupational Information Network
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Dataset funded by
United States Department of Laborhttp://www.dol.gov/
Description
The O*NET Database contains hundreds of standardized and occupation-specific descriptors on almost 1,000 occupations covering the entire U.S. economy. The database, which is available to the public at no cost, is continually updated by a multi-method data collection program. Sources of data include: job incumbents, occupational experts, occupational analysts, employer job postings, and customer/professional association input.
Data content areas include:
Worker Characteristics (e.g., Abilities, Interests, Work Styles)
Worker Requirements (e.g., Education, Knowledge, Skills)
Experience Requirements (e.g., On-the-Job Training, Work Experience)
Occupational Requirements (e.g., Detailed Work Activities, Work Context)
Occupation-Specific Information (e.g., Job Titles, Tasks, Technology Skills)
u
Social Recommendation Data
cseweb.ucsd.edu
json
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UCSD CSE Research Project, Social Recommendation Data [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
Explore at:
jsonAvailable download formats
Dataset authored and provided by
UCSD CSE Research Project
Description
These datasets include ratings as well as social (or trust) relationships between users. Data are from LibraryThing (a book review website) and epinions (general consumer reviews).

Metadata includes

reviews

price paid (epinions)

helpfulness votes (librarything)

flags (librarything)
H
Consumer Expenditure Survey (CE)
dataverse.harvard.edu
Updated May 30, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anthony Damico (2013). Consumer Expenditure Survey (CE) [Dataset]. http://doi.org/10.7910/DVN/UTNJAH
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/UTNJAH
Dataset updated
May 30, 2013
Dataset provided by
Harvard Dataverse
Authors
Anthony Damico
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
analyze the consumer expenditure survey (ce) with r the consumer expenditure survey (ce) is the primo data source to understand how americans spend money. participating households keep a running diary about every little purchase over the year. those diaries are then summed up into precise expenditure categories. how else are you gonna know that the average american household spent $34 (±2) on bacon, $826 (±17) on cellular phones, and $13 (±2) on digital e-readers in 2011? an integral component of the market basket calculation in the consumer price index, this survey recently became available as public-use microdata and they're slowly releasing historical files back to 1996. hooray! for a t aste of what's possible with ce data, look at the quick tables listed on their main page - these tables contain approximately a bazillion different expenditure categories broken down by demographic groups. guess what? i just learned that americans living in households with $5,000 to $9,999 of annual income spent an average of $283 (±90) on pets, toys, hobbies, and playground equipment (pdf page 3). you can often get close to your statistic of interest from these web tables. but say you wanted to look at domestic pet expenditure among only households with children between 12 and 17 years old. another one of the thirteen web tables - the consumer unit composition table - shows a few different breakouts of households with kids, but none matching that exact population of interest. the bureau of labor statistics (bls) (the survey's designers) and the census bureau (the survey's administrators) have provided plenty of the major statistics and breakouts for you, but they're not psychic. if you want to comb through this data for specific expenditure categories broken out by a you-defined segment of the united states' population, then let a little r into your life. fun starts now. fair warning: only analyze t he consumer expenditure survey if you are nerd to the core. the microdata ship with two different survey types (interview and diary), each containing five or six quarterly table formats that need to be stacked, merged, and manipulated prior to a methodologically-correct analysis. the scripts in this repository contain examples to prepare 'em all, just be advised that magnificent data like this will never be no-assembly-required. the folks at bls have posted an excellent summary of what's av ailable - read it before anything else. after that, read the getting started guide. don't skim. a few of the descriptions below refer to sas programs provided by the bureau of labor statistics. you'll find these in the C:\My Directory\CES\2011\docs directory after you run the download program. this new github repository contains three scripts: 2010-2011 - download all microdata.R lo op through every year and download every file hosted on the bls's ce ftp site import each of the comma-separated value files into r with read.csv depending on user-settings, save each table as an r data file (.rda) or stat a-readable file (.dta) 2011 fmly intrvw - analysis examples.R load the r data files (.rda) necessary to create the 'fmly' table shown in the ce macros program documentation.doc file construct that 'fmly' table, using five quarters of interviews (q1 2011 thru q1 2012) initiate a replicate-weighted survey design object perform some lovely li'l analysis examples replicate the %mean_variance() macro found in "ce macros.sas" and provide some examples of calculating descriptive statistics using unimputed variables replicate the %compare_groups() macro found in "ce macros.sas" and provide some examples of performing t -tests using unimputed variables create an rsqlite database (to minimize ram usage) containing the five imputed variable files, after identifying which variables were imputed based on pdf page 3 of the user's guide to income imputation initiate a replicate-weighted, database-backed, multiply-imputed survey design object perform a few additional analyses that highlight the modified syntax required for multiply-imputed survey designs replicate the %mean_variance() macro found in "ce macros.sas" and provide some examples of calculating descriptive statistics using imputed variables repl icate the %compare_groups() macro found in "ce macros.sas" and provide some examples of performing t-tests using imputed variables replicate the %proc_reg() and %proc_logistic() macros found in "ce macros.sas" and provide some examples of regressions and logistic regressions using both unimputed and imputed variables replicate integrated mean and se.R match each step in the bls-provided sas program "integr ated mean and se.sas" but with r instead of sas create an rsqlite database when the expenditure table gets too large for older computers to handle in ram export a table "2011 integrated mean and se.csv" that exactly matches the contents of the sas-produced "2011 integrated mean and se.lst" text file click here to view these three scripts for...
d
Foot Traffic Data | Global Consumer Visitation Insights To Inform Marketing...
datarade.ai
.csv
Updated Jun 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GapMaps (2024). Foot Traffic Data | Global Consumer Visitation Insights To Inform Marketing and Operational Decisions | Mobile Location Data [Dataset]. https://datarade.ai/data-products/gapmaps-foot-traffic-data-by-azira-global-foot-traffic-data-gapmaps
Explore at:
.csvAvailable download formats
Dataset updated
Jun 30, 2024
Dataset authored and provided by
GapMaps
Area covered
Saudi Arabia, Chile, Kuwait, Madagascar, Belize, Burkina Faso, Papua New Guinea, Liberia, Mozambique, Bahrain
Description
GapMaps Foot Traffic Data uses location data on mobile phones sourced by Azira which is collected from smartphone apps when the users have given their permission to track their location. It can shed light on consumer visitation patterns (“where from” and “where to”), frequency of visits, profiles of consumers and much more.

Businesses can utilise foot traffic data to answer key questions including: - What is the demographic profile of customers visiting my locations? - What is my primary catchment? And where within that catchment do most of my customers travel from to reach my locations? - What points of interest drive customers to my locations (ie. work, shopping, recreation, hotel or education facilities that are in the area) ? - How far do customers travel to visit my locations? - Where are the potential gaps in my store network for new developments?
- What is the sales impact on an existing store if a new store is opened nearby? - Is my marketing strategy targeted to the right audience? - Where are my competitor's customers coming from?

Foot Traffic data provides a range of benefits that make it a valuable addition to location intelligence services including: - Real-time - Low-cost at high scale - Accurate - Flexible - Non-proprietary - Empirical

Azira have created robust screening methods to evaluate the quality of Foot Traffic data collected from multiple sources to ensure that their data lake contains only the highest-quality mobile location data.

This includes partnering with trusted location SDK providers that get proper end user consent to track their location when they download an application, can detect device movement/visits and use GPS to determine location co-ordinates.

Data received from partners is put through Azira's data quality algorithm discarding data points that receive a low quality score.

Use cases in Europe will be considered on a case to case basis.
e
Inspire Download Service (predefined ATOM) for data set Building plan “VEP...
data.europa.eu
gimi9.com
atom feed
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LVGL, Inspire Download Service (predefined ATOM) for data set Building plan “VEP Consumer Market Puetzwies” [Dataset]. https://data.europa.eu/data/datasets/b0da0160-2948-0001-a236-098c13e704f4
Explore at:
atom feedAvailable download formats
Dataset authored and provided by
LVGL
Description
Description of the INSPIRE Download Service (predefined Atom): Development plan “VEP Consumer Market Puetzwies” of the district town of Merzig — The link(s) for downloading the records is/are generated dynamically from a DataURL link of a WMS layer

Travel Datasets

brightdata.com

.json, .csv, .xlsx

Updated Feb 15, 2023

Facebook

Twitter

Click to copy link

Link copied

Cite

Bright Data (2023). Travel Datasets [Dataset]. https://brightdata.com/products/datasets/travel

Explore at:

.json, .csv, .xlsxAvailable download formats

Dataset updated

Feb 15, 2023

Dataset authored and provided by

Bright Data

License

https://brightdata.com/licensehttps://brightdata.com/license

Area covered

Worldwide

Description

Our travel datasets provide extensive, structured data covering various aspects of the global travel and hospitality industry. These datasets are ideal for businesses, analysts, and developers looking to gain insights into hotel pricing, short-term rentals, restaurant listings, and travel trends. Whether you're optimizing pricing strategies, analyzing market trends, or enhancing travel-related applications, our datasets offer the depth and accuracy you need.

Key Travel Datasets Available:

  Hotel & Rental Listings: Access detailed data on hotel properties, short-term rentals, and vacation stays from platforms like 
    Airbnb, Booking.com, and other OTAs. This includes property details, pricing, availability, guest reviews, and amenities.

  Real-Time & Historical Pricing Data: Track hotel room pricing, rental occupancy rates, and pricing trends 
    to optimize revenue management and competitive analysis.

  Restaurant Listings & Reviews: Explore restaurant data from Tripadvisor, OpenTable, Zomato, Deliveroo, and Talabat, 
    including restaurant details, customer ratings, menus, and delivery availability.

  Market & Trend Analysis: Use structured datasets to analyze travel demand, seasonal trends, and consumer preferences 
    across different regions.

  Geo-Targeted Data: Get location-specific insights with city, state, and country-level segmentation, 
    allowing for precise market research and localized business strategies.



Use Cases for Travel Datasets:

  Dynamic Pricing & Revenue Optimization: Adjust pricing strategies based on real-time market trends and competitor analysis.
  Market Research & Competitive Intelligence: Identify emerging travel trends, monitor competitor performance, and assess market demand.
  Travel & Hospitality App Development: Enhance travel platforms with accurate, up-to-date data on hotels, restaurants, and rental properties.
  Investment & Financial Analysis: Evaluate travel industry performance for investment decisions and economic forecasting.



  Our travel datasets are available in multiple formats (JSON, CSV, Excel) and can be delivered via 
  API, cloud storage (AWS, Google Cloud, Azure), or direct download. 
  Stay ahead in the travel industry with high-quality, structured data that powers smarter decisions.

Credit card dataset for visualization
kaggle.com
Updated Sep 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peachji (2023). Credit card dataset for visualization [Dataset]. https://www.kaggle.com/datasets/peachji/credit-card-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 30, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Peachji
License
https://cdla.io/permissive-1-0/https://cdla.io/permissive-1-0/
Description
This dataset had adapted from 'Credit Card Churn Prediction: https://www.kaggle.com/datasets/anwarsan/credit-card-bank-churn ' for visualization in our university project. We have modified customer information, spending behavior, and also added revenue targets.

Scenario 🕶️ In 2019, the marketing team launched a campaign to attract millennial customers (born 1980-1996) with the goal of increasing revenue and enhancing the brand's appeal to a younger audience.
As the BI team, your task is to create a dashboard for users. 1. The Vice President of Sales wants to view the performance of the credit business. 2. The marketing team is interested in understanding customer segments and customer spending to measure Customer Lifetime Value (CLV) and Marketing Cost per Acquired Customer (MCAC).

⚠️Note: This is just a suggestion to guide the creation of the dashboard

Example in Tableau

Executive summary https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10099382%2F508a2d2d89dabdfd368743f86c2a71e1%2Fexecutive%20overview.JPG?generation=1696110593484137&alt=media" alt=""> Customer behavior https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10099382%2F1e4a1f62a25eab3c6707d002243894c7%2Fcustomer_behaviour.JPG?generation=1696110689732332&alt=media" alt="">
p
Appliances Customer Services in Mexico - 193 Verified Listings Database
poidata.io
csv, excel, json
Updated Aug 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poidata.io (2025). Appliances Customer Services in Mexico - 193 Verified Listings Database [Dataset]. https://www.poidata.io/report/appliances-customer-service/mexico
Explore at:
csv, json, excelAvailable download formats
Dataset updated
Aug 1, 2025
Dataset provided by
Poidata.io
Area covered
Mexico
Description
Comprehensive dataset of 193 Appliances customer services in Mexico as of August, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.

Facebook

Twitter

Click to copy link

Link copied

Cite

DEBJYOTI SAHA (2021). Amazon Customer Database [Dataset]. https://www.kaggle.com/datasets/debjyotisaha/amazon-customer-database

Amazon Customer Database

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jul 28, 2021

Dataset provided by

Kagglehttp://kaggle.com/

Authors

DEBJYOTI SAHA

Description

Dataset

This dataset was created by DEBJYOTI SAHA

Clear search

Close search

Google apps

Main menu

Amazon Customer Database

Dataset

Contents

Retail Transactions Dataset

`Context:`

`Inspiration:`

`Dataset Information:`

`Use Cases:`

Note: This dataset is entirely synthetic and was generated using the Python Faker library, which means it doesn't contain real customer data. It's designed for educational and research purposes.

Replication Data for: \"A Topic-based Segmentation Model for Identifying...

Real World Customer Churn Dataset

60,000+ Real Anonymized Customer Usage Data for Churn Prediction!

Dataset Information

Overview

Usage Categories

Dataset Files

Potential Use Cases

Dataset Importance

Disclaimer

815 Million Global Contact Data - B2B / Email / Mobile Phone / LinkedIn URL...

E-commerce, customer relation management (CRM) and secure transactions by...

Appliances Customer Services in New Jersey, United States - 22 Verified...

Appliances Customer Services in United States - 898 Verified Listings...

Customer Segmentation Data

Key Features:

Usage Examples:

Appliances Customer Services in Germany - 1,619 Verified Listings Database

National Consumer Complaint Database (NCCDB) - National Consumer Complaint...

[Obsolete] Essential public order data – enriched data

O*NET Database

Social Recommendation Data

Consumer Expenditure Survey (CE)

Foot Traffic Data | Global Consumer Visitation Insights To Inform Marketing...

Inspire Download Service (predefined ATOM) for data set Building plan “VEP...

Travel Datasets

Credit card dataset for visualization

Appliances Customer Services in Mexico - 193 Verified Listings Database

Amazon Customer Database

Dataset

Contents