73 datasets found

Dating App Behavior Dataset 2025
kaggle.com
Updated Apr 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Keyush nisar (2025). Dating App Behavior Dataset 2025 [Dataset]. https://www.kaggle.com/datasets/keyushnisar/dating-app-behavior-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 11, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Keyush nisar
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset provides a synthetic representation of user behavior on a fictional dating app. It contains 50,000 records with 19 features capturing demographic details, app usage patterns, swipe tendencies, and match outcomes. The data was generated programmatically to simulate realistic user interactions, making it ideal for exploratory data analysis (EDA), machine learning modeling (e.g., predicting match outcomes), or studying user behavior trends in online dating platforms.

Key features include gender, sexual orientation, location type, income bracket, education level, user interests, app usage time, swipe ratios, likes received, mutual matches, and match outcomes (e.g., "Mutual Match," "Ghosted," "Catfished"). The dataset is designed to be diverse and balanced, with categorical, numerical, and labeled variables for various analytical purposes.

Usage

This dataset can be used for:

Exploratory Data Analysis (EDA): Investigate correlations between demographics, app usage, and match success. Machine Learning: Build models to predict match outcomes or user engagement levels. Social Studies: Analyze trends in dating app behavior across different demographics. Feature Engineering Practice: Experiment with transforming categorical and numerical data.
f
Is Demography Destiny? Application of Machine Learning Techniques to...
plos.figshare.com
figshare.com
docx
Updated Jun 3, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wei Luo; Thin Nguyen; Melanie Nichols; Truyen Tran; Santu Rana; Sunil Gupta; Dinh Phung; Svetha Venkatesh; Steve Allender (2023). Is Demography Destiny? Application of Machine Learning Techniques to Accurately Predict Population Health Outcomes from a Minimal Demographic Dataset [Dataset]. http://doi.org/10.1371/journal.pone.0125602
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0125602
Dataset updated
Jun 3, 2023
Dataset provided by
PLOS ONE
Authors
Wei Luo; Thin Nguyen; Melanie Nichols; Truyen Tran; Santu Rana; Sunil Gupta; Dinh Phung; Svetha Venkatesh; Steve Allender
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
For years, we have relied on population surveys to keep track of regional public health statistics, including the prevalence of non-communicable diseases. Because of the cost and limitations of such surveys, we often do not have the up-to-date data on health outcomes of a region. In this paper, we examined the feasibility of inferring regional health outcomes from socio-demographic data that are widely available and timely updated through national censuses and community surveys. Using data for 50 American states (excluding Washington DC) from 2007 to 2012, we constructed a machine-learning model to predict the prevalence of six non-communicable disease (NCD) outcomes (four NCDs and two major clinical risk factors), based on population socio-demographic characteristics from the American Community Survey. We found that regional prevalence estimates for non-communicable diseases can be reasonably predicted. The predictions were highly correlated with the observed data, in both the states included in the derivation model (median correlation 0.88) and those excluded from the development for use as a completely separated validation sample (median correlation 0.85), demonstrating that the model had sufficient external validity to make good predictions, based on demographics alone, for areas not included in the model development. This highlights both the utility of this sophisticated approach to model development, and the vital importance of simple socio-demographic characteristics as both indicators and determinants of chronic disease.
d
Factori USA Consumer Graph Data | socio-demographic, location, interest and...
datarade.ai
.json, .csv
Updated Jul 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Factori (2022). Factori USA Consumer Graph Data | socio-demographic, location, interest and intent data | E-Commere |Mobile Apps | Online Services [Dataset]. https://datarade.ai/data-products/factori-usa-consumer-graph-data-socio-demographic-location-factori
Explore at:
.json, .csvAvailable download formats
Dataset updated
Jul 23, 2022
Dataset authored and provided by
Factori
Area covered
United States of America
Description
Our consumer data is gathered and aggregated via surveys, digital services, and public data sources. We use powerful profiling algorithms to collect and ingest only fresh and reliable data points.

Our comprehensive data enrichment solution includes a variety of data sets that can help you address gaps in your customer data, gain a deeper understanding of your customers, and power superior client experiences.

Geography - City, State, ZIP, County, CBSA, Census Tract, etc.

Demographics - Gender, Age Group, Marital Status, Language etc.

Financial - Income Range, Credit Rating Range, Credit Type, Net worth Range, etc

Persona - Consumer type, Communication preferences, Family type, etc

Interests - Content, Brands, Shopping, Hobbies, Lifestyle etc.

Household - Number of Children, Number of Adults, IP Address, etc.

Behaviours - Brand Affinity, App Usage, Web Browsing etc.

Firmographics - Industry, Company, Occupation, Revenue, etc

Retail Purchase - Store, Category, Brand, SKU, Quantity, Price etc.

Auto - Car Make, Model, Type, Year, etc.

Housing - Home type, Home value, Renter/Owner, Year Built etc.

Consumer Graph Schema & Reach: Our data reach represents the total number of counts available within various categories and comprises attributes such as country location, MAU, DAU & Monthly Location Pings:

Data Export Methodology: Since we collect data dynamically, we provide the most updated data and insights via a best-suited method on a suitable interval (daily/weekly/monthly).

Consumer Graph Use Cases:

360-Degree Customer View:Get a comprehensive image of customers by the means of internal and external data aggregation.

Data Enrichment:Leverage Online to offline consumer profiles to build holistic audience segments to improve campaign targeting using user data enrichment

Fraud Detection: Use multiple digital (web and mobile) identities to verify real users and detect anomalies or fraudulent activity.

Advertising & Marketing:Understand audience demographics, interests, lifestyle, hobbies, and behaviors to build targeted marketing campaigns.

Using Factori Consumer Data graph you can solve use cases like:

Acquisition Marketing Expand your reach to new users and customers using lookalike modeling with your first party audiences to extend to other potential consumers with similar traits and attributes.

Lookalike Modeling

Build lookalike audience segments using your first party audiences as a seed to extend your reach for running marketing campaigns to acquire new users or customers

And also, CRM Data Enrichment, Consumer Data Enrichment B2B Data Enrichment B2C Data Enrichment Customer Acquisition Audience Segmentation 360-Degree Customer View Consumer Profiling Consumer Behaviour Data
Data and code for: Generation and applications of simulated datasets to...
zenodo.org
data.niaid.nih.gov
+1more
bin, zip
Updated Mar 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthew Silk; Matthew Silk; Olivier Gimenez; Olivier Gimenez (2023). Data and code for: Generation and applications of simulated datasets to integrate social network and demographic analyses [Dataset]. http://doi.org/10.5061/dryad.m0cfxpp7s
Explore at:
zip, binAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.m0cfxpp7s
Dataset updated
Mar 12, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Matthew Silk; Matthew Silk; Olivier Gimenez; Olivier Gimenez
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Social networks are tied to population dynamics; interactions are driven by population density and demographic structure, while social relationships can be key determinants of survival and reproductive success. However, difficulties integrating models used in demography and network analysis have limited research at this interface. We introduce the R package genNetDem for simulating integrated network-demographic datasets. It can be used to create longitudinal social networks and/or capture-recapture datasets with known properties. It incorporates the ability to generate populations and their social networks, generate grouping events using these networks, simulate social network effects on individual survival, and flexibly sample these longitudinal datasets of social associations. By generating co-capture data with known statistical relationships it provides functionality for methodological research. We demonstrate its use with case studies testing how imputation and sampling design influence the success of adding network traits to conventional Cormack-Jolly-Seber (CJS) models. We show that incorporating social network effects in CJS models generates qualitatively accurate results, but with downward-biased parameter estimates when network position influences survival. Biases are greater when fewer interactions are sampled or fewer individuals are observed in each interaction. While our results indicate the potential of incorporating social effects within demographic models, they show that imputing missing network measures alone is insufficient to accurately estimate social effects on survival, pointing to the importance of incorporating network imputation approaches. genNetDem provides a flexible tool to aid these methodological advancements and help researchers test other sampling considerations in social network studies.
m
Guatemala Geodemographic Information Dataset
app.mobito.io
Updated Mar 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Guatemala Geodemographic Information Dataset [Dataset]. https://app.mobito.io/data-product/guatemala-geodemographic-information-dataset
Explore at:
Dataset updated
Mar 10, 2023
Area covered
Guatemala
Description
This dataset offers valuable insights into the demographic profile of a specific population, with data on factors such as age, income, and gender distribution. The data is geocoded using geohash7 (152.9m x 152.4m), providing a more accurate representation of the population distribution. This information is a valuable resource for companies, researchers, and policymakers looking to gain a deeper understanding of the economic and social landscape of a community. Utilizing this data, they can make informed decisions related to resource allocation, planning, and policy development, and tailor initiatives to effectively address the challenges and opportunities facing the population. The dataset can be provided by country, department, municipality, zone, polygon, etc.
m
Mexico Geodemographic Information Dataset
app.mobito.io
Updated Feb 23, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Mexico Geodemographic Information Dataset [Dataset]. https://app.mobito.io/data-product/mexico-geodemographic-information-dataset
Explore at:
Dataset updated
Feb 23, 2023
Area covered
Mexico
Description
This dataset offers valuable insights into the demographic profile of a specific population, with data on factors such as age, income, and gender distribution, as well as number of homes and spending habits categorized into major expenditure categories such as food, transportation, and healthcare. The data is geocoded using geohash7 (152.9m x 152.4m), providing a more accurate representation of the population distribution. This information is a valuable resource for companies, researchers, and policymakers looking to gain a deeper understanding of the economic and social landscape of a community. Utilizing this data, they can make informed decisions related to resource allocation, planning, and policy development, and tailor initiatives to effectively address the challenges and opportunities facing the population. The dataset can be provided by country, state, municipality, colony, zone, polygon, etc.
App User Dataset
kaggle.com
Updated Sep 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kalle Fischer (2022). App User Dataset [Dataset]. https://www.kaggle.com/datasets/kallefischer/app-user-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 7, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Kalle Fischer
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
About Dataset

This dataset contains 6 columns and 10k rows about the demographics of the users of an app. UID - User ID, unique identifier for every app user. reg_date - Date that each user registered. device - Operating system of the user. Gender - Gender of the user Country - Country where the user downloaded the app. Age - Age of the user.
Lovoo v3 Dating App User Profiles and Statistics
kaggle.com
Updated Jan 15, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2023). Lovoo v3 Dating App User Profiles and Statistics [Dataset]. https://www.kaggle.com/datasets/thedevastator/lovoo-v3-dating-app-user-profiles-and-statistics/discussion?sort=undefined
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 15, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
The Devastator
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Lovoo v3 Dating App User Profiles and Statistics

Revealing popular user traits and behavior

By Jeffrey Mvutu Mabilama [source]

About this dataset

When Dating apps like Tinder began to become more popular, users wanted to create the best profiles possible in order to maximize their chances of being noticed and gain more potential encounters. Unlike traditional dating platforms, these new ones required mutual attraction before allowing two people to chat, making it all the more important for users to create a great profile that would give them an advantage over others.

It was amidst this scene that we Humans began paying attention at how charismatic and inspiring people presented themselves online. The most charismatic individuals tended to be the ones with the most followers or friends on social networks. This made us question what makes a great user profile and how one could make a lasting first impression in order ensure finding true love or even just some new friendships? How do we recognize a truly charismatic person from their presentation on social media? Is there any way of quantifying charisma?

In 2015 I set out with researching all this using Lovoo's newest dating app version -V3 (the iOS version), gathering user profile data such as age demographics, interest types (friendship, chatting or dating), language preferences etc., as well as usually unavailable metrics like number of profile visits, kisses received etc. I was also able to collect pictures of those user profiles in order discern any correlations between appeal and reputation that may have existed at that time amongst Lovoo's population base.

My goal is forthis dataset will help you answer those questions related not just romantic success but also popularity/charisma censes/demographic studies and even detect influential figures both within & outside Lovoo's platform . A starter analysis is available accompanying this dataset which can be used as a reference point when working with the data here. Using this dataset you can your own investigations into:

* What type of person has attracted more visitors or potential matches than others? * Which criteria can be used when determining someone’s charm/likability among others ? * How does one optimize his/her dating app profile visibility so he/she won’t remain unseen among other users?

Grab this amazing opportunity now! Kick-start your journey towards understanding the inner workings behind success in online relationships today!

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

To get started with this dataset first you need to download it from Kaggle. Once downloaded you should take a look at the column names in order to get an idea of what information is available. This data includes fields such as gender, age name (and nickname), number of pictures uploaded/profile visits/kisses /fans/gifts received and flirt interests (chatting or making friends). It also contains language specifics like detected languages for each user as well as country & city of residence.

The most interesting section for your research is likely the number of details that have been filled in for each user – such as whether they are interested in chatting or making friends. Usually these information points allow us to infer more about a person’s character – from jokester to serious individualist (or anything else!). The same holds true for their language preferences which might reveal aspects regarding their cultures orientation or habits.

You may also want collected data which was left out here - imagery associated with users' profiles - so please contact JfreexDatasets_bot on Telegram if you would like access to this imagery that has not yet been uploaded here on Kaggle but is intregral part of understanding what makes a great user profile attractive on these platforms according Aesthetics Theory applied in an uthentic way when considering how each image adds sentimental appeal value by its perspective content focus - be it visually descriptive; emotive narrative; personality coupled with expression mood association.. etcetera... Or simple just download relevant images yourself using automated scripts ready made via webiste Grammak where Github Repo exists: https://github.com/grammak580542008/Lovoo-v3-Profiles-Data # 1 year ago...

Finally moving ahead — keep in mind that there are other ways data can be gathered possible besides just downloading it from Kaggle – such us Messenger Bots or Customer Relationship Management systems which help companies serve...
i
Social Services Hoosier Health - Dataset - The Indiana Data Hub
hub.mph.in.gov
Updated Jul 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Social Services Hoosier Health - Dataset - The Indiana Data Hub [Dataset]. https://hub.mph.in.gov/dataset/social-services-hoosier-health
Explore at:
Dataset updated
Jul 21, 2021
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Indiana
Description
Archived as of 5/30/2025: The datasets will no longer receive updates but the historical data will continue to be available for download. In August 2018, 10 optional questions were added to all online applications through the state for health coverage, the Supplemental Nutrition Assistance Program (SNAP), and Temporary Assistance for Needy Families (TANF). It does not represent anyone who applied in-person, by telephone, by main, or any other method. In 2019, 79% of those who applied for SNAP, TANF, or health coverage applied online. The assessment does not impact eligibility for SNAP, TANF, or health coverage. Applications are filed at a household level and may represent several individuals. The application includes demographic information for the person who applied and not all members of the household. An individual may complete an assessment every time they apply for health coverage, SNAP or TANF. If an individual completed the survey more than once with multiple applications for assistance, each set of survey responses is represented on the dashboard. If an individual completes more than one assessment when applying for multiple programs, only one assessment will be represented in the data. To ensure personally identifiable information is protected, all data are presented in aggregate and data representing 20 or fewer individuals in any county will not be displayed (the demographic field will show as 0). Because some survey responses are not included in the individual race categories shown here, total counts from the individual race categories add up to less than the total for the "All" race category.
H
Bangladesh - Population Counts
data.humdata.org
geotiff
Updated Sep 19, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WorldPop (2021). Bangladesh - Population Counts [Dataset]. https://data.humdata.org/dataset/worldpop-population-counts-for-bangladesh
Explore at:
geotiffAvailable download formats
Dataset updated
Sep 19, 2021
Dataset provided by
WorldPop
Area covered
Bangladesh
Description
WorldPop produces different types of gridded population count datasets, depending on the methods used and end application. Please make sure you have read our Mapping Populations overview page before choosing and downloading a dataset.

Bespoke methods used to produce datasets for specific individual countries are available through the WorldPop Open Population Repository (WOPR) link below. These are 100m resolution gridded population estimates using customized methods ("bottom-up" and/or "top-down") developed for the latest data available from each country. They can also be visualised and explored through the woprVision App.
The remaining datasets in the links below are produced using the "top-down" method, with either the unconstrained or constrained top-down disaggregation method used. Please make sure you read the Top-down estimation modelling overview page to decide on which datasets best meet your needs. Datasets are available to download in Geotiff and ASCII XYZ format at a resolution of 3 and 30 arc-seconds (approximately 100m and 1km at the equator, respectively):

- Unconstrained individual countries 2000-2020 ( 1km resolution ): Consistent 1km resolution population count datasets created using unconstrained top-down methods for all countries of the World for each year 2000-2020.
- Unconstrained individual countries 2000-2020 ( 100m resolution ): Consistent 100m resolution population count datasets created using unconstrained top-down methods for all countries of the World for each year 2000-2020.
- Unconstrained individual countries 2000-2020 UN adjusted ( 100m resolution ): Consistent 100m resolution population count datasets created using unconstrained top-down methods for all countries of the World for each year 2000-2020 and adjusted to match United Nations national population estimates (UN 2019)
-Unconstrained individual countries 2000-2020 UN adjusted ( 1km resolution ): Consistent 1km resolution population count datasets created using unconstrained top-down methods for all countries of the World for each year 2000-2020 and adjusted to match United Nations national population estimates (UN 2019).
-Unconstrained global mosaics 2000-2020 ( 1km resolution ): Mosaiced 1km resolution versions of the "Unconstrained individual countries 2000-2020" datasets.
-Constrained individual countries 2020 ( 100m resolution ): Consistent 100m resolution population count datasets created using constrained top-down methods for all countries of the World for 2020.
-Constrained individual countries 2020 UN adjusted ( 100m resolution ): Consistent 100m resolution population count datasets created using constrained top-down methods for all countries of the World for 2020 and adjusted to match United Nations national population estimates (UN 2019).

Older datasets produced for specific individual countries and continents, using a set of tailored geospatial inputs and differing "top-down" methods and time periods are still available for download here: Individual countries and Whole Continent.

Data for earlier dates is available directly from WorldPop.

WorldPop (www.worldpop.org - School of Geography and Environmental Science, University of Southampton; Department of Geography and Geosciences, University of Louisville; Departement de Geographie, Universite de Namur) and Center for International Earth Science Information Network (CIESIN), Columbia University (2018). Global High Resolution Population Denominators Project - Funded by The Bill and Melinda Gates Foundation (OPP1134076). https://dx.doi.org/10.5258/SOTON/WP00645
d
Africa Population Distribution Database
search.dataone.org
Updated Nov 17, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deichmann, Uwe; Nelson, Andy (2014). Africa Population Distribution Database [Dataset]. https://search.dataone.org/view/Africa_Population_Distribution_Database.xml
Explore at:
Dataset updated
Nov 17, 2014
Dataset provided by
Regional and Global Biogeochemical Dynamics Data (RGD)
Authors
Deichmann, Uwe; Nelson, Andy
Time period covered
Jan 1, 1960 - Dec 31, 1997
Area covered

Description
The Africa Population Distribution Database provides decadal population density data for African administrative units for the period 1960-1990. The databsae was prepared for the United Nations Environment Programme / Global Resource Information Database (UNEP/GRID) project as part of an ongoing effort to improve global, spatially referenced demographic data holdings. The database is useful for a variety of applications including strategic-level agricultural research and applications in the analysis of the human dimensions of global change.

This documentation describes the third version of a database of administrative units and associated population density data for Africa. The first version was compiled for UNEP's Global Desertification Atlas (UNEP, 1997; Deichmann and Eklundh, 1991), while the second version represented an update and expansion of this first product (Deichmann, 1994; WRI, 1995). The current work is also related to National Center for Geographic Information and Analysis (NCGIA) activities to produce a global database of subnational population estimates (Tobler et al., 1995), and an improved database for the Asian continent (Deichmann, 1996). The new version for Africa provides considerably more detail: more than 4700 administrative units, compared to about 800 in the first and 2200 in the second version. In addition, for each of these units a population estimate was compiled for 1960, 70, 80 and 90 which provides an indication of past population dynamics in Africa. Forthcoming are population count data files as download options.

African population density data were compiled from a large number of heterogeneous sources, including official government censuses and estimates/projections derived from yearbooks, gazetteers, area handbooks, and other country studies. The political boundaries template (PONET) of the Digital Chart of the World (DCW) was used delineate national boundaries and coastlines for African countries.

For more information on African population density and administrative boundary data sets, see metadata files at [http://na.unep.net/datasets/datalist.php3] which provide information on file identification, format, spatial data organization, distribution, and metadata reference.

References:

Deichmann, U. 1994. A medium resolution population database for Africa, Database documentation and digital database, National Center for Geographic Information and Analysis, University of California, Santa Barbara.

Deichmann, U. and L. Eklundh. 1991. Global digital datasets for land degradation studies: A GIS approach, GRID Case Study Series No. 4, Global Resource Information Database, United Nations Environment Programme, Nairobi.

UNEP. 1997. World Atlas of Desertification, 2nd Ed., United Nations Environment Programme, Edward Arnold Publishers, London.

WRI. 1995. Africa data sampler, Digital database and documentation, World Resources Institute, Washington, D.C.
a
Everyone Counts - Demographic Data
hub-cookcountyil.opendata.arcgis.com
Updated Jun 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cook County Government (2023). Everyone Counts - Demographic Data [Dataset]. https://hub-cookcountyil.opendata.arcgis.com/datasets/everyone-counts-demographic-data
Explore at:
Dataset updated
Jun 28, 2023
Dataset authored and provided by
Cook County Government
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The demographic data behind the Everyone Counts application covers a broad spectrum of variables that contribute to the creation of informative tables and charts. These visual representations provide valuable insights into crucial population characteristics, including age, gender, race, ethnicity, and language spoken at home. Moreover, the data offers a deeper understanding of socioeconomic factors like educational attainment, income levels, and poverty rates. Furthermore, housing-related information such as housing tenure, household size, and occupancy adds an additional layer of knowledge about communities. This data is the source of the hosted view layer that is consumed by the Everyone Counts application. For a full documentation detailing how this dataset was created see the document here. For a description of all the variables see the data dictionary here.
d
TagX Web Browsing clickstream Data - 300K Users North America, EU - GDPR -...
datarade.ai
.json, .csv, .xls
Updated Sep 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TagX (2024). TagX Web Browsing clickstream Data - 300K Users North America, EU - GDPR - CCPA Compliant [Dataset]. https://datarade.ai/data-products/tagx-web-browsing-clickstream-data-300k-users-north-america-tagx
Explore at:
.json, .csv, .xlsAvailable download formats
Dataset updated
Sep 16, 2024
Dataset authored and provided by
TagX
Area covered
United States
Description
TagX Web Browsing Clickstream Data: Unveiling Digital Behavior Across North America and EU Unique Insights into Online User Behavior TagX Web Browsing clickstream Data offers an unparalleled window into the digital lives of 1 million users across North America and the European Union. This comprehensive dataset stands out in the market due to its breadth, depth, and stringent compliance with data protection regulations. What Makes Our Data Unique?

Extensive Geographic Coverage: Spanning two major markets, our data provides a holistic view of web browsing patterns in developed economies. Large User Base: With 300K active users, our dataset offers statistically significant insights across various demographics and user segments. GDPR and CCPA Compliance: We prioritize user privacy and data protection, ensuring that our data collection and processing methods adhere to the strictest regulatory standards. Real-time Updates: Our clickstream data is continuously refreshed, providing up-to-the-minute insights into evolving online trends and user behaviors. Granular Data Points: We capture a wide array of metrics, including time spent on websites, click patterns, search queries, and user journey flows.

Data Sourcing: Ethical and Transparent Our web browsing clickstream data is sourced through a network of partnered websites and applications. Users explicitly opt-in to data collection, ensuring transparency and consent. We employ advanced anonymization techniques to protect individual privacy while maintaining the integrity and value of the aggregated data. Key aspects of our data sourcing process include:

Voluntary user participation through clear opt-in mechanisms Regular audits of data collection methods to ensure ongoing compliance Collaboration with privacy experts to implement best practices in data anonymization Continuous monitoring of regulatory landscapes to adapt our processes as needed

Primary Use Cases and Verticals TagX Web Browsing clickstream Data serves a multitude of industries and use cases, including but not limited to:

Digital Marketing and Advertising:

Audience segmentation and targeting Campaign performance optimization Competitor analysis and benchmarking

E-commerce and Retail:

Customer journey mapping Product recommendation enhancements Cart abandonment analysis

Media and Entertainment:

Content consumption trends Audience engagement metrics Cross-platform user behavior analysis

Financial Services:

Risk assessment based on online behavior Fraud detection through anomaly identification Investment trend analysis

Technology and Software:

User experience optimization Feature adoption tracking Competitive intelligence

Market Research and Consulting:

Consumer behavior studies Industry trend analysis Digital transformation strategies

Integration with Broader Data Offering TagX Web Browsing clickstream Data is a cornerstone of our comprehensive digital intelligence suite. It seamlessly integrates with our other data products to provide a 360-degree view of online user behavior:

Social Media Engagement Data: Combine clickstream insights with social media interactions for a holistic understanding of digital footprints. Mobile App Usage Data: Cross-reference web browsing patterns with mobile app usage to map the complete digital journey. Purchase Intent Signals: Enrich clickstream data with purchase intent indicators to power predictive analytics and targeted marketing efforts. Demographic Overlays: Enhance web browsing data with demographic information for more precise audience segmentation and targeting.

By leveraging these complementary datasets, businesses can unlock deeper insights and drive more impactful strategies across their digital initiatives. Data Quality and Scale We pride ourselves on delivering high-quality, reliable data at scale:

Rigorous Data Cleaning: Advanced algorithms filter out bot traffic, VPNs, and other non-human interactions. Regular Quality Checks: Our data science team conducts ongoing audits to ensure data accuracy and consistency. Scalable Infrastructure: Our robust data processing pipeline can handle billions of daily events, ensuring comprehensive coverage. Historical Data Availability: Access up to 24 months of historical data for trend analysis and longitudinal studies. Customizable Data Feeds: Tailor the data delivery to your specific needs, from raw clickstream events to aggregated insights.

Empowering Data-Driven Decision Making In today's digital-first world, understanding online user behavior is crucial for businesses across all sectors. TagX Web Browsing clickstream Data empowers organizations to make informed decisions, optimize their digital strategies, and stay ahead of the competition. Whether you're a marketer looking to refine your targeting, a product manager seeking to enhance user experience, or a researcher exploring digital trends, our cli...
H
Mali - National Demographic and Health Data
data.humdata.org
csv
Updated Jun 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The DHS Program (2025). Mali - National Demographic and Health Data [Dataset]. https://data.humdata.org/dataset/dhs-data-for-mali
Explore at:
csv(11572), csv(6234), csv(11447), csv(38782), csv(5869), csv(100634), csv(31361), csv(82347), csv(16517), csv(18697), csv(41841), csv(211020), csv(27311), csv(14660), csv(13030), csv(14428), csv(2155), csv(3780), csv(30444), csv(12372), csv(9854), csv(10627), csv(43422), csv(39898), csv(70100), csv(11197), csv(20241), csv(19966), csv(56415), csv(141414), csv(5103), csv(9949), csv(13404), csv(4098), csv(33999), csv(8611), csv(13991), csv(32777), csv(20352), csv(17790)Available download formats
Dataset updated
Jun 20, 2025
Dataset provided by
The DHS Program
Description
Contains data from the DHS data portal. There is also a dataset containing Mali - Subnational Demographic and Health Data on HDX.

The DHS Program Application Programming Interface (API) provides software developers access to aggregated indicator data from The Demographic and Health Surveys (DHS) Program. The API can be used to create various applications to help analyze, visualize, explore and disseminate data on population, health, HIV, and nutrition from more than 90 countries.
o
School information and student demographics
data.ontario.ca
datasets.ai
+1more
xlsx
Updated May 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Education (2025). School information and student demographics [Dataset]. https://data.ontario.ca/dataset/school-information-and-student-demographics
Explore at:
xlsx(1565910), xlsx(1550796), xlsx(1566878), xlsx(1565304), xlsx(1562805), xlsx(1459001), xlsx(1475787), xlsx(1462006), xlsx(1460629), xlsx(1547704), xlsx(1567330), xlsx(1580734), xlsx(1492217), xlsx(1462064)Available download formats
Dataset updated
May 22, 2025
Dataset authored and provided by
Education
License
https://www.ontario.ca/page/open-government-licence-ontariohttps://www.ontario.ca/page/open-government-licence-ontario
Time period covered
May 1, 2025
Area covered
Ontario
Description
Data includes: board and school information, grade 3 and 6 EQAO student achievements for reading, writing and mathematics, and grade 9 mathematics EQAO and OSSLT. Data excludes private schools, Education and Community Partnership Programs (ECPP), summer, night and continuing education schools.

How Are We Protecting Privacy?

Results for OnSIS and Statistics Canada variables are suppressed based on school population size to better protect student privacy. In order to achieve this additional level of protection, the Ministry has used a methodology that randomly rounds a percentage either up or down depending on school enrolment. In order to protect privacy, the ministry does not publicly report on data when there are fewer than 10 individuals represented.
* Percentages depicted as 0 may not always be 0 values as in certain situations the values have been randomly rounded down or there are no reported results at a school for the respective indicator. * Percentages depicted as 100 are not always 100, in certain situations the values have been randomly rounded up.
The school enrolment totals have been rounded to the nearest 5 in order to better protect and maintain student privacy.

The information in the School Information Finder is the most current available to the Ministry of Education at this time, as reported by schools, school boards, EQAO and Statistics Canada. The information is updated as frequently as possible.

This information is also available on the Ministry of Education's School Information Finder website by individual school.

Descriptions for some of the data types can be found in our glossary.

School/school board and school authority contact information are updated and maintained by school boards and may not be the most current version. For the most recent information please visit: https://data.ontario.ca/dataset/ontario-public-school-contact-information.
d
Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Data |...
datarade.ai
.csv
Updated Aug 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataplex (2024). Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Data | Perfect for Historical Analysis & Easy Ingestion [Dataset]. https://datarade.ai/data-products/dataplex-all-cms-data-feeds-access-1519-reports-26b-row-dataplex
Explore at:
.csvAvailable download formats
Dataset updated
Aug 14, 2024
Dataset authored and provided by
Dataplex
Area covered
United States of America
Description
The All CMS Data Feeds dataset is an expansive resource offering access to 118 unique report feeds, providing in-depth insights into various aspects of the U.S. healthcare system. With over 25.8 billion rows of data meticulously collected since 2007, this dataset is invaluable for healthcare professionals, analysts, researchers, and businesses seeking to understand and analyze healthcare trends, performance metrics, and demographic shifts over time. The dataset is updated monthly, ensuring that users always have access to the most current and relevant data available.

Dataset Overview:

118 Report Feeds: - The dataset includes a wide array of report feeds, each providing unique insights into different dimensions of healthcare. These topics range from Medicare and Medicaid service metrics, patient demographics, provider information, financial data, and much more. The breadth of information ensures that users can find relevant data for nearly any healthcare-related analysis. - As CMS releases new report feeds, they are automatically added to this dataset, keeping it current and expanding its utility for users.

25.8 Billion Rows of Data:

With over 25.8 billion rows of data, this dataset provides a comprehensive view of the U.S. healthcare system. This extensive volume of data allows for granular analysis, enabling users to uncover insights that might be missed in smaller datasets. The data is also meticulously cleaned and aligned, ensuring accuracy and ease of use.

Historical Data Since 2007: - The dataset spans from 2007 to the present, offering a rich historical perspective that is essential for tracking long-term trends and changes in healthcare delivery, policy impacts, and patient outcomes. This historical data is particularly valuable for conducting longitudinal studies and evaluating the effects of various healthcare interventions over time.

Monthly Updates:

To ensure that users have access to the most current information, the dataset is updated monthly. These updates include new reports as well as revisions to existing data, making the dataset a continuously evolving resource that stays relevant and accurate.

Data Sourced from CMS:

The data in this dataset is sourced directly from the Centers for Medicare & Medicaid Services (CMS). After collection, the data is meticulously cleaned and its attributes are aligned, ensuring consistency, accuracy, and ease of use for any application. Furthermore, any new updates or releases from CMS are automatically integrated into the dataset, keeping it comprehensive and current.

Use Cases:

Market Analysis:

The dataset is ideal for market analysts who need to understand the dynamics of the healthcare industry. The extensive historical data allows for detailed segmentation and analysis, helping users identify trends, market shifts, and growth opportunities. The comprehensive nature of the data enables users to perform in-depth analyses of specific market segments, making it a valuable tool for strategic decision-making.

Healthcare Research:

Researchers will find the All CMS Data Feeds dataset to be a robust foundation for academic and commercial research. The historical data, combined with the breadth of coverage across various healthcare metrics, supports rigorous, in-depth analysis. Researchers can explore the effects of healthcare policies, study patient outcomes, analyze provider performance, and more, all within a single, comprehensive dataset.

Performance Tracking:

Healthcare providers and organizations can use the dataset to track performance metrics over time. By comparing data across different periods, organizations can identify areas for improvement, monitor the effectiveness of initiatives, and ensure compliance with regulatory standards. The dataset provides the detailed, reliable data needed to track and analyze key performance indicators.

Compliance and Regulatory Reporting:

The dataset is also an essential tool for compliance officers and those involved in regulatory reporting. With detailed data on provider performance, patient outcomes, and healthcare utilization, the dataset helps organizations meet regulatory requirements, prepare for audits, and ensure adherence to best practices. The accuracy and comprehensiveness of the data make it a trusted resource for regulatory compliance.

Data Quality and Reliability:

The All CMS Data Feeds dataset is designed with a strong emphasis on data quality and reliability. Each row of data is meticulously cleaned and aligned, ensuring that it is both accurate and consistent. This attention to detail makes the dataset a trusted resource for high-stakes applications, where data quality is critical.

Integration and Usability:

Ease of Integration:

The dataset is provided in a CSV format, which is widely compatible with most data analysis tools and platforms. This ensures that users can easily integrate the data into their existing wo...
f
Dataset.
plos.figshare.com
xlsx
Updated Oct 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jennifer J. Lee; Mavra Ahmed; Rim Mouhaffel; Mary R. L’Abbé (2023). Dataset. [Dataset]. http://doi.org/10.1371/journal.pdig.0000360.s005
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pdig.0000360.s005
Dataset updated
Oct 25, 2023
Dataset provided by
PLOS Digital Health
Authors
Jennifer J. Lee; Mavra Ahmed; Rim Mouhaffel; Mary R. L’Abbé
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
There has been an increased emphasis on plant-based foods and diets. Although mobile technology has the potential to be a convenient and innovative tool to help consumers adhere to dietary guidelines, little is known about the content and quality of free, popular mobile health (mHealth) plant-based diet apps. The objective of the study was to assess the content and quality of free, popular mHealth apps supporting plant-based diets for Canadians. Free mHealth apps with high user ratings, a high number of user ratings, available on both Apple App and GooglePlay stores, and primarily marketed to help users follow plant-based diet were included. Using pre-defined search terms, Apple App and GooglePlay App stores were searched on December 22, 2020; the top 100 returns for each search term were screened for eligibility. Included apps were downloaded and assessed for quality by three dietitians/nutrition research assistants using the Mobile App Rating Scale (MARS) and the App Quality Evaluation (AQEL) scale. Of the 998 apps screened, 16 apps (mean user ratings±SEM: 4.6±0.1) met the eligibility criteria, comprising 10 recipe managers and meal planners, 2 food scanners, 2 community builders, 1 restaurant identifier, and 1 sustainability assessor. All included apps targeted the general population and focused on changing behaviors using education (15 apps), skills training (9 apps), and/or goal setting (4 apps). Although MARS (scale: 1–5) revealed overall adequate app quality scores (3.8±0.1), domain-specific assessments revealed high functionality (4.0±0.1) and aesthetic (4.0±0.2), but low credibility scores (2.4±0.1). The AQEL (scale: 0–10) revealed overall low score in support of knowledge acquisition (4.5±0.4) and adequate scores in other nutrition-focused domains (6.1–7.6). Despite a variety of free plant-based apps available with different focuses to help Canadians follow plant-based diets, our findings suggest a need for increased credibility and additional resources to complement the low support of knowledge acquisition among currently available plant-based apps. This research received no specific grant from any funding agency.
i
Hoosier Health and Well-being By County and Date - Dataset - The Indiana...
hub.mph.in.gov
Updated Mar 5, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Hoosier Health and Well-being By County and Date - Dataset - The Indiana Data Hub [Dataset]. https://hub.mph.in.gov/dataset/hoosier-health-and-well-being-by-county-and-date
Explore at:
Dataset updated
Mar 5, 2021
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Indiana
Description
In August of 2018, FSSA’s Office of Healthy Opportunities deployed a social risk assessment survey. The 10-question survey was made available to anyone applying online through FSSA for health coverage, the Supplemental Nutritional Assistance Program or Temporary Assistance for Needy Families. The results of this survey are aggregated and presented below and can help communities better understand the social risk factors affecting the health of those applying for our services. Please read and review the following information regarding the use of this data prior to viewing the tool. This survey was made available to those individuals who applied online ONLY and does not represent anyone who applied in-person, by telephone, by mail or any other method. In 2018, online applications accounted for 79% of those who applied for SNAP, TANF or health coverage. Survey completion is voluntary and does not impact eligibility for SNAP, TANF or health coverage. Applications are filed at a household level and may represent several individuals. The application process identifies a primary contact person for the household, and that individual’s demographics are represented on the dashboard; for example, person’s gender, race and education level. An individual who completes more than one application and survey over any given time period is represented once for each instance, and the survey answers and demographic details are based on each application’s responses. For example, an applicant’s age, education level and survey answers can change over time, and the reporting reflects any such changes. All information is presented in aggregate to ensure personally identifiable information is protected. To protect the privacy of individuals, data representing 20 or less individuals in any county will not be displayed. I.e. it will show as blank
f
dataset for dating app use and TNSB.sav
figshare.com
bin
Updated Jan 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yao Yao (2024). dataset for dating app use and TNSB.sav [Dataset]. http://doi.org/10.6084/m9.figshare.25001390.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.25001390.v1
Dataset updated
Jan 16, 2024
Dataset provided by
figshare
Authors
Yao Yao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This research conducted an online survey to investigate the relationship between dating app use and hookup intention. It measured dating app use, perceived descriptive norms, injunctive norms, fear of negative evaluation, hookup intention, and demographic information including age, gender, sexual orientation, and relationship status.
E
IMA-AIM data set including Permanent Sample
www-acc.healthinformationportal.eu
healthinformationportal.eu
html
Updated Mar 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
IMA-AIM (2022). IMA-AIM data set including Permanent Sample [Dataset]. https://www-acc.healthinformationportal.eu/services/find-data?page=35
Explore at:
htmlAvailable download formats
Dataset updated
Mar 2, 2022
Dataset authored and provided by
IMA-AIM
License
https://aim-ima.be/Donnees-individuelles-realiser-l?lang=frhttps://aim-ima.be/Donnees-individuelles-realiser-l?lang=fr
Variables measured
sex, title, topics, country, language, data_owners, description, contact_name, geo_coverage, contact_email, and 12 more
Measurement technique
Hospital resources & Healthcare resources
Description
IMA-AIM can provide you with detailed data on the health care system in Belgium. Their data collection includes information on the reimbursed care and medicines of the 11 million citizens insured in our country. The data is collected by the 7 health insurance funds and processed, analysed and made available for research by IMA-AIM.

The seven health insurance funds in Belgium collect a lot of data about their members in order to be able to carry out their tasks. IMA-AIM brings these data together in databases for the purpose of analysis and research. The databases contain three types of data: population data (demographic and socio-economic characteristics), information about reimbursed health care and information about reimbursed medicines.

The Permanent Sample (EPS) is a longitudinal dataset containing data from the Population, Health Care and Pharmanet databases, as well as data on hospitalisations. The data are available in separate datasets per calendar year. The aim of EPS is to make the administrative data of the health insurance funds permanently available to a number of federal and regional partners. More information about the EPS: https://metadata.ima-aim.be/nl/app/bdds/Ps

Facebook

Twitter

Click to copy link

Link copied

Cite

Keyush nisar (2025). Dating App Behavior Dataset 2025 [Dataset]. https://www.kaggle.com/datasets/keyushnisar/dating-app-behavior-dataset

Dating App Behavior Dataset 2025

Synthetic Data on User Interactions and Preferences in a Dating App

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Apr 11, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Keyush nisar

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

This dataset provides a synthetic representation of user behavior on a fictional dating app. It contains 50,000 records with 19 features capturing demographic details, app usage patterns, swipe tendencies, and match outcomes. The data was generated programmatically to simulate realistic user interactions, making it ideal for exploratory data analysis (EDA), machine learning modeling (e.g., predicting match outcomes), or studying user behavior trends in online dating platforms.

Key features include gender, sexual orientation, location type, income bracket, education level, user interests, app usage time, swipe ratios, likes received, mutual matches, and match outcomes (e.g., "Mutual Match," "Ghosted," "Catfished"). The dataset is designed to be diverse and balanced, with categorical, numerical, and labeled variables for various analytical purposes.

Usage

This dataset can be used for:

Exploratory Data Analysis (EDA): Investigate correlations between demographics, app usage, and match success. Machine Learning: Build models to predict match outcomes or user engagement levels. Social Studies: Analyze trends in dating app behavior across different demographics. Feature Engineering Practice: Experiment with transforming categorical and numerical data.

Clear search

Close search

Google apps

Main menu

Dating App Behavior Dataset 2025

Usage

Is Demography Destiny? Application of Machine Learning Techniques to...

Factori USA Consumer Graph Data | socio-demographic, location, interest and...

Data and code for: Generation and applications of simulated datasets to...

Guatemala Geodemographic Information Dataset

Mexico Geodemographic Information Dataset

App User Dataset

About Dataset

Lovoo v3 Dating App User Profiles and Statistics

Lovoo v3 Dating App User Profiles and Statistics

Revealing popular user traits and behavior

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Social Services Hoosier Health - Dataset - The Indiana Data Hub

Bangladesh - Population Counts

Africa Population Distribution Database

Everyone Counts - Demographic Data

TagX Web Browsing clickstream Data - 300K Users North America, EU - GDPR -...

Mali - National Demographic and Health Data

School information and student demographics

Dataplex: All CMS Data Feeds | Access 1519 Reports & 26B+ Rows of Data |...

Dataset.

Hoosier Health and Well-being By County and Date - Dataset - The Indiana...

dataset for dating app use and TNSB.sav

IMA-AIM data set including Permanent Sample

Dating App Behavior Dataset 2025

Synthetic Data on User Interactions and Preferences in a Dating App

Usage