7 datasets found

Dating App Fame & Behavior
kaggle.com
Updated May 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Utkarsh Singh (2023). Dating App Fame & Behavior [Dataset]. https://www.kaggle.com/utkarshx27/lovoo-dating-app-dataset/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 16, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Utkarsh Singh
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F13364933%2F23694fae55e2e76299358693ba6f32b9%2Flv-share.jpg?generation=1684843825246772&alt=media" alt=""> ➡️ There are total 3 datasets containing valuable information. ➡️ Understand people's fame and behavior's on a dating app platform. | Column Name | Description | |---------------------|------------------------------| | Age | The age of the user. | | Number of Users | The total number of users. | | Percent Want Chats | Percentage of users who want chats. | | Percent Want Friends| Percentage of users who want friendships. | | Percent Want Dates | Percentage of users who want romantic dates. | | Mean Kisses Received| Average number of kisses received by users. | | Mean Visits Received| Average number of profile visits received by users. | | Mean Followers | Average number of followers for each user. | | Mean Languages Known| Average number of languages known by users. | | Total Want Chats | Total count of users interested in chats. | | Total Want Friends | Total count of users looking for friendships. | | Total Want Dates | Total count of users seeking romantic dates. | | Total Kisses Received| Overall count of kisses received by users. | | Total Visits Received| Overall count of profile visits received by users. | | Total Followers | Overall count of followers for all users. | | Total Languages Spoken| Total count of languages spoken by all users. |

SUMMARY

When Dating apps like Tinder were becoming viral, people wanted to have the best profile in order to get more matches and more potential encounters. Unlike other previous dating platforms, those new ones emphasized on the mutuality of attraction before allowing any two people to get in touch and chat. This made it all the more important to create the best profile in order to get the best first impression.

Parallel to that, we Humans have always been in awe before charismatic and inspiring people. The more charismatic people tend to be followed and listened to by more people. Through their metrics such as the number of friends/followers, social networks give some ways of "measuring" the potential charisma of some people.

In regard to all that, one can then think:

what makes a great user profile ? how to make the best first impression in order to get more matches (and ultimately find love, or new friendships) ? what makes a person charismatic ? how do charismatic people present themselves ? In order to try and understand those different social questions, I decided to create a dataset of user profile informations using the social network Lovoo when it came out. By using different methodologies, I was able to gather user profile data, as well as some usually unavailable metrics (such as the number of profile visits).

Content

The dataset contains user profile infos of users of the website Lovoo.

The dataset was gathered during spring 2015 (april, may). At that time, Lovoo was expanding in european countries (among others), while Tinder was trending both in America and in Europe. At that time the iOS version of the Lovoo app was in version 3.

Accessory image data The dataset references pictures (field pictureId) of user profiles. These pictures are also available for a fraction of users but have not been uploaded and should be asked separately.

The idea when gathering the profile pictures was to determine whether some correlations could be identified between a profile picture and the reputation or success of a given profile. Since first impression matters, a sound hypothesis to make is that the profile picture might have a great influence on the number of profile visits, matches and so on. Do not forget that only a fraction of a user's profile is seen when browsing through a list of users.

https://s1.dmcdn.net/v/BnWkG1M7WuJDq2PKP/x480

Details about collection methodology In order to gather the data, I developed a set of tools that would save the data while browsing through profiles and doing searches. Because of this approach (and the constraints that forced me to develop this approach) I could only gather user profiles that were recommended by Lovoo's algorithm for 2 profiles I created for this purpose occasion (male, open to friends & chats & dates). That is why there are only female users in the dataset. Another work could be done to fetch similar data for both genders or other age ranges.

Regarding the number of user profiles It turned out that the recommendation algorithm always seemed to output the same set of user profiles. This meant Lovoo's algorithm was probably heavily relying on settings like location (to recommend more people nearby than people in different places or countries) and maybe cookies. This diminished the number of different user profiles that would be pr...
h
this-person-does-not-exist
huggingface.co
Updated Mar 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tales Leonidas (2024). this-person-does-not-exist [Dataset]. https://huggingface.co/datasets/TLeonidas/this-person-does-not-exist
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 24, 2024
Authors
Tales Leonidas
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This dataset consists of 8892 AI-generated profile pictures downloaded from (https://www.kaggle.com/datasets/pablobedolla/this-person-does-not-exist-data)
Human faces align crop and segment
kaggle.com
Updated Jan 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arnaud ROUGETET (2021). Human faces align crop and segment [Dataset]. https://www.kaggle.com/arnaud58/human-faces-align-crop-and-segment/activity
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 6, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Arnaud ROUGETET
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

A collection of 6.6k+ images useful for multiple use cases such image identifiers, classifier algorithms etc.

The pictures are from Human faces fataset but align, crop and segment with the algorith of minivision

https://raw.githubusercontent.com/minivision-ai/photo2cartoon/master/images/data_process.jpg" alt="operation of align/crop/segment">

Content

A mix of front face, side profile pictures which will help achieve great identifying results and improved range of classifier possibilities.
Instagram Images with Captions
kaggle.com
Updated Mar 21, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prithvi Jaunjale (2020). Instagram Images with Captions [Dataset]. https://www.kaggle.com/prithvijaunjale/instagram-images-with-captions/activity
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 21, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Prithvi Jaunjale
Description
Context

There's a story behind every dataset and here's your opportunity to share yours.

Content

What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

Acknowledgements

We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?
Real vs fake faces
kaggle.com
Updated May 4, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Udit Sharma (2022). Real vs fake faces [Dataset]. https://www.kaggle.com/datasets/uditsharma72/real-vs-fake-faces
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 4, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Udit Sharma
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
About Dataset This dataset contains real and fake images of human faces. Real and Fake Face Detection Fake Face Photos by Photoshop Experts Introduction When using social networks, have you ever encountered a 'fake identity'? Anyone can create a fake profile image using image editing tools, or even using deep learning based generators. If you are interested in making the world wide web a better place by recognizing such fake faces, you should check this dataset.
Fake/Authentic User Instagram
kaggle.com
zip
Updated Feb 11, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kristo Radion Purba (2021). Fake/Authentic User Instagram [Dataset]. https://www.kaggle.com/krpurba/fakeauthentic-user-instagram
Explore at:
zip(3451107 bytes)Available download formats
Dataset updated
Feb 11, 2021
Authors
Kristo Radion Purba
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Kindly refer to my paper for more information. Please cite my work if you use my dataset in any work : K. R. Purba, D. Asirvatham and R. K. Murugesan, "Classification of instagram fake users using supervised machine learning algorithms," International Journal of Electrical and Computer Engineering (IJECE), vol. 10, no. 3, pp. 2763-2772, 2020.

The dataset was collected using web scraping from third-party Instagram websites, to capture their metadata and up to 12 latest media posts from each user. The collection process was executed from September 1st, 2019, until September 20th, 2019. The dataset contains authentic users and fake users, which were filtered using human annotators. The authentic users were taken from followers of 24 private university pages (8 Indonesian, 8 Malaysian, 8 Australian) on Instagram. To reduce the number of users, they are picked using proportional random sampling based on their source university. All private users were removed, which is a total of 31,335 out of 63,795 users (49.11%). The final number of public users used in this research was 32,460 users.

Var name | Feature name | Description pos | Num posts | Number of total posts that the user has ever posted. flg | Num following | Number of following flr | Num followers | Number of followers bl | Biography length | Length (number of characters) of the user's biography pic | Picture availability | Value 0 if the user has no profile picture, or 1 if has lin | Link availability | Value 0 if the user has no external URL, or 1 if has cl | Average caption length | The average number of character of captions in media cz | Caption zero | Percentage (0.0 to 1.0) of captions that has almost zero (<=3) length ni | Non image percentage | Percentage (0.0 to 1.0) of non-image media. There are three types of media on an Instagram post, i.e. image, video, carousel erl | Engagement rate (Like) | Engagement rate (ER) is commonly defined as (num likes) divide by (num media) divide by (num followers) erc | Engagement rate (Comm.) | Similar to ER like, but it is for comments lt | Location tag percentage | Percentage (0.0 to 1.0) of posts tagged with location hc | Average hashtag count | Average number of hashtags used in a post pr | Promotional keywords | Average use of promotional keywords in hashtag, i.e. {regrann, contest, repost, giveaway, mention, share, give away, quiz} fo | Followers keywords | Average use of followers hunter keywords in hashtag, i.e. {follow, like, folback, follback, f4f} cs | Cosine similarity | Average cosine similarity of between all pair of two posts a user has pi | Post interval | Average interval between posts (in hours)

Output : 2-class User classes : r (real/authentic user), f (fake user / bought followers) 4-class User classes : r (authentic/real user), a (active fake user), i (inactive fake user), s (spammer fake user) Note that the 3 fake user classes (a, i, s) were judged by human annotators.
Colorimetry standard fruit images
kaggle.com
Updated Dec 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peter McAtee (2021). Colorimetry standard fruit images [Dataset]. https://www.kaggle.com/datasets/petermcatee/colorimetry-standard-fruit-images/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 13, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Peter McAtee
Description
Inspiration

This dataset was generated to provide a benchmark set of images to test colorimetric algorithms against. We hope that it will be useful in the future to progress the ability of the scientific communities to develop algorithms/methodologies that can reduce the density of colour data contained in complex images like fruit in a unbias, reliable, and rapid manner.

Context

Contained within a a set of standard images that can be used in future studies to benchmark algorithms that undertake colorimetric analysis. These images consist of cross sectional fruit and tubers that contain complex colour gradients across the visible spectrum. These images were analyzed in our publication "A data driven approach to assess complex colour profiles in plant tissues"

A selection of 28 species of fruit and tubers was purchased from a local supermarket in Auckland, New Zealand. These fruit and tubers represented different families including Anacardiaceae (mango), Ebenaceae (persimmon), Actinidiaceae (kiwifruit), Lauraceae (avocado), Musaceae (banana), Rosaceae (apple, peach, pear, plum, and strawberry), Rutaceae (grapefruit, lemon, mandarin, and orange), and Solanaceae (potato, tamarillo, and tomato). Each fruit was cross sectioned along its most symmetrical side. Up to three cross sections of the same fruit type were placed face down on the scanner on a predefined 3x1 grid with defined positions to allow image capture of the individual fruit.

The images contained here were captured using a Canon LIDE 220 flatbed scanner (Scanning element sensor: CIS, Light source: 3 colour RGB LED) that was placed in a 2mm black perspex box with a retractable lid that completely blocked ambient light. Parent images with dimensions of 4960 pixels (W) and 7015 pixels (H) was acquired at a resolution of 600 dots-per-inch/pixels-per-inch (DPI/PPI) and output in a TIFF format. Each sibling image was calibrated using the white tile standard on the X-Rite mini-colour checker card that was included in each scanned parent image (McCamy, C. S. et al., 1976) and segmented.

Mccamy, C.S., Marcus, H., and Davidson, J.G. (1976). A Color Rendition Chart. Journal of Applied Photographic Engineering 11, 95-99.

Content

This folder contains 81 fruit images in TIF format.

Each TIF image contains two stacks. Stack-1 is a RGB image of the cross-section and Stack-2 contains a binary image the defines the of the fruit/tuber boundary.

A xlsx containing the calibration of each colour channel (RGB) is also included

Acknowledgements

Peter A. McAtee1*, Simona Nardozza1, Annette Richardson2, Mark Wohlers1, Robert J. Schaffer3,4

The New Zealand Institute for Plant & Food Research (PFR), Private Bag 92169, Auckland 1142, New Zealand

The New Zealand Institute for Plant & Food Research (PFR) , 121 Keri Downs Road, Kerikeri, 0294, New Zealand

The New Zealand Institute for Plant & Food Research (PFR), 55 Old Mill Lane, Motueka, 7198, New Zealand

School of Biological Sciences, University of Auckland, Private Bag 92019, Auckland, 1142, New Zealand
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Utkarsh Singh (2023). Dating App Fame & Behavior [Dataset]. https://www.kaggle.com/utkarshx27/lovoo-dating-app-dataset/discussion

Dating App Fame & Behavior

Understand people's fame and behavior's on a dating app platform

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

May 16, 2023

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Utkarsh Singh

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F13364933%2F23694fae55e2e76299358693ba6f32b9%2Flv-share.jpg?generation=1684843825246772&alt=media" alt=""> ➡️ There are total 3 datasets containing valuable information. ➡️ Understand people's fame and behavior's on a dating app platform. | Column Name | Description | |---------------------|------------------------------| | Age | The age of the user. | | Number of Users | The total number of users. | | Percent Want Chats | Percentage of users who want chats. | | Percent Want Friends| Percentage of users who want friendships. | | Percent Want Dates | Percentage of users who want romantic dates. | | Mean Kisses Received| Average number of kisses received by users. | | Mean Visits Received| Average number of profile visits received by users. | | Mean Followers | Average number of followers for each user. | | Mean Languages Known| Average number of languages known by users. | | Total Want Chats | Total count of users interested in chats. | | Total Want Friends | Total count of users looking for friendships. | | Total Want Dates | Total count of users seeking romantic dates. | | Total Kisses Received| Overall count of kisses received by users. | | Total Visits Received| Overall count of profile visits received by users. | | Total Followers | Overall count of followers for all users. | | Total Languages Spoken| Total count of languages spoken by all users. |

SUMMARY

When Dating apps like Tinder were becoming viral, people wanted to have the best profile in order to get more matches and more potential encounters. Unlike other previous dating platforms, those new ones emphasized on the mutuality of attraction before allowing any two people to get in touch and chat. This made it all the more important to create the best profile in order to get the best first impression.

Parallel to that, we Humans have always been in awe before charismatic and inspiring people. The more charismatic people tend to be followed and listened to by more people. Through their metrics such as the number of friends/followers, social networks give some ways of "measuring" the potential charisma of some people.

In regard to all that, one can then think:

what makes a great user profile ? how to make the best first impression in order to get more matches (and ultimately find love, or new friendships) ? what makes a person charismatic ? how do charismatic people present themselves ? In order to try and understand those different social questions, I decided to create a dataset of user profile informations using the social network Lovoo when it came out. By using different methodologies, I was able to gather user profile data, as well as some usually unavailable metrics (such as the number of profile visits).

Content

The dataset contains user profile infos of users of the website Lovoo.

The dataset was gathered during spring 2015 (april, may). At that time, Lovoo was expanding in european countries (among others), while Tinder was trending both in America and in Europe. At that time the iOS version of the Lovoo app was in version 3.

Accessory image data The dataset references pictures (field pictureId) of user profiles. These pictures are also available for a fraction of users but have not been uploaded and should be asked separately.

The idea when gathering the profile pictures was to determine whether some correlations could be identified between a profile picture and the reputation or success of a given profile. Since first impression matters, a sound hypothesis to make is that the profile picture might have a great influence on the number of profile visits, matches and so on. Do not forget that only a fraction of a user's profile is seen when browsing through a list of users.

https://s1.dmcdn.net/v/BnWkG1M7WuJDq2PKP/x480

Details about collection methodology In order to gather the data, I developed a set of tools that would save the data while browsing through profiles and doing searches. Because of this approach (and the constraints that forced me to develop this approach) I could only gather user profiles that were recommended by Lovoo's algorithm for 2 profiles I created for this purpose occasion (male, open to friends & chats & dates). That is why there are only female users in the dataset. Another work could be done to fetch similar data for both genders or other age ranges.

Regarding the number of user profiles It turned out that the recommendation algorithm always seemed to output the same set of user profiles. This meant Lovoo's algorithm was probably heavily relying on settings like location (to recommend more people nearby than people in different places or countries) and maybe cookies. This diminished the number of different user profiles that would be pr...

Clear search

Close search

Google apps

Main menu

Dating App Fame & Behavior

SUMMARY

Content

this-person-does-not-exist

Human faces align crop and segment

Context

Content

Instagram Images with Captions

Context

Content

Acknowledgements

Inspiration

Real vs fake faces

Fake/Authentic User Instagram

Colorimetry standard fruit images

Inspiration

Context

Content

Acknowledgements

Dating App Fame & Behavior

Understand people's fame and behavior's on a dating app platform

SUMMARY

Content