68 datasets found
  1. 🌟 Emoji Trends Dataset

    • kaggle.com
    Updated Jul 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waqar Ali (2024). 🌟 Emoji Trends Dataset [Dataset]. https://www.kaggle.com/datasets/waqi786/emoji-trends-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 31, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Waqar Ali
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset provides a detailed analysis of emoji usage across various social media platforms. It captures how different emojis are used in different contexts, reflecting emotions, trends, and user demographics.

    With emojis becoming a universal digital language, this dataset helps researchers, marketers, and data analysts explore how people express emotions online and identify patterns in social media communication.

    📌 Key Features: 😊 Emoji Details: Emoji 🎭: The specific emoji used in a post, comment, or message. Context 💬: The meaning or emotion associated with the emoji (e.g., Happy, Love, Funny, Sad). Platform 🌐: The social media platform where the emoji was used (e.g., Facebook, Instagram, Twitter). 👤 User Demographics: User Age 🎂: Age of the user who posted the emoji (ranges from 13 to 65 years). User Gender 🚻: Gender of the user (Male/Female). 📈 Additional Insights: Emoji Popularity 🔥: Frequency of each emoji’s usage across platforms. Trends Over Time 📅: How emoji usage changes based on trends or events. Regional Usage Patterns 🌍: How different cultures and regions use emojis differently. 📊 Use Cases & Applications: 🔹 Understanding emoji trends across social media 🔹 Analyzing emotional expression through digital communication 🔹 Exploring demographic differences in emoji usage 🔹 Identifying platform-specific emoji preferences 🔹 Enhancing sentiment analysis models with emoji insights

    ⚠️ Important Note: This dataset is synthetically generated for educational and analytical purposes. It does not contain real user data but is designed to reflect real-world trends in emoji usage.

  2. O

    Data from: Multimodal Emoji Prediction

    • opendatalab.com
    • paperswithcode.com
    zip
    Updated Sep 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IBM T. J. Watson Research Center USA (2022). Multimodal Emoji Prediction [Dataset]. https://opendatalab.com/OpenDataLab/Multimodal_Emoji_Prediction
    Explore at:
    zip(24140764 bytes)Available download formats
    Dataset updated
    Sep 22, 2022
    Dataset provided by
    Universitat Pompeu Fabra
    TALN
    IBM T. J. Watson Research Center USA
    Description

    The twitter emoji dataset obtained from CodaLab comprises of 50 thousand tweets along with the associated emoji label. Each tweet in the dataset has a corresponding numerical label which maps to a specific emoji. The emojis are of the 20 most frequent emojis and hence the labels range from 0 to 19

  3. h

    emoji-dataset

    • huggingface.co
    Updated Jun 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suraj Patil (2025). emoji-dataset [Dataset]. https://huggingface.co/datasets/valhalla/emoji-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 22, 2025
    Authors
    Suraj Patil
    Description

    valhalla/emoji-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. o

    Arabic Emoji Meanings Dataset – 1500+ Emojis with

    • opendatabay.com
    .undefined
    Updated Jun 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). Arabic Emoji Meanings Dataset – 1500+ Emojis with [Dataset]. https://www.opendatabay.com/data/ai-ml/8ab1a0f1-f22c-46a1-bb69-93e08a8f4722
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 15, 2025
    Dataset authored and provided by
    Datasimple
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Social Media and Networking
    Description

    This dataset contains 1500+ emojis with their Arabic descriptions, sentiment classifications (Positive, Negative, Mixed), and Unicode representations. It is useful for NLP tasks, sentiment analysis, emoji-based chatbots, and AI language models supporting Arabic text. The dataset has been refined to correct Arabic text and remove inappropriate words.

    Key Features:

    📝 1500+ emojis with detailed Arabic meanings 📊 Sentiment labels (Positive, Negative, Mixed) 🔤 Unicode representation for easy integration ✅ Cleaned & filtered to improve readability and avoid inappropriate terms 💡 Useful for AI, machine learning, and chatbot training

    Original Data Source: Arabic Emoji Meanings Dataset – 1500+ Emojis with

  5. Emojis.xlsx

    • figshare.com
    xlsx
    Updated Feb 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohamad Jalalian (2021). Emojis.xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.14049614.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 17, 2021
    Dataset provided by
    figshare
    Authors
    Mohamad Jalalian
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    some Unicode of emojis

  6. 𝒙 Twemoji Dataset

    • kaggle.com
    Updated Sep 22, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mexwell (2023). 𝒙 Twemoji Dataset [Dataset]. https://www.kaggle.com/datasets/mexwell/twemoji-dataset/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 22, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    mexwell
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Collection of 13M tweets divided into training, validation, and test sets for the purposes of predicting emoji based on text and/or images.

    The data provides the tweet status ID and the emoji annotations associated with it. In the case of image-containing subsets, the image URL is also listed.

    The Full, unbalanced dataset consists of a random test and validation sets of 1M tweets, with the remainder in the training set.

    The Balanced testset is a subset of the test set chosen to improve emoji class balance.

    The Image subsets are image-containing tweets.

    Finally, emoji_map_1791.csv provides information regarding the emoji labels and potential metadata.

    URL to get the tweet based on ID: `https://twitter.com/anyuser/status/

  7. Emoji Sentiment Ranking

    • figshare.com
    txt
    Updated May 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Petra Kralj Novak; Jasmina Smailović; Borut Sluban; Igor Mozetic (2023). Emoji Sentiment Ranking [Dataset]. http://doi.org/10.6084/m9.figshare.1600931.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Petra Kralj Novak; Jasmina Smailović; Borut Sluban; Igor Mozetic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A lexicon of 751 emoji characters with automatically assigned sentiment. The sentiment is computed from 70,000 tweets, labeled by 83 human annotators in 13 European languages. The Emoji Sentiment Ranking web page at http://kt.ijs.si/data/Emoji_sentiment_ranking/ is automatically generated from the data provided in this repository. The process and analysis of emoji sentiment ranking is described in the paper: P. Kralj Novak, J. Smailović, B. Sluban, I. Mozetič, Sentiment of Emojis, submitted; arXiv preprint, http://arxiv.org/abs/1509.07761, 2015.

  8. h

    emoji-map

    • huggingface.co
    Updated Sep 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Omar Kamali (2024). emoji-map [Dataset]. https://huggingface.co/datasets/omarkamali/emoji-map
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    Omar Kamali
    Description

    📊 Dataset Overview

    The emoji-map dataset, created by omarkamali, contains text data in parquet format. It consists of 10K-100K entries, specifically 5.03k rows. The dataset is available in the train split.

      📁 Data Structure
    

    The dataset includes two main columns: emoji and unicode_description. The emoji column contains various emoji characters, while the unicode_description column provides a textual description of each emoji.

      🔍 Sample Data
    

    Examples from the… See the full description on the dataset page: https://huggingface.co/datasets/omarkamali/emoji-map.

  9. f

    Data from: How quickly are face emojis integrated with their surrounding...

    • tandf.figshare.com
    docx
    Updated Jun 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Kilby; Demian Stoianov; Signy Wegener; Nenagh Kemp; Elisabeth Beyersmann (2025). How quickly are face emojis integrated with their surrounding text? An eye-tracking study [Dataset]. http://doi.org/10.6084/m9.figshare.29318369.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 13, 2025
    Dataset provided by
    Taylor & Francis
    Authors
    Alexander Kilby; Demian Stoianov; Signy Wegener; Nenagh Kemp; Elisabeth Beyersmann
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Emojis are used in online communication to convey expression and emotion. This study investigated whether emoji integration occurs at an early stage of reading or at a late, more conscious stage. Participants' eye movements were monitored as they read informal, text-message-style sentences containing either a contextually congruent face emoji, a contextually incongruent face emoji, or a dash. Comprehension questions were included after each message to encourage reading for comprehension. Three early (skipping rate, first fixation duration, gaze duration) and three late (total reading time, regression in probability, trial dwell time) processing measures were analysed. Results revealed that compared with message-congruent emojis, incongruent emojis incurred significant processing costs on all late measures and one early measure (gaze duration). Further, both emoji conditions showed higher skipping rates and longer reading times relative to the dash trials across most measures, indicating emoji processing costs during both early and late stages of reading.

  10. m

    The Emotional Impact of Emojis on Subsequent Texts

    • data.mendeley.com
    Updated Aug 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuanfu Dai (2023). The Emotional Impact of Emojis on Subsequent Texts [Dataset]. http://doi.org/10.17632/6xv8mkkz88.1
    Explore at:
    Dataset updated
    Aug 22, 2023
    Authors
    Yuanfu Dai
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset includes material collection, experimental procedures and experimental data (raw data and data used for analyses).

  11. Emoji data

    • kaggle.com
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fares Hazem (2023). Emoji data [Dataset]. https://www.kaggle.com/datasets/fareshazem/emoji-data/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Fares Hazem
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Fares Hazem

    Released under Apache 2.0

    Contents

  12. m

    Colour Emoji Dataset

    • data.mendeley.com
    Updated Jul 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Declan Forrester (2024). Colour Emoji Dataset [Dataset]. http://doi.org/10.17632/pyt7kzr5f2.1
    Explore at:
    Dataset updated
    Jul 9, 2024
    Authors
    Declan Forrester
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Reaction time and accuracy data for colour emoji categorisation task

  13. Data from: Tomographic X-ray data of 3D emoji

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenith Purisha; Zenith Purisha; Alexander Meaney; Samuli Siltanen; Alexander Meaney; Samuli Siltanen (2020). Tomographic X-ray data of 3D emoji [Dataset]. http://doi.org/10.5281/zenodo.1183532
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Zenith Purisha; Zenith Purisha; Alexander Meaney; Samuli Siltanen; Alexander Meaney; Samuli Siltanen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the documentation of the tomographic X-ray data of emoji
    phantom made available at http://www.fips.fi/dataset.php. The data can be freely used for scienti c purposes with appropriate references to the data and to this document in http://arxiv.org/. The data set consists of (1) the X-ray sinogram of a single 2D slice of 33 emoji faces (contains 15 different emoji faces) made by small squared ceramic stones and (2) the corresponding static and dynamic measurement matrices modeling the linear operation of the X-ray transform. Each of these sinograms was obtained from a measured 60-projection fan-beam sinogram by down-sampling and taking logarithms. The original (measured) sinogram is also provided in its original form and resolution. The original (measured) sinogram is also provided in its original form and resolution.

  14. i

    Data from: CAO System Emoticon Parts Dataset with Emotion Labels

    • ieee-dataport.org
    Updated Mar 14, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michal Ptaszynski (2019). CAO System Emoticon Parts Dataset with Emotion Labels [Dataset]. https://ieee-dataport.org/documents/cao-system-emoticon-parts-dataset-emotion-labels
    Explore at:
    Dataset updated
    Mar 14, 2019
    Authors
    Michal Ptaszynski
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    2) emoticon parts automatically divided from raw emoticons into semantic areas representing “mouths” or “eyes”.

  15. Colour emoji Dataset.sav

    • figshare.com
    bin
    Updated Jul 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Declan Forrester (2024). Colour emoji Dataset.sav [Dataset]. http://doi.org/10.6084/m9.figshare.26210690.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Jul 8, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Declan Forrester
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Reaction time and accuracy data from a categorisation task of emojis. The emojis were positive, negative, and neutral valence emojis presented on red, green, blue, grey, or white backgrounds.

  16. f

    Data from: Emotion recognition of faces and emoji in individuals with...

    • tandf.figshare.com
    docx
    Updated Jun 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sharice Clough; Emily Morrow; Bilge Mutlu; Lyn Turkstra; Melissa C. Duff (2023). Emotion recognition of faces and emoji in individuals with moderate-severe traumatic brain injury [Dataset]. http://doi.org/10.6084/m9.figshare.22183144.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 7, 2023
    Dataset provided by
    Taylor & Francis
    Authors
    Sharice Clough; Emily Morrow; Bilge Mutlu; Lyn Turkstra; Melissa C. Duff
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Facial emotion recognition deficits are common after moderate-severe traumatic brain injury (TBI) and linked to poor social outcomes. We examine whether emotion recognition deficits extend to facial expressions depicted by emoji. Fifty-one individuals with moderate-severe TBI (25 female) and fifty-one neurotypical peers (26 female) viewed photos of human faces and emoji. Participants selected the best-fitting label from a set of basic emotions (anger, disgust, fear, sadness, neutral, surprise, happy) or social emotions (embarrassed, remorseful, anxious, neutral, flirting, confident, proud). We analyzed the likelihood of correctly labeling an emotion by group (neurotypical, TBI), stimulus condition (basic faces, basic emoji, social emoji), sex (female, male), and their interactions. Participants with TBI did not significantly differ from neurotypical peers in overall emotion labeling accuracy. Both groups had poorer labeling accuracy for emoji compared to faces. Participants with TBI (but not neurotypical peers) had poorer accuracy for labeling social emotions depicted by emoji compared to basic emotions depicted by emoji. There were no effects of participant sex. Because emotion representation is more ambiguous in emoji than human faces, studying emoji use and perception in TBI is an important consideration for understanding functional communication and social participation after brain injury.

  17. The data of Emoji Use, Empathy, Attribution of Responsibility, and...

    • zenodo.org
    bin
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous; Anonymous (2023). The data of Emoji Use, Empathy, Attribution of Responsibility, and Forgiveness in Apologies [Dataset]. http://doi.org/10.5281/zenodo.10057150
    Explore at:
    binAvailable download formats
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anonymous; Anonymous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the data for paper: Can Emoji Promote Forgiveness? The Relationship between Emoji Use, Empathy, Attribution of Responsibility, and Forgiveness in Apologies. A total of 323 participants were recruited in that study, and a recall method (Study 1) and scenario simulation method (Study 2) were used to explore the effect of emoji use during apologies on forgiveness, and the mediating role of empathy and attribution of responsibility. The results showed that (a) people chose emoji that resembled real remorseful facial expressions when apologizing; (b) using emoji that expressed remorse when apologizing could promote forgiveness; and (c) empathy mediated the process of emoji promoting forgiveness, while attribution of responsibility did not play a mediating role.

  18. Z

    Italian Tweet Embeddings Used For Emoji Prediction

    • data.niaid.nih.gov
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giacomo Zara (2020). Italian Tweet Embeddings Used For Emoji Prediction [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_1467219
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Yaroslav Nechaev
    Giacomo Zara
    Andrei Catalin Coman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains 100d word embeddings trained on 48M Italian tweets using fastText and employed by our team to predict emojis during ITAmoji competition of EVALITA 2018 Evaluation Campaign.

  19. f

    Data from: DataSet "Political communication on TikTok: from the feminisation...

    • figshare.com
    • portalcienciaytecnologia.jcyl.es
    • +1more
    xlsx
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Salvador Gómez García; Raquel Quevedo Redondo (2023). DataSet "Political communication on TikTok: from the feminisation of discourse to incivility expressed in emoji form" [Dataset]. http://doi.org/10.6084/m9.figshare.24599562.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    figshare
    Authors
    Salvador Gómez García; Raquel Quevedo Redondo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In a context where there is permanent electoral campaigning, an increasing number of political communication experts are trying to unravel the resources used by government officials and their parties to influence TikTok users. From a broad perspective, the subject matter is not new, but it is topical; nonetheless, this research discloses a gap in the literature by amalgamating the recognition of idiosyncratic attributes of the feminisation of political discourse on TikTok with the analysis of the reactions (text and emojis) that the audiovisual content imbued by this trend elicits in users. The purpose is to ascertain whether the inclusive tone of the feminised rhetorical style can be extrapolated to TikTok and, if so, whether its particular characteristics mitigate expressions of incivility. To do so, the initial content posted (first seven months) on TikTok by the Spanish political platform Sumar with its leader, Yolanda Díaz, featuring prominently in most of the videos, were selected for scrutiny. A mixed methodology analysis of audiovisual content and comments showed that the anti-polarisation rhetoric and storytelling contributed to neutralising the extreme forms of flaming, although Sumar did not use a strategy tailor-made to suit TikTok.

  20. NCA Calendar

    • kaggle.com
    Updated Dec 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    something4kag (2022). NCA Calendar [Dataset]. https://www.kaggle.com/datasets/something4kag/nca-calendar
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 25, 2022
    Dataset provided by
    Kaggle
    Authors
    something4kag
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The NCA Calendar Dataset is part of the Neural Cellular Automata Emoji Challenge and contains animated gifs used as content for each day in the NCA emojis Advent Calendar Julekalender notebooks

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Waqar Ali (2024). 🌟 Emoji Trends Dataset [Dataset]. https://www.kaggle.com/datasets/waqi786/emoji-trends-dataset
Organization logo

🌟 Emoji Trends Dataset

Insights on Emoji Popularity & Contexts 💬✨

Explore at:
152 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 31, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Waqar Ali
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

This dataset provides a detailed analysis of emoji usage across various social media platforms. It captures how different emojis are used in different contexts, reflecting emotions, trends, and user demographics.

With emojis becoming a universal digital language, this dataset helps researchers, marketers, and data analysts explore how people express emotions online and identify patterns in social media communication.

📌 Key Features: 😊 Emoji Details: Emoji 🎭: The specific emoji used in a post, comment, or message. Context 💬: The meaning or emotion associated with the emoji (e.g., Happy, Love, Funny, Sad). Platform 🌐: The social media platform where the emoji was used (e.g., Facebook, Instagram, Twitter). 👤 User Demographics: User Age 🎂: Age of the user who posted the emoji (ranges from 13 to 65 years). User Gender 🚻: Gender of the user (Male/Female). 📈 Additional Insights: Emoji Popularity 🔥: Frequency of each emoji’s usage across platforms. Trends Over Time 📅: How emoji usage changes based on trends or events. Regional Usage Patterns 🌍: How different cultures and regions use emojis differently. 📊 Use Cases & Applications: 🔹 Understanding emoji trends across social media 🔹 Analyzing emotional expression through digital communication 🔹 Exploring demographic differences in emoji usage 🔹 Identifying platform-specific emoji preferences 🔹 Enhancing sentiment analysis models with emoji insights

⚠️ Important Note: This dataset is synthetically generated for educational and analytical purposes. It does not contain real user data but is designed to reflect real-world trends in emoji usage.

Search
Clear search
Close search
Google apps
Main menu