100+ datasets found
  1. Reddit usage reach in the United States 2024, by age group

    • statista.com
    • ai-chatbox.pro
    Updated Feb 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Reddit usage reach in the United States 2024, by age group [Dataset]. https://www.statista.com/statistics/261766/share-of-us-internet-users-who-use-reddit-by-age-group/
    Explore at:
    Dataset updated
    Feb 17, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Feb 1, 2024 - Jun 10, 2024
    Area covered
    United States
    Description

    According to a survey of adults in the United States in 2024, 46 percent of respondents who used Reddit were aged between 19 and 29 years. Reddit usage tends to be affected by users’ age, with older users reporting lower levels of engagement. Reddit engagement in numbers Reddit is one of the most popular websites in the forum category, allowing users to interact in multiple close-knitted communities organized in sub-threads and divided by topics. In March 2024, Reddit.com registered an average of 2.2 billion monthly visits from desktop and mobile combined. Reddit users are mostly based in North America, with the United States accounting for the biggest share of traffic worldwide by far. The future of Reddit Reddit was created in 2005, was redesigned for the very first time in 2018 to make it more appealing to new users and increase engagement from non-participating guests (jokingly called “lurkers”) who nonetheless enjoy the content. In February 2024, the company announced it was entering the public market by releasing its S-1 registration statement. In 2024, the company generated around 1.3 billion U.S. dollars worldwide in revenues. This translated into an average revenue per user (ARPU) of around 4.21 dollars in the last quarter of 2024.

  2. Reddit user worldwide 2024, by country

    • statista.com
    Updated Mar 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Reddit user worldwide 2024, by country [Dataset]. https://www.statista.com/forecasts/1174696/reddit-user-by-country
    Explore at:
    Dataset updated
    Mar 10, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jan 1, 2024 - Dec 31, 2024
    Area covered
    Albania
    Description

    Comparing the 132 selected regions regarding the number of Reddit users , the United States is leading the ranking (197.79 million users) and is followed by the United Kingdom with 34.21 million users. At the other end of the spectrum is Gabon with 0.02 million users, indicating a difference of 197.77 million users to the United States. User figures, shown here with regards to the platform reddit, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once. Reddit users encompass both users that are logged in and those that are not.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).

  3. r/AskAnAustralian Answers Reddit Users

    • figshare.com
    txt
    Updated Aug 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Margo Van Poucke (2023). r/AskAnAustralian Answers Reddit Users [Dataset]. http://doi.org/10.6084/m9.figshare.23897952.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Aug 7, 2023
    Dataset provided by
    figshare
    Authors
    Margo Van Poucke
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains responses provided by Reddit users to a set of 70 questions posted on the subreddit r/AskAnAustralian

  4. Reddit: distribution of global audiences 2024, by gender

    • statista.com
    Updated Feb 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Reddit: distribution of global audiences 2024, by gender [Dataset]. https://www.statista.com/statistics/1255182/distribution-of-users-on-reddit-worldwide-gender/
    Explore at:
    Dataset updated
    Feb 13, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    As of the third quarter of 2024, the majority of Reddit users were male, accounting for 59.8 percent of its audience base. Overall, women accounted for roughly 39.1 percent of the website users. Additionally, most of Reddit's desktop users were based in the United States.

  5. reddit user posting behavior (mid-2013)

    • figshare.com
    application/gzip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Randy Olson (2023). reddit user posting behavior (mid-2013) [Dataset]. http://doi.org/10.6084/m9.figshare.874101.v2
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Authors
    Randy Olson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This file contains the posting preferences for over 850,000 active reddit users. This sample was taken in mid-2013. This data was used to generate the interactive visualization, "redditviz," and will be analyzed in detail in an upcoming research article. Please cite our paper "Navigating the massive world of reddit" if you use this data in your work. URL: http://arxiv.org/abs/1312.3387 The file is organized as follows: Each line is an entry for an anonymous user. Each user was randomly assigned a unique ID, which is what shows in the first entry of each line. Following the user ID, separated by commas, are the subreddits (i.e., interests) that the user regularly posts in. In order for a user to be considered "active" in that subreddit, they had to post or comment there at least 10 times in their last 1,000 posts and comments.

  6. Reddit users in the United States 2019-2028

    • statista.com
    • ai-chatbox.pro
    Updated Jun 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2024). Reddit users in the United States 2019-2028 [Dataset]. https://www.statista.com/topics/3196/social-media-usage-in-the-united-states/
    Explore at:
    Dataset updated
    Jun 13, 2024
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    United States
    Description

    The number of Reddit users in the United States was forecast to continuously increase between 2024 and 2028 by in total 10.3 million users (+5.21 percent). After the ninth consecutive increasing year, the Reddit user base is estimated to reach 208.12 million users and therefore a new peak in 2028. Notably, the number of Reddit users of was continuously increasing over the past years.User figures, shown here with regards to the platform reddit, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once. Reddit users encompass both users that are logged in and those that are not.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Reddit users in countries like Mexico and Canada.

  7. Subreddit Interactions for 25,000 Users

    • kaggle.com
    Updated Feb 19, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    colemaclean (2017). Subreddit Interactions for 25,000 Users [Dataset]. https://www.kaggle.com/datasets/colemaclean/subreddit-interactions/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2017
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    colemaclean
    Description

    Context

    The dataset is a csv file compiled using a python scrapper developed using Reddit's PRAW API. The raw data is a list of 3-tuples of [username,subreddit,utc timestamp]. Each row represents a single comment made by the user, representing about 5 days worth of Reddit data. Note that the actual comment text is not included, only the user, subreddit and comment timestamp of the users comment. The goal of the dataset is to provide a lens in discovering user patterns from reddit meta-data alone. The original use case was to compile a dataset suitable for training a neural network in developing a subreddit recommender system. That final system can be found here

    A very unpolished EDA for the dataset can be found here. Note the published dataset is only half of the one used in the EDA and recommender system, to meet kaggle's 500MB size limitation.

    Content

    user - The username of the person submitting the comment
    subreddit - The title of the subreddit the user made the comment in
    utc_stamp - the utc timestamp of when the user made the comment

    Acknowledgements

    The dataset was compiled as part of a school project. The final project report, with my collaborators, can be found here

    Inspiration

    We were able to build a pretty cool subreddit recommender with the dataset. A blog post for it can be found here, and the stand alone jupyter notebook for it here. Our final model is very undertuned, so there's definitely improvements to be made there, but I think there are many other cool data projects and visualizations that could be built from this dataset. One example would be to analyze the spread of users through the Reddit ecosystem, whether the average user clusters in close communities, or traverses wide and far to different corners. If you do end up building something on this, please share! And have fun!

    Released under Reddit's API licence

  8. S

    Reddit Statistics By Popular Subreddits, Users and Usage

    • sci-tech-today.com
    Updated May 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sci-Tech Today (2025). Reddit Statistics By Popular Subreddits, Users and Usage [Dataset]. https://www.sci-tech-today.com/stats/reddit-statistics/
    Explore at:
    Dataset updated
    May 2, 2025
    Dataset authored and provided by
    Sci-Tech Today
    License

    https://www.sci-tech-today.com/privacy-policyhttps://www.sci-tech-today.com/privacy-policy

    Time period covered
    2022 - 2032
    Area covered
    Global
    Description

    Introduction

    Reddit Statistics: Reddit is an American news social media platform where users can submit content, text posts, and images. Posts can be upvoted and downvoted. As of October 2023, it is the 18th most visited website in the world. Based on submissions and posts can be voted up or down by other members. Â As a platform, it has faced various criticism for spreading misinformation.

    Nevertheless, it remains one of the most popular social media platforms. Â With the help of Reddit Statistics, we will discuss some interesting aspects.

  9. Reddit users in Israel 2020-2028

    • ai-chatbox.pro
    • statista.com
    Updated Oct 31, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2024). Reddit users in Israel 2020-2028 [Dataset]. https://www.ai-chatbox.pro/?_=%2Ftopics%2F9744%2Fsocial-media-in-israel%2F%23XgboD02vawLbpWJjSPEePEUG%2FVFd%2Bik%3D
    Explore at:
    Dataset updated
    Oct 31, 2024
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    Israel
    Description

    The number of Reddit users in Israel was forecast to increase between 2024 and 2028 by in total 0.01 million users (+0.76 percent). This overall increase does not happen continuously, notably not in 2027. The Reddit user base is estimated to amount to 1.32 million users in 2028. Notably, the number of Reddit users of was continuously increasing over the past years.User figures, shown here with regards to the platform reddit, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once. Reddit users encompass both users that are logged in and those that are not.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Reddit users in countries like Bahrain and Kuwait.

  10. Reddit usage reach in the United States 2023, by ethnicity

    • statista.com
    Updated Feb 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Reddit usage reach in the United States 2023, by ethnicity [Dataset]. https://www.statista.com/statistics/261770/share-of-us-internet-users-who-use-reddit-by-ethnicity/
    Explore at:
    Dataset updated
    Feb 17, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Feb 1, 2024 - Jun 10, 2024
    Area covered
    United States
    Description

    According to a survey of internet users conducted in the United States between February and June, 2024, 14 percent of Black Americans reported having ever used Reddit. Asian Americans appeared to be more likely than both Black and white Americans to have ever used the social media and community forum, with 36 percent of users in the demographic reporting to have used the popular forum and social media.

  11. Reddit Submissions

    • kaggle.com
    Updated Oct 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmad (2023). Reddit Submissions [Dataset]. https://www.kaggle.com/datasets/pypiahmad/reddit-submissions/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 30, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ahmad
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Reddit Submissions dataset encompasses submissions of Reddit posts, particularly focusing on resubmissions of the same content, along with pertinent metadata. This dataset covers a timespan from July 2008 to January 2013 and provides an insightful view into the dynamics of content sharing and engagement within the Reddit community.

    Basic Statistics: - Number of Submissions (images): 132,308 - Number of Unique Images: 16,736 - Timespan: July 2008 - January 2013

    Metadata: - Timestamps: The time when a post was submitted. - Upvotes/Downvotes: The number of upvotes and downvotes a post received. - Post Title: The title of the submitted post. - Subreddit: The subreddit to which the post was submitted. - Additional metadata such as total votes, Reddit ID, number of comments, and username of the submitter.

    Examples: ```plaintext

    image_id, unixtime, rawtime, title, total_votes, reddit_id,...

    number_of_downvotes, localtime, score, number_of_comments, username 1005, 1335861624, 2012-05-01T15:40:24.968266-07:00, I immediately regret this decision, 27, t296r, 20, pics, 7, 1335886824, 13, 0, ninjaroflmaster 1005, 1336470481, 2012-05-08T16:48:01.418140-07:00, "Pushing your friend into the water, Level: 99", 18, tds4i, 16, funny, 2, 1336495681, 14, 0, hme4 1005, 1339566752, 2012-06-13T12:52:32.371941-07:00, I told him. He Didn't Listen, 6, v0cma, 4, funny, 2, 1339591952, 2, 0, HeyPatWhatsUp 1005, 1342200476, 2012-07-14T00:27:56.857805-07:00, Don't end up as this guy., 16, wjivx, 7, funny, 9, 1342225676, -2, 2, catalyst24 ```

    Download Links: - Resubmissions Data (7.3MB) - Raw HTML of Resubmissions (1.8GB)

    Citation: - Understanding the interplay between titles, content, and communities in social media, Himabindu Lakkaraju, Julian McAuley, Jure Leskovec, ICWSM, 2013. pdf

    Use Cases: 1. Content Resubmission Analysis: Analyzing the pattern and impact of content resubmissions across different subreddits. 2. Community Engagement: Studying how different titles, content, and subreddits influence user engagement in terms of upvotes, downvotes, and comments. 3. Temporal Analysis: Investigating how the popularity of certain content changes over time and how resubmissions are accepted by the community at different time intervals. 4. Subreddit Analysis: Understanding the characteristics of different subreddits in terms of content sharing and resubmissions. 5. User Behavior Analysis: Examining user behavior in terms of content submission, resubmission, and interaction. 6. Social Media Marketing: For marketers, understanding the dynamics of content resubmission could help in optimizing the content sharing strategy on Reddit. 7. Machine Learning: Utilizing the dataset to build models that can predict the success of a post or resubmission based on various factors. 8. NLP Applications: Analyzing text data for sentiment analysis, topic modeling, and other Natural Language Processing (NLP) applications. 9. Spam Detection: Identifying spam or redundant content through the analysis of resubmissions and user behaviors.

    This dataset is valuable for researchers, social media analysts, marketers, and data scientists interested in studying social media dynamics, especially on a platform like Reddit where content resubmission is common.

  12. Reddit app user ratio in the U.S. 2021, by age group

    • statista.com
    Updated Feb 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Reddit app user ratio in the U.S. 2021, by age group [Dataset]. https://www.statista.com/statistics/1125159/reddit-us-app-users-age/
    Explore at:
    Dataset updated
    Feb 26, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Mar 2021
    Area covered
    United States
    Description

    As of March 2021, users in their twenties and thirties accounted for almost two-thirds of Reddit active user accounts in the United States. According to recent data, users aged 20 to 29 years, accounted for 28.1 percent of the social news app's user base on the Android platform.

  13. m

    Reddit Ideological and Extreme Bias Dataset - Part 1

    • data.mendeley.com
    Updated Feb 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kamalakkannan Ravi (2024). Reddit Ideological and Extreme Bias Dataset - Part 1 [Dataset]. http://doi.org/10.17632/2tdr9sjd83.3
    Explore at:
    Dataset updated
    Feb 28, 2024
    Authors
    Kamalakkannan Ravi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data 1: Dataset with articles posted in the r/Liberal and r/Conservative subreddits. In total, we collected a corpus of 226,010 articles. We have collected news articles to understand political expression through the shared news articles. Data 2: Dataset with articles posted in the Liberal, Conservative, and Restricted (private or banned) subreddits. In total, we collected a corpus of 1.3 million articles. We have collected news articles to understand radicalized communities through the shared news articles.

    Part 1 has Data 1 (all) and Data 2 (Raw and Labeled Data - Restricted.json) Part 2 has Data 2 (Raw and Labeled Data - Liberal.json, and Conservative.json) and Data 2 (Raw and Unlabeled Data - first 40 of the 76 .json files) Part 3 has Data 2 (Raw and Unlabeled Data - reamaining 36 of the 76 .json files)

  14. C

    Reddit vs. Quora Statistics – Which is the Better Choice? (2025)

    • coolest-gadgets.com
    Updated May 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Coolest Gadgets (2025). Reddit vs. Quora Statistics – Which is the Better Choice? (2025) [Dataset]. https://www.coolest-gadgets.com/reddit-vs-quora-statistics/
    Explore at:
    Dataset updated
    May 27, 2025
    Dataset authored and provided by
    Coolest Gadgets
    License

    https://www.coolest-gadgets.com/privacy-policyhttps://www.coolest-gadgets.com/privacy-policy

    Time period covered
    2022 - 2032
    Area covered
    Global
    Description

    Introduction

    Reddit vs. Quora Statistics: Reddit and Quora are well-known websites where people share information and discuss different topics. Reddit works through communities called "subreddits," where users post news, stories or opinions. Quora is more focused on questions and answers, where people ask things and get replies from others who know about the topic.

    By looking at numbers like how many people use the platform, how often they spend time there, how much time they spend there, and how far their reach is worldwide, we can get a better idea of which site is more useful or popular for different needs in 2024- 2025. The article will break down the numbers behind “Reddit vs Quora Statistics- Which is Better?†through this article.

  15. Reddit Sci/Tech Acronyms

    • kaggle.com
    Updated Jun 10, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    salbaroudi (2019). Reddit Sci/Tech Acronyms [Dataset]. https://www.kaggle.com/salbaroudi/reddit-scitech-acronyms/activity
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 10, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    salbaroudi
    License

    https://www.reddit.com/wiki/apihttps://www.reddit.com/wiki/api

    Description

    Introduction:

    140k+ acronyms were mined from science, tech, bio and future leaning subreddits. This was done with PRAW and compiled into a .csv file. This data set was originally mined to be a learning tool, used to illustrate pandas groupings, visualizations and count based time series. If the data set is refined enough, it might be possible to use it for prediction.

    Data Acquisition (Codebook):

    PRAW (a python3 library) script was used to mine the data from a list of subreddits that were hand selected. Science and Tech themed subreddits were focused on, as they tend to have higher quality content. To expand the list, a subreddit graph explorer was used to get a better view of the Sci/Tech subreddit network. Subreddits were excluded according to the following criteria:

    (1) Too few submissions and/or users.

    (2) Too esoteric, niche, or a subset of a much larger subreddit (example: pennystocks is a subset of stocks, in terms of content scope).

    (3) Satirical, politicized, or highly valenced in content (example: pcmasterrace).

    Some of these points are dependent on human interpretation - which may introduce bias into the data. See subreddit.txt file for a list of those selected. For each subreddit: upto 1000 submissions had there comment trees fully populated, and each comment was scanned for acronyms that were 3 to 7 letters in length. Associated information was then compiled, and written to a csv file. The format of the data table is below:

    commID: Reddit Comment ID (base 36 integer) (primary key)

    time: unix system time stamp for comment, that acronym is mentioned in. (float)

    user: username for person making comment. (string)

    subreddit: name of subreddit acronym appears in. (string)

    acronym: The term itself. (string)

    Data Statistics and Facts:

    See the kernel for more details.

    References:

    To reference this data set, use the following information: al-Baroudi, S. (2019, June). Reddit Sci/Tech Acronyms Dataset, Version 1. Retrieved (current date)

  16. MBTI and Birthdays

    • kaggle.com
    Updated Apr 18, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dakota Gravitt (2020). MBTI and Birthdays [Dataset]. https://www.kaggle.com/datasets/dakotagravitt/mbti-and-birthdays/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 18, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dakota Gravitt
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    Dataset

    This dataset was created by Dakota Gravitt

    Released under GPL 2

    Contents

  17. Reddit Datasets

    • promptcloud.com
    csv
    Updated Mar 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PromptCloud (2025). Reddit Datasets [Dataset]. https://www.promptcloud.com/dataset/reddit/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Mar 28, 2025
    Dataset authored and provided by
    PromptCloud
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Extracting Insights from Online DiscussionsReddit is one of the largest social discussion platforms, making it a valuable source for real-time opinions, trends, sentiment analysis, and user interactions across various industries. Scraping Reddit data allows businesses, researchers, and analysts to explore public discussions, track sentiment, and gain actionable insights from user-generated content. Benefits and Impact: Trend […]

  18. Reddit: quarterly number of DAU 2021-2025, by online status

    • statista.com
    Updated Feb 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Reddit: quarterly number of DAU 2021-2025, by online status [Dataset]. https://www.statista.com/statistics/1453133/reddit-quarterly-dau-by-online-status/
    Explore at:
    Dataset updated
    Feb 26, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    During the first quarter of 2025, online forum and news aggregator Reddit saw approximately 108.1 million daily active users (DAU) engaging with its platform. Of these, over 59.4 million users were not logged in and accessed the platform's content without proving they registered to Reddit. This represents an increase of approximately 6.8 percent compared to the previous quarter, when Reddit saw 55.6 million logged-off DAU.

  19. Reddit AskScience Flair Analysis Dataset

    • kaggle.com
    Updated Feb 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sumit Mishra (2025). Reddit AskScience Flair Analysis Dataset [Dataset]. https://www.kaggle.com/sumitm004/reddit-raskscience-flair-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 15, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sumit Mishra
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Context

    Reddit is a massive platform for news, content, and discussions, hosting millions of active users daily. Among its vast number of subreddits, we focus on the r/AskScience community, where users engage in science-related discussions and questions.

    Content

    This dataset is derived from the r/AskScience subreddit, collected between January 1, 2016, and May 20, 2022. It includes 612,668 datapoints across 22 columns, featuring diverse information such as the content of the questions, submission descriptions, associated flairs, NSFW/SFW status, year of submission, and more. The data was extracted using Python and Pushshift's API, followed by some cleaning with NumPy and pandas. Detailed column descriptions are available for clarity.

    Mendeley Data

    Ideas for Usage

    • Flair Prediction:Train models to predict post flairs (e.g., 'Science', 'Ask', 'Discussion') to automate content categorization for platforms like Reddit.
    • NSFW Classification: Classify posts as SFW or NSFW based on textual content, enabling content moderation tools for online forums.
    • Text Mining / NLP Tasks: Apply NLP techniques like Sentiment Analysis, Topic Modeling, and Text Classification to explore the content and themes of science-related discussions.
    • Community Engagement Analysis: Investigate which post types or flairs generate more engagement (e.g., upvotes or comments), offering insights into user interaction.
    • Trend Detection in Science Topics: Identify emerging science topics and analyze shifts in interest areas, which can help predict future trends in scientific discussions.
  20. Data from: The Reddit Politosphere: A Large-Scale Text and Network Resource...

    • zenodo.org
    • data.niaid.nih.gov
    bz2, csv, json
    Updated Jan 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Valentin Hofmann; Valentin Hofmann; Hinrich Schütze; Hinrich Schütze; Janet B. Pierrehumbert; Janet B. Pierrehumbert (2022). The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse [Dataset]. http://doi.org/10.5281/zenodo.5851729
    Explore at:
    bz2, csv, jsonAvailable download formats
    Dataset updated
    Jan 16, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Valentin Hofmann; Valentin Hofmann; Hinrich Schütze; Hinrich Schütze; Janet B. Pierrehumbert; Janet B. Pierrehumbert
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Reddit Politosphere is a large-scale resource of online political discourse covering more than 600 political discussion groups over a period of 12 years. Based on the Pushshift Reddit Dataset, it is to the best of our knowledge the largest and ideologically most comprehensive dataset of its type now available. One key feature of the Reddit Politosphere is that it consists of both text and network data. We also release annotated metadata for subreddits and users.

    Documentation and scripts for easy data access are provided in an associated repository on GitHub.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Statista (2025). Reddit usage reach in the United States 2024, by age group [Dataset]. https://www.statista.com/statistics/261766/share-of-us-internet-users-who-use-reddit-by-age-group/
Organization logo

Reddit usage reach in the United States 2024, by age group

Explore at:
38 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Feb 17, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Feb 1, 2024 - Jun 10, 2024
Area covered
United States
Description

According to a survey of adults in the United States in 2024, 46 percent of respondents who used Reddit were aged between 19 and 29 years. Reddit usage tends to be affected by users’ age, with older users reporting lower levels of engagement. Reddit engagement in numbers Reddit is one of the most popular websites in the forum category, allowing users to interact in multiple close-knitted communities organized in sub-threads and divided by topics. In March 2024, Reddit.com registered an average of 2.2 billion monthly visits from desktop and mobile combined. Reddit users are mostly based in North America, with the United States accounting for the biggest share of traffic worldwide by far. The future of Reddit Reddit was created in 2005, was redesigned for the very first time in 2018 to make it more appealing to new users and increase engagement from non-participating guests (jokingly called “lurkers”) who nonetheless enjoy the content. In February 2024, the company announced it was entering the public market by releasing its S-1 registration statement. In 2024, the company generated around 1.3 billion U.S. dollars worldwide in revenues. This translated into an average revenue per user (ARPU) of around 4.21 dollars in the last quarter of 2024.

Search
Clear search
Close search
Google apps
Main menu