8 datasets found
  1. Social Animal 10K articles with text and NLP data

    • kaggle.com
    zip
    Updated Jun 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alex P (2023). Social Animal 10K articles with text and NLP data [Dataset]. https://www.kaggle.com/datasets/socialanimal/social-animal-10k-articles-with-text-and-nlp-data
    Explore at:
    zip(26400310 bytes)Available download formats
    Dataset updated
    Jun 11, 2023
    Authors
    Alex P
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Social Animal 10K Articles with NLP

    Text, Shares and NLP metadata from trending news articles and blog posts around the world.

    The metadata for the 10K articles is present in the articles.csv file. The article text is available as a text file inside the text directory. The file name is available as the content column in the CSV file.

    CSV File Header Documentation

    The following is a documentation of the headers present in the CSV file:

    • url: The URL of the article.
    • domain: The domain or website where the article is published.
    • title: The title of the article.
    • authors: The author(s) of the article.
    • content: The content or text of the article. (File name of the text file in the text directory)
    • content_word_count: The word count of the article's content.
    • published_date: The date when the article was published.
    • created_at: The timestamp indicating when the article entry was created.
    • updated_at: The timestamp indicating when the article entry was last updated.
    • language: The language in which the article is written.
    • article_links: Any links or references present within the article.
    • total_share_count: The total count of shares for the article.
    • article_category: The category or topic to which the article belongs.
    • keywords: The keywords associated with the article.
    • entities: Any entities mentioned in the article (e.g., names of people, organizations).
    • sentiment: The sentiment analysis result for the article.
    • title_sentiment: The sentiment analysis result for the article's title.
    • keywords_ex: Extra or additional keywords related to the article.
    • content_types.listicle: Indicates if the article is a listicle.
    • content_types.infographic: Indicates if the article contains an infographic.
    • content_types.how_to: Indicates if the article is a how-to guide.
    • content_types.case_study: Indicates if the article is a case study.
    • content_types.guest_post: Indicates if the article is a guest post.
    • content_types.review: Indicates if the article is a review.
    • content_types.video: Indicates if the article contains a video.
    • content_types.podcast: Indicates if the article contains a podcast.
    • content_types.webinar: Indicates if the article contains a webinar.
    • content_types.interview: Indicates if the article is an interview.
    • content_types.quote: Indicates if the article contains a quote.
    • content_types.meme: Indicates if the article contains a meme.
    • content_types.give_away: Indicates if the article is a giveaway.
    • content_types.quiz: Indicates if the article is a quiz.
    • locations.cities_and_states: Cities and states mentioned or relevant to the article.
    • locations.countries: Countries mentioned or relevant to the article.
    • locations.geo_locations: Geographical locations associated with the article.

    Reach out if you need more data

    We have more than 450 million articles in our database. We have picked 10K articles from this database and shared it here. We continuously collect text articles from various sources and process them with our NLP pipeline. Our database contains near real time data on trending articles from around the world. Reach out to alex@socialanimal.io if you are interested in getting access to our API or database. www.socialanimal.com

  2. u

    Code book of RTL visualization in Arabic News media

    • rdr.ucl.ac.uk
    xlsx
    Updated Jul 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muna Alebri; No ̈elle Rakotondravony; Lane Harrison (2024). Code book of RTL visualization in Arabic News media [Dataset]. http://doi.org/10.5522/04/26150749.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 3, 2024
    Dataset provided by
    University College London
    Authors
    Muna Alebri; No ̈elle Rakotondravony; Lane Harrison
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    In this project, we aimed to map the visualisation design space of visualisation embedded in right-to-left (RTL) scripts. We aimed to expand our knowledge of visualisation design beyond the dominance of research based on left-to-right (LTR) scripts. Through this project, we identify common design practices regarding the chart structure, the text, and the source. We also identify ambiguity, particularly regarding the axis position and direction, suggesting that the community may benefit from unified standards similar to those found on web design for RTL scripts. To achieve this goal, we curated a dataset that covered 128 visualisations found in Arabic news media and coded these visualisations based on the chart composition (e.g., chart type, x-axis direction, y-axis position, legend position, interaction, embellishment type), text (e.g., availability of text, availability of caption, annotation type), and source (source position, attribution to designer, ownership of the visualisation design). Links are also provided to the articles and the visualisations. This dataset is limited for stand-alone visualisations, whether they were single-panelled or included small multiples. We also did not consider infographics in this project, nor any visualisation that did not have an identifiable chart type (e.g., bar chart, line chart). The attached documents also include some graphs from our analysis of the dataset provided, where we illustrate common design patterns and their popularity within our sample.

  3. w

    Global News Apps Market Research Report: By Content Type (Text News, Audio...

    • wiseguyreports.com
    Updated Aug 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Global News Apps Market Research Report: By Content Type (Text News, Audio News, Video News, Infographics, Podcasts), By User Demographics (Teenagers, Young Adults, Middle-Aged Adults, Seniors), By Platform (Mobile Apps, Web Apps, Desktop Apps), By Subscription Model (Free, Freemium, Paid Subscription) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2035 [Dataset]. https://www.wiseguyreports.com/cn/reports/news-apps-market
    Explore at:
    Dataset updated
    Aug 6, 2025
    License

    https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

    Time period covered
    Aug 25, 2025
    Area covered
    Global
    Description
    BASE YEAR2024
    HISTORICAL DATA2019 - 2023
    REGIONS COVEREDNorth America, Europe, APAC, South America, MEA
    REPORT COVERAGERevenue Forecast, Competitive Landscape, Growth Factors, and Trends
    MARKET SIZE 20247.73(USD Billion)
    MARKET SIZE 20258.43(USD Billion)
    MARKET SIZE 203520.0(USD Billion)
    SEGMENTS COVEREDContent Type, User Demographics, Platform, Subscription Model, Regional
    COUNTRIES COVEREDUS, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA
    KEY MARKET DYNAMICSincreased smartphone penetration, demand for personalized content, rise of subscription models, competition among platforms, integration of AI technology
    MARKET FORECAST UNITSUSD Billion
    KEY COMPANIES PROFILEDFlipboard, Snap, Reuters, Hearst, Microsoft, Google, Associated Press, Condé Nast, Apple, Amazon, News Corp, Twitter, Bloomberg, Reddit, Facebook
    MARKET FORECAST PERIOD2025 - 2035
    KEY MARKET OPPORTUNITIESPersonalized content delivery, Integration of AI for insights, Augmented reality news experience, Enhanced multimedia storytelling, Real-time updates and alerts
    COMPOUND ANNUAL GROWTH RATE (CAGR) 9.1% (2025 - 2035)
  4. LNDS Annual Report 2023

    • kaggle.com
    • data.public.lu
    • +1more
    zip
    Updated Jan 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olaf Yunus Laitinen Imanov (2025). LNDS Annual Report 2023 [Dataset]. https://www.kaggle.com/datasets/olaflundstrom/lnds-annual-report-2023
    Explore at:
    zip(3719686 bytes)Available download formats
    Dataset updated
    Jan 23, 2025
    Authors
    Olaf Yunus Laitinen Imanov
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    We are pleased to announce that LNDS has successfully completed its Annual Report of 2023. A fruit of rigorous and detailed work, the Annual Report highlights the achievements, challenges and prospects of its first full year in operation. Transparency is one of the key values of LNDS. As a sign of living its values, in addition to the Annual Report, LNDS has also made available all public data files associated with this report. These data files enable a more in-depth analysis and offer greater transparency on the past year's activities. This is a commendable initiative on the part of LNDS, demonstrating our commitment to openness and accountability. Each data file corresponds to a section of the Annual Report and shows the data that has been used to create the content, and most notably the data visualisations and infographics.

  5. h

    ViInfographicsVQA

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Đoàn Đặng Phương Nam, ViInfographicsVQA [Dataset]. https://huggingface.co/datasets/Namronaldo2004/ViInfographicsVQA
    Explore at:
    Authors
    Đoàn Đặng Phương Nam
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Introduction

    ViInfographicsVQA is a Vietnamese Visual Question Answering (VQA) dataset constructed from infographics sourced from 26 different news platforms. The dataset is designed to support research in multimodal learning by providing diverse questions and answers based on real-world visual data. The detailed distribution of sources is presented in the table below.

    Figure 1: The number of infographics per news source.
    

    Developed by: @Namronaldo2004, @Kiet2302… See the full description on the dataset page: https://huggingface.co/datasets/Namronaldo2004/ViInfographicsVQA.

  6. g

    Coronavirus COVID-19 cases in Austria

    • gimi9.com
    Updated Dec 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Coronavirus COVID-19 cases in Austria [Dataset]. https://gimi9.com/dataset/eu_f8097a9b-2cca-441e-a13a-273209fbdadd
    Explore at:
    Dataset updated
    Dec 15, 2024
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Austria
    Description

    The coronavirus is moving the world. The media landscape is also dominated by this topic. Up-to-date information on the corona situation is given out almost every minute in live tickers. But what is missing are current data visualizations on the cases of illness. Although there are data applications, such as the representations of the WHO, these have the global distribution in view. Static infographics, which are often no longer up-to-date at the time of publication, cannot do justice to the rapidly changing situation. The virus has arrived in Europe, now visualizations that the individual states have in view are in demand and as up-to-date as possible. Our developed data dashboard shows the current distribution on a state and district basis in maps, diagrams and lists. We try to keep the presentations as up-to-date as possible and research current data from the website of the Ministry of Social Affairs, via data.gv.at or search in Austrian news portals.

  7. Population (2011 & 2021) and seats (2020) for the European Parliament

    • figshare.com
    txt
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wenruo Lyu (2023). Population (2011 & 2021) and seats (2020) for the European Parliament [Dataset]. http://doi.org/10.6084/m9.figshare.23358152.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Wenruo Lyu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is a dataset containing the population of each Member State from the 2011 and 2021 censuses held by Eurostat, and the current distribution of seats in the European Parliament (EP). The population data was downloaded from the official website of Eurostat (2011: https://ec.europa.eu/eurostat/web/population-demography/population-housing-censuses/database; 2021: https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Population_and_housing_census_2021_-_population_grids&stable=1#Distribution_of_European_population). The seat data was obtained from the official website of the EP (https://www.europarl.europa.eu/news/en/headlines/eu-affairs/20180126STO94114/infographic-how-many-seats-does-each-country-get-in-in-the-european-parliament).

  8. m

    Stock Illustration Market Size, Share & Industry Analysis 2033

    • marketresearchintellect.com
    Updated Jul 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). Stock Illustration Market Size, Share & Industry Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/global-stock-illustration-market-size-and-forecast/
    Explore at:
    Dataset updated
    Jul 3, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Discover Market Research Intellect's Stock Illustration Market Report, worth USD 1.5 billion in 2024 and projected to hit USD 2.8 billion by 2033, registering a CAGR of 8.5% between 2026 and 2033.Gain in-depth knowledge of emerging trends, growth drivers, and leading companies.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Alex P (2023). Social Animal 10K articles with text and NLP data [Dataset]. https://www.kaggle.com/datasets/socialanimal/social-animal-10k-articles-with-text-and-nlp-data
Organization logo

Social Animal 10K articles with text and NLP data

Text, share count and NLP metadata from trending news articles and blogs

Explore at:
zip(26400310 bytes)Available download formats
Dataset updated
Jun 11, 2023
Authors
Alex P
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Social Animal 10K Articles with NLP

Text, Shares and NLP metadata from trending news articles and blog posts around the world.

The metadata for the 10K articles is present in the articles.csv file. The article text is available as a text file inside the text directory. The file name is available as the content column in the CSV file.

CSV File Header Documentation

The following is a documentation of the headers present in the CSV file:

  • url: The URL of the article.
  • domain: The domain or website where the article is published.
  • title: The title of the article.
  • authors: The author(s) of the article.
  • content: The content or text of the article. (File name of the text file in the text directory)
  • content_word_count: The word count of the article's content.
  • published_date: The date when the article was published.
  • created_at: The timestamp indicating when the article entry was created.
  • updated_at: The timestamp indicating when the article entry was last updated.
  • language: The language in which the article is written.
  • article_links: Any links or references present within the article.
  • total_share_count: The total count of shares for the article.
  • article_category: The category or topic to which the article belongs.
  • keywords: The keywords associated with the article.
  • entities: Any entities mentioned in the article (e.g., names of people, organizations).
  • sentiment: The sentiment analysis result for the article.
  • title_sentiment: The sentiment analysis result for the article's title.
  • keywords_ex: Extra or additional keywords related to the article.
  • content_types.listicle: Indicates if the article is a listicle.
  • content_types.infographic: Indicates if the article contains an infographic.
  • content_types.how_to: Indicates if the article is a how-to guide.
  • content_types.case_study: Indicates if the article is a case study.
  • content_types.guest_post: Indicates if the article is a guest post.
  • content_types.review: Indicates if the article is a review.
  • content_types.video: Indicates if the article contains a video.
  • content_types.podcast: Indicates if the article contains a podcast.
  • content_types.webinar: Indicates if the article contains a webinar.
  • content_types.interview: Indicates if the article is an interview.
  • content_types.quote: Indicates if the article contains a quote.
  • content_types.meme: Indicates if the article contains a meme.
  • content_types.give_away: Indicates if the article is a giveaway.
  • content_types.quiz: Indicates if the article is a quiz.
  • locations.cities_and_states: Cities and states mentioned or relevant to the article.
  • locations.countries: Countries mentioned or relevant to the article.
  • locations.geo_locations: Geographical locations associated with the article.

Reach out if you need more data

We have more than 450 million articles in our database. We have picked 10K articles from this database and shared it here. We continuously collect text articles from various sources and process them with our NLP pipeline. Our database contains near real time data on trending articles from around the world. Reach out to alex@socialanimal.io if you are interested in getting access to our API or database. www.socialanimal.com

Search
Clear search
Close search
Google apps
Main menu