8 datasets found

Social Animal 10K articles with text and NLP data
kaggle.com
zip
Updated Jun 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alex P (2023). Social Animal 10K articles with text and NLP data [Dataset]. https://www.kaggle.com/datasets/socialanimal/social-animal-10k-articles-with-text-and-nlp-data
Explore at:
zip(26400310 bytes)Available download formats
Dataset updated
Jun 11, 2023
Authors
Alex P
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Social Animal 10K Articles with NLP

Text, Shares and NLP metadata from trending news articles and blog posts around the world.

The metadata for the 10K articles is present in the articles.csv file. The article text is available as a text file inside the text directory. The file name is available as the content column in the CSV file.

CSV File Header Documentation

The following is a documentation of the headers present in the CSV file:

url: The URL of the article.

domain: The domain or website where the article is published.

title: The title of the article.

authors: The author(s) of the article.

content: The content or text of the article. (File name of the text file in the text directory)

content_word_count: The word count of the article's content.

published_date: The date when the article was published.

created_at: The timestamp indicating when the article entry was created.

updated_at: The timestamp indicating when the article entry was last updated.

language: The language in which the article is written.

article_links: Any links or references present within the article.

total_share_count: The total count of shares for the article.

article_category: The category or topic to which the article belongs.

keywords: The keywords associated with the article.

entities: Any entities mentioned in the article (e.g., names of people, organizations).

sentiment: The sentiment analysis result for the article.

title_sentiment: The sentiment analysis result for the article's title.

keywords_ex: Extra or additional keywords related to the article.

content_types.listicle: Indicates if the article is a listicle.

content_types.infographic: Indicates if the article contains an infographic.

content_types.how_to: Indicates if the article is a how-to guide.

content_types.case_study: Indicates if the article is a case study.

content_types.guest_post: Indicates if the article is a guest post.

content_types.review: Indicates if the article is a review.

content_types.video: Indicates if the article contains a video.

content_types.podcast: Indicates if the article contains a podcast.

content_types.webinar: Indicates if the article contains a webinar.

content_types.interview: Indicates if the article is an interview.

content_types.quote: Indicates if the article contains a quote.

content_types.meme: Indicates if the article contains a meme.

content_types.give_away: Indicates if the article is a giveaway.

content_types.quiz: Indicates if the article is a quiz.

locations.cities_and_states: Cities and states mentioned or relevant to the article.

locations.countries: Countries mentioned or relevant to the article.

locations.geo_locations: Geographical locations associated with the article.

Reach out if you need more data

We have more than 450 million articles in our database. We have picked 10K articles from this database and shared it here. We continuously collect text articles from various sources and process them with our NLP pipeline. Our database contains near real time data on trending articles from around the world. Reach out to alex@socialanimal.io if you are interested in getting access to our API or database. www.socialanimal.com
u
Code book of RTL visualization in Arabic News media
rdr.ucl.ac.uk
xlsx
Updated Jul 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muna Alebri; No ̈elle Rakotondravony; Lane Harrison (2024). Code book of RTL visualization in Arabic News media [Dataset]. http://doi.org/10.5522/04/26150749.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.5522/04/26150749.v1
Dataset updated
Jul 3, 2024
Dataset provided by
University College London
Authors
Muna Alebri; No ̈elle Rakotondravony; Lane Harrison
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
In this project, we aimed to map the visualisation design space of visualisation embedded in right-to-left (RTL) scripts. We aimed to expand our knowledge of visualisation design beyond the dominance of research based on left-to-right (LTR) scripts. Through this project, we identify common design practices regarding the chart structure, the text, and the source. We also identify ambiguity, particularly regarding the axis position and direction, suggesting that the community may benefit from unified standards similar to those found on web design for RTL scripts. To achieve this goal, we curated a dataset that covered 128 visualisations found in Arabic news media and coded these visualisations based on the chart composition (e.g., chart type, x-axis direction, y-axis position, legend position, interaction, embellishment type), text (e.g., availability of text, availability of caption, annotation type), and source (source position, attribution to designer, ownership of the visualisation design). Links are also provided to the articles and the visualisations. This dataset is limited for stand-alone visualisations, whether they were single-panelled or included small multiples. We also did not consider infographics in this project, nor any visualisation that did not have an identifiable chart type (e.g., bar chart, line chart). The attached documents also include some graphs from our analysis of the dataset provided, where we illustrate common design patterns and their popularity within our sample.

Global News Apps Market Research Report: By Content Type (Text News, Audio...

wiseguyreports.com

Updated Aug 6, 2025

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

(2025). Global News Apps Market Research Report: By Content Type (Text News, Audio News, Video News, Infographics, Podcasts), By User Demographics (Teenagers, Young Adults, Middle-Aged Adults, Seniors), By Platform (Mobile Apps, Web Apps, Desktop Apps), By Subscription Model (Free, Freemium, Paid Subscription) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2035 [Dataset]. https://www.wiseguyreports.com/cn/reports/news-apps-market

Explore at:

Dataset updated

Aug 6, 2025

License

https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

Time period covered

Aug 25, 2025

Area covered

Global

Description

BASE YEAR	2024
HISTORICAL DATA	2019 - 2023
REGIONS COVERED	North America, Europe, APAC, South America, MEA
REPORT COVERAGE	Revenue Forecast, Competitive Landscape, Growth Factors, and Trends
MARKET SIZE 2024	7.73(USD Billion)
MARKET SIZE 2025	8.43(USD Billion)
MARKET SIZE 2035	20.0(USD Billion)
SEGMENTS COVERED	Content Type, User Demographics, Platform, Subscription Model, Regional
COUNTRIES COVERED	US, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA
KEY MARKET DYNAMICS	increased smartphone penetration, demand for personalized content, rise of subscription models, competition among platforms, integration of AI technology
MARKET FORECAST UNITS	USD Billion
KEY COMPANIES PROFILED	Flipboard, Snap, Reuters, Hearst, Microsoft, Google, Associated Press, Condé Nast, Apple, Amazon, News Corp, Twitter, Bloomberg, Reddit, Facebook
MARKET FORECAST PERIOD	2025 - 2035
KEY MARKET OPPORTUNITIES	Personalized content delivery, Integration of AI for insights, Augmented reality news experience, Enhanced multimedia storytelling, Real-time updates and alerts
COMPOUND ANNUAL GROWTH RATE (CAGR)	9.1% (2025 - 2035)

LNDS Annual Report 2023
kaggle.com
data.public.lu
+1more
zip
Updated Jan 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Olaf Yunus Laitinen Imanov (2025). LNDS Annual Report 2023 [Dataset]. https://www.kaggle.com/datasets/olaflundstrom/lnds-annual-report-2023
Explore at:
zip(3719686 bytes)Available download formats
Dataset updated
Jan 23, 2025
Authors
Olaf Yunus Laitinen Imanov
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
We are pleased to announce that LNDS has successfully completed its Annual Report of 2023. A fruit of rigorous and detailed work, the Annual Report highlights the achievements, challenges and prospects of its first full year in operation. Transparency is one of the key values of LNDS. As a sign of living its values, in addition to the Annual Report, LNDS has also made available all public data files associated with this report. These data files enable a more in-depth analysis and offer greater transparency on the past year's activities. This is a commendable initiative on the part of LNDS, demonstrating our commitment to openness and accountability. Each data file corresponds to a section of the Annual Report and shows the data that has been used to create the content, and most notably the data visualisations and infographics.
h
ViInfographicsVQA
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Đoàn Đặng Phương Nam, ViInfographicsVQA [Dataset]. https://huggingface.co/datasets/Namronaldo2004/ViInfographicsVQA
Explore at:
Authors
Đoàn Đặng Phương Nam
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Introduction

ViInfographicsVQA is a Vietnamese Visual Question Answering (VQA) dataset constructed from infographics sourced from 26 different news platforms. The dataset is designed to support research in multimodal learning by providing diverse questions and answers based on real-world visual data. The detailed distribution of sources is presented in the table below.

Figure 1: The number of infographics per news source.

Developed by: @Namronaldo2004, @Kiet2302… See the full description on the dataset page: https://huggingface.co/datasets/Namronaldo2004/ViInfographicsVQA.
g
Coronavirus COVID-19 cases in Austria
gimi9.com
Updated Dec 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Coronavirus COVID-19 cases in Austria [Dataset]. https://gimi9.com/dataset/eu_f8097a9b-2cca-441e-a13a-273209fbdadd
Explore at:
Dataset updated
Dec 15, 2024
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Austria
Description
The coronavirus is moving the world. The media landscape is also dominated by this topic. Up-to-date information on the corona situation is given out almost every minute in live tickers. But what is missing are current data visualizations on the cases of illness. Although there are data applications, such as the representations of the WHO, these have the global distribution in view. Static infographics, which are often no longer up-to-date at the time of publication, cannot do justice to the rapidly changing situation. The virus has arrived in Europe, now visualizations that the individual states have in view are in demand and as up-to-date as possible. Our developed data dashboard shows the current distribution on a state and district basis in maps, diagrams and lists. We try to keep the presentations as up-to-date as possible and research current data from the website of the Ministry of Social Affairs, via data.gv.at or search in Austrian news portals.
Population (2011 & 2021) and seats (2020) for the European Parliament
figshare.com
txt
Updated Jun 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wenruo Lyu (2023). Population (2011 & 2021) and seats (2020) for the European Parliament [Dataset]. http://doi.org/10.6084/m9.figshare.23358152.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.23358152.v1
Dataset updated
Jun 8, 2023
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Wenruo Lyu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is a dataset containing the population of each Member State from the 2011 and 2021 censuses held by Eurostat, and the current distribution of seats in the European Parliament (EP). The population data was downloaded from the official website of Eurostat (2011: https://ec.europa.eu/eurostat/web/population-demography/population-housing-censuses/database; 2021: https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Population_and_housing_census_2021_-_population_grids&stable=1#Distribution_of_European_population). The seat data was obtained from the official website of the EP (https://www.europarl.europa.eu/news/en/headlines/eu-affairs/20180126STO94114/infographic-how-many-seats-does-each-country-get-in-in-the-european-parliament).
m
Stock Illustration Market Size, Share & Industry Analysis 2033
marketresearchintellect.com
Updated Jul 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Intellect (2025). Stock Illustration Market Size, Share & Industry Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/global-stock-illustration-market-size-and-forecast/
Explore at:
Dataset updated
Jul 3, 2025
Dataset authored and provided by
Market Research Intellect
License
https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy
Area covered
Global
Description
Discover Market Research Intellect's Stock Illustration Market Report, worth USD 1.5 billion in 2024 and projected to hit USD 2.8 billion by 2033, registering a CAGR of 8.5% between 2026 and 2033.Gain in-depth knowledge of emerging trends, growth drivers, and leading companies.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Alex P (2023). Social Animal 10K articles with text and NLP data [Dataset]. https://www.kaggle.com/datasets/socialanimal/social-animal-10k-articles-with-text-and-nlp-data

Social Animal 10K articles with text and NLP data

Text, share count and NLP metadata from trending news articles and blogs

Explore at:

zip(26400310 bytes)Available download formats

Dataset updated

Jun 11, 2023

Authors

Alex P

License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Social Animal 10K Articles with NLP

Text, Shares and NLP metadata from trending news articles and blog posts around the world.

The metadata for the 10K articles is present in the articles.csv file. The article text is available as a text file inside the text directory. The file name is available as the content column in the CSV file.

CSV File Header Documentation

The following is a documentation of the headers present in the CSV file:

url: The URL of the article.
domain: The domain or website where the article is published.
title: The title of the article.
authors: The author(s) of the article.
content: The content or text of the article. (File name of the text file in the text directory)
content_word_count: The word count of the article's content.
published_date: The date when the article was published.
created_at: The timestamp indicating when the article entry was created.
updated_at: The timestamp indicating when the article entry was last updated.
language: The language in which the article is written.
article_links: Any links or references present within the article.
total_share_count: The total count of shares for the article.
article_category: The category or topic to which the article belongs.
keywords: The keywords associated with the article.
entities: Any entities mentioned in the article (e.g., names of people, organizations).
sentiment: The sentiment analysis result for the article.
title_sentiment: The sentiment analysis result for the article's title.
keywords_ex: Extra or additional keywords related to the article.
content_types.listicle: Indicates if the article is a listicle.
content_types.infographic: Indicates if the article contains an infographic.
content_types.how_to: Indicates if the article is a how-to guide.
content_types.case_study: Indicates if the article is a case study.
content_types.guest_post: Indicates if the article is a guest post.
content_types.review: Indicates if the article is a review.
content_types.video: Indicates if the article contains a video.
content_types.podcast: Indicates if the article contains a podcast.
content_types.webinar: Indicates if the article contains a webinar.
content_types.interview: Indicates if the article is an interview.
content_types.quote: Indicates if the article contains a quote.
content_types.meme: Indicates if the article contains a meme.
content_types.give_away: Indicates if the article is a giveaway.
content_types.quiz: Indicates if the article is a quiz.
locations.cities_and_states: Cities and states mentioned or relevant to the article.
locations.countries: Countries mentioned or relevant to the article.
locations.geo_locations: Geographical locations associated with the article.

Reach out if you need more data

We have more than 450 million articles in our database. We have picked 10K articles from this database and shared it here. We continuously collect text articles from various sources and process them with our NLP pipeline. Our database contains near real time data on trending articles from around the world. Reach out to alex@socialanimal.io if you are interested in getting access to our API or database. www.socialanimal.com

Clear search

Close search

Google apps

Main menu

Social Animal 10K articles with text and NLP data

Social Animal 10K Articles with NLP

Text, Shares and NLP metadata from trending news articles and blog posts around the world.

CSV File Header Documentation

Reach out if you need more data

Code book of RTL visualization in Arabic News media

Global News Apps Market Research Report: By Content Type (Text News, Audio...

LNDS Annual Report 2023

ViInfographicsVQA

Coronavirus COVID-19 cases in Austria

Population (2011 & 2021) and seats (2020) for the European Parliament

Stock Illustration Market Size, Share & Industry Analysis 2033

Social Animal 10K articles with text and NLP data

Text, share count and NLP metadata from trending news articles and blogs

Social Animal 10K Articles with NLP

Text, Shares and NLP metadata from trending news articles and blog posts around the world.

CSV File Header Documentation

Reach out if you need more data