100+ datasets found

Daily Google News (update infrequently)
kaggle.com
zip
Updated May 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
crxxom (2025). Daily Google News (update infrequently) [Dataset]. https://www.kaggle.com/datasets/crxxom/daily-google-news
Explore at:
zip(120405348 bytes)Available download formats
Dataset updated
May 21, 2025
Authors
crxxom
Description
This dataset contains metadata of millions of news articles from Google News, including title, publisher, DateTime, link, and category.

This is also an automation project in which data is scraped every day at 4am UTC on 8 major categories. This dataset is expected to have a monthly update, thus the data collected daily will be merged into a single monthly csv file and published on Kaggle at the end of each month. One may expect the value of the dataset to continuously grow through time.

If you find this dataset useful, feel free to drop a like. If you have any requests/suggestions/inquires, feel free to leave it in the comment sections as well.

What does the dataset contain?

As mentioned, each monthly csv file mainly contain 5 columns

1. Title: The title of the news article

2. Publisher: The publisher of the news article

3. DateTime: The DateTime of when the news article is published on Google News

4. Link: A link that will direct users to the corresponding article, one may feel free to dig deeper and scrape extended content by following the links

5. Category: 8 major categories defined by Google News, particularly Business, Entertainment, Headlines, Health, Science, Sports, Technology and WorldWide.
w
Websites using Google News
webtechsurvey.com
csv
Updated Oct 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey (2025). Websites using Google News [Dataset]. https://webtechsurvey.com/technology/google-news
Explore at:
csvAvailable download formats
Dataset updated
Oct 10, 2025
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Google News technology, compiled through global website indexing conducted by WebTechSurvey.
Credibility of Google News in the U.S. 2022
statista.com
Updated Nov 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Credibility of Google News in the U.S. 2022 [Dataset]. https://www.statista.com/statistics/1308030/google-news-credibility-in-the-united-states/
Explore at:
Dataset updated
Nov 27, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Feb 9, 2022 - Feb 10, 2022
Area covered
United States
Description
A total of 20 percent of U.S. adults responding to a survey in February 2022 said that they thought Google News was very credible, and eight percent found the source to be not all credible. Google News' credibility rating was higher among Black and Hispanic respondents than their white counterparts, and Gen Z and millennials were also more likely to consider Google News a very credible source of information than their older peers.
google-news-vectors
kaggle.com
zip
Updated Oct 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Didier Salazar (2024). google-news-vectors [Dataset]. https://www.kaggle.com/datasets/didiersalazar/google-news-vectors
Explore at:
zip(1760926034 bytes)Available download formats
Dataset updated
Oct 23, 2024
Authors
Didier Salazar
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Didier Salazar

Released under Apache 2.0

Contents
w
Websites using Simple Google News De
webtechsurvey.com
csv
Updated Oct 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey (2025). Websites using Simple Google News De [Dataset]. https://webtechsurvey.com/technology/simple-google-news-de
Explore at:
csvAvailable download formats
Dataset updated
Oct 14, 2025
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Simple Google News De technology, compiled through global website indexing conducted by WebTechSurvey.
Data from: Google-News
kaggle.com
zip
Updated Nov 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Healthy_Manish (2024). Google-News [Dataset]. https://www.kaggle.com/datasets/manishdwivedi2/google-news
Explore at:
zip(58183 bytes)Available download formats
Dataset updated
Nov 28, 2024
Authors
Healthy_Manish
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This Dataset consist of Title with it's snippet and publisher name and also the timestamp at which it was being posted. It Had been categorised in 7 columns i.e ['Buisness', 'entertainment', 'world', 'health', 'sport', 'science', 'technology']. Have fun with it!
w
Websites using Google News Automatic Widget
webtechsurvey.com
csv
Updated Oct 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey (2025). Websites using Google News Automatic Widget [Dataset]. https://webtechsurvey.com/technology/google-news-automatic-widget
Explore at:
csvAvailable download formats
Dataset updated
Oct 13, 2025
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Google News Automatic Widget technology, compiled through global website indexing conducted by WebTechSurvey.
o
News Data, Global News, Topic News, and More from Google News
openwebninja.com
json
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OpenWeb Ninja, News Data, Global News, Topic News, and More from Google News [Dataset]. https://www.openwebninja.com/api/real-time-news-data
Explore at:
jsonAvailable download formats
Dataset authored and provided by
OpenWeb Ninja
Area covered
Global News Coverage
Description
This dataset provides comprehensive access to news articles and headlines from Google News in real-time. Get top news globally or by specific topics, with support for geographic targeting and custom search queries. Perfect for applications requiring news monitoring, media analysis, and content aggregation. The dataset is delivered in a JSON format via REST API.
Google News - Sports
kaggle.com
zip
Updated Aug 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shivam Taneja (2024). Google News - Sports [Dataset]. https://www.kaggle.com/datasets/shivamtaneja2304/google-news-sports
Explore at:
zip(4658953 bytes)Available download formats
Dataset updated
Aug 27, 2024
Authors
Shivam Taneja
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This is a news of sport news titles from Google News. The dataset is updated daily. It has 3 values 1. Headline - That is the headline of the news 2. Sport - The sport in question 3. Date - The day the news was scraped.
t
GoogleNews - Dataset - LDM
service.tib.eu
Updated Dec 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). GoogleNews - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/googlenews
Explore at:
Dataset updated
Dec 2, 2024
Description
The dataset used in this paper is a collection of news articles from Google News.
C
Data on Google News coverage in Brazil, Colombia, Mexico, Portugal and Spain...
dataverse.csuc.cat
tsv, txt
Updated Jul 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Douglas Cordeiro; Douglas Cordeiro; Javier Guallar; Javier Guallar; Carlos Lopezosa; Carlos Lopezosa; Mari Vállez; Mari Vállez (2025). Data on Google News coverage in Brazil, Colombia, Mexico, Portugal and Spain [Dataset]. http://doi.org/10.34810/data1243
Explore at:
tsv(677137), tsv(4985925), txt(1848)Available download formats
Unique identifier
https://doi.org/10.34810/data1243
Dataset updated
Jul 14, 2025
Dataset provided by
CORA.Repositori de Dades de Recerca
Authors
Douglas Cordeiro; Douglas Cordeiro; Javier Guallar; Javier Guallar; Carlos Lopezosa; Carlos Lopezosa; Mari Vállez; Mari Vállez
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains the set of records extracted from the main pages of some version of Google News (Brazil, Colombia, Mexico, Portugal, Spain). The data were extracted using a web scraping computational solution. The acquired data were integrated into a structured database. Google News versions: Brazil, Colombia, Mexico, Portugal, Spain
GoogleNews-vectors-negative300 ( word2vec )
kaggle.com
zip
Updated Jun 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KA-KA-shi (2021). GoogleNews-vectors-negative300 ( word2vec ) [Dataset]. https://www.kaggle.com/adarshsng/googlenewsvectors
Explore at:
zip(1760926034 bytes)Available download formats
Dataset updated
Jun 6, 2021
Authors
KA-KA-shi
Description
word2vec

This repository hosts the word2vec pre-trained Google News corpus (3 billion running words) word vector model (3 million 300-dimension English word vectors).
f
Correlation between scientific production (as captured by Google Scholar and...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Nov 3, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Brigo, Francesco; Durando, Paolo; Toletone, Alessandra; Dini, Guglielmo; Bragazzi, Nicola Luigi (2016). Correlation between scientific production (as captured by Google Scholar and PubMed), news coverage (as captured by Google News), web queries (as captured by Google Trends), access to Wikipedia page and Internet activities (as captured by Twitter and YouTube). [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001571613
Explore at:
Dataset updated
Nov 3, 2016
Authors
Brigo, Francesco; Durando, Paolo; Toletone, Alessandra; Dini, Guglielmo; Bragazzi, Nicola Luigi
Area covered
YouTube
Description
Correlation between scientific production (as captured by Google Scholar and PubMed), news coverage (as captured by Google News), web queries (as captured by Google Trends), access to Wikipedia page and Internet activities (as captured by Twitter and YouTube).
Data from: Google news Dataset
kaggle.com
zip
Updated Dec 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmad Khaled (2025). Google news Dataset [Dataset]. https://www.kaggle.com/datasets/lohpohk/google-news-dataset
Explore at:
zip(1760926034 bytes)Available download formats
Dataset updated
Dec 5, 2025
Authors
Ahmad Khaled
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by Ahmad Khaled

Released under MIT

Contents
Monthly app downloads of Google News in Japan 2023
statista.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, Monthly app downloads of Google News in Japan 2023 [Dataset]. https://www.statista.com/statistics/1398484/japan-monthly-number-of-app-downloads-google-news/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Jan 2023 - Dec 2023
Area covered
Japan
Description
The Google News app was downloaded more than ****** times in Japan in December 2023. The total number of downloads during that year reached more than *******. The news aggregation app was released by Google LLC in 2012.
D
Google News Search Results for Japanese Yen
dataandsons.com
csv, zip
Updated Jan 9, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kirill Konovalov (2022). Google News Search Results for Japanese Yen [Dataset]. https://www.dataandsons.com/categories/markets/google-news-search-results-for-japanese-yen
Explore at:
zip, csvAvailable download formats
Dataset updated
Jan 9, 2022
Dataset provided by
Data & Sons
Authors
Kirill Konovalov
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Time period covered
Apr 7, 2017 - Jan 7, 2022
Area covered
Japan
Description
About this Dataset

Results of scraping Google News search results for "JPY" (2017-2022).

Category

Markets

Keywords

jpy,news,google news,google

Row Count

1233

Price

$1700.00
General characteristics of health news and scientific articles.
plos.figshare.com
figshare.com
xls
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Romana Haneef; Clement Lazarus; Philippe Ravaud; Amélie Yavchitz; Isabelle Boutron (2023). General characteristics of health news and scientific articles. [Dataset]. http://doi.org/10.1371/journal.pone.0140889.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0140889.t001
Dataset updated
Jun 3, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Romana Haneef; Clement Lazarus; Philippe Ravaud; Amélie Yavchitz; Isabelle Boutron
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
*[IQR], interquartile rangeGeneral characteristics of health news and scientific articles.
Leading news and magazine apps in Google Play in Germany 2021, by downloads
statista.com
Updated Nov 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Leading news and magazine apps in Google Play in Germany 2021, by downloads [Dataset]. https://www.statista.com/statistics/690712/leading-news-and-magazine-apps-in-google-play-in-germany-by-downloads/
Explore at:
Dataset updated
Nov 27, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Dec 2021
Area covered
Germany
Description
Opera news ranked the top leading news and magazine mobile app in the Google Play Store in Germany as of December 2021, amounting to around 99.1 thousand downloads. Additionally, following that was ZDFheute - Nachrichten with 86.9 thousand.
h
google_news_en
huggingface.co
Updated Jul 30, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carlos Muñoz (2022). google_news_en [Dataset]. https://huggingface.co/datasets/cmunhozc/google_news_en
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 30, 2022
Authors
Carlos Muñoz
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Attributes:

This dataset comprises three attributes: the first corresponds to Headlines 1, the second to Headlines 2, and the third to the target variable. Both sentences are associated with news extracted from Google News, while the target variable indicates whether both sentences are related to the same event (1) or not (0).

Data Source:

The dataset is derived from Google News headlines between July 23, 2022, and July 30, 2022, which were manually annotated.… See the full description on the dataset page: https://huggingface.co/datasets/cmunhozc/google_news_en.
d
Replication Data for: Stereotype Content Dictionary: A semantic space of 3...
dataone.org
dataverse.harvard.edu
Updated Dec 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
qin, xuanlong (2023). Replication Data for: Stereotype Content Dictionary: A semantic space of 3 million words and phrases using Google News word2vec embeddings [Dataset]. http://doi.org/10.7910/DVN/OUXIYW
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/OUXIYW
Dataset updated
Dec 16, 2023
Dataset provided by
Harvard Dataverse
Authors
qin, xuanlong
Description
Stereotype Content Dictionary: A semantic space of 3 million words and phrases using Google News word2vec embeddings

Facebook

Twitter

Click to copy link

Link copied

Cite

crxxom (2025). Daily Google News (update infrequently) [Dataset]. https://www.kaggle.com/datasets/crxxom/daily-google-news

Daily Google News (update infrequently)

Contains millions of daily news metadata scraped from Google News

Explore at:

zip(120405348 bytes)Available download formats

Dataset updated

May 21, 2025

Authors

crxxom

Description

This dataset contains metadata of millions of news articles from Google News, including title, publisher, DateTime, link, and category.

This is also an automation project in which data is scraped every day at 4am UTC on 8 major categories. This dataset is expected to have a monthly update, thus the data collected daily will be merged into a single monthly csv file and published on Kaggle at the end of each month. One may expect the value of the dataset to continuously grow through time.

If you find this dataset useful, feel free to drop a like. If you have any requests/suggestions/inquires, feel free to leave it in the comment sections as well.

What does the dataset contain?

As mentioned, each monthly csv file mainly contain 5 columns

1. Title: The title of the news article

2. Publisher: The publisher of the news article

3. DateTime: The DateTime of when the news article is published on Google News

4. Link: A link that will direct users to the corresponding article, one may feel free to dig deeper and scrape extended content by following the links

5. Category: 8 major categories defined by Google News, particularly Business, Entertainment, Headlines, Health, Science, Sports, Technology and WorldWide.

Clear search

Close search

Google apps

Main menu

Daily Google News (update infrequently)

What does the dataset contain?

Websites using Google News

Credibility of Google News in the U.S. 2022

google-news-vectors

Dataset

Contents

Websites using Simple Google News De

Data from: Google-News

Websites using Google News Automatic Widget

News Data, Global News, Topic News, and More from Google News

Google News - Sports

GoogleNews - Dataset - LDM

Data on Google News coverage in Brazil, Colombia, Mexico, Portugal and Spain...

GoogleNews-vectors-negative300 ( word2vec )

word2vec

Correlation between scientific production (as captured by Google Scholar and...

Data from: Google news Dataset

Dataset

Contents

Monthly app downloads of Google News in Japan 2023

Google News Search Results for Japanese Yen

About this Dataset

Category

Keywords

Row Count

Price

General characteristics of health news and scientific articles.

Leading news and magazine apps in Google Play in Germany 2021, by downloads

google_news_en

Replication Data for: Stereotype Content Dictionary: A semantic space of 3...

Daily Google News (update infrequently)

Contains millions of daily news metadata scraped from Google News

What does the dataset contain?