5 datasets found

The GDELT Project
kaggle.com
zip
Updated Feb 12, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The GDELT Project (2019). The GDELT Project [Dataset]. https://www.kaggle.com/datasets/gdelt/gdelt
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Feb 12, 2019
Dataset authored and provided by
The GDELT Project
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

The GDELT Project is the largest, most comprehensive, and highest resolution open database of human society ever created. Just the 2015 data alone records nearly three quarters of a trillion emotional snapshots and more than 1.5 billion location references, while its total archives span more than 215 years, making it one of the largest open-access spatio-temporal datasets in existance and pushing the boundaries of "big data" study of global human society. Its Global Knowledge Graph connects the world's people, organizations, locations, themes, counts, images and emotions into a single holistic network over the entire planet. How can you query, explore, model, visualize, interact, and even forecast this vast archive of human society?

Content

GDELT 2.0 has a wealth of features in the event database which includes events reported in articles published in 65 live translated languages, measurements of 2,300 emotions and themes, high resolution views of the non-Western world, relevant imagery, videos, and social media embeds, quotes, names, amounts, and more.

You may find these code books helpful:
GDELT Global Knowledge Graph Codebook V2.1 (PDF)
GDELT Event Codebook V2.0 (PDF)

Querying BigQuery tables

You can use the BigQuery Python client library to query tables in this dataset in Kernels. Note that methods available in Kernels are limited to querying data. Tables are at bigquery-public-data.github_repos.[TABLENAME]. [Fork this kernel to get started][98] to learn how to safely manage analyzing large BigQuery datasets.

Acknowledgements

You may redistribute, rehost, republish, and mirror any of the GDELT datasets in any form. However, any use or redistribution of the data must include a citation to the GDELT Project and a link to the website (https://www.gdeltproject.org/).
GDELT 2.0 Event Database
console.cloud.google.com
Updated Jul 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
https://console.cloud.google.com/marketplace/browse?filter=partner:The%20GDELT%20Project&hl=JA&inv=1&invt=Ab1ekA (2023). GDELT 2.0 Event Database [Dataset]. https://console.cloud.google.com/marketplace/product/the-gdelt-project/gdelt-2-events?hl=JA
Explore at:
Dataset updated
Jul 18, 2023
Dataset provided by
Googlehttp://google.com/
Description
The GDELT 2.0 Event Database is a global catalog of worldwide activities (“events”) in over 300 categories from protests and military attacks to peace appeals and diplomatic exchanges. Each event record details 58 fields capturing many different attributes of the event. The GDELT 2.0 Event Database currently runs from February 2015 to present, updated every 15 minutes and is comprised of 326 million mentions of 103 million distinct events as of February 19, 2016. This dataset uses machine translation coverage of all monitored content in 65 core languages, with a sample of an additional 35 languages hand translated. It also expands upon GDELT 1.0 by providing a separate MENTIONS table that records every mention of each event, along with the offset, context and confidence of each of those mentions. This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery .
h
gdelt-mentions-2025-v2
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Don Branson, gdelt-mentions-2025-v2 [Dataset]. https://huggingface.co/datasets/dwb2023/gdelt-mentions-2025-v2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Don Branson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for dwb2023/gdelt-mentions-2025-v2

This dataset contains the mentions records from the GDELT (Global Database of Events, Language, and Tone) Project, tracking how global events are mentioned across media sources over time.

Dataset Details Dataset Description

The GDELT Mentions table is a component of the GDELT Event Database that tracks each mention of an event across all monitored news sources. Unlike the Event table which records unique events… See the full description on the dataset page: https://huggingface.co/datasets/dwb2023/gdelt-mentions-2025-v2.
h
gdelt-gkg-march2020-v2
huggingface.co
Updated Mar 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Don Branson (2020). gdelt-gkg-march2020-v2 [Dataset]. https://huggingface.co/datasets/dwb2023/gdelt-gkg-march2020-v2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 10, 2020
Authors
Don Branson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for dwb2023/gdelt-gkg-march2020-v2

Dataset Details Dataset Description

This dataset contains GDELT Global Knowledge Graph (GKG) data covering March 10-22, 2020, during the early phase of the COVID-19 pandemic. It captures global event interactions, actor relationships, and contextual narratives to support temporal, spatial, and thematic analysis.

Curated by: dwb2023

Dataset Sources

Repository: http://data.gdeltproject.org/gdeltv2 GKG… See the full description on the dataset page: https://huggingface.co/datasets/dwb2023/gdelt-gkg-march2020-v2.
h
gdelt-event-2025-v2
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Don Branson, gdelt-event-2025-v2 [Dataset]. https://huggingface.co/datasets/dwb2023/gdelt-event-2025-v2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Don Branson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for dwb2023/gdelt-event-2025-v2

This dataset contains global event records from the GDELT (Global Database of Events, Language, and Tone) Project, capturing real-world events and their characteristics across the globe through news media coverage.

Dataset Details Dataset Description

The GDELT Event Database is a comprehensive repository of human societal-scale behavior and beliefs across all countries of the world, connecting every person… See the full description on the dataset page: https://huggingface.co/datasets/dwb2023/gdelt-event-2025-v2.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

The GDELT Project (2019). The GDELT Project [Dataset]. https://www.kaggle.com/datasets/gdelt/gdelt

The GDELT Project

A realtime database of global human society for open research

Explore at:

zip(0 bytes)Available download formats

Dataset updated

Feb 12, 2019

Dataset authored and provided by

The GDELT Project

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

The GDELT Project is the largest, most comprehensive, and highest resolution open database of human society ever created. Just the 2015 data alone records nearly three quarters of a trillion emotional snapshots and more than 1.5 billion location references, while its total archives span more than 215 years, making it one of the largest open-access spatio-temporal datasets in existance and pushing the boundaries of "big data" study of global human society. Its Global Knowledge Graph connects the world's people, organizations, locations, themes, counts, images and emotions into a single holistic network over the entire planet. How can you query, explore, model, visualize, interact, and even forecast this vast archive of human society?

Content

GDELT 2.0 has a wealth of features in the event database which includes events reported in articles published in 65 live translated languages, measurements of 2,300 emotions and themes, high resolution views of the non-Western world, relevant imagery, videos, and social media embeds, quotes, names, amounts, and more.

You may find these code books helpful:
GDELT Global Knowledge Graph Codebook V2.1 (PDF)
GDELT Event Codebook V2.0 (PDF)

Querying BigQuery tables

You can use the BigQuery Python client library to query tables in this dataset in Kernels. Note that methods available in Kernels are limited to querying data. Tables are at bigquery-public-data.github_repos.[TABLENAME]. [Fork this kernel to get started][98] to learn how to safely manage analyzing large BigQuery datasets.

Acknowledgements

You may redistribute, rehost, republish, and mirror any of the GDELT datasets in any form. However, any use or redistribution of the data must include a citation to the GDELT Project and a link to the website (https://www.gdeltproject.org/).

Clear search

Close search

Google apps

Main menu

The GDELT Project

Context

Content

Querying BigQuery tables

Acknowledgements

GDELT 2.0 Event Database

gdelt-mentions-2025-v2

gdelt-gkg-march2020-v2

gdelt-event-2025-v2

The GDELT Project

A realtime database of global human society for open research

Context

Content

Querying BigQuery tables

Acknowledgements