36 datasets found

World Religion Project - Global Religion Dataset
thearda.com
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Association of Religion Data Archives, World Religion Project - Global Religion Dataset [Dataset]. http://doi.org/10.17605/OSF.IO/J7BCM
Explore at:
Unique identifier
https://doi.org/10.17605/OSF.IO/J7BCM
Dataset provided by
Association of Religion Data Archives
Dataset funded by
The University of California, Davis
The John Templeton Foundation
Description
The World Religion Project (WRP) aims to provide detailed information about religious adherence worldwide since 1945. It contains data about the number of adherents by religion in each of the states in the international system. These numbers are given for every half-decade period (1945, 1950, etc., through 2010). Percentages of the states' populations that practice a given religion are also provided. (Note: These percentages are expressed as decimals, ranging from 0 to 1, where 0 indicates that 0 percent of the population practices a given religion and 1 indicates that 100 percent of the population practices that religion.) Some of the religions (as detailed below) are divided into religious families. To the extent data are available, the breakdown of adherents within a given religion into religious families is also provided.

The project was developed in three stages. The first stage consisted of the formation of a religion tree. A religion tree is a systematic classification of major religions and of religious families within those major religions. To develop the religion tree we prepared a comprehensive literature review, the aim of which was (i) to define a religion, (ii) to find tangible indicators of a given religion of religious families within a major religion, and (iii) to identify existing efforts at classifying world religions. (Please see the original survey instrument to view the structure of the religion tree.) The second stage consisted of the identification of major data sources of religious adherence and the collection of data from these sources according to the religion tree classification. This created a dataset that included multiple records for some states for a given point in time. It also contained multiple missing data for specific states, specific time periods and specific religions. The third stage consisted of cleaning the data, reconciling discrepancies of information from different sources and imputing data for the missing cases.

The Global Religion Dataset: This dataset uses a religion-by-five-year unit. It aggregates the number of adherents of a given religion and religious group globally by five-year periods.
World Religions Across Regions
kaggle.com
Updated Dec 6, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2022). World Religions Across Regions [Dataset]. https://www.kaggle.com/datasets/thedevastator/a-global-perspective-on-world-religions-1945-201/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 6, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
The Devastator
Area covered
World
Description
World Religions Across Regions

Analyzing Adherence Across Regions, States and the Global System

By Correlates of War Project [source]

About this dataset

The World Religion Project (WRP) is an ambitious endeavor to conduct a comprehensive analysis of religious adherence throughout the world from 1945 to 2010. This cutting-edge project offers unparalleled insight into the religious behavior of people in different countries, regions, and continents during this time period. Its datasets provide important information about the numbers and percentages of adherents across a multitude of different religions, religion families, and non-religious affiliations.

The WRP consists of three distinct datasets: the national religion dataset, regional religion dataset, and global religion dataset. Each is focused on understanding individually specific realms for varied analysis approaches - from individual states to global systems. The national dataset provides data on number of adherents by state as well as percentage population practicing a given faith group in five-year increments; focusing attention to how this number evolves from nation to nation over time. Similarly, regional data is provided at five year intervals highlighting individual region designations with one modification – Pacific Ocean states have been reclassified into their own Oceania category according to Country Code Number 900 or above). Finally at a global level – all states are aggregated in order that we may understand a snapshot view at any five-year interval between 1945‐2010 regarding relationships between religions or religio‐families within one location or transnationally.

This project was developed in three stages: firstly forming a religions tree (a systematic classification), secondly collecting data such as this provided by WRP according to that classification structure – lastly cleaning the data so discrepancies may be reconciled and imported where needed with gaps selected when unknown values were encountered during collection process . We would encourage anyone wishing details undergoing more detailed reading/analysis relating various use applications for these rich datasets - please contact Zeev Maoz (University California Davis) & Errol A Henderson _(Pennsylvania State University)

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

The World Religions Project (WRP) dataset offers a comprehensive look at religious adherence around the world within a single dataset. With this dataset, you can track global religious trends over a period of 65 years and explore how they’ve changed during that time. By exploring the WRP data set, you’ll gain insight into cross-regional and cross-time patterns in religious affiliation around the world.

Research Ideas

Analyzing historical patterns of religious growth and decline across different regions

Creating visualizations to compare religious adherence in various states, countries, or globally

Studying the impact of governmental policies on religious participation over time

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

Columns

File: WRP regional data.csv | Column name | Description | |:-----------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------| | Year | Reference year for data collection. (Integer) | | Region | World region according to Correlates Of War (COW) Regional Systemizations with one modification (Oceania category for COW country code ...
Dataset of Global Religious Composition Estimates for 2010 and 2020
pewresearch.org
Updated 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Conrad Hackett; Marcin Stonawski; Yunping Tong; Stephanie Kramer; Anne Fengyan Shi (2025). Dataset of Global Religious Composition Estimates for 2010 and 2020 [Dataset]. http://doi.org/10.58094/vhrw-k516
Explore at:
Unique identifier
https://doi.org/10.58094/vhrw-k516
Dataset updated
2025
Dataset provided by
Pew Research Centerhttp://pewresearch.org/
datacite
Authors
Conrad Hackett; Marcin Stonawski; Yunping Tong; Stephanie Kramer; Anne Fengyan Shi
License
https://www.pewresearch.org/about/terms-and-conditions/https://www.pewresearch.org/about/terms-and-conditions/
Dataset funded by
John Templeton Foundation
Pew Charitable Trusts
Description
This dataset describes the world’s religious makeup in 2020 and 2010. We focus on seven categories: Christians, Muslims, Hindus, Buddhists, Jews, people who belong to other religions, and those who are religiously unaffiliated. This analysis is based on more than 2,700 sources of data, including national censuses, large-scale demographic surveys, general population surveys and population registers. For more information about this data, see the associated Pew Research Center report "How the Global Religious Landscape Changed From 2010 to 2020."
E
World Sites (TimeMap Sample Dataset)
ecaidata.org
Updated Oct 4, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ECAI Clearinghouse (2014). World Sites (TimeMap Sample Dataset) [Dataset]. https://ecaidata.org/dataset/ecaiclearinghouse-id-12
Explore at:
Dataset updated
Oct 4, 2014
Dataset provided by
ECAI Clearinghouse
Area covered
World
Description
Initial data source was UNESCO web site, supplemented by individual work on different countires/regions;A database of cultural heritage sites assembled by volunteers at the Archaeological Computing Laboratory, University of Sydney
Religious composition of the world's migrants: Peru case study
pewresearch.org
Updated 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anne Fengyan Shi; Yunping Tong; Stephanie Kramer (2024). Religious composition of the world's migrants: Peru case study [Dataset]. http://doi.org/10.58094/zk7y-q042
Explore at:
Unique identifier
https://doi.org/10.58094/zk7y-q042
Dataset updated
2024
Dataset provided by
Pew Research Centerhttp://pewresearch.org/
datacite
Authors
Anne Fengyan Shi; Yunping Tong; Stephanie Kramer
License
https://www.pewresearch.org/about/terms-and-conditions/https://www.pewresearch.org/about/terms-and-conditions/
Area covered
World
Dataset funded by
The Pew Charitable Trustshttps://www.pew.org/
John Templeton Foundationhttp://templeton.org/
Description
This folder consists of files for a case study of the methods used by Pew Research Center to make direct and indirect estimates for our report on The Religious Composition of the World's Migrants. Two subfolders demonstrate the procedures of the algorithm using two statistical programs, which mirror one another.
t
World's Muslims Data Set, 2012
thearda.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
James Bell, World's Muslims Data Set, 2012 [Dataset]. http://doi.org/10.17605/OSF.IO/C2VE5
Explore at:
Unique identifier
https://doi.org/10.17605/OSF.IO/C2VE5
Dataset provided by
The Association of Religion Data Archives
Authors
James Bell
Dataset funded by
The Pew Charitable Trusts
The John Templeton Foundation
Description
"Between October 2011 and November 2012, Pew Research Center, with generous funding from The Pew Charitable Trusts and the John Templeton Foundation, conducted a public opinion survey involving more than 30,000 face-to-face interviews in 26 countries in Africa, Asia, the Middle East and Europe. The survey asked people to describe their religious beliefs and practices, and sought to gauge respondents; knowledge of and attitudes toward other faiths. It aimed to assess levels of political and economic satisfaction, concerns about crime, corruption and extremism, positions on issues such as abortion and polygamy, and views of democracy, religious law and the place of women in society.

"Although the surveys were nationally representative in most countries, the primary goal of the survey was to gauge and compare beliefs and attitudes of Muslims. The findings for Muslim respondents are summarized in the Religion & Public Life Project's reports The World's Muslims: Unity and Diversity and The World's Muslims: Religion, Politics and Society, which are available at www.pewresearch.org. [...] This dataset only contains data for Muslim respondents in the countries surveyed. Please note that this codebook is meant as a guide to the dataset, and is not the survey questionnaire." (2012 Pew Religion Worlds Muslims Codebook)
l
Census 2021 - Religion
data.leicester.gov.uk
csv, excel, json
Updated May 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Census 2021 - Religion [Dataset]. https://data.leicester.gov.uk/explore/dataset/census-2021-leicester-religion/
Explore at:
csv, excel, jsonAvailable download formats
Dataset updated
May 25, 2023
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
The census is undertaken by the Office for National Statistics every 10 years and gives us a picture of all the people and households in England and Wales. The most recent census took place in March of 2021.The census asks every household questions about the people who live there and the type of home they live in. In doing so, it helps to build a detailed snapshot of society. Information from the census helps the government and local authorities to plan and fund local services, such as education, doctors' surgeries and roads.Key census statistics for Leicester are published on the open data platform to make information accessible to local services, voluntary and community groups, and residents.Further information about the census and full datasets can be found on the ONS website - https://www.ons.gov.uk/census/aboutcensus/censusproductsReligionThis dataset provides Census 2021 estimates that classify usual residents in England and Wales by religion. The estimates are as at Census Day, 21 March 2021.Definition: The religion people connect or identify with (their religious affiliation), whether or not they practice or have belief in it.This question was voluntary and the variable includes people who answered the question, including 'No Religion', alongside those who chose not to answer this question.This variable classifies responses into the eight tick-box response options. Write-in responses are classified by their "parent" religious affiliation, including 'No Religion', where applicable.This dataset contains details for Leicester City and England overall. There is also a dashboard that has been produced to show a selection of Census statistics for the city of Leicester which can be viewed here: Census 21 - Leicester dashboard.
w
Dataset of books called All one body : bishops of the Anglican Church speak...
workwithdata.com
Updated Apr 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Dataset of books called All one body : bishops of the Anglican Church speak of Christian faith and action in different parts of the world today [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=All+one+body+%3A+bishops+of+the+Anglican+Church+speak+of+Christian+faith+and+action+in+different+parts+of+the+world+today
Explore at:
Dataset updated
Apr 17, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about books. It has 1 row and is filtered where the book is All one body : bishops of the Anglican Church speak of Christian faith and action in different parts of the world today. It features 7 columns including author, publication date, language, and book publisher.
d
Evolution of Religion and Morality Project Dataset (Wave 1)
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Purzycki, Benjamin; Apicella, Coren; Atkinson, Quentin; Cohen, Emma; Henrich, Joseph; McNamara, Rita; Norenzayan, Ara; Willard, Aiyana; Xygalatas, Dimitris (2023). Evolution of Religion and Morality Project Dataset (Wave 1) [Dataset]. http://doi.org/10.7910/DVN/RT5JTV
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/RT5JTV
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Purzycki, Benjamin; Apicella, Coren; Atkinson, Quentin; Cohen, Emma; Henrich, Joseph; McNamara, Rita; Norenzayan, Ara; Willard, Aiyana; Xygalatas, Dimitris
Description
This dataset includes demographic, behavioral, and religiosity data from eight different populations from around the world. The samples were drawn from: (1) Coastal and (2) Inland Tanna, Vanuatu; (3) Hadzaland, Tanzania; (4) Lovu, Fiji; (5) Pointe aux Piment, Mauritius; (6) Pesqueiro, Brazil; (7) Kyzyl, Tyva Republic; and (8) Yasawa, Fiji. The materials documents includes: a) a codebook for variable definitions, b) images of experimental conditions, and c) data set updates and corrigenda. Also included is a script for R that highlights analyses from Purzycki, et al. (2016). Moralistic Gods, Supernatural Punishment and the Expansion of Human Sociality. Nature, 530(7590): 327-330.
D
Data Collected During the Digital Humanities Project 'Dhimmis & Muslims -...
darus.uni-stuttgart.de
Updated Mar 16, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dorothea Weltecke; Steffen Koch; Ralph Barczok; Max Franke; Bernd Andreas Vest (2022). Data Collected During the Digital Humanities Project 'Dhimmis & Muslims - Analysing Multireligious Spaces in the Medieval Muslim World' [Dataset]. http://doi.org/10.18419/DARUS-2318
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.18419/DARUS-2318
Dataset updated
Mar 16, 2022
Dataset provided by
DaRUS
Authors
Dorothea Weltecke; Steffen Koch; Ralph Barczok; Max Franke; Bernd Andreas Vest
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 600 - Dec 31, 1400
Dataset funded by
VolkswagenFoundation
Description
This repository contains historical data collected in the digital humanities project Dhimmis & Muslims – Analysing Multireligious Spaces in the Medieval Muslim World. The project was funded by the VolkswagenFoundation within the scope of the Mixed Methods initiative. The project was a collaboration between the Institute for Medieval History II of the Goethe University in Frankfurt/Main, Germany, and the Institute for Visualization and Interactive Systems at the University of Stuttgart, and took place there from 2018 to 2021. The objective of this joint project was to develop a novel visualization approach in order to gain new insights on the multi-religious landscapes of the Middle East under Muslim rule during the Middle Ages (7th to 14th century). In particular, information on multi-religious communities were researched and made available in a database accessible through interactive visualization as well as through a pilot web-based geo-temporal multi-view system to analyze and compare information from multiple sources. The code for this visualization system is publicly available on GitHub under the MIT license. The data in this repository is a curated database dump containing data collected from a predetermined set of primary historical sources and literature. The core objective of the data entry was to record historical evidence for religious groups in cities of the Medieval Middle East. In the project, data was collected in a relational PostgreSQL database, the structure of which can be reconstructed from the file schema.sql. An entire database dump including both the database schema and the table contents is located in database.sql. The PDF file database-structure.pdf describes the relationship between tables in a graphical schematic. In the database.json file, the contents of the individual tables are stored in JSON format. At the top level, the JSON file is an object. Each table is stored as a key-value pair, where the key is the database name, and the value is an array of table records. Each table record is itself an object of key-value pairs, where the keys are the table columns, and the values are the corresponding values in the record. The dataset is centered around the evidence, which represents one piece of historical evidence as extracted from one or more sources. An evidence must contain a reference to a place and a religion, and may reference a person and one or more time spans. Instances are used to connect evidences to places, persons, and religions; and additional metadata are stored individually in the instances. Time instances are connected to the evidence via a time group to allow for more than one time span per evidence. An evidence is connected via one or more source instances to one or more sources. Evidences can also be tagged with one or more tags via the tag_evidence table. Places and persons have a type, which are defined in the place type and person type tables. Alternative names for places are stored in the name_var table with a reference to the respective language. For places and persons, references to URIs in other data collections (such as Syriaca.org or the Digital Atlas of the Roman Empire) are also stored, in the external_place_uri and external_person_uri tables. Rules for how to construct the URIs from the fragments stored in the last-mentioned tables are controlled via the uri_namespace and external_database tables. Part of the project was to extract historical evidence from digitized texts, via annotations. Annotations are placed in a document, which is a digital version of a source. An annotation can be one of the four instance types, thereby referencing a place, person, religion, or time group. A reference to the annotation is stored in the instance, and evidences are constructed from annotations by connecting the respective instances in an evidence tuple.
t
The Religion and State Project, Minorities Module, Round 2
thearda.com
Updated Jul 22, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jonathan Fox (2014). The Religion and State Project, Minorities Module, Round 2 [Dataset]. http://doi.org/10.17605/OSF.IO/RHC7G
Explore at:
Unique identifier
https://doi.org/10.17605/OSF.IO/RHC7G
Dataset updated
Jul 22, 2014
Dataset provided by
The Association of Religion Data Archives
Authors
Jonathan Fox
Dataset funded by
The John Templeton Foundation
The Sara and Simha Lainer Chair in Democracy and Civility
Israel Science Foundation
Description
This Religion and State-Minorities (RASM) dataset is supplemental to the Religion and State Round 2 (RAS2) dataset. It codes the RAS religious discrimination variable using the minority as the unit of analysis (RAS2 uses a country as the unit of analysis and, is a general measure of all discrimination in the country). RASM codes religious discrimination by governments against all 566 minorities in 175 countries which make a minimum population cut off. Any religious minority which is at least 0.25 percent of the population or has a population of at least 500,000 (in countries with populations of 200 million or more) are included. The dataset also includes all Christian minorities in Muslim countries and all Muslim minorities in Christian countries for a total of 597 minorities. The data cover 1990 to 2008 with yearly codings.

These religious discrimination variables are designed to examine restrictions the government places on the practice of religion by minority religious groups. It is important to clarify two points. First, these variables focus on restrictions on minority religions. Restrictions that apply to all religions are not coded in this set of variables. This is because the act of restricting or regulating the religious practices of minorities is qualitatively different from restricting or regulating all religions. Second, this set of variables focuses only on restrictions of the practice of religion itself or on religious institutions and does not include other types of restrictions on religious minorities. The reasoning behind this is that there is much more likely to be a religious motivation for restrictions on the practice of religion than there is for political, economic, or cultural restrictions on a religious minority. These secular types of restrictions, while potentially motivated by religion, also can be due to other reasons. That political, economic, and cultural restrictions are often placed on ethnic minorities who share the same religion and the majority group in their state is proof of this.

This set of variables is essentially a list of specific types of religious restrictions which a government may place on some or all minority religions. These variables are identical to those included in the RAS2 dataset, save that one is not included because it focuses on foreign missionaries and this set of variables focuses on minorities living in the country. Each of the items in this category is coded on the following scale:

0. The activity is not restricted or the government does not engage in this practice.
1. The activity is restricted slightly or sporadically or the government engages in a mild form of this practice or a severe form sporadically.
2. The activity is significantly restricted or the government engages in this activity often and on a large scale.

A composite version combining the variables to create a measure of religious discrimination against minority religions which ranges from 0 to 48 also is included.

ARDA Note: This file was revised on October 6, 2017. At the PIs request, we removed the variable reporting on the minority's percentage of a country's population after finding inconsistencies with the reported values. For detailed data on religious demographics, see the "/data-archive?fid=RCSREG2" Target="_blank">Religious Characteristics of States Dataset Project.
Largest Mosques
kaggle.com
zip
Updated Apr 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saquib Hussain (2024). Largest Mosques [Dataset]. https://www.kaggle.com/datasets/saquib7hussain/largest-mosques
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Apr 6, 2024
Authors
Saquib Hussain
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
All mosques from around the world by available capacity, that belong to any Islamic school or branch, that can accommodate at least 15,000 worshippers in all available places of prayer such as prayer halls (musala), courtyards (ṣaḥn) and porticoes (riwāq). All the mosques in this list are congregational mosques – a type of mosque that hosts the Friday prayer (ṣalāt al-jumuʿa) in congregation (jamāʿa).
g
Data from: Joint EVS/WVS 2017-2022 Dataset (Joint EVS/WVS)
search.gesis.org
eprints.soton.ac.uk
+3more
Updated Jun 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gedeshi, Ilir; Rotman, David; Pachulia, Merab; Poghosyan, Gevorg; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Pachulia, Merab; Wolf, Christof; Bréchon, Pierre; Voas, David; Rosta, Gergely; Rovati, Giancarlo; Jónsdóttir, Guðbjörg A.; Petkovska, Antoanela; Ziliukaite, Ruta; Reeskens, Tim; Jenssen, Anders T.; Komar, Olivera; Voicu, Bogdan; Soboleva, Natalia; Marody, Mirosława; Bešić, Miloš; Strapcová, Katarina; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Micó Ibáñez, Joan; Carballo, Marita; McAllister, Ian; Foa, Roberto Stefan (PI Bangladesh); Moreno Morales, Daniel E.; de Oliveira de Castro, Henrique Carlos; Lagos, Marta; Zhong, Yang; Casas, Andres (PI Colombia); Yesilada, Birol (PI Cyprus); Paez, Cristina; Abdel Latif, Abdel Hamid; Jennings, Will (PI Ethiopia); Welzel, Christian; Koniordos. Sokratis; Díaz Argueta, Julio César; Cheng, Edmund; Gravelle, Timothy (PI Indonesia); Stoker, Gerry; Dagher, Munqith; Yamazaki, Seiko; Braizat, Fares; Rakisheva, Botagoz; Bakaloff, Yuri; Haerpfer, Christian (PI Lebanon); Wing-yat Yu, Eilo; Lee, Grace; Moreno, Alejandro; Souvanlasy, Chansada; Perry, Paul; Denton, Carlos (PI Nicaragua); Puranen, Bi (PI Nigeria); Gilani, Bilal; Romero, Catalina; Guerrero, Linda; Hernández Acosta, Javier J.; Voicu, Bogdan; Zavadskaya, Margarita; Veskovic, Nino; Auh, Soo Young; Tsai, Ming-Chang; Olimov, Muzaffar; Bureekul, Thawilwadee; Ben Hafaiedh, Abdelwahab; Esmer, Yilmaz; Inglehart, Ronald; Depouilly, Xavier; Norris, Pippa (PI Zimbabwe); Balakireva, Olga; Lachapelle, Guy; Mathews, Mathew; Mieriņa, Inta; Manasyan, Heghine; Ekstroem, Anna M. (PI Kenya); Swehli, Nedal; Riyaz, Aminath; Tseveen, Tsetsenbileg; Abderebbi, Mhammed; Verhoeven, Piet; Briceno-Leon, Roberto; Moravec, Vaclav; Duffy, Bobby; Stoneman, Paul; Kosnac, Pavol; Zuasnabar, Ignacio; Kumar, Sanjay; Uzbekistan: not specified for security reasons (2024). Joint EVS/WVS 2017-2022 Dataset (Joint EVS/WVS) [Dataset]. http://doi.org/10.4232/1.14320
Explore at:
(13603141), (16565189)Available download formats
Unique identifier
https://doi.org/10.4232/1.14320
Dataset updated
Jun 24, 2024
Dataset provided by
GESIS search
GESIS
Authors
Gedeshi, Ilir; Rotman, David; Pachulia, Merab; Poghosyan, Gevorg; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Pachulia, Merab; Wolf, Christof; Bréchon, Pierre; Voas, David; Rosta, Gergely; Rovati, Giancarlo; Jónsdóttir, Guðbjörg A.; Petkovska, Antoanela; Ziliukaite, Ruta; Reeskens, Tim; Jenssen, Anders T.; Komar, Olivera; Voicu, Bogdan; Soboleva, Natalia; Marody, Mirosława; Bešić, Miloš; Strapcová, Katarina; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Micó Ibáñez, Joan; Carballo, Marita; McAllister, Ian; Foa, Roberto Stefan (PI Bangladesh); Moreno Morales, Daniel E.; de Oliveira de Castro, Henrique Carlos; Lagos, Marta; Zhong, Yang; Casas, Andres (PI Colombia); Yesilada, Birol (PI Cyprus); Paez, Cristina; Abdel Latif, Abdel Hamid; Jennings, Will (PI Ethiopia); Welzel, Christian; Koniordos. Sokratis; Díaz Argueta, Julio César; Cheng, Edmund; Gravelle, Timothy (PI Indonesia); Stoker, Gerry; Dagher, Munqith; Yamazaki, Seiko; Braizat, Fares; Rakisheva, Botagoz; Bakaloff, Yuri; Haerpfer, Christian (PI Lebanon); Wing-yat Yu, Eilo; Lee, Grace; Moreno, Alejandro; Souvanlasy, Chansada; Perry, Paul; Denton, Carlos (PI Nicaragua); Puranen, Bi (PI Nigeria); Gilani, Bilal; Romero, Catalina; Guerrero, Linda; Hernández Acosta, Javier J.; Voicu, Bogdan; Zavadskaya, Margarita; Veskovic, Nino; Auh, Soo Young; Tsai, Ming-Chang; Olimov, Muzaffar; Bureekul, Thawilwadee; Ben Hafaiedh, Abdelwahab; Esmer, Yilmaz; Inglehart, Ronald; Depouilly, Xavier; Norris, Pippa (PI Zimbabwe); Balakireva, Olga; Lachapelle, Guy; Mathews, Mathew; Mieriņa, Inta; Manasyan, Heghine; Ekstroem, Anna M. (PI Kenya); Swehli, Nedal; Riyaz, Aminath; Tseveen, Tsetsenbileg; Abderebbi, Mhammed; Verhoeven, Piet; Briceno-Leon, Roberto; Moravec, Vaclav; Duffy, Bobby; Stoneman, Paul; Kosnac, Pavol; Zuasnabar, Ignacio; Kumar, Sanjay; Uzbekistan: not specified for security reasons
License
https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms
Time period covered
Jan 18, 2017 - Jul 2, 2023
Variables measured
X001 - Sex, X003 - Age, wave - Wave, study - Study, gwght - Weight, year - Year survey, X002 - Year of birth, X007 - Marital status, E035 - Income equality, F050 - Believe in: God, and 221 more
Description
The European Values Study (EVS) and the World Values Survey (WVS) are two large-scale, cross-national and longitudinal survey research programmes. They include a large number of questions on moral, religious, social, political, occupational and family values which have been replicated since the early eighties.

Both organizations agreed to cooperate in joint data collection from 2017. EVS has been responsible for planning and conducting surveys in European countries, using the EVS questionnaire and EVS methodological guidelines. WVSA has been responsible for planning and conducting surveys in countries in the world outside Europe, using the WVS questionnaire and WVS methodological guidelines. Both organisations developed their draft master questionnaires independently. The joint items define the Common Core of both questionnaires.

The Joint EVS/WVS is constructed from the two EVS and WVS source datasets: - European Values Study 2017 Integrated Dataset (EVS 2017), ZA7500 Data file Version 5.0.0, doi:10.4232/1.13897 (https://doi.org/10.4232/1.13897). Haerpfer, C., Inglehart, R., Moreno,A., Welzel,C., Kizilova,K., Diez-Medrano J., M. Lagos, P. Norris, E. Ponarin & B. Puranen et al. (eds.). 2024. World Values Survey: Round Seven–Country-Pooled Datafile. Madrid, Spain & Vienna, Austria: JD Systems Institute & WVSA Secretariat. Version. 6.0.0, doi:10.14281/18241.24.
Z
IndQNER: Indonesian Benchmark Dataset from the Indonesian Translation of the...
data.niaid.nih.gov
Updated Jan 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gusmita, Ria Hari (2024). IndQNER: Indonesian Benchmark Dataset from the Indonesian Translation of the Quran [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7454891
Explore at:
Dataset updated
Jan 27, 2024
Dataset provided by
Firmansyah, Asep Fajar
Gusmita, Ria Hari
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
IndQNER

IndQNER is a Named Entity Recognition (NER) benchmark dataset that was created by manually annotating 8 chapters in the Indonesian translation of the Quran. The annotation was performed using a web-based text annotation tool, Tagtog, and the BIO (Beginning-Inside-Outside) tagging format. The dataset contains:

3117 sentences

62027 tokens

2475 named entities

18 named entity categories

Named Entity Classes

The named entity classes were initially defined by analyzing the existing Quran concepts ontology. The initial classes were updated based on the information acquired during the annotation process. Finally, there are 20 classes, as follows:

Allah

Allah's Throne

Artifact

Astronomical body

Event

False deity

Holy book

Language

Angel

Person

Messenger

Prophet

Sentient

Afterlife location

Geographical location

Color

Religion

Food

Fruit

The book of Allah

Annotation Stage

There were eight annotators who contributed to the annotation process. They were informatics engineering students at the State Islamic University Syarif Hidayatullah Jakarta.

Anggita Maharani Gumay Putri

Muhammad Destamal Junas

Naufaldi Hafidhigbal

Nur Kholis Azzam Ubaidillah

Puspitasari

Septiany Nur Anggita

Wilda Nurjannah

William Santoso

Verification Stage

We found many named entity and class candidates during the annotation stage. To verify the candidates, we consulted Quran and Tafseer (content) experts who are lecturers at Quran and Tafseer Department at the State Islamic University Syarif Hidayatullah Jakarta.

Dr. Eva Nugraha, M.Ag.

Dr. Jauhar Azizy, MA

Dr. Lilik Ummi Kultsum, MA

Evaluation

We evaluated the annotation quality of IndQNER by performing experiments in two settings: supervised learning (BiLSTM+CRF) and transfer learning (IndoBERT fine-tuning).

Supervised Learning Setting

The implementation of BiLSTM and CRF utilized IndoBERT to provide word embeddings. All experiments used a batch size of 16. These are the results:

Maximum sequence length Number of e-poch Precision Recall F1 score

256 10 0.94 0.92 0.93

256 20 0.99 0.97 0.98

256 40 0.96 0.96 0.96

256 100 0.97 0.96 0.96

512 10 0.92 0.92 0.92

512 20 0.96 0.95 0.96

512 40 0.97 0.95 0.96

512 100 0.97 0.95 0.96

Transfer Learning Setting

We performed several experiments with different parameters in IndoBERT fine-tuning. All experiments used a learning rate of 2e-5 and a batch size of 16. These are the results:

Maximum sequence length Number of e-poch Precision Recall F1 score

256 10 0.67 0.65 0.65

256 20 0.60 0.59 0.59

256 40 0.75 0.72 0.71

256 100 0.73 0.68 0.68

512 10 0.72 0.62 0.64

512 20 0.62 0.57 0.58

512 40 0.72 0.66 0.67

512 100 0.68 0.68 0.67

This dataset is also part of the NusaCrowd project which aims to collect Natural Language Processing (NLP) datasets for Indonesian and its local languages.

How to Cite

@InProceedings{10.1007/978-3-031-35320-8_12,author="Gusmita, Ria Hariand Firmansyah, Asep Fajarand Moussallem, Diegoand Ngonga Ngomo, Axel-Cyrille",editor="M{\'e}tais, Elisabethand Meziane, Faridand Sugumaran, Vijayanand Manning, Warrenand Reiff-Marganiec, Stephan",title="IndQNER: Named Entity Recognition Benchmark Dataset from the Indonesian Translation of the Quran",booktitle="Natural Language Processing and Information Systems",year="2023",publisher="Springer Nature Switzerland",address="Cham",pages="170--185",abstract="Indonesian is classified as underrepresented in the Natural Language Processing (NLP) field, despite being the tenth most spoken language in the world with 198 million speakers. The paucity of datasets is recognized as the main reason for the slow advancements in NLP research for underrepresented languages. Significant attempts were made in 2020 to address this drawback for Indonesian. The Indonesian Natural Language Understanding (IndoNLU) benchmark was introduced alongside IndoBERT pre-trained language model. The second benchmark, Indonesian Language Evaluation Montage (IndoLEM), was presented in the same year. These benchmarks support several tasks, including Named Entity Recognition (NER). However, all NER datasets are in the public domain and do not contain domain-specific datasets. To alleviate this drawback, we introduce IndQNER, a manually annotated NER benchmark dataset in the religious domain that adheres to a meticulously designed annotation guideline. Since Indonesia has the world's largest Muslim population, we build the dataset from the Indonesian translation of the Quran. The dataset includes 2475 named entities representing 18 different classes. To assess the annotation quality of IndQNER, we perform experiments with BiLSTM and CRF-based NER, as well as IndoBERT fine-tuning. The results reveal that the first model outperforms the second model achieving 0.98 F1 points. This outcome indicates that IndQNER may be an acceptable evaluation metric for Indonesian NER tasks in the aforementioned domain, widening the research's domain range.",isbn="978-3-031-35320-8"}

Contact

If you have any questions or feedback, feel free to contact us at ria.hari.gusmita@uni-paderborn.de or ria.gusmita@uinjkt.ac.id
E
World Sites
ecaidata.org
Updated Oct 4, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ECAI Clearinghouse (2014). World Sites [Dataset]. https://ecaidata.org/dataset/ecaiclearinghouse-id-269
Explore at:
Dataset updated
Oct 4, 2014
Dataset provided by
ECAI Clearinghouse
Area covered
World
Description
Initial data source was UNESCO web site, supplemented by individual work on different countires/regions;A database of cultural heritage sites assembled by volunteers at the Archaeological Computing Laboratory, University of Sydney;Database is now availabe online through ECAI and can be updated through a password-controlled web browser interface
India Census: Population: by Religion: Muslim: Urban
ceicdata.com
Updated Mar 15, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2023). India Census: Population: by Religion: Muslim: Urban [Dataset]. https://www.ceicdata.com/en/india/census-population-by-religion/census-population-by-religion-muslim-urban
Explore at:
Dataset updated
Mar 15, 2023
Dataset provided by
CEIC Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Mar 1, 2001 - Mar 1, 2011
Area covered
India
Variables measured
Population
Description
India Census: Population: by Religion: Muslim: Urban data was reported at 68,740,419.000 Person in 2011. This records an increase from the previous number of 49,393,496.000 Person for 2001. India Census: Population: by Religion: Muslim: Urban data is updated yearly, averaging 59,066,957.500 Person from Mar 2001 (Median) to 2011, with 2 observations. The data reached an all-time high of 68,740,419.000 Person in 2011 and a record low of 49,393,496.000 Person in 2001. India Census: Population: by Religion: Muslim: Urban data remains active status in CEIC and is reported by Census of India. The data is categorized under India Premium Database’s Demographic – Table IN.GAE001: Census: Population: by Religion.
P
THAR Dataset Dataset
paperswithcode.com
Updated Mar 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). THAR Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/thar-dataset
Explore at:
Dataset updated
Mar 22, 2024
Description
The increase in religiously motivated hate on social media is clear and ongoing. These platforms have become fertile ground for the dissemination of hate speech directed at religious communities, resulting in tangible repercussions in the real world. Much of the current research concerning the automated identification of hateful content on social media focuses on English-language content. There is comparatively less exploration in low-resource languages such as Hindi. As social media users increasingly utilize their regional languages for expression, it becomes crucial to dedicate appropriate research efforts to hate speech detection in these languages.

Hence, this work aims to fill this research void by introducing a meticulously curated and annotated dataset of YouTube comments in Hindi-English code-mixed language, specifically designed to identify instances of religious hate.

Citation: Sharma, D., Singh, A., & Singh, V. K. (2024). THAR-Targeted Hate Speech Against Religion: A high-quality Hindi-English code-mixed Dataset with the Application of Deep Learning Models for Automatic Detection. ACM Transactions on Asian and Low-Resource Language Information Processing. (https://doi.org/10.1145/3653017)
South and Southeast Asia Survey Dataset
pewresearch.org
Updated 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jonathan Evans (2024). South and Southeast Asia Survey Dataset [Dataset]. http://doi.org/10.58094/rf31-hd47
Explore at:
Unique identifier
https://doi.org/10.58094/rf31-hd47
Dataset updated
2024
Dataset provided by
Pew Research Centerhttp://pewresearch.org/
datacite
Authors
Jonathan Evans
License
https://www.pewresearch.org/about/terms-and-conditions/https://www.pewresearch.org/about/terms-and-conditions/
Area covered
South East Asia, Asia
Dataset funded by
The Pew Charitable Trustshttps://www.pew.org/
John Templeton Foundationhttp://templeton.org/
Description
Pew Research Center conducted random, probability-based surveys among 13,122 adults (ages 18 and older) across six South and Southeast Asian countries: Cambodia, Indonesia, Malaysia, Singapore, Sri Lanka and Thailand. Interviewing was carried out under the direction of Langer Research Associates. In Malaysia and Singapore, interviews were conducted via computer-assisted telephone interviewing (CATI) using mobile phones. In Cambodia, Indonesia, Sri Lanka and Thailand, interviews were administered face-to-face using tablet devices, also known as computer-assisted personal interviewing (CAPI). All surveys were conducted between June 1 and Sept. 4, 2022.

This project was produced by Pew Research Center as part of the Pew-Templeton Global Religious Futures project, which analyzes religious change and its impact on societies around the world. Funding for the Global Religious Futures project comes from The Pew Charitable Trusts and the John Templeton Foundation (grant 61640). This publication does not necessarily reflect the views of the John Templeton Foundation.

As of July 2024, one report has been published that focuses on the findings from this data: Buddhism, Islam and Religious Pluralism in South and Southeast Asia: https://www.pewresearch.org/religion/2023/09/12/buddhism-islam-and-religious-pluralism-in-south-and-southeast-asia/
I
India Census: Population: by Religion: Hindu: Male
ceicdata.com
Updated May 28, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
India Census: Population: by Religion: Hindu: Male [Dataset]. https://www.ceicdata.com/en/india/census-population-by-religion/census-population-by-religion-hindu-male
Explore at:
Dataset updated
May 28, 2017
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Mar 1, 2001 - Mar 1, 2011
Area covered
India
Variables measured
Population
Description
India Census: Population: by Religion: Hindu: Male data was reported at 498,306,968.000 Person in 2011. This records an increase from the previous number of 428,678,554.000 Person for 2001. India Census: Population: by Religion: Hindu: Male data is updated yearly, averaging 463,492,761.000 Person from Mar 2001 (Median) to 2011, with 2 observations. The data reached an all-time high of 498,306,968.000 Person in 2011 and a record low of 428,678,554.000 Person in 2001. India Census: Population: by Religion: Hindu: Male data remains active status in CEIC and is reported by Census of India. The data is categorized under India Premium Database’s Demographic – Table IN.GAE001: Census: Population: by Religion.
n
International Data Base
neuinfo.org
dknet.org
+2more
Updated Feb 1, 2001
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2001). International Data Base [Dataset]. http://identifiers.org/RRID:SCR_013139
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_013139
Dataset updated
Feb 1, 2001
Description
A computerized data set of demographic, economic and social data for 227 countries of the world. Information presented includes population, health, nutrition, mortality, fertility, family planning and contraceptive use, literacy, housing, and economic activity data. Tabular data are broken down by such variables as age, sex, and urban/rural residence. Data are organized as a series of statistical tables identified by country and table number. Each record consists of the data values associated with a single row of a given table. There are 105 tables with data for 208 countries. The second file is a note file, containing text of notes associated with various tables. These notes provide information such as definitions of categories (i.e. urban/rural) and how various values were calculated. The IDB was created in the U.S. Census Bureau''s International Programs Center (IPC) to help IPC staff meet the needs of organizations that sponsor IPC research. The IDB provides quick access to specialized information, with emphasis on demographic measures, for individual countries or groups of countries. The IDB combines data from country sources (typically censuses and surveys) with IPC estimates and projections to provide information dating back as far as 1950 and as far ahead as 2050. Because the IDB is maintained as a research tool for IPC sponsor requirements, the amount of information available may vary by country. As funding and research activity permit, the IPC updates and expands the data base content. Types of data include: * Population by age and sex * Vital rates, infant mortality, and life tables * Fertility and child survivorship * Migration * Marital status * Family planning Data characteristics: * Temporal: Selected years, 1950present, projected demographic data to 2050. * Spatial: 227 countries and areas. * Resolution: National population, selected data by urban/rural * residence, selected data by age and sex. Sources of data include: * U.S. Census Bureau * International projects (e.g., the Demographic and Health Survey) * United Nations agencies Links: * ICPSR: http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/08490

Facebook

Twitter

Click to copy link

Link copied

Cite

The Association of Religion Data Archives, World Religion Project - Global Religion Dataset [Dataset]. http://doi.org/10.17605/OSF.IO/J7BCM

World Religion Project - Global Religion Dataset

Explore at:

93 scholarly articles cite this dataset (View in Google Scholar)

Unique identifier

https://doi.org/10.17605/OSF.IO/J7BCM

Dataset provided by

Association of Religion Data Archives

Dataset funded by

The University of California, Davis
The John Templeton Foundation

Description

The World Religion Project (WRP) aims to provide detailed information about religious adherence worldwide since 1945. It contains data about the number of adherents by religion in each of the states in the international system. These numbers are given for every half-decade period (1945, 1950, etc., through 2010). Percentages of the states' populations that practice a given religion are also provided. (Note: These percentages are expressed as decimals, ranging from 0 to 1, where 0 indicates that 0 percent of the population practices a given religion and 1 indicates that 100 percent of the population practices that religion.) Some of the religions (as detailed below) are divided into religious families. To the extent data are available, the breakdown of adherents within a given religion into religious families is also provided.

The project was developed in three stages. The first stage consisted of the formation of a religion tree. A religion tree is a systematic classification of major religions and of religious families within those major religions. To develop the religion tree we prepared a comprehensive literature review, the aim of which was (i) to define a religion, (ii) to find tangible indicators of a given religion of religious families within a major religion, and (iii) to identify existing efforts at classifying world religions. (Please see the original survey instrument to view the structure of the religion tree.) The second stage consisted of the identification of major data sources of religious adherence and the collection of data from these sources according to the religion tree classification. This created a dataset that included multiple records for some states for a given point in time. It also contained multiple missing data for specific states, specific time periods and specific religions. The third stage consisted of cleaning the data, reconciling discrepancies of information from different sources and imputing data for the missing cases.

The Global Religion Dataset: This dataset uses a religion-by-five-year unit. It aggregates the number of adherents of a given religion and religious group globally by five-year periods.

Clear search

Close search

Google apps

Main menu

World Religion Project - Global Religion Dataset

World Religions Across Regions

World Religions Across Regions

Analyzing Adherence Across Regions, States and the Global System

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

Dataset of Global Religious Composition Estimates for 2010 and 2020

World Sites (TimeMap Sample Dataset)

Religious composition of the world's migrants: Peru case study

World's Muslims Data Set, 2012

Census 2021 - Religion

Dataset of books called All one body : bishops of the Anglican Church speak...

Evolution of Religion and Morality Project Dataset (Wave 1)

Data Collected During the Digital Humanities Project 'Dhimmis & Muslims -...

The Religion and State Project, Minorities Module, Round 2

Largest Mosques

Data from: Joint EVS/WVS 2017-2022 Dataset (Joint EVS/WVS)

IndQNER: Indonesian Benchmark Dataset from the Indonesian Translation of the...

World Sites

India Census: Population: by Religion: Muslim: Urban

THAR Dataset Dataset

South and Southeast Asia Survey Dataset

India Census: Population: by Religion: Hindu: Male

International Data Base

World Religion Project - Global Religion DatasetSee More Versions

World Religion Project - Global Religion Dataset