36 datasets found
  1. World Religion Project - Global Religion Dataset

    • thearda.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Association of Religion Data Archives, World Religion Project - Global Religion Dataset [Dataset]. http://doi.org/10.17605/OSF.IO/J7BCM
    Explore at:
    Dataset provided by
    Association of Religion Data Archives
    Dataset funded by
    The University of California, Davis
    The John Templeton Foundation
    Description

    The World Religion Project (WRP) aims to provide detailed information about religious adherence worldwide since 1945. It contains data about the number of adherents by religion in each of the states in the international system. These numbers are given for every half-decade period (1945, 1950, etc., through 2010). Percentages of the states' populations that practice a given religion are also provided. (Note: These percentages are expressed as decimals, ranging from 0 to 1, where 0 indicates that 0 percent of the population practices a given religion and 1 indicates that 100 percent of the population practices that religion.) Some of the religions (as detailed below) are divided into religious families. To the extent data are available, the breakdown of adherents within a given religion into religious families is also provided.

    The project was developed in three stages. The first stage consisted of the formation of a religion tree. A religion tree is a systematic classification of major religions and of religious families within those major religions. To develop the religion tree we prepared a comprehensive literature review, the aim of which was (i) to define a religion, (ii) to find tangible indicators of a given religion of religious families within a major religion, and (iii) to identify existing efforts at classifying world religions. (Please see the original survey instrument to view the structure of the religion tree.) The second stage consisted of the identification of major data sources of religious adherence and the collection of data from these sources according to the religion tree classification. This created a dataset that included multiple records for some states for a given point in time. It also contained multiple missing data for specific states, specific time periods and specific religions. The third stage consisted of cleaning the data, reconciling discrepancies of information from different sources and imputing data for the missing cases.

    The Global Religion Dataset: This dataset uses a religion-by-five-year unit. It aggregates the number of adherents of a given religion and religious group globally by five-year periods.

  2. World Religions Across Regions

    • kaggle.com
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). World Religions Across Regions [Dataset]. https://www.kaggle.com/datasets/thedevastator/a-global-perspective-on-world-religions-1945-201/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 6, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    The Devastator
    Area covered
    World
    Description

    World Religions Across Regions

    Analyzing Adherence Across Regions, States and the Global System

    By Correlates of War Project [source]

    About this dataset

    The World Religion Project (WRP) is an ambitious endeavor to conduct a comprehensive analysis of religious adherence throughout the world from 1945 to 2010. This cutting-edge project offers unparalleled insight into the religious behavior of people in different countries, regions, and continents during this time period. Its datasets provide important information about the numbers and percentages of adherents across a multitude of different religions, religion families, and non-religious affiliations.

    The WRP consists of three distinct datasets: the national religion dataset, regional religion dataset, and global religion dataset. Each is focused on understanding individually specific realms for varied analysis approaches - from individual states to global systems. The national dataset provides data on number of adherents by state as well as percentage population practicing a given faith group in five-year increments; focusing attention to how this number evolves from nation to nation over time. Similarly, regional data is provided at five year intervals highlighting individual region designations with one modification – Pacific Ocean states have been reclassified into their own Oceania category according to Country Code Number 900 or above). Finally at a global level – all states are aggregated in order that we may understand a snapshot view at any five-year interval between 1945‐2010 regarding relationships between religions or religio‐families within one location or transnationally.

    This project was developed in three stages: firstly forming a religions tree (a systematic classification), secondly collecting data such as this provided by WRP according to that classification structure – lastly cleaning the data so discrepancies may be reconciled and imported where needed with gaps selected when unknown values were encountered during collection process . We would encourage anyone wishing details undergoing more detailed reading/analysis relating various use applications for these rich datasets - please contact Zeev Maoz (University California Davis) & Errol A Henderson _(Pennsylvania State University)

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    The World Religions Project (WRP) dataset offers a comprehensive look at religious adherence around the world within a single dataset. With this dataset, you can track global religious trends over a period of 65 years and explore how they’ve changed during that time. By exploring the WRP data set, you’ll gain insight into cross-regional and cross-time patterns in religious affiliation around the world.

    Research Ideas

    • Analyzing historical patterns of religious growth and decline across different regions
    • Creating visualizations to compare religious adherence in various states, countries, or globally
    • Studying the impact of governmental policies on religious participation over time

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.

    Columns

    File: WRP regional data.csv | Column name | Description | |:-----------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------| | Year | Reference year for data collection. (Integer) | | Region | World region according to Correlates Of War (COW) Regional Systemizations with one modification (Oceania category for COW country code ...

  3. Dataset of Global Religious Composition Estimates for 2010 and 2020

    • pewresearch.org
    Updated 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Conrad Hackett; Marcin Stonawski; Yunping Tong; Stephanie Kramer; Anne Fengyan Shi (2025). Dataset of Global Religious Composition Estimates for 2010 and 2020 [Dataset]. http://doi.org/10.58094/vhrw-k516
    Explore at:
    Dataset updated
    2025
    Dataset provided by
    Pew Research Centerhttp://pewresearch.org/
    datacite
    Authors
    Conrad Hackett; Marcin Stonawski; Yunping Tong; Stephanie Kramer; Anne Fengyan Shi
    License

    https://www.pewresearch.org/about/terms-and-conditions/https://www.pewresearch.org/about/terms-and-conditions/

    Dataset funded by
    John Templeton Foundation
    Pew Charitable Trusts
    Description

    This dataset describes the world’s religious makeup in 2020 and 2010. We focus on seven categories: Christians, Muslims, Hindus, Buddhists, Jews, people who belong to other religions, and those who are religiously unaffiliated. This analysis is based on more than 2,700 sources of data, including national censuses, large-scale demographic surveys, general population surveys and population registers. For more information about this data, see the associated Pew Research Center report "How the Global Religious Landscape Changed From 2010 to 2020."

  4. E

    World Sites (TimeMap Sample Dataset)

    • ecaidata.org
    Updated Oct 4, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ECAI Clearinghouse (2014). World Sites (TimeMap Sample Dataset) [Dataset]. https://ecaidata.org/dataset/ecaiclearinghouse-id-12
    Explore at:
    Dataset updated
    Oct 4, 2014
    Dataset provided by
    ECAI Clearinghouse
    Area covered
    World
    Description

    Initial data source was UNESCO web site, supplemented by individual work on different countires/regions;A database of cultural heritage sites assembled by volunteers at the Archaeological Computing Laboratory, University of Sydney

  5. Religious composition of the world's migrants: Peru case study

    • pewresearch.org
    Updated 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anne Fengyan Shi; Yunping Tong; Stephanie Kramer (2024). Religious composition of the world's migrants: Peru case study [Dataset]. http://doi.org/10.58094/zk7y-q042
    Explore at:
    Dataset updated
    2024
    Dataset provided by
    Pew Research Centerhttp://pewresearch.org/
    datacite
    Authors
    Anne Fengyan Shi; Yunping Tong; Stephanie Kramer
    License

    https://www.pewresearch.org/about/terms-and-conditions/https://www.pewresearch.org/about/terms-and-conditions/

    Area covered
    World
    Dataset funded by
    The Pew Charitable Trustshttps://www.pew.org/
    John Templeton Foundationhttp://templeton.org/
    Description

    This folder consists of files for a case study of the methods used by Pew Research Center to make direct and indirect estimates for our report on The Religious Composition of the World's Migrants. Two subfolders demonstrate the procedures of the algorithm using two statistical programs, which mirror one another.

  6. t

    World's Muslims Data Set, 2012

    • thearda.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    James Bell, World's Muslims Data Set, 2012 [Dataset]. http://doi.org/10.17605/OSF.IO/C2VE5
    Explore at:
    Dataset provided by
    The Association of Religion Data Archives
    Authors
    James Bell
    Dataset funded by
    The Pew Charitable Trusts
    The John Templeton Foundation
    Description

    "Between October 2011 and November 2012, Pew Research Center, with generous funding from The Pew Charitable Trusts and the John Templeton Foundation, conducted a public opinion survey involving more than 30,000 face-to-face interviews in 26 countries in Africa, Asia, the Middle East and Europe. The survey asked people to describe their religious beliefs and practices, and sought to gauge respondents; knowledge of and attitudes toward other faiths. It aimed to assess levels of political and economic satisfaction, concerns about crime, corruption and extremism, positions on issues such as abortion and polygamy, and views of democracy, religious law and the place of women in society.

    "Although the surveys were nationally representative in most countries, the primary goal of the survey was to gauge and compare beliefs and attitudes of Muslims. The findings for Muslim respondents are summarized in the Religion & Public Life Project's reports The World's Muslims: Unity and Diversity and The World's Muslims: Religion, Politics and Society, which are available at www.pewresearch.org. [...] This dataset only contains data for Muslim respondents in the countries surveyed. Please note that this codebook is meant as a guide to the dataset, and is not the survey questionnaire." (2012 Pew Religion Worlds Muslims Codebook)

  7. l

    Census 2021 - Religion

    • data.leicester.gov.uk
    csv, excel, json
    Updated May 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Census 2021 - Religion [Dataset]. https://data.leicester.gov.uk/explore/dataset/census-2021-leicester-religion/
    Explore at:
    csv, excel, jsonAvailable download formats
    Dataset updated
    May 25, 2023
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    The census is undertaken by the Office for National Statistics every 10 years and gives us a picture of all the people and households in England and Wales. The most recent census took place in March of 2021.The census asks every household questions about the people who live there and the type of home they live in. In doing so, it helps to build a detailed snapshot of society. Information from the census helps the government and local authorities to plan and fund local services, such as education, doctors' surgeries and roads.Key census statistics for Leicester are published on the open data platform to make information accessible to local services, voluntary and community groups, and residents.Further information about the census and full datasets can be found on the ONS website - https://www.ons.gov.uk/census/aboutcensus/censusproductsReligionThis dataset provides Census 2021 estimates that classify usual residents in England and Wales by religion. The estimates are as at Census Day, 21 March 2021.Definition: The religion people connect or identify with (their religious affiliation), whether or not they practice or have belief in it.This question was voluntary and the variable includes people who answered the question, including 'No Religion', alongside those who chose not to answer this question.This variable classifies responses into the eight tick-box response options. Write-in responses are classified by their "parent" religious affiliation, including 'No Religion', where applicable.This dataset contains details for Leicester City and England overall. There is also a dashboard that has been produced to show a selection of Census statistics for the city of Leicester which can be viewed here: Census 21 - Leicester dashboard.

  8. w

    Dataset of books called All one body : bishops of the Anglican Church speak...

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called All one body : bishops of the Anglican Church speak of Christian faith and action in different parts of the world today [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=All+one+body+%3A+bishops+of+the+Anglican+Church+speak+of+Christian+faith+and+action+in+different+parts+of+the+world+today
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 1 row and is filtered where the book is All one body : bishops of the Anglican Church speak of Christian faith and action in different parts of the world today. It features 7 columns including author, publication date, language, and book publisher.

  9. d

    Evolution of Religion and Morality Project Dataset (Wave 1)

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Purzycki, Benjamin; Apicella, Coren; Atkinson, Quentin; Cohen, Emma; Henrich, Joseph; McNamara, Rita; Norenzayan, Ara; Willard, Aiyana; Xygalatas, Dimitris (2023). Evolution of Religion and Morality Project Dataset (Wave 1) [Dataset]. http://doi.org/10.7910/DVN/RT5JTV
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Purzycki, Benjamin; Apicella, Coren; Atkinson, Quentin; Cohen, Emma; Henrich, Joseph; McNamara, Rita; Norenzayan, Ara; Willard, Aiyana; Xygalatas, Dimitris
    Description

    This dataset includes demographic, behavioral, and religiosity data from eight different populations from around the world. The samples were drawn from: (1) Coastal and (2) Inland Tanna, Vanuatu; (3) Hadzaland, Tanzania; (4) Lovu, Fiji; (5) Pointe aux Piment, Mauritius; (6) Pesqueiro, Brazil; (7) Kyzyl, Tyva Republic; and (8) Yasawa, Fiji. The materials documents includes: a) a codebook for variable definitions, b) images of experimental conditions, and c) data set updates and corrigenda. Also included is a script for R that highlights analyses from Purzycki, et al. (2016). Moralistic Gods, Supernatural Punishment and the Expansion of Human Sociality. Nature, 530(7590): 327-330.

  10. D

    Data Collected During the Digital Humanities Project 'Dhimmis & Muslims -...

    • darus.uni-stuttgart.de
    Updated Mar 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dorothea Weltecke; Steffen Koch; Ralph Barczok; Max Franke; Bernd Andreas Vest (2022). Data Collected During the Digital Humanities Project 'Dhimmis & Muslims - Analysing Multireligious Spaces in the Medieval Muslim World' [Dataset]. http://doi.org/10.18419/DARUS-2318
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 16, 2022
    Dataset provided by
    DaRUS
    Authors
    Dorothea Weltecke; Steffen Koch; Ralph Barczok; Max Franke; Bernd Andreas Vest
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 600 - Dec 31, 1400
    Dataset funded by
    VolkswagenFoundation
    Description

    This repository contains historical data collected in the digital humanities project Dhimmis & Muslims – Analysing Multireligious Spaces in the Medieval Muslim World. The project was funded by the VolkswagenFoundation within the scope of the Mixed Methods initiative. The project was a collaboration between the Institute for Medieval History II of the Goethe University in Frankfurt/Main, Germany, and the Institute for Visualization and Interactive Systems at the University of Stuttgart, and took place there from 2018 to 2021. The objective of this joint project was to develop a novel visualization approach in order to gain new insights on the multi-religious landscapes of the Middle East under Muslim rule during the Middle Ages (7th to 14th century). In particular, information on multi-religious communities were researched and made available in a database accessible through interactive visualization as well as through a pilot web-based geo-temporal multi-view system to analyze and compare information from multiple sources. The code for this visualization system is publicly available on GitHub under the MIT license. The data in this repository is a curated database dump containing data collected from a predetermined set of primary historical sources and literature. The core objective of the data entry was to record historical evidence for religious groups in cities of the Medieval Middle East. In the project, data was collected in a relational PostgreSQL database, the structure of which can be reconstructed from the file schema.sql. An entire database dump including both the database schema and the table contents is located in database.sql. The PDF file database-structure.pdf describes the relationship between tables in a graphical schematic. In the database.json file, the contents of the individual tables are stored in JSON format. At the top level, the JSON file is an object. Each table is stored as a key-value pair, where the key is the database name, and the value is an array of table records. Each table record is itself an object of key-value pairs, where the keys are the table columns, and the values are the corresponding values in the record. The dataset is centered around the evidence, which represents one piece of historical evidence as extracted from one or more sources. An evidence must contain a reference to a place and a religion, and may reference a person and one or more time spans. Instances are used to connect evidences to places, persons, and religions; and additional metadata are stored individually in the instances. Time instances are connected to the evidence via a time group to allow for more than one time span per evidence. An evidence is connected via one or more source instances to one or more sources. Evidences can also be tagged with one or more tags via the tag_evidence table. Places and persons have a type, which are defined in the place type and person type tables. Alternative names for places are stored in the name_var table with a reference to the respective language. For places and persons, references to URIs in other data collections (such as Syriaca.org or the Digital Atlas of the Roman Empire) are also stored, in the external_place_uri and external_person_uri tables. Rules for how to construct the URIs from the fragments stored in the last-mentioned tables are controlled via the uri_namespace and external_database tables. Part of the project was to extract historical evidence from digitized texts, via annotations. Annotations are placed in a document, which is a digital version of a source. An annotation can be one of the four instance types, thereby referencing a place, person, religion, or time group. A reference to the annotation is stored in the instance, and evidences are constructed from annotations by connecting the respective instances in an evidence tuple.

  11. t

    The Religion and State Project, Minorities Module, Round 2

    • thearda.com
    Updated Jul 22, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Fox (2014). The Religion and State Project, Minorities Module, Round 2 [Dataset]. http://doi.org/10.17605/OSF.IO/RHC7G
    Explore at:
    Dataset updated
    Jul 22, 2014
    Dataset provided by
    The Association of Religion Data Archives
    Authors
    Jonathan Fox
    Dataset funded by
    The John Templeton Foundation
    The Sara and Simha Lainer Chair in Democracy and Civility
    Israel Science Foundation
    Description

    This Religion and State-Minorities (RASM) dataset is supplemental to the Religion and State Round 2 (RAS2) dataset. It codes the RAS religious discrimination variable using the minority as the unit of analysis (RAS2 uses a country as the unit of analysis and, is a general measure of all discrimination in the country). RASM codes religious discrimination by governments against all 566 minorities in 175 countries which make a minimum population cut off. Any religious minority which is at least 0.25 percent of the population or has a population of at least 500,000 (in countries with populations of 200 million or more) are included. The dataset also includes all Christian minorities in Muslim countries and all Muslim minorities in Christian countries for a total of 597 minorities. The data cover 1990 to 2008 with yearly codings.

    These religious discrimination variables are designed to examine restrictions the government places on the practice of religion by minority religious groups. It is important to clarify two points. First, these variables focus on restrictions on minority religions. Restrictions that apply to all religions are not coded in this set of variables. This is because the act of restricting or regulating the religious practices of minorities is qualitatively different from restricting or regulating all religions. Second, this set of variables focuses only on restrictions of the practice of religion itself or on religious institutions and does not include other types of restrictions on religious minorities. The reasoning behind this is that there is much more likely to be a religious motivation for restrictions on the practice of religion than there is for political, economic, or cultural restrictions on a religious minority. These secular types of restrictions, while potentially motivated by religion, also can be due to other reasons. That political, economic, and cultural restrictions are often placed on ethnic minorities who share the same religion and the majority group in their state is proof of this.

    This set of variables is essentially a list of specific types of religious restrictions which a government may place on some or all minority religions. These variables are identical to those included in the RAS2 dataset, save that one is not included because it focuses on foreign missionaries and this set of variables focuses on minorities living in the country. Each of the items in this category is coded on the following scale:

    0. The activity is not restricted or the government does not engage in this practice.
    1. The activity is restricted slightly or sporadically or the government engages in a mild form of this practice or a severe form sporadically.
    2. The activity is significantly restricted or the government engages in this activity often and on a large scale.

    A composite version combining the variables to create a measure of religious discrimination against minority religions which ranges from 0 to 48 also is included.

    ARDA Note: This file was revised on October 6, 2017. At the PIs request, we removed the variable reporting on the minority's percentage of a country's population after finding inconsistencies with the reported values. For detailed data on religious demographics, see the "/data-archive?fid=RCSREG2" Target="_blank">Religious Characteristics of States Dataset Project.

  12. Largest Mosques

    • kaggle.com
    zip
    Updated Apr 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saquib Hussain (2024). Largest Mosques [Dataset]. https://www.kaggle.com/datasets/saquib7hussain/largest-mosques
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    Apr 6, 2024
    Authors
    Saquib Hussain
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    All mosques from around the world by available capacity, that belong to any Islamic school or branch, that can accommodate at least 15,000 worshippers in all available places of prayer such as prayer halls (musala), courtyards (ṣaḥn) and porticoes (riwāq). All the mosques in this list are congregational mosques – a type of mosque that hosts the Friday prayer (ṣalāt al-jumuʿa) in congregation (jamāʿa).

  13. g

    Data from: Joint EVS/WVS 2017-2022 Dataset (Joint EVS/WVS)

    • search.gesis.org
    • eprints.soton.ac.uk
    • +3more
    Updated Jun 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gedeshi, Ilir; Rotman, David; Pachulia, Merab; Poghosyan, Gevorg; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Pachulia, Merab; Wolf, Christof; Bréchon, Pierre; Voas, David; Rosta, Gergely; Rovati, Giancarlo; Jónsdóttir, Guðbjörg A.; Petkovska, Antoanela; Ziliukaite, Ruta; Reeskens, Tim; Jenssen, Anders T.; Komar, Olivera; Voicu, Bogdan; Soboleva, Natalia; Marody, Mirosława; Bešić, Miloš; Strapcová, Katarina; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Micó Ibáñez, Joan; Carballo, Marita; McAllister, Ian; Foa, Roberto Stefan (PI Bangladesh); Moreno Morales, Daniel E.; de Oliveira de Castro, Henrique Carlos; Lagos, Marta; Zhong, Yang; Casas, Andres (PI Colombia); Yesilada, Birol (PI Cyprus); Paez, Cristina; Abdel Latif, Abdel Hamid; Jennings, Will (PI Ethiopia); Welzel, Christian; Koniordos. Sokratis; Díaz Argueta, Julio César; Cheng, Edmund; Gravelle, Timothy (PI Indonesia); Stoker, Gerry; Dagher, Munqith; Yamazaki, Seiko; Braizat, Fares; Rakisheva, Botagoz; Bakaloff, Yuri; Haerpfer, Christian (PI Lebanon); Wing-yat Yu, Eilo; Lee, Grace; Moreno, Alejandro; Souvanlasy, Chansada; Perry, Paul; Denton, Carlos (PI Nicaragua); Puranen, Bi (PI Nigeria); Gilani, Bilal; Romero, Catalina; Guerrero, Linda; Hernández Acosta, Javier J.; Voicu, Bogdan; Zavadskaya, Margarita; Veskovic, Nino; Auh, Soo Young; Tsai, Ming-Chang; Olimov, Muzaffar; Bureekul, Thawilwadee; Ben Hafaiedh, Abdelwahab; Esmer, Yilmaz; Inglehart, Ronald; Depouilly, Xavier; Norris, Pippa (PI Zimbabwe); Balakireva, Olga; Lachapelle, Guy; Mathews, Mathew; Mieriņa, Inta; Manasyan, Heghine; Ekstroem, Anna M. (PI Kenya); Swehli, Nedal; Riyaz, Aminath; Tseveen, Tsetsenbileg; Abderebbi, Mhammed; Verhoeven, Piet; Briceno-Leon, Roberto; Moravec, Vaclav; Duffy, Bobby; Stoneman, Paul; Kosnac, Pavol; Zuasnabar, Ignacio; Kumar, Sanjay; Uzbekistan: not specified for security reasons (2024). Joint EVS/WVS 2017-2022 Dataset (Joint EVS/WVS) [Dataset]. http://doi.org/10.4232/1.14320
    Explore at:
    (13603141), (16565189)Available download formats
    Dataset updated
    Jun 24, 2024
    Dataset provided by
    GESIS search
    GESIS
    Authors
    Gedeshi, Ilir; Rotman, David; Pachulia, Merab; Poghosyan, Gevorg; Kritzinger, Sylvia; Fotev, Georgy; Kolenović-Đapo, Jadranka; Baloban, Josip; Baloban, Stjepan; Rabušic, Ladislav; Frederiksen, Morten; Saar, Erki; Ketola, Kimmo; Pachulia, Merab; Wolf, Christof; Bréchon, Pierre; Voas, David; Rosta, Gergely; Rovati, Giancarlo; Jónsdóttir, Guðbjörg A.; Petkovska, Antoanela; Ziliukaite, Ruta; Reeskens, Tim; Jenssen, Anders T.; Komar, Olivera; Voicu, Bogdan; Soboleva, Natalia; Marody, Mirosława; Bešić, Miloš; Strapcová, Katarina; Uhan, Samo; Silvestre Cabrera, María; Wallman-Lundåsen, Susanne; Ernst Stähli, Michèle; Ramos, Alice; Micó Ibáñez, Joan; Carballo, Marita; McAllister, Ian; Foa, Roberto Stefan (PI Bangladesh); Moreno Morales, Daniel E.; de Oliveira de Castro, Henrique Carlos; Lagos, Marta; Zhong, Yang; Casas, Andres (PI Colombia); Yesilada, Birol (PI Cyprus); Paez, Cristina; Abdel Latif, Abdel Hamid; Jennings, Will (PI Ethiopia); Welzel, Christian; Koniordos. Sokratis; Díaz Argueta, Julio César; Cheng, Edmund; Gravelle, Timothy (PI Indonesia); Stoker, Gerry; Dagher, Munqith; Yamazaki, Seiko; Braizat, Fares; Rakisheva, Botagoz; Bakaloff, Yuri; Haerpfer, Christian (PI Lebanon); Wing-yat Yu, Eilo; Lee, Grace; Moreno, Alejandro; Souvanlasy, Chansada; Perry, Paul; Denton, Carlos (PI Nicaragua); Puranen, Bi (PI Nigeria); Gilani, Bilal; Romero, Catalina; Guerrero, Linda; Hernández Acosta, Javier J.; Voicu, Bogdan; Zavadskaya, Margarita; Veskovic, Nino; Auh, Soo Young; Tsai, Ming-Chang; Olimov, Muzaffar; Bureekul, Thawilwadee; Ben Hafaiedh, Abdelwahab; Esmer, Yilmaz; Inglehart, Ronald; Depouilly, Xavier; Norris, Pippa (PI Zimbabwe); Balakireva, Olga; Lachapelle, Guy; Mathews, Mathew; Mieriņa, Inta; Manasyan, Heghine; Ekstroem, Anna M. (PI Kenya); Swehli, Nedal; Riyaz, Aminath; Tseveen, Tsetsenbileg; Abderebbi, Mhammed; Verhoeven, Piet; Briceno-Leon, Roberto; Moravec, Vaclav; Duffy, Bobby; Stoneman, Paul; Kosnac, Pavol; Zuasnabar, Ignacio; Kumar, Sanjay; Uzbekistan: not specified for security reasons
    License

    https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms

    Time period covered
    Jan 18, 2017 - Jul 2, 2023
    Variables measured
    X001 - Sex, X003 - Age, wave - Wave, study - Study, gwght - Weight, year - Year survey, X002 - Year of birth, X007 - Marital status, E035 - Income equality, F050 - Believe in: God, and 221 more
    Description

    The European Values Study (EVS) and the World Values Survey (WVS) are two large-scale, cross-national and longitudinal survey research programmes. They include a large number of questions on moral, religious, social, political, occupational and family values which have been replicated since the early eighties.

    Both organizations agreed to cooperate in joint data collection from 2017. EVS has been responsible for planning and conducting surveys in European countries, using the EVS questionnaire and EVS methodological guidelines. WVSA has been responsible for planning and conducting surveys in countries in the world outside Europe, using the WVS questionnaire and WVS methodological guidelines. Both organisations developed their draft master questionnaires independently. The joint items define the Common Core of both questionnaires.

    The Joint EVS/WVS is constructed from the two EVS and WVS source datasets: - European Values Study 2017 Integrated Dataset (EVS 2017), ZA7500 Data file Version 5.0.0, doi:10.4232/1.13897 (https://doi.org/10.4232/1.13897). Haerpfer, C., Inglehart, R., Moreno,A., Welzel,C., Kizilova,K., Diez-Medrano J., M. Lagos, P. Norris, E. Ponarin & B. Puranen et al. (eds.). 2024. World Values Survey: Round Seven–Country-Pooled Datafile. Madrid, Spain & Vienna, Austria: JD Systems Institute & WVSA Secretariat. Version. 6.0.0, doi:10.14281/18241.24.

  14. Z

    IndQNER: Indonesian Benchmark Dataset from the Indonesian Translation of the...

    • data.niaid.nih.gov
    Updated Jan 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gusmita, Ria Hari (2024). IndQNER: Indonesian Benchmark Dataset from the Indonesian Translation of the Quran [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7454891
    Explore at:
    Dataset updated
    Jan 27, 2024
    Dataset provided by
    Firmansyah, Asep Fajar
    Gusmita, Ria Hari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IndQNER

    IndQNER is a Named Entity Recognition (NER) benchmark dataset that was created by manually annotating 8 chapters in the Indonesian translation of the Quran. The annotation was performed using a web-based text annotation tool, Tagtog, and the BIO (Beginning-Inside-Outside) tagging format. The dataset contains:

    3117 sentences

    62027 tokens

    2475 named entities

    18 named entity categories

    Named Entity Classes

    The named entity classes were initially defined by analyzing the existing Quran concepts ontology. The initial classes were updated based on the information acquired during the annotation process. Finally, there are 20 classes, as follows:

    Allah

    Allah's Throne

    Artifact

    Astronomical body

    Event

    False deity

    Holy book

    Language

    Angel

    Person

    Messenger

    Prophet

    Sentient

    Afterlife location

    Geographical location

    Color

    Religion

    Food

    Fruit

    The book of Allah

    Annotation Stage

    There were eight annotators who contributed to the annotation process. They were informatics engineering students at the State Islamic University Syarif Hidayatullah Jakarta.

    Anggita Maharani Gumay Putri

    Muhammad Destamal Junas

    Naufaldi Hafidhigbal

    Nur Kholis Azzam Ubaidillah

    Puspitasari

    Septiany Nur Anggita

    Wilda Nurjannah

    William Santoso

    Verification Stage

    We found many named entity and class candidates during the annotation stage. To verify the candidates, we consulted Quran and Tafseer (content) experts who are lecturers at Quran and Tafseer Department at the State Islamic University Syarif Hidayatullah Jakarta.

    Dr. Eva Nugraha, M.Ag.

    Dr. Jauhar Azizy, MA

    Dr. Lilik Ummi Kultsum, MA

    Evaluation

    We evaluated the annotation quality of IndQNER by performing experiments in two settings: supervised learning (BiLSTM+CRF) and transfer learning (IndoBERT fine-tuning).

    Supervised Learning Setting

    The implementation of BiLSTM and CRF utilized IndoBERT to provide word embeddings. All experiments used a batch size of 16. These are the results:

    Maximum sequence length Number of e-poch Precision Recall F1 score

    256 10 0.94 0.92 0.93

    256 20 0.99 0.97 0.98

    256 40 0.96 0.96 0.96

    256 100 0.97 0.96 0.96

    512 10 0.92 0.92 0.92

    512 20 0.96 0.95 0.96

    512 40 0.97 0.95 0.96

    512 100 0.97 0.95 0.96

    Transfer Learning Setting

    We performed several experiments with different parameters in IndoBERT fine-tuning. All experiments used a learning rate of 2e-5 and a batch size of 16. These are the results:

    Maximum sequence length Number of e-poch Precision Recall F1 score

    256 10 0.67 0.65 0.65

    256 20 0.60 0.59 0.59

    256 40 0.75 0.72 0.71

    256 100 0.73 0.68 0.68

    512 10 0.72 0.62 0.64

    512 20 0.62 0.57 0.58

    512 40 0.72 0.66 0.67

    512 100 0.68 0.68 0.67

    This dataset is also part of the NusaCrowd project which aims to collect Natural Language Processing (NLP) datasets for Indonesian and its local languages.

    How to Cite

    @InProceedings{10.1007/978-3-031-35320-8_12,author="Gusmita, Ria Hariand Firmansyah, Asep Fajarand Moussallem, Diegoand Ngonga Ngomo, Axel-Cyrille",editor="M{\'e}tais, Elisabethand Meziane, Faridand Sugumaran, Vijayanand Manning, Warrenand Reiff-Marganiec, Stephan",title="IndQNER: Named Entity Recognition Benchmark Dataset from the Indonesian Translation of the Quran",booktitle="Natural Language Processing and Information Systems",year="2023",publisher="Springer Nature Switzerland",address="Cham",pages="170--185",abstract="Indonesian is classified as underrepresented in the Natural Language Processing (NLP) field, despite being the tenth most spoken language in the world with 198 million speakers. The paucity of datasets is recognized as the main reason for the slow advancements in NLP research for underrepresented languages. Significant attempts were made in 2020 to address this drawback for Indonesian. The Indonesian Natural Language Understanding (IndoNLU) benchmark was introduced alongside IndoBERT pre-trained language model. The second benchmark, Indonesian Language Evaluation Montage (IndoLEM), was presented in the same year. These benchmarks support several tasks, including Named Entity Recognition (NER). However, all NER datasets are in the public domain and do not contain domain-specific datasets. To alleviate this drawback, we introduce IndQNER, a manually annotated NER benchmark dataset in the religious domain that adheres to a meticulously designed annotation guideline. Since Indonesia has the world's largest Muslim population, we build the dataset from the Indonesian translation of the Quran. The dataset includes 2475 named entities representing 18 different classes. To assess the annotation quality of IndQNER, we perform experiments with BiLSTM and CRF-based NER, as well as IndoBERT fine-tuning. The results reveal that the first model outperforms the second model achieving 0.98 F1 points. This outcome indicates that IndQNER may be an acceptable evaluation metric for Indonesian NER tasks in the aforementioned domain, widening the research's domain range.",isbn="978-3-031-35320-8"}

    Contact

    If you have any questions or feedback, feel free to contact us at ria.hari.gusmita@uni-paderborn.de or ria.gusmita@uinjkt.ac.id

  15. E

    World Sites

    • ecaidata.org
    Updated Oct 4, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ECAI Clearinghouse (2014). World Sites [Dataset]. https://ecaidata.org/dataset/ecaiclearinghouse-id-269
    Explore at:
    Dataset updated
    Oct 4, 2014
    Dataset provided by
    ECAI Clearinghouse
    Area covered
    World
    Description

    Initial data source was UNESCO web site, supplemented by individual work on different countires/regions;A database of cultural heritage sites assembled by volunteers at the Archaeological Computing Laboratory, University of Sydney;Database is now availabe online through ECAI and can be updated through a password-controlled web browser interface

  16. India Census: Population: by Religion: Muslim: Urban

    • ceicdata.com
    Updated Mar 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2023). India Census: Population: by Religion: Muslim: Urban [Dataset]. https://www.ceicdata.com/en/india/census-population-by-religion/census-population-by-religion-muslim-urban
    Explore at:
    Dataset updated
    Mar 15, 2023
    Dataset provided by
    CEIC Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 1, 2001 - Mar 1, 2011
    Area covered
    India
    Variables measured
    Population
    Description

    India Census: Population: by Religion: Muslim: Urban data was reported at 68,740,419.000 Person in 2011. This records an increase from the previous number of 49,393,496.000 Person for 2001. India Census: Population: by Religion: Muslim: Urban data is updated yearly, averaging 59,066,957.500 Person from Mar 2001 (Median) to 2011, with 2 observations. The data reached an all-time high of 68,740,419.000 Person in 2011 and a record low of 49,393,496.000 Person in 2001. India Census: Population: by Religion: Muslim: Urban data remains active status in CEIC and is reported by Census of India. The data is categorized under India Premium Database’s Demographic – Table IN.GAE001: Census: Population: by Religion.

  17. P

    THAR Dataset Dataset

    • paperswithcode.com
    Updated Mar 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). THAR Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/thar-dataset
    Explore at:
    Dataset updated
    Mar 22, 2024
    Description

    The increase in religiously motivated hate on social media is clear and ongoing. These platforms have become fertile ground for the dissemination of hate speech directed at religious communities, resulting in tangible repercussions in the real world. Much of the current research concerning the automated identification of hateful content on social media focuses on English-language content. There is comparatively less exploration in low-resource languages such as Hindi. As social media users increasingly utilize their regional languages for expression, it becomes crucial to dedicate appropriate research efforts to hate speech detection in these languages.

    Hence, this work aims to fill this research void by introducing a meticulously curated and annotated dataset of YouTube comments in Hindi-English code-mixed language, specifically designed to identify instances of religious hate.

    Citation: Sharma, D., Singh, A., & Singh, V. K. (2024). THAR-Targeted Hate Speech Against Religion: A high-quality Hindi-English code-mixed Dataset with the Application of Deep Learning Models for Automatic Detection. ACM Transactions on Asian and Low-Resource Language Information Processing. (https://doi.org/10.1145/3653017)

  18. South and Southeast Asia Survey Dataset

    • pewresearch.org
    Updated 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Evans (2024). South and Southeast Asia Survey Dataset [Dataset]. http://doi.org/10.58094/rf31-hd47
    Explore at:
    Dataset updated
    2024
    Dataset provided by
    Pew Research Centerhttp://pewresearch.org/
    datacite
    Authors
    Jonathan Evans
    License

    https://www.pewresearch.org/about/terms-and-conditions/https://www.pewresearch.org/about/terms-and-conditions/

    Area covered
    South East Asia, Asia
    Dataset funded by
    The Pew Charitable Trustshttps://www.pew.org/
    John Templeton Foundationhttp://templeton.org/
    Description

    Pew Research Center conducted random, probability-based surveys among 13,122 adults (ages 18 and older) across six South and Southeast Asian countries: Cambodia, Indonesia, Malaysia, Singapore, Sri Lanka and Thailand. Interviewing was carried out under the direction of Langer Research Associates. In Malaysia and Singapore, interviews were conducted via computer-assisted telephone interviewing (CATI) using mobile phones. In Cambodia, Indonesia, Sri Lanka and Thailand, interviews were administered face-to-face using tablet devices, also known as computer-assisted personal interviewing (CAPI). All surveys were conducted between June 1 and Sept. 4, 2022.

    This project was produced by Pew Research Center as part of the Pew-Templeton Global Religious Futures project, which analyzes religious change and its impact on societies around the world. Funding for the Global Religious Futures project comes from The Pew Charitable Trusts and the John Templeton Foundation (grant 61640). This publication does not necessarily reflect the views of the John Templeton Foundation.

    As of July 2024, one report has been published that focuses on the findings from this data: Buddhism, Islam and Religious Pluralism in South and Southeast Asia: https://www.pewresearch.org/religion/2023/09/12/buddhism-islam-and-religious-pluralism-in-south-and-southeast-asia/

  19. I

    India Census: Population: by Religion: Hindu: Male

    • ceicdata.com
    Updated May 28, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    India Census: Population: by Religion: Hindu: Male [Dataset]. https://www.ceicdata.com/en/india/census-population-by-religion/census-population-by-religion-hindu-male
    Explore at:
    Dataset updated
    May 28, 2017
    Dataset provided by
    CEICdata.com
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 1, 2001 - Mar 1, 2011
    Area covered
    India
    Variables measured
    Population
    Description

    India Census: Population: by Religion: Hindu: Male data was reported at 498,306,968.000 Person in 2011. This records an increase from the previous number of 428,678,554.000 Person for 2001. India Census: Population: by Religion: Hindu: Male data is updated yearly, averaging 463,492,761.000 Person from Mar 2001 (Median) to 2011, with 2 observations. The data reached an all-time high of 498,306,968.000 Person in 2011 and a record low of 428,678,554.000 Person in 2001. India Census: Population: by Religion: Hindu: Male data remains active status in CEIC and is reported by Census of India. The data is categorized under India Premium Database’s Demographic – Table IN.GAE001: Census: Population: by Religion.

  20. n

    International Data Base

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Feb 1, 2001
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2001). International Data Base [Dataset]. http://identifiers.org/RRID:SCR_013139
    Explore at:
    Dataset updated
    Feb 1, 2001
    Description

    A computerized data set of demographic, economic and social data for 227 countries of the world. Information presented includes population, health, nutrition, mortality, fertility, family planning and contraceptive use, literacy, housing, and economic activity data. Tabular data are broken down by such variables as age, sex, and urban/rural residence. Data are organized as a series of statistical tables identified by country and table number. Each record consists of the data values associated with a single row of a given table. There are 105 tables with data for 208 countries. The second file is a note file, containing text of notes associated with various tables. These notes provide information such as definitions of categories (i.e. urban/rural) and how various values were calculated. The IDB was created in the U.S. Census Bureau''s International Programs Center (IPC) to help IPC staff meet the needs of organizations that sponsor IPC research. The IDB provides quick access to specialized information, with emphasis on demographic measures, for individual countries or groups of countries. The IDB combines data from country sources (typically censuses and surveys) with IPC estimates and projections to provide information dating back as far as 1950 and as far ahead as 2050. Because the IDB is maintained as a research tool for IPC sponsor requirements, the amount of information available may vary by country. As funding and research activity permit, the IPC updates and expands the data base content. Types of data include: * Population by age and sex * Vital rates, infant mortality, and life tables * Fertility and child survivorship * Migration * Marital status * Family planning Data characteristics: * Temporal: Selected years, 1950present, projected demographic data to 2050. * Spatial: 227 countries and areas. * Resolution: National population, selected data by urban/rural * residence, selected data by age and sex. Sources of data include: * U.S. Census Bureau * International projects (e.g., the Demographic and Health Survey) * United Nations agencies Links: * ICPSR: http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/08490

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
The Association of Religion Data Archives, World Religion Project - Global Religion Dataset [Dataset]. http://doi.org/10.17605/OSF.IO/J7BCM
Organization logo

World Religion Project - Global Religion Dataset

Explore at:
93 scholarly articles cite this dataset (View in Google Scholar)
Dataset provided by
Association of Religion Data Archives
Dataset funded by
The University of California, Davis
The John Templeton Foundation
Description

The World Religion Project (WRP) aims to provide detailed information about religious adherence worldwide since 1945. It contains data about the number of adherents by religion in each of the states in the international system. These numbers are given for every half-decade period (1945, 1950, etc., through 2010). Percentages of the states' populations that practice a given religion are also provided. (Note: These percentages are expressed as decimals, ranging from 0 to 1, where 0 indicates that 0 percent of the population practices a given religion and 1 indicates that 100 percent of the population practices that religion.) Some of the religions (as detailed below) are divided into religious families. To the extent data are available, the breakdown of adherents within a given religion into religious families is also provided.

The project was developed in three stages. The first stage consisted of the formation of a religion tree. A religion tree is a systematic classification of major religions and of religious families within those major religions. To develop the religion tree we prepared a comprehensive literature review, the aim of which was (i) to define a religion, (ii) to find tangible indicators of a given religion of religious families within a major religion, and (iii) to identify existing efforts at classifying world religions. (Please see the original survey instrument to view the structure of the religion tree.) The second stage consisted of the identification of major data sources of religious adherence and the collection of data from these sources according to the religion tree classification. This created a dataset that included multiple records for some states for a given point in time. It also contained multiple missing data for specific states, specific time periods and specific religions. The third stage consisted of cleaning the data, reconciling discrepancies of information from different sources and imputing data for the missing cases.

The Global Religion Dataset: This dataset uses a religion-by-five-year unit. It aggregates the number of adherents of a given religion and religious group globally by five-year periods.

Search
Clear search
Close search
Google apps
Main menu