A cross-national data set of 21 variables was assembled for 212 countries from three sources (Barro and Lee 1994; Gordon 2005; CIA World Fact Book 2005). Our data set includes several proxy measures for national wealth, cultural diversity, social instability (both at national and international levels), and demography. Separate diversity measures were calculated for three different cultural domains, namely language, religion and ethnic groups . In addition, wealth variables (per capita GDP, and GINI, the coefficient of income inequality) were assembled, along with indicators of societal functioning drawn from the literature (especially Barro and Lee 1994), including indices of political rights (PRIGHTSB), revolutions and coups d'états (REVCOUP), and political instability (PINSTAB). Measures of international conflict were extracted from the social science literature, and the following were used: the proportion of the time between 1960-85 the country was involved in an external war (WARTIME), the number of international disputes in which the country was involved (TOTINTDISP), and an index of total military expenditure (TOTMILITEXP). Possible confounding variables such as population size (POPSIZE) and the number of international borders (NBINTBORDERS) were also included.
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
This dataset shows the data corresponding to the cultural and social aspect of the Democratic Republic of the Congo with ethnics, tribes, the spoken languages, the culinary culture, .....
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Data on ethnic or cultural origin by gender and age for the population in private households in Canada, provinces and territories, census metropolitan areas, census agglomerations and parts.
https://www.statcan.gc.ca/eng/reference/licencehttps://www.statcan.gc.ca/eng/reference/licence
Statistics Canada Census Data from 2021. This dataset includes the ethnic or cultural origin data for the male population provided by Statistics Canada joined with the census tracts. Each topic covered by the census was exported as a separate table. Each table contains the total, male, and female characteristics as fields for each census tract. Topics range from population, age and sex, immigration, language, family and households, income, education, and labour. For more information on definitions of terms used in the tables and other notes, refer to Statistics Canada's 2021 Census.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These data were assembled to study the causes of domestic conflicts. They are based on a project of the University of Illinois (Social, Political, Economic Event Database, SPEED), which leverages six decades of journalist output (1945-2005), and contains 62,141 newsworthy disruptive events worldwide. It also distinguishes between events rooted in class-based conflicts and anti-government sentiment, a total of eight "origins", in addition to socio-cultural animosities (SCA). We explored the dataset to assess the impact of communal identities as compared to other cleavages that unavoidably emerge in all societies. We added background variables from other open-source databases, such as Quality of Government, and Varieties of Democracy. The data merging was carried out in two ways: (i) adding country features to the original event-level SPEED data; and (ii) adding aggregated SPEED information to a country-level dataset compiled from the other sources. Thus two compiled datasets are deposited. The original sources, all open access data, are listed below with their original web addresses. SPEED Global Random Sample 1945-2005 ("spp_public.xls”), Cline Center for Advanced Social Research, University of Illinois, data produced within the frames of the Social, Political, and Economic Event Database (SPEED) Project, https://clinecenter.illinois.edu/project/human-loop-event-data-projects/SPEED. (The exact location of the Excel file used is at https://uofi.app.box.com/s/l2qc5rnn7tjpbwodhi6jc50fk2vlwjbv.) Historical Index of Ethnic Fractionalization (HIEF) data. The fractionalization indexes were calculated by Lenka Drazanova (https://openhumanitiesdata.metajnl.com/articles/10.5334/johd.16), based on the ethnic distribution data compiled by the Composition of Religious and Ethnic Groups (CREG) Project, also of the Cline Center for Advanced Social Research, https://clinecenter.illinois.edu/project/Religious-Ethnic-Identity/composition-religious-and-ethnic-.... Teorell, Jan, Aksel Sundström, Sören Holmberg, Bo Rothstein, Natalia Alvarado Pachon, Cem Mert Dalli & Yente Meijers. 2023. The Quality of Government Standard Dataset, version Jan23. University of Gothenburg: The Quality of Government Institute, https://www.gu.se/en/quality-government doi:10.18157/qogstdjan23. We used the time series version. Coppedge, Michael, John Gerring, Carl Henrik Knutsen, Staffan I. Lindberg, Jan Teorell, David Altman, Michael Bernhard, Agnes Cornell, M. Steven Fish, Lisa Gastaldi, Haakon Gjerløw, Adam Glynn, Ana Good God, Sandra Grahn, Allen Hicken, Katrin Kinzelbach, Joshua Krusell, Kyle L. Marquardt, Kelly McMann, Valeriya Mechkova, Juraj Medzihorsky, Natalia Natsika, Anja Neundorf, Pamela Paxton, Daniel Pemstein, Josefine Pernes, Oskar Rydén, Johannes von Römer, Brigitte Seim, Rachel Sigman, Svend-Erik Skaaning, Jeffrey Staton, Aksel Sundström, Eitan Tzelgov, Yi-ting Wang, Tore Wig, Steven Wilson and Daniel Ziblatt. 2023. "V-Dem [Country-Year] Dataset v13" Varieties of Democracy (V-Dem) Project. https://doi.org/10.23696/vdemds23.
This table is part of a series of tables that present a portrait of Canada based on the various census topics. The tables range in complexity and levels of geography. Content varies from a simple overview of the country to complex cross-tabulations; the tables may also cover several censuses.
This table provides statistical information about people in Canada by their demographic, social and economic characteristics as well as provide information about the housing units in which they live.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about books. It has 1 row and is filtered where the book is Cultural awareness in the human services : a multi-ethnic approach. It features 7 columns including author, publication date, language, and book publisher.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This study constructed a dataset of online media in Gansu Province from 2013 to 2022, with data from six major online media platforms in Linxia Hui Autonomous Prefecture and Gannan Tibetan Autonomous Prefecture, including Linxia Prefecture Government Website, Ethnic Daily, China Linxia Website, Shambhala Online, and China Gannan Website. The dataset covers a wide range of social, cultural, and linguistic aspects of the ethnic areas in Gansu, spanning a decade, and all the data are Chinese-language news reports and commentaries. Neologism extraction was carried out for each year's dataset, and the extracted neologisms were analyzed for their characteristics in terms of word frequency, lexicality, word number, cohesion, degrees of freedom, and neologism probability. The dataset was constructed with strict quality control measures, including manual proofreading, noise filtering, de-emphasis processing and language annotation, to ensure the accuracy and completeness of the data. This dataset is an important basic data for the study of language use, social and cultural dynamics and bilingual education development in ethnic areas, and has the value of being widely used in policy analysis, social opinion monitoring and language policy research.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
We show that cultural and ethnolinguistic diversity on their own are not enough to describe ethnic political organization, but that co-ethnics need to reliably use ethnicity as a signal of cultural alignment. Using Benin and Senegal as a case study, we show that the overlap between cultural fractionalization and ethnolinguistic fractionalization in the two countries are statistically different from one another. Evidence from 2000 simulations and the Komolgrov-Smirnov test suggests that the degree to which cultural and ethnolinguistic diversity overlap serves as a first step in explaining why we observe political organization around ethnicity in Benin and not in Senegal--even though the two have statistically indistinguishable levels of ethnolinguistic and cultural diversity. This work informs the broader question of why ethnic politics emerge in some ethnically diverse settings and not in others.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
SAP2022T2T2ED - Usually Resident Population by Ethnic or Cultural Background. Published by Central Statistics Office. Available under the license Creative Commons Attribution 4.0 (CC-BY-4.0).Usually Resident Population by Ethnic or Cultural Background...
Data on visible minority by ethnic or cultural origin, age and gender for the population in private households in Canada, provinces and territories, census metropolitan areas, census agglomerations and parts.
Statistics Canada Census Data from 2021. This dataset includes the ethnic or cultural origin data for the female population provided by Statistics Canada joined with the census tracts. Each topic covered by the census was exported as a separate table. Each table contains the total, male, and female characteristics as fields for each census tract. Topics range from population, age and sex, immigration, language, family and households, income, education, and labour. For more information on definitions of terms used in the tables and other notes, refer to Statistics Canada's 2021 Census.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Columbus population by race and ethnicity. The dataset can be utilized to understand the racial distribution of Columbus.
The dataset will have the following datasets when applicable
Please note that in case when either of Hispanic or Non-Hispanic population doesnt exist, the respective dataset will not be available (as there will not be a population subset applicable for the same)
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population Aged 3 Years and Over Usually Resident and Present in the State (Number) by Ethnic or Cultural Background, Irish Speakers and Non-Irish Speakers, CensusYear and Age Group
View data using web pages
Download .px file (Software required)
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Dataset Card for SwitchLingua_text
Dataset Summary
SwitchLingua is a comprehensive multilingual and multicultural code-switching dataset designed to advance research in automatic speech recognition, natural language processing, and conversational AI. The textual data for SwitchLingua was first generated using the proposed LinguaMaster framework, and the audio data was recorded by 174 bilingual speakers from diverse linguistic and cultural backgrounds to ensure high… See the full description on the dataset page: https://huggingface.co/datasets/Shelton1013/SwitchLingua_text.
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This table is part of a series of tables that present a portrait of Canada based on the various census topics. The tables range in complexity and levels of geography. Content varies from a simple overview of the country to complex cross-tabulations; the tables may also cover several censuses.
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Data on long-form data quality indicators for 2021 Census ethnic or cultural origin, population group and religion content, Canada, provinces and territories, census metropolitan areas, census agglomerations and census subdivisions.
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Data on ethnic or cultural origin by generation status, age and gender for the population in private households in Canada, provinces and territories, census metropolitan areas, census agglomerations and parts.
Please be advised that there are issues with the Small Area boundary dataset generalised to 20m which affect Small Area 268014010 in Ballygall D, Dublin City. The Small Area boundary dataset generalised to 20m is in the process of being revised and the updated datasets will be available as soon as the boundaries are amended. This feature layer was created using Census 2016 data produced by the Central Statistics Office (CSO) and Small Areas national boundary data (generalised to 20m) produced by Tailte Éireann. The layer represents Census 2016 theme 2.2, the population usually resident in Ireland by ethnic or cultural background. Attributes include population breakdown by ethnicity or cultural background (e.g. Asian or Asian Irish, White Irish). Census 2016 theme 2 represents Migration, Ethnicity and Religion. The Census is carried out every five years by the CSO to determine an account of every person in Ireland. The results provide information on a range of themes, such as, population, housing and education. The data were sourced from the CSO.The Small Area Boundaries were created with the following credentials. National boundary dataset. Consistent sub-divisions of an ED. Created not to cross some natural features. Defined area with a minimum number of GeoDirectory building address points. Defined area initially created with minimum of 65 – approx. average of around 90 residential address points. Generated using two bespoke algorithms which incorporated the ED and Townland boundaries, ortho-photography, large scale vector data and GeoDirectory data. Before the 2011 census they were split in relation to motorways and dual carriageways. After the census some boundaries were merged and other divided to maintain privacy of the residential area occupants. They are available as generalised and non generalised boundary sets.
A cross-national data set of 21 variables was assembled for 212 countries from three sources (Barro and Lee 1994; Gordon 2005; CIA World Fact Book 2005). Our data set includes several proxy measures for national wealth, cultural diversity, social instability (both at national and international levels), and demography. Separate diversity measures were calculated for three different cultural domains, namely language, religion and ethnic groups . In addition, wealth variables (per capita GDP, and GINI, the coefficient of income inequality) were assembled, along with indicators of societal functioning drawn from the literature (especially Barro and Lee 1994), including indices of political rights (PRIGHTSB), revolutions and coups d'états (REVCOUP), and political instability (PINSTAB). Measures of international conflict were extracted from the social science literature, and the following were used: the proportion of the time between 1960-85 the country was involved in an external war (WARTIME), the number of international disputes in which the country was involved (TOTINTDISP), and an index of total military expenditure (TOTMILITEXP). Possible confounding variables such as population size (POPSIZE) and the number of international borders (NBINTBORDERS) were also included.