24 datasets found

Number of native Spanish speakers worldwide 2024, by country
statista.com
Updated Jan 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Number of native Spanish speakers worldwide 2024, by country [Dataset]. https://www.statista.com/statistics/991020/number-native-spanish-speakers-country-worldwide/
Explore at:
Dataset updated
Jan 15, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
World
Description
Mexico is the country with the largest number of native Spanish speakers in the world. As of 2024, 132.5 million people in Mexico spoke Spanish with a native command of the language. Colombia was the nation with the second-highest number of native Spanish speakers, at around 52.7 million. Spain came in third, with 48 million, and Argentina fourth, with 46 million. Spanish, a world language As of 2023, Spanish ranked as the fourth most spoken language in the world, only behind English, Chinese, and Hindi, with over half a billion speakers. Spanish is the official language of over 20 countries, the majority on the American continent, nonetheless, it's also one of the official languages of Equatorial Guinea in Africa. Other countries have a strong influence, like the United States, Morocco, or Brazil, countries included in the list of non-Hispanic countries with the highest number of Spanish speakers. The second most spoken language in the U.S. In the most recent data, Spanish ranked as the language, other than English, with the highest number of speakers, with 12 times more speakers as the second place. Which comes to no surprise following the long history of migrations from Latin American countries to the Northern country. Moreover, only during the fiscal year 2022. 5 out of the top 10 countries of origin of naturalized people in the U.S. came from Spanish-speaking countries.
Census Data - Languages spoken in Chicago, 2008 – 2012
data.cityofchicago.org
healthdata.gov
+4more
csv, xlsx, xml
Updated Sep 12, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2014). Census Data - Languages spoken in Chicago, 2008 – 2012 [Dataset]. https://data.cityofchicago.org/Health-Human-Services/Census-Data-Languages-spoken-in-Chicago-2008-2012/a2fk-ec6q
Explore at:
xlsx, xml, csvAvailable download formats
Dataset updated
Sep 12, 2014
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
U.S. Census Bureau
Area covered
Chicago
Description
This dataset contains estimates of the number of residents aged 5 years or older in Chicago who “speak English less than very well,” by the non-English language spoken at home and community area of residence, for the years 2008 – 2012. See the full dataset description for more information at: https://data.cityofchicago.org/api/views/fpup-mc9v/files/dK6ZKRQZJ7XEugvUavf5MNrGNW11AjdWw0vkpj9EGjg?download=true&filename=P:\EPI\OEPHI\MATERIALS\REFERENCES\ECONOMIC_INDICATORS\Dataset_Description_Languages_2012_FOR_PORTAL_ONLY.pdf
Ranking of languages spoken at home in the U.S. 2023
statista.com
Updated Apr 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Ranking of languages spoken at home in the U.S. 2023 [Dataset]. https://www.statista.com/statistics/183483/ranking-of-languages-spoken-at-home-in-the-us-in-2008/
Explore at:
Dataset updated
Apr 14, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2023
Area covered
United States
Description
In 2023, around 43.37 million people in the United States spoke Spanish at home. In comparison, approximately 998,179 people were speaking Russian at home during the same year. The distribution of the U.S. population by ethnicity can be accessed here. A ranking of the most spoken languages across the world can be accessed here.
n
Data from: Language Spoken at Home
linc.osbm.nc.gov
ncosbm.opendatasoft.com
csv, excel, geojson +1
Updated Oct 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Language Spoken at Home [Dataset]. https://linc.osbm.nc.gov/explore/dataset/language-spoken-at-home/
Explore at:
geojson, csv, json, excelAvailable download formats
Dataset updated
Oct 3, 2024
Description
Language spoken at home and the ability to speak English for the population age 5 and over as reported by the US Census Bureau's, American Community Survey (ACS) 5-year estimates table C16001.
The most spoken languages worldwide 2025
statista.com
Updated Apr 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). The most spoken languages worldwide 2025 [Dataset]. https://www.statista.com/statistics/266808/the-most-spoken-languages-worldwide/
Explore at:
Dataset updated
Apr 14, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2025
Area covered
World
Description
In 2025, there were around 1.53 billion people worldwide who spoke English either natively or as a second language, slightly more than the 1.18 billion Mandarin Chinese speakers at the time of survey. Hindi and Spanish accounted for the third and fourth most widespread languages that year. Languages in the United States The United States does not have an official language, but the country uses English, specifically American English, for legislation, regulation, and other official pronouncements. The United States is a land of immigration, and the languages spoken in the United States vary as a result of the multicultural population. The second most common language spoken in the United States is Spanish or Spanish Creole, which over than 43 million people spoke at home in 2023. There were also 3.5 million Chinese speakers (including both Mandarin and Cantonese),1.8 million Tagalog speakers, and 1.57 million Vietnamese speakers counted in the United States that year. Different languages at home The percentage of people in the United States speaking a language other than English at home varies from state to state. The state with the highest percentage of population speaking a language other than English is California. About 45 percent of its population was speaking a language other than English at home in 2023.
d
Population of the Limited English Proficient (LEP) Speakers by Community...
catalog.data.gov
data.cityofnewyork.us
+1more
Updated Jan 19, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.cityofnewyork.us (2024). Population of the Limited English Proficient (LEP) Speakers by Community District [Dataset]. https://catalog.data.gov/dataset/population-of-the-limited-english-proficient-lep-speakers-by-community-district
Explore at:
Dataset updated
Jan 19, 2024
Dataset provided by
data.cityofnewyork.us
Description
Many residents of New York City speak more than one language; a number of them speak and understand non-English languages more fluently than English. This dataset, derived from the Census Bureau's American Community Survey (ACS), includes information on over 1.7 million limited English proficient (LEP) residents and a subset of that population called limited English proficient citizens of voting age (CVALEP) at the Community District level. There are 59 community districts throughout NYC, with each district being represented by a Community Board.
O
2017 San Diego County Demographics - Language Spoken at Home for the...
data.sandiegocounty.gov
application/rdfxml +5
Updated Feb 22, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
County of San Diego (2020). 2017 San Diego County Demographics - Language Spoken at Home for the Population 5 Years and Ability to Speak English [Dataset]. https://data.sandiegocounty.gov/Demographics/2017-San-Diego-County-Demographics-Language-Spoken/69ct-7r4j
Explore at:
application/rssxml, csv, xml, application/rdfxml, json, tsvAvailable download formats
Dataset updated
Feb 22, 2020
Dataset authored and provided by
County of San Diego
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Area covered
San Diego County
Description
*Asian/Pacific Islander

Language questions were only asked of persons 5 years and older. The language question is about current use of a non-English language at home, not about ability to speak another language or the use of such a language in the past. People who speak a language other than English outside of the home are not reported as speaking a language other than English. Similarly, people whose mother tongue is a non-English language but who do not currently use the language at home do not report the language.

Source: U.S. Census Bureau; 2013-2017 American Community Survey 5-Year Estimates, Table DP02.
a
What language do people speak at home other than English?
hub.arcgis.com
Updated Oct 30, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ArcGIS Living Atlas Team (2018). What language do people speak at home other than English? [Dataset]. https://hub.arcgis.com/maps/9fd7fbc0f731409abd57cfd15d963683
Explore at:
Dataset updated
Oct 30, 2018
Dataset authored and provided by
ArcGIS Living Atlas Team
Area covered

Description
This map shows the predominant non-English language spoken at home by the population age 5+. The pattern is seen at state, county, and tract geographies. The data shown is current-year American Community Survey (ACS) data from the US Census. The data is updated each year when the ACS releases its new 5-year estimates. For more information about this data, visit this page.To learn more about when the ACS releases data updates, click here.
Common languages used for web content 2025, by share of websites
statista.com
Updated Feb 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Common languages used for web content 2025, by share of websites [Dataset]. https://www.statista.com/statistics/262946/most-common-languages-on-the-internet/
Explore at:
Dataset updated
Feb 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Feb 2025
Area covered
Worldwide
Description
As of February 2025, English was the most popular language for web content, with over 49.4 percent of websites using it. Spanish ranked second, with six percent of web content, while the content in the German language followed, with 5.6 percent. English as the leading online language United States and India, the countries with the most internet users after China, are also the world's biggest English-speaking markets. The internet user base in both countries combined, as of January 2023, was over a billion individuals. This has led to most of the online information being created in English. Consequently, even those who are not native speakers may use it for convenience. Global internet usage by regions As of October 2024, the number of internet users worldwide was 5.52 billion. In the same period, Northern Europe and North America were leading in terms of internet penetration rates worldwide, with around 97 percent of its populations accessing the internet.
a
Predominant Language Spoken at Home - ACS 2016-Copy-Copy
umn.hub.arcgis.com
Updated Dec 24, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University of Minnesota (2019). Predominant Language Spoken at Home - ACS 2016-Copy-Copy [Dataset]. https://umn.hub.arcgis.com/maps/75844925ac7d472ea49c62e518686577
Explore at:
Dataset updated
Dec 24, 2019
Dataset authored and provided by
University of Minnesota
Area covered

Description
This map shows the predominant language spoken at home by the US population aged 5+. This is shown by Census Tract and County centroids. The data values are from the 2012-2016 American Community Survey 5-year estimates in the S1601 Table for Language Spoken at Home. The popup in the map provides a breakdown of the population age 5+ by the language spoken at home. Data values for other age groups are also available within the data's table. The color of the symbols represent the most common language spoken at home. This predominance map style compares the count of people age 5+ based on what language is spoken at home, and returns the value with the highest count. The census breaks down the population 5+ by the following language options:English OnlyNon-English - SpanishNon-English - Asian and Pacific Islander LanguagesNon-English - Indo European LanguagesNon-English - OtherThe size of the symbols represents how many people are 5 years or older, which helps highlight the quantity of people that live within an area that were sampled for this language categorization. The strength of the color represents HOW predominant an language is within an area. If the symbol is a strong color, it makes up a larger portion of the population. This map is designed for a dark basemap such as the Human Geography Basemap or the Dark Gray Canvas Basemap. See the web map to see the pattern at both the county and tract level. This map helps to show the most common language spoken at home at both a regional and local level. The tract pattern shows how distinct neighborhoods are clustered by which language they speak. The county pattern shows how language is used throughout the country. This pattern is shown by census tracts at large scales, and counties at smaller scales.This data was downloaded from the United States Census Bureau American Fact Finder on January 16, 2018. It was then joined with 2016 vintage centroid points and hosted to ArcGIS Online and into the Living Atlas.Nationally, the breakdown of education for the population 5+ is as follows:Total EstimateMargin of ErrorPercent EstimateMargin of ErrorPopulation 5 years and over298,691,202+/-3,594(X)(X)Speak only English235,519,143+/-154,40978.90%+/-0.1Spanish39,145,066+/-94,57113.10%+/-0.1Asian and Pacific Island languages10,172,370+/-22,5613.40%+/-0.1Other Indo-European languages10,827,536+/-46,3353.60%+/-0.1Other languages3,027,087+/-23,3021.00%+/-0.1
h
peoples_speech
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MLCommons, peoples_speech [Dataset]. https://huggingface.co/datasets/MLCommons/peoples_speech
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset authored and provided by
MLCommons
License
Attribution 2.0 (CC BY 2.0)https://creativecommons.org/licenses/by/2.0/
License information was derived automatically
Description
Dataset Card for People's Speech

Dataset Summary

The People's Speech Dataset is among the world's largest English speech recognition corpus today that is licensed for academic and commercial usage under CC-BY-SA and CC-BY 4.0. It includes 30,000+ hours of transcribed speech in English languages with a diverse set of speakers. This open dataset is large enough to train speech-to-text systems and crucially is available with a permissive license.

Supported Tasks… See the full description on the dataset page: https://huggingface.co/datasets/MLCommons/peoples_speech.
a
LA County Language Spoken at Home (census tract)
equity-lacounty.hub.arcgis.com
data.lacounty.gov
+3more
Updated Jul 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
County of Los Angeles (2025). LA County Language Spoken at Home (census tract) [Dataset]. https://equity-lacounty.hub.arcgis.com/items/0a1abd82f6024c72aa9f62bdbd5c5603
Explore at:
Dataset updated
Jul 28, 2025
Dataset authored and provided by
County of Los Angeles
Area covered

Description
US Census American Community Survey Custom Tabulation (ST542) by Census Tract. Language spoken at home for population 5 years and over by ability to speak English, summarized by census tract for 114 languages spoken across LA County, 5-year estimates 2019-2023.See also source data tables:Census Tracts: Language Spoken at Home LA County Census TractsLA County: Language Spoken at Home LA County Headings:GEOIDGeography identificationCT20Census tract (2020)NameCensus tract nameCSACountywide Statistical Area (city or community)SPAService Planning AreaSDSupervisorial Districttotal_popPopulation over 5 years old in census tract (universe)total_limited_engPopulation that speaks English less than "very well"total_limited_eng_pctPercent of population that speaks English less than "very well"
u
Bilingual acquisition data: longitudinal corpus_FerFuLice dataset
portaldelaciencia.uva.es
Updated 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fernández Fuertes, Raquel; M. Liceras, Juana; Fernández Fuertes, Raquel; M. Liceras, Juana (2021). Bilingual acquisition data: longitudinal corpus_FerFuLice dataset [Dataset]. https://portaldelaciencia.uva.es/documentos/682afbc04c44bf76b28821ec
Explore at:
Dataset updated
2021
Authors
Fernández Fuertes, Raquel; M. Liceras, Juana; Fernández Fuertes, Raquel; M. Liceras, Juana
Description
This corpus contains spontaneous productions from a longitudinal study of two English/Spanish bilingual identical twins with the pseudonyms of Simon and Leo. They were born 28-DEC-1998 into a middle-class family in Spain. The father is a native speaker of Peninsular Spanish, and the mother is a native speaker of American English. The father always speaks to the children in Spanish and the mother always addresses them in English. The parents generally communicate in Spanish with each other, except on summers when they travel to the United States for approximately two months or when a monolingual English speaker is present. Therefore, we are dealing with bilingual English/Spanish first language acquisition in a monolingual-Spanish social context, a type of bilingualism that is referred to in the literature as individual bilingualism (Bhatia and Ritchie, 2004).
Data from: Caregiver experiences with oral bilingualism (Benítez-Barrera et...
asha.figshare.com
datasetcatalog.nlm.nih.gov
+1more
pdf
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carlos Benítez-Barrera; Lina Reiss; Marjan Majid; Trisha Chau; Johanna Wilson; Erika Figueroa Rico; Ferenc Bunta; Robert M. Raphael; Beatriz de Diego-Lázaro (2023). Caregiver experiences with oral bilingualism (Benítez-Barrera et al., 2023) [Dataset]. http://doi.org/10.23641/asha.21644846.v2
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.23641/asha.21644846.v2
Dataset updated
May 30, 2023
Dataset provided by
American Speech–Language–Hearing Associationhttp://www.asha.org/
Authors
Carlos Benítez-Barrera; Lina Reiss; Marjan Majid; Trisha Chau; Johanna Wilson; Erika Figueroa Rico; Ferenc Bunta; Robert M. Raphael; Beatriz de Diego-Lázaro
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Purpose: Best practices recommend promoting the use of the home language and allowing caregivers to choose the language(s) that they want to use with their child who is deaf or hard of hearing (DHH). We examined whether Spanish-speaking caregivers of children who are DHH receive professional recommendations on oral bilingualism that follow best practices. We also assessed whether professional recommendations, caregiver beliefs, and language practices had an impact on child language(s) proficiency. Method: Sixty caregivers completed a questionnaire on demographic questions, language(s) use and recommendations, beliefs on bilingualism, and child language proficiency measures in English, Spanish, and American Sign Language (ASL). Professional recommendations on oral bilingualism were reported descriptively, and linear regression was used to identify the predictors of child language(s) proficiency. Results: We found that only 23.3% of the caregivers were actively encouraged to raise their child orally bilingual. Language practices predicted child proficiency in each language (English, Spanish, and ASL), but professional recommendations and caregiver beliefs did not. Conclusions: Our results revealed that most caregivers received recommendations that do not follow current best practices. Professional training is still needed to promote bilingualism and increase cultural competence when providing services to caregivers who speak languages different from English. Supplemental Material S1. Survey items and response scoring. Benítez-Barrera, C., Reiss, L., Majid, M., Chau, T., Wilson, J., Rico, E. F., Bunta, F., Raphael, R. M., & de Diego-Lázaro, B. (2023). Caregiver experiences with oral bilingualism in children who are deaf and hard of hearing in the United States: Impact on child language proficiency. Language, Speech, and Hearing Services in Schools, 54(1), 224–240. https://doi.org/10.1044/2022_LSHSS-22-00095
l
What languages are spoken by people who have limited English ability?
visionzero.geohub.lacity.org
Updated Apr 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Urban Observatory by Esri (2023). What languages are spoken by people who have limited English ability? [Dataset]. https://visionzero.geohub.lacity.org/maps/2c15f2d8d81343d883e70a73317a5cb9
Explore at:
Dataset updated
Apr 21, 2023
Dataset authored and provided by
Urban Observatory by Esri
Area covered

Description
This map shows the predominant language(s) spoken by people who have limited English speaking ability. This is shown using American Community Survey data from the US Census Bureau by state, county, and tract.There are 12 different language/language groupings: SpanishFrench, Haitian, or CajunKoreanChinese (including Mandarin and Cantonese)VietnameseTagalog (including Filipino)ArabicGerman or other West GermanicRussian, Polish, or other SlavicOther Indo-European (such as Italian or Portuguese)Other Asian and Pacific Island (such as Japanese or Hmong)Other and unspecified (such as Navajo or Hebrew).This map also uses a feature effect to identify the counties with either 10,000 or 5% of the population having limited English ability. According to the Voting Rights Act, "localities where there are more than 10,000 or over 5 percent of the total voting age citizens in a single political subdivision (usually a county, but a township or municipality in some states) who are members of a single language minority group, have depressed literacy rates, and do not speak English very well" are required to "provide [voting materials] in the language of the applicable minority group as well as in the English language".This map uses these hosted feature layers containing the most recent American Community Survey data. These layers are part of ArcGIS Living Atlas, and are updated every year when the American Community Survey releases new estimates, so values in the map always reflect the newest data available.
h
HausaVG
huggingface.co
Updated Jul 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HausaNLP (2023). HausaVG [Dataset]. https://huggingface.co/datasets/HausaNLP/HausaVG
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 3, 2023
Dataset authored and provided by
HausaNLP
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Multi-modal Machine Translation (MMT) enables the use of visual information to enhance the quality of translations, especially where the full context is not available to enable the unambiguous translation in standard machine translation. Despite the increasing popularity of such technique, it lacks sufficient and qualitative datasets to maximize the full extent of its potential. Hausa, a Chadic language, is a member of the Afro-Asiatic language family. It is estimated that about 100 to 150 million people speak the language, with more than 80 million indigenous speakers. This is more than any of the other Chadic languages. Despite the large number of speakers, the Hausa language is considered as a low resource language in natural language processing (NLP). This is due to the absence of enough resources to implement most of the tasks in NLP. While some datasets exist, they are either scarce, machine-generated or in the religious domain. Therefore, there is the need to create training and evaluation data for implementing machine learning tasks and bridging the research gap in the language. This work presents the Hausa Visual Genome (HaVG), a dataset that contains the description of an image or a section within the image in Hausa and its equivalent in English. The dataset was prepared by automatically translating the English description of the images in the Hindi Visual Genome (HVG). The synthetic Hausa data was then carefully postedited, taking into cognizance the respective images. The data is made of 32,923 images and their descriptions that are divided into training, development, test, and challenge test set. The Hausa Visual Genome is the first dataset of its kind and can be used for Hausa-English machine translation, multi-modal research, image description, among various other natural language processing and generation tasks.
U.S. - children who speak another language than English at home 1979-2019
statista.com
Updated Jul 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). U.S. - children who speak another language than English at home 1979-2019 [Dataset]. https://www.statista.com/statistics/476745/number-of-children-who-speak-another-language-than-english-at-home-in-the-us/
Explore at:
Dataset updated
Jul 5, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
United States
Description
In 2019, about 12.08 million children were speaking another language other than English at home in the United States. This number is fairly consistent with the previous year, where 12.13 million children spoke another language at home.
People Speaking English Less Than "Very Well" GIS
data-sccphd.opendata.arcgis.com
hub.arcgis.com
Updated Aug 24, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Santa Clara County Public Health (2022). People Speaking English Less Than "Very Well" GIS [Dataset]. https://data-sccphd.opendata.arcgis.com/datasets/people-speaking-english-less-than-very-well-gis
Explore at:
Dataset updated
Aug 24, 2022
Dataset provided by
Santa Clara County Public Health Departmenthttps://publichealth.sccgov.org/
Authors
Santa Clara County Public Health
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Table contains count and percentage of county residents ages 5 years and older who speak English less than "very well". Data are presented at county, city, zip code and census tract level. Data are presented for zip codes (ZCTAs) fully within the county. Source: U.S. Census Bureau, 2016-2020 American Community Survey 5-year estimates, Table S1601; data accessed on August 23, 2022 from https://api.census.gov. The 2020 Decennial geographies are used for data summarization.METADATA:notes (String): Lists table title, notes, sourcesgeolevel (String): Level of geographyGEOID (Numeric): Geography IDNAME (String): Name of geographypop_5plus (Numeric): Population ages 5 years and olderspeak_Eng_lt_very_well (Numeric): Number of people ages 5 and older who speak English less than "very well"pct_speak_Eng_lt_very_well (Numeric): Percent of people ages 5 and older who speak English less than "very well"
Z
The Dublin Language Garden Perceptual Dialectology of Irish English...
data.niaid.nih.gov
Updated Nov 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Garnett, Vicky (2020). The Dublin Language Garden Perceptual Dialectology of Irish English Collection [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4247828
Explore at:
Dataset updated
Nov 6, 2020
Dataset provided by
Garnett, Vicky
Lucek, Stephen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Ireland, Dublin
Description
Recommended citation for this dataset: Garnett, Vicky, & Lucek, Stephen. (2020). The Dublin Language Garden Perceptual Dialectology of Irish English Collection (Version 1.0.0) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.4247829

About this Dataset The field of Perceptual Dialectology is an area of sociolinguistic study that investigates how non-linguists view different varieties of language. It often includes hand-drawn map exercises in which participants indicate where they believe various varieties are spoken, and their attitudes towards them.

In 2015, as part of a public linguistics outreach event (the Dublin Language Garden) held at Trinity College Dublin, the authors created an activity for members of the public and collected hand-drawn maps from them that gave responses to the following tasks:

a. Indicate where you come from on the map (using a red dot sticker) b. Draw where you think the Dublin dialect occurs c. Draw the boundaries of any other dialects you believe occur in Ireland d. Tell us what you think are the features of those dialects e. Tell us what you think are the characteristics of the people who speak those dialects.

Participants of all ages were encouraged to take part, but only data from those over 18 were retained after the event and used in this data collection. Participants were all given information on how the data was to be anonymised, processed and published on a clearly displayed poster to read before they were given a map to complete the 5 tasks (listed above). No additional information about the participants, aside from that acquired through Task a, was collected.

File List:

_READ_ME - Dublin Language Garden Perceptual Dialectology of Irish English data.txt Contains a detailed description of this dataset.

DLG_PDIE_KML_data_by_location.zip This zipped folder contains the .kml data of multiple hand-drawn maps organised into folders by their location

DLG_PDIE_KML_data_by_part.zip This zipped folder contains the .kml data of multiple hand-drawn maps organised into folders according to the participants.

These folders have been organised in this way in order to make discoverability easier between the data. Users may wish to analyse the data only by the locations of the varieties identified by the participants. Other users may only be interested in the data given by specific participants, and therefore the folder that organises the data in this way may be of better use to them. Both folders, however, contain the same data, it is simply how they are organised.

Garnett and Lucek DLG_PD_IE Qualitative Data (Nov 2020).xlsx Spreadsheet featuring tabulated qualitative data taken from all maps

Sample Hand-drawn Maps.zip Folder containing 2 sample hand-drawn maps from the participants to help contextualise the data presented here.

Any questions? Any enquiries regarding this dataset should be directed to either Vicky Garnett (garnetv@tcd.ie) or Stephen Lucek (stephen.lucek@ucd.ie).
Census of Population and Housing, 1980: Summary Tape File 3B
archive.ciser.cornell.edu
Updated Feb 13, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bureau of the Census (2020). Census of Population and Housing, 1980: Summary Tape File 3B [Dataset]. http://doi.org/10.6077/j5/gwagmn
Explore at:
Unique identifier
https://doi.org/10.6077/j5/gwagmn
Dataset updated
Feb 13, 2020
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
Bureau of the Census
Variables measured
Individual, HousingUnit
Description
This data collection is a component of Summary Tape File (STF) 3, which consists of four sets of data files containing detailed tabulations of the nation's population and housing characteristics produced from the 1980 Census. The STF 3 files contain sample data inflated to represent the total United States population. The files also contain 100-percent counts and unweighted sample counts of persons and housing units. All files in the STF 3 series are identical, containing 321 substantive data variables organized in the form of 150 "tables," as well as standard geographic identification variables. Population items tabulated for each person include demographic data and information on schooling, Spanish origin, language spoken at home and ability to speak English, labor force status in 1979, residency in 1975, number of children ever born, means of transportation to work, current occupation, industry, and 1979 details on occupation, hours worked, and income. Housing items include size and condition of the housing unit as well as information on value, age, water, sewage and heating, number of vehicles, and monthly owner costs (e.g., sum of payments for real estate taxes, property insurance, utilities, and regular mortgage payments). Selected aggregates and medians are also provided. Each dataset in STF 3 provides different geographic coverage. Summary Tape File 3B provides summaries for each 5-digit ZIP-code area within a state, and for 5-digit ZIP-code areas within states that were contained within Standard Metropolitan Statistical Areas (SMSAs), portions of SMSAs, or within counties, county portions, or county equivalents. All persons and housing units in the United States were sampled. Population and housing items include household relationship, sex, race, age, marital status, Hispanic origin, number of units at address, complete plumbing facilities, number of rooms, whether owned or rented, vacancy status, and value for noncondominiums. The Census Bureau's machine-readable data dictionary for STF 3 is also available through CENSUS OF POPULATION AND HOUSING, 1980 [UNITED STATES]: CENSUS SOFTWARE PACKAGE (CENSPAC) VERSION 3.2 WITH STF4 DATA DICTIONARIES (ICPSR 7789), the software package designed specifically by the Census Bureau for use with the 1980 Census data files. (Source: downloaded from ICPSR 7/13/10)

Please Note: This dataset is part of the historical CISER Data Archive Collection and is also available at ICPSR -- https://doi.org/10.3886/ICPSR08318.v1. We highly recommend using the ICPSR version as they made this dataset available in multiple data formats.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2025). Number of native Spanish speakers worldwide 2024, by country [Dataset]. https://www.statista.com/statistics/991020/number-native-spanish-speakers-country-worldwide/

Number of native Spanish speakers worldwide 2024, by country

Explore at:

9 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jan 15, 2025

Dataset authored and provided by

Statistahttp://statista.com/

Area covered

World

Description

Mexico is the country with the largest number of native Spanish speakers in the world. As of 2024, 132.5 million people in Mexico spoke Spanish with a native command of the language. Colombia was the nation with the second-highest number of native Spanish speakers, at around 52.7 million. Spain came in third, with 48 million, and Argentina fourth, with 46 million. Spanish, a world language As of 2023, Spanish ranked as the fourth most spoken language in the world, only behind English, Chinese, and Hindi, with over half a billion speakers. Spanish is the official language of over 20 countries, the majority on the American continent, nonetheless, it's also one of the official languages of Equatorial Guinea in Africa. Other countries have a strong influence, like the United States, Morocco, or Brazil, countries included in the list of non-Hispanic countries with the highest number of Spanish speakers. The second most spoken language in the U.S. In the most recent data, Spanish ranked as the language, other than English, with the highest number of speakers, with 12 times more speakers as the second place. Which comes to no surprise following the long history of migrations from Latin American countries to the Northern country. Moreover, only during the fiscal year 2022. 5 out of the top 10 countries of origin of naturalized people in the U.S. came from Spanish-speaking countries.

Clear search

Close search

Google apps

Main menu

Number of native Spanish speakers worldwide 2024, by country

Census Data - Languages spoken in Chicago, 2008 – 2012

Ranking of languages spoken at home in the U.S. 2023

Data from: Language Spoken at Home

The most spoken languages worldwide 2025

Population of the Limited English Proficient (LEP) Speakers by Community...

2017 San Diego County Demographics - Language Spoken at Home for the...

What language do people speak at home other than English?

Common languages used for web content 2025, by share of websites

Predominant Language Spoken at Home - ACS 2016-Copy-Copy

peoples_speech

LA County Language Spoken at Home (census tract)

Bilingual acquisition data: longitudinal corpus_FerFuLice dataset

Data from: Caregiver experiences with oral bilingualism (Benítez-Barrera et...

What languages are spoken by people who have limited English ability?

HausaVG

U.S. - children who speak another language than English at home 1979-2019

People Speaking English Less Than "Very Well" GIS

The Dublin Language Garden Perceptual Dialectology of Irish English...

Census of Population and Housing, 1980: Summary Tape File 3B

Number of native Spanish speakers worldwide 2024, by country