51 datasets found

Number of native Spanish speakers worldwide 2024, by country
statista.com
boostndoto.org
+5more
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, Number of native Spanish speakers worldwide 2024, by country [Dataset]. https://www.statista.com/statistics/991020/number-native-spanish-speakers-country-worldwide/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
World
Description
Mexico is the country with the largest number of native Spanish speakers in the world. As of 2024, 132.5 million people in Mexico spoke Spanish with a native command of the language. Colombia was the nation with the second-highest number of native Spanish speakers, at around 52.7 million. Spain came in third, with 48 million, and Argentina fourth, with 46 million. Spanish, a world language As of 2023, Spanish ranked as the fourth most spoken language in the world, only behind English, Chinese, and Hindi, with over half a billion speakers. Spanish is the official language of over 20 countries, the majority on the American continent, nonetheless, it's also one of the official languages of Equatorial Guinea in Africa. Other countries have a strong influence, like the United States, Morocco, or Brazil, countries included in the list of non-Hispanic countries with the highest number of Spanish speakers. The second most spoken language in the U.S. In the most recent data, Spanish ranked as the language, other than English, with the highest number of speakers, with 12 times more speakers as the second place. Which comes to no surprise following the long history of migrations from Latin American countries to the Northern country. Moreover, only during the fiscal year 2022. 5 out of the top 10 countries of origin of naturalized people in the U.S. came from Spanish-speaking countries.
Spanish speakers in countries where Spanish is not an official language 2024...
statista.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, Spanish speakers in countries where Spanish is not an official language 2024 [Dataset]. https://www.statista.com/statistics/1276290/number-spanish-speakers-non-hispanic-countries-worldwide/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
World
Description
The United States is the non-hispanic country with the largest number of native Spanish speakers in the world, with approximately 41.89 million people with a native command of the language in 2024. However, the European Union had the largest group of non-native speakers with limited proficiency of Spanish, at around 28 million people. Furthermore, Mexico is the country with the largest number of native Spanish speakers in the world as of 2024.
Hispanic population U.S. 2023, by state
statista.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, Hispanic population U.S. 2023, by state [Dataset]. https://www.statista.com/statistics/259850/hispanic-population-of-the-us-by-state/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2023
Area covered
United States
Description
In 2023, California had the highest Hispanic population in the United States, with over 15.76 million people claiming Hispanic heritage. Texas, Florida, New York, and Illinois rounded out the top five states for Hispanic residents in that year. History of Hispanic people Hispanic people are those whose heritage stems from a former Spanish colony. The Spanish Empire colonized most of Central and Latin America in the 15th century, which began when Christopher Columbus arrived in the Americas in 1492. The Spanish Empire expanded its territory throughout Central America and South America, but the colonization of the United States did not include the Northeastern part of the United States. Despite the number of Hispanic people living in the United States having increased, the median income of Hispanic households has fluctuated slightly since 1990. Hispanic population in the United States Hispanic people are the second-largest ethnic group in the United States, making Spanish the second most common language spoken in the country. In 2021, about one-fifth of Hispanic households in the United States made between 50,000 to 74,999 U.S. dollars. The unemployment rate of Hispanic Americans has fluctuated significantly since 1990, but has been on the decline since 2010, with the exception of 2020 and 2021, due to the impact of the coronavirus (COVID-19) pandemic.
Hispanic population in the U.S. 2023, by origin
statista.com
Updated Oct 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Hispanic population in the U.S. 2023, by origin [Dataset]. https://www.statista.com/statistics/234852/us-hispanic-population/
Explore at:
Dataset updated
Oct 21, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2023
Area covered
United States
Description
As of 2023, around 37.99 million people of Mexican descent were living in the United States - the largest of any Hispanic group. Puerto Ricans, Salvadorans, Cubans, and Dominicans rounded out the top five Hispanic groups living in the U.S. in that year.
Number of students learning Spanish worldwide 2024, by country
statista.com
Updated Jan 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Number of students learning Spanish worldwide 2024, by country [Dataset]. https://www.statista.com/statistics/1276319/number-spanish-language-students-country-worldwide/
Explore at:
Dataset updated
Jan 22, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Spain, Worldwide
Description
The United States is the country with the largest number of Spanish language students, at approximately 8.59 million people in 2024. The second country is Brazil, with around 4.05 million students of the Spanish language. Moreover, the United States is also the non-hispanic country with the largest number of native Spanish speakers in the world.
Spanish Language Datasets | 1.8M+ Sentences | Translation Data | TTS |...
datarade.ai
Updated Jul 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oxford Languages (2025). Spanish Language Datasets | 1.8M+ Sentences | Translation Data | TTS | Dictionary Display | Translations | EU & LATAM Coverage [Dataset]. https://datarade.ai/data-products/spanish-language-datasets-1-8m-sentences-nlp-tts-dic-oxford-languages
Explore at:
.json, .xml, .csv, .xls, .txt, .mp3, .wavAvailable download formats
Dataset updated
Jul 11, 2025
Dataset authored and provided by
Oxford Languageshttps://lexico.com/es
Area covered
Honduras, Chile, Ecuador, Costa Rica, Nicaragua, Colombia, Paraguay, Panama, Bolivia (Plurinational State of), Cuba
Description
Linguistically annotated Spanish language datasets with headwords, definitions, senses, examples, POS tags, semantic metadata, and usage info. Ideal for dictionary tools, NLP, and TTS model training or fine-tuning.

Our Spanish language datasets are carefully compiled and annotated by language and linguistic experts; you can find them available for licensing:

Spanish Monolingual Dictionary Data

Spanish Bilingual Dictionary Data

Spanish Sentences Data

Synonyms and Antonyms Data

Audio Data

Spanish Word List Data

Key Features (approximate numbers):

Spanish Monolingual Dictionary Data

Our Spanish monolingual reliably offers clear definitions and examples, a large volume of headwords, and comprehensive coverage of the Spanish language.

Words: 73,000

Senses: 123,000

Example sentences: 104,000

Format: XML and JSON formats

Delivery: Email (link-based file sharing) and REST API

Updated frequency: annually

Spanish Bilingual Dictionary Data

The bilingual data provides translations in both directions, from English to Spanish and from Spanish to English. It is annually reviewed and updated by our in-house team of language experts. Offers significant coverage of the language, providing a large volume of translated words of excellent quality.

Translations: 221,300

Senses: 103,500

Example sentences: 74,500

Example translations: 83,800

Format: XML and JSON formats

Delivery: Email (link-based file sharing) and REST API

Updated frequency: annually

Spanish Sentences Data

Spanish sentences retrieved from the corpus are ideal for NLP model training, presenting approximately 20 million words. The sentences provide a great coverage of Spanish-speaking countries and are accordingly tagged to a particular country or dialect.

Sentences volume: 1,840,000

Format: XML and JSON format

Delivery: Email (link-based file sharing) and REST API

Spanish Synonyms and Antonyms Data

This Spanish language dataset offers a rich collection of synonyms and antonyms, accompanied by detailed definitions and part-of-speech (POS) annotations, making it a comprehensive resource for building linguistically aware AI systems and language technologies.

Synonyms: 127,700

Antonyms: 9,500

Format: XML format

Delivery: Email (link-based file sharing)

Updated frequency: annually

Spanish Audio Data (word-level)

Curated word-level audio data for the Spanish language, which covers all varieties of world Spanish, providing rich dialectal diversity in the Spanish language.

Audio files: 20,900

Format: XLSX (for index), MP3 and WAV (audio files)

Spanish Word List Data

This language data contains a carefully curated and comprehensive list of 450,000 Spanish words.

Wordforms: 450,000

Format: CSV and TXT formats

Delivery: Email (link-based file sharing)

Use Cases:

We consistently work with our clients on new use cases as language technology continues to evolve. These include NLP applications, TTS, dictionary display tools, games, translation, word embedding, and word sense disambiguation (WSD).

If you have a specific use case in mind that isn't listed here, we’d be happy to explore it with you. Don’t hesitate to get in touch with us at Oxford.Languages@oup.com to start the conversation.

Pricing:

Oxford Languages offers flexible pricing based on use case and delivery format. Our datasets are licensed via term-based IP agreements and tiered pricing for API-delivered data. Whether you’re integrating into a product, training an LLM, or building custom NLP solutions, we tailor licensing to your specific needs.

Contact our team or email us at Oxford.Languages@oup.com to explore pricing options and discover how our language data can support your goals.

About the sample:

The samples offer a brief overview of one or two language datasets (monolingual or/and bilingual dictionary data). To help you explore the structure and features of our dataset, we provide a sample in CSV format for preview purposes only.

If you need the complete original sample or more details about any dataset, please contact us (Growth.OL@oup.com) to request access or further information
N
states in U.S. Ranked by Hispanic Other Race Population // 2025 Edition
neilsberg.com
csv, json
Updated Feb 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). states in U.S. Ranked by Hispanic Other Race Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/states-in-united-states-by-hispanic-other-race-population/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 11, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Variables measured
Hispanic Other Race Population, Hispanic Other Race Population as Percent of Total Population of states in United States, Hispanic Other Race Population as Percent of Total Hispanic Other Race Population of United States
Measurement technique
To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

This list ranks the 51 states in the United States by Hispanic Some Other Race (SOR) population, as estimated by the United States Census Bureau. It also highlights population changes in each states over the past five years.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

2019-2023 American Community Survey 5-Year Estimates

2018-2022 American Community Survey 5-Year Estimates

2017-2021 American Community Survey 5-Year Estimates

2016-2020 American Community Survey 5-Year Estimates

2015-2019 American Community Survey 5-Year Estimates

Variables / Data Columns

Rank by Hispanic Other Race Population: This column displays the rank of states in the United States by their Hispanic Some Other Race (SOR) population, using the most recent ACS data available.

states: The states for which the rank is shown in the previous column.

Hispanic Other Race Population: The Hispanic Other Race population of the states is shown in this column.

% of Total states Population: This shows what percentage of the total states population identifies as Hispanic Other Race. Please note that the sum of all percentages may not equal one due to rounding of values.

% of Total U.S. Hispanic Other Race Population: This tells us how much of the entire United States Hispanic Other Race population lives in that states. Please note that the sum of all percentages may not equal one due to rounding of values.

5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
F
Mexican Spanish General Conversation Speech Dataset for ASR
futurebeeai.com
wav
Updated Aug 1, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Mexican Spanish General Conversation Speech Dataset for ASR [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/general-conversation-spanish-mexico
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
Mexico
Dataset funded by
FutureBeeAI
Description
Introduction
Welcome to the Mexican Spanish General Conversation Speech Dataset — a rich, linguistically diverse corpus purpose-built to accelerate the development of Spanish speech technologies. This dataset is designed to train and fine-tune ASR systems, spoken language understanding models, and generative voice AI tailored to real-world Mexican Spanish communication.
Curated by FutureBeeAI, this 30 hours dataset offers unscripted, spontaneous two-speaker conversations across a wide array of real-life topics. It enables researchers, AI developers, and voice-first product teams to build robust, production-grade Spanish speech models that understand and respond to authentic Mexican accents and dialects.
Speech Data
The dataset comprises 30 hours of high-quality audio, featuring natural, free-flowing dialogue between native speakers of Mexican Spanish. These sessions range from informal daily talks to deeper, topic-specific discussions, ensuring variability and context richness for diverse use cases.
•Participant Diversity:
•
Speakers: 60 verified native Mexican Spanish speakers from FutureBeeAI’s contributor community.

•
Regions: Representing various provinces of Mexico to ensure dialectal diversity and demographic balance.

•
Demographics: A balanced gender ratio (60% male, 40% female) with participant ages ranging from 18 to 70 years.

•Recording Details:
•
Conversation Style: Unscripted, spontaneous peer-to-peer dialogues.

•
Duration: Each conversation ranges from 15 to 60 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, recorded at 16kHz sample rate.

•
Environment: Quiet, echo-free settings with no background noise.

Topic Diversity
The dataset spans a wide variety of everyday and domain-relevant themes. This topic diversity ensures the resulting models are adaptable to broad speech contexts.
•Sample Topics Include:
•Family & Relationships
•Food & Recipes
•Education & Career
•Healthcare Discussions
•Social Issues
•Technology & Gadgets
•Travel & Local Culture
•Shopping & Marketplace Experiences, and many more.
Transcription
Each audio file is paired with a human-verified, verbatim transcription available in JSON format.
•Transcription Highlights:
•Speaker-segmented dialogues
•Time-coded utterances
•Non-speech elements (pauses, laughter, etc.)
•High transcription accuracy, achieved through double QA pass, average WER < 5%
These transcriptions are production-ready, enabling seamless integration into ASR model pipelines or conversational AI workflows.
Metadata
The dataset comes with granular metadata for both speakers and recordings:
•
Speaker Metadata: Age, gender, accent, dialect, state/province, and participant ID.

•
Recording Metadata: Topic, duration, audio format, device type, and sample rate.

Such metadata helps developers fine-tune model training and supports use-case-specific filtering or demographic analysis.
Usage and Applications
This dataset is a versatile resource for multiple Spanish speech and language AI applications:
•
ASR Development: Train accurate speech-to-text systems for Mexican Spanish.

•
Voice Assistants: Build smart assistants capable of understanding natural Mexican conversations.

<div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px;
F
Spanish TTS Speech Dataset for Speech Synthesis
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Spanish TTS Speech Dataset for Speech Synthesis [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/tts-monolgue-spanish-spain
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
The Spanish TTS Monologue Speech Dataset is a professionally curated resource built to train realistic, expressive, and production-grade text-to-speech (TTS) systems. It contains studio-recorded long-form speech by trained native Spanish voice artists, each contributing 1 to 2 hours of clean, uninterrupted monologue audio.
Unlike typical prompt-based datasets with short, isolated phrases, this collection features long-form, topic-driven monologues that mirror natural human narration. It includes content types that are directly useful for real-world applications, like audiobook-style storytelling, educational lectures, health advisories, product explainers, digital how-tos, formal announcements, and more.
All recordings are captured in professional studios using high-end equipment and under the guidance of experienced voice directors.
Recording & Audio Quality
•
Audio Format: WAV, 48 kHz, available in 16-bit, 24-bit, and 32-bit depth

•
SNR: Minimum 30 dB

•
Channel: Mono

•
Recording Duration: 20-30 minutes

•
Recording Environment: Studio-controlled, acoustically treated rooms

•
Per Speaker Volume: 1–2 hours of speech per artist

•
Quality Control: Each file is reviewed and cleaned for common acoustic issues, including: reverberation, lip smacks, mouth clicks, thumping, hissing, plosives, sibilance, background noise, static interference, clipping, and other artifacts.

Only clean, production-grade audio makes it into the final dataset.
Voice Artist Selection
All voice artists are native Spanish speakers with professional training or prior experience in narration. We ensure a diverse pool in terms of age, gender, and region to bring a balanced and rich vocal dataset.
•Artist Profile:
•Gender: Male and Female
•Age Range: 20–60 years
•Regions: Native Spanish-speaking states from Spain
•
Selection Process: All artists are screened, onboarded, and sample-approved using FutureBeeAI’s proprietary Yugo platform.

Script Quality & Coverage
Scripts are not generic or repetitive. Scripts are professionally authored by domain experts to reflect real-world use cases. They avoid redundancy and include modern vocabulary, emotional range, and phonetically rich sentence structures.
•
Word Count per Script: 3,000–5,000 words per 30-minute session

•Content Types:
•Storytelling
•Script and book reading
•Informational explainers
•Government service instructions
•E-commerce tutorials
•Motivational content
•Health & wellness guides
•Education & career advice
•
Linguistic Design: Balanced punctuation, emotional range, modern syntax, and vocabulary diversity

Transcripts & Alignment
While the script is used during the recording, we also provide post-recording updates to ensure the transcript reflects the final spoken audio. Minor edits are made to adjust for skipped or rephrased words.
•
Segmentation: Time-stamped at the sentence level, aligned to actual spoken delivery

•
Format: Available in plain text and JSON

•Post-processing:
•Corrected for
N
states in U.S. Ranked by Hispanic White Population // 2025 Edition
neilsberg.com
csv, json
Updated Feb 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). states in U.S. Ranked by Hispanic White Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/states-in-united-states-by-hispanic-white-population/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 11, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Variables measured
Hispanic White Population, Hispanic White Population as Percent of Total Population of states in United States, Hispanic White Population as Percent of Total Hispanic White Population of United States
Measurement technique
To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

This list ranks the 51 states in the United States by Hispanic White population, as estimated by the United States Census Bureau. It also highlights population changes in each states over the past five years.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

2019-2023 American Community Survey 5-Year Estimates

2018-2022 American Community Survey 5-Year Estimates

2017-2021 American Community Survey 5-Year Estimates

2016-2020 American Community Survey 5-Year Estimates

2015-2019 American Community Survey 5-Year Estimates

Variables / Data Columns

Rank by Hispanic White Population: This column displays the rank of states in the United States by their Hispanic White population, using the most recent ACS data available.

states: The states for which the rank is shown in the previous column.

Hispanic White Population: The Hispanic White population of the states is shown in this column.

% of Total states Population: This shows what percentage of the total states population identifies as Hispanic White. Please note that the sum of all percentages may not equal one due to rounding of values.

% of Total U.S. Hispanic White Population: This tells us how much of the entire United States Hispanic White population lives in that states. Please note that the sum of all percentages may not equal one due to rounding of values.

5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
Largest countries in Latin America, by land area
statista.com
Updated Apr 2, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2019). Largest countries in Latin America, by land area [Dataset]. https://www.statista.com/statistics/990519/largest-countries-area-latin-america/
Explore at:
Dataset updated
Apr 2, 2019
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Americas, Latin America
Description
Based on land area, Brazil is the largest country in Latin America by far, with a total area of over 8.5 million square kilometers. Argentina follows with almost 2.8 million square kilometers. Cuba, whose surface area extends over almost 111,000 square kilometers, is the Caribbean country with the largest territory.

Brazil: a country with a lot to offer

Brazil's borders reach nearly half of the South American subcontinent, making it the fifth-largest country in the world and the third-largest country in the Western Hemisphere. Along with its landmass, Brazil also boasts the largest population and economy in the region. Although Brasília is the capital, the most significant portion of the country's population is concentrated along its coastline in the cities of São Paulo and Rio de Janeiro.

South America: a region of extreme geographic variation

With the Andes mountain range in the West, the Amazon Rainforest in the East, the Equator in the North, and Cape Horn as the Southern-most continental tip, South America has some of the most diverse climatic and ecological terrains in the world. At its core, its biodiversity can largely be attributed to the Amazon, the world's largest tropical rainforest, and the Amazon river, the world's largest river. However, with this incredible wealth of ecology also comes great responsibility. In the past decade, roughly 80,000 square kilometers of the Brazilian Amazon were destroyed. And, as of late 2019, there were at least 1,000 threatened species in Brazil alone.
N
states in U.S. Ranked by Hispanic Pacific Islander Population // 2025...
neilsberg.com
csv, json
Updated Feb 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). states in U.S. Ranked by Hispanic Pacific Islander Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/states-in-united-states-by-hispanic-pacific-islander-population/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 11, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Variables measured
Hispanic Pacific Islander Population, Hispanic Pacific Islander Population as Percent of Total Population of states in United States, Hispanic Pacific Islander Population as Percent of Total Hispanic Pacific Islander Population of United States
Measurement technique
To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

This list ranks the 51 states in the United States by Hispanic Native Hawaiian and Other Pacific Islander (NHPI) population, as estimated by the United States Census Bureau. It also highlights population changes in each states over the past five years.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

2019-2023 American Community Survey 5-Year Estimates

2018-2022 American Community Survey 5-Year Estimates

2017-2021 American Community Survey 5-Year Estimates

2016-2020 American Community Survey 5-Year Estimates

2015-2019 American Community Survey 5-Year Estimates

Variables / Data Columns

Rank by Hispanic Pacific Islander Population: This column displays the rank of states in the United States by their Hispanic Native Hawaiian and Other Pacific Islander (NHPI) population, using the most recent ACS data available.

states: The states for which the rank is shown in the previous column.

Hispanic Pacific Islander Population: The Hispanic Pacific Islander population of the states is shown in this column.

% of Total states Population: This shows what percentage of the total states population identifies as Hispanic Pacific Islander. Please note that the sum of all percentages may not equal one due to rounding of values.

% of Total U.S. Hispanic Pacific Islander Population: This tells us how much of the entire United States Hispanic Pacific Islander population lives in that states. Please note that the sum of all percentages may not equal one due to rounding of values.

5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
N
Norway, MI Hispanic or Latino Population Distribution by Their Ancestries
neilsberg.com
csv, json
Updated Aug 18, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2023). Norway, MI Hispanic or Latino Population Distribution by Their Ancestries [Dataset]. https://www.neilsberg.com/research/datasets/6d7c1bb6-3d85-11ee-9abe-0aa64bf2eeb2/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Aug 18, 2023
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Michigan, Norway
Variables measured
Hispanic or Latino population with Cuban ancestry, Hispanic or Latino population with Mexican ancestry, Hispanic or Latino population with Puerto Rican ancestry, Hispanic or Latino population with Other Hispanic or Latino ancestry, Hispanic or Latino population with Cuban ancestry as Percent of Total Hispanic Population, Hispanic or Latino population with Mexican ancestry as Percent of Total Hispanic Population, Hispanic or Latino population with Puerto Rican ancestry as Percent of Total Hispanic Population, Hispanic or Latino population with Other Hispanic or Latino ancestry as Percent of Total Hispanic Population
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) Origin / Ancestry for Hispanic population and (b) respective population as a percentage of the total Hispanic population, we initially analyzed and categorized the data for each of the ancestries across the Hispanic or Latino population. It is ensured that the population estimates used in this dataset pertain exclusively to ancestries for the Hispanic or Latino population. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Norway Hispanic or Latino population. It includes the distribution of the Hispanic or Latino population, of Norway, by their ancestries, as identified by the Census Bureau. The dataset can be utilized to understand the origin of the Hispanic or Latino population of Norway.

Key observations

Among the Hispanic population in Norway, regardless of the race, the largest group is of Mexican origin, with a population of 14 (100% of the total Hispanic population).

https://i.neilsberg.com/ch/norway-mi-population-by-race-and-ethnicity.jpeg" alt="Norway Non-Hispanic population by race">

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Origin for Hispanic or Latino population include:

Mexican

Black or African American

Puerto Rican

Cuban

Other Hispanic or Latino

Variables / Data Columns

Origin: This column displays the origin for Hispanic or Latino population for the Norway

Population: The population of the specific origin for Hispanic or Latino population in the Norway is shown in this column.

% of Total Hispanic Population: This column displays the percentage distribution of each Hispanic origin as a proportion of Norway total Hispanic or Latino population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Norway Population by Race & Ethnicity. You can refer the same here
h
messirve
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Spanish Info Retrieval, messirve [Dataset]. https://huggingface.co/datasets/spanish-ir/messirve
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset authored and provided by
Spanish Info Retrieval
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
July 2025 UPDATE: We released version 1.1, adding almost 200k new queries 🎉🎉🎉. Use with: country = "full" # "ar", "bo", ... version = "1.1" dataset = datasets.load_dataset("spanish-ir/messirve", country, revision=version) print(dataset)

Dataset Card for MessIRve

MessIRve is a large-scale dataset for Spanish IR, designed to better capture the information needs of Spanish speakers across different countries. Queries are obtained from Google's autocomplete API… See the full description on the dataset page: https://huggingface.co/datasets/spanish-ir/messirve.
F
Spanish (Spain) Call Center Data for Delivery & Logistics AI
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Spanish (Spain) Call Center Data for Delivery & Logistics AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/delivery-call-center-conversation-spanish-spain
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
Spain
Dataset funded by
FutureBeeAI
Description
Introduction
This Spanish Call Center Speech Dataset for the Delivery and Logistics industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Spanish-speaking customers. With over 30 hours of real-world, unscripted call center audio, this dataset captures authentic delivery-related conversations essential for training high-performance ASR models.
Curated by FutureBeeAI, this dataset empowers AI teams, logistics tech providers, and NLP researchers to build accurate, production-ready models for customer support automation in delivery and logistics.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Spanish speakers. Captured across various delivery and logistics service scenarios, these conversations cover everything from order tracking to missed delivery resolutions offering a rich, real-world training base for AI models.
•Participant Diversity:
•
Speakers: 60 native Spanish speakers from our verified contributor pool.

•
Regions: Multiple provinces of Spain for accent and dialect diversity.

•
Participant Profile: Balanced gender distribution (60% male, 40% female) with ages ranging from 18 to 70.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted customer-agent dialogues.

•
Call Duration: 5 to 15 minutes on average.

•
Audio Format: Stereo WAV, 16-bit depth, recorded at 8kHz and 16kHz.

•
Recording Environment: Captured in clean, noise-free, echo-free conditions.

Topic Diversity
This speech corpus includes both inbound and outbound delivery-related conversations, covering varied outcomes (positive, negative, neutral) to train adaptable voice models.
•Inbound Calls:
•Order Tracking
•Delivery Complaints
•Undeliverable Addresses
•Return Process Enquiries
•Delivery Method Selection
•Order Modifications, and more
•Outbound Calls:
•Delivery Confirmations
•Subscription Offer Calls
•Incorrect Address Follow-ups
•Missed Delivery Notifications
•Delivery Feedback Surveys
•Out-of-Stock Alerts, and others
This comprehensive coverage reflects real-world logistics workflows, helping voice AI systems interpret context and intent with precision.
Transcription
All recordings come with high-quality, human-generated verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•Time-coded Segments
•Non-speech Tags (e.g., pauses, noise)
•High transcription accuracy with word error rate under 5% via dual-layer quality checks.
These transcriptions support fast, reliable model development for Spanish voice AI applications in the delivery sector.
Metadata
Detailed metadata is included for each participant and conversation:
•
Participant Metadata: ID, age, gender, region, accent, dialect.

•
Conversation Metadata: Topic, call type, sentiment, sample rate, and technical attributes.

This metadata aids in training specialized models, filtering demographics, and running advanced analytics.
Usage and Applications
This dataset
F
Mexican Spanish Call Center Data for Delivery & Logistics AI
futurebeeai.com
wav
Updated Aug 1, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Mexican Spanish Call Center Data for Delivery & Logistics AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/delivery-call-center-conversation-spanish-mexico
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
This Mexican Spanish Call Center Speech Dataset for the Delivery and Logistics industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Spanish-speaking customers. With over 30 hours of real-world, unscripted call center audio, this dataset captures authentic delivery-related conversations essential for training high-performance ASR models.
Curated by FutureBeeAI, this dataset empowers AI teams, logistics tech providers, and NLP researchers to build accurate, production-ready models for customer support automation in delivery and logistics.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Mexican Spanish speakers. Captured across various delivery and logistics service scenarios, these conversations cover everything from order tracking to missed delivery resolutions offering a rich, real-world training base for AI models.
•Participant Diversity:
•
Speakers: 60 native Mexican Spanish speakers from our verified contributor pool.

•
Regions: Multiple provinces of Mexico for accent and dialect diversity.

•
Participant Profile: Balanced gender distribution (60% male, 40% female) with ages ranging from 18 to 70.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted customer-agent dialogues.

•
Call Duration: 5 to 15 minutes on average.

•
Audio Format: Stereo WAV, 16-bit depth, recorded at 8kHz and 16kHz.

•
Recording Environment: Captured in clean, noise-free, echo-free conditions.

Topic Diversity
This speech corpus includes both inbound and outbound delivery-related conversations, covering varied outcomes (positive, negative, neutral) to train adaptable voice models.
•Inbound Calls:
•Order Tracking
•Delivery Complaints
•Undeliverable Addresses
•Return Process Enquiries
•Delivery Method Selection
•Order Modifications, and more
•Outbound Calls:
•Delivery Confirmations
•Subscription Offer Calls
•Incorrect Address Follow-ups
•Missed Delivery Notifications
•Delivery Feedback Surveys
•Out-of-Stock Alerts, and others
This comprehensive coverage reflects real-world logistics workflows, helping voice AI systems interpret context and intent with precision.
Transcription
All recordings come with high-quality, human-generated verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•Time-coded Segments
•Non-speech Tags (e.g., pauses, noise)
•High transcription accuracy with word error rate under 5% via dual-layer quality checks.
These transcriptions support fast, reliable model development for Spanish voice AI applications in the delivery sector.
Metadata
Detailed metadata is included for each participant and conversation:
•
Participant Metadata: ID, age, gender, region, accent, dialect.

•
Conversation Metadata: Topic, call type, sentiment, sample rate, and technical attributes.

This metadata aids in training specialized models, filtering demographics, and running advanced analytics.
Usage and Applications
<p
LATAM Data Suite | 1.8M+ Sentences | Natural Language Processing (NLP) Data...
datarade.ai
Updated Jul 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oxford Languages (2025). LATAM Data Suite | 1.8M+ Sentences | Natural Language Processing (NLP) Data | TTS | Dictionary Display | Translation Data | LATAM Coverage [Dataset]. https://datarade.ai/data-products/latam-data-suite-1-8m-sentences-nlp-tts-dictionary-d-oxford-languages
Explore at:
.json, .xml, .csv, .xls, .mp3, .wavAvailable download formats
Dataset updated
Jul 22, 2025
Dataset authored and provided by
Oxford Languageshttps://lexico.com/es
Area covered
Colombia, Bolivia (Plurinational State of), Spain, Panama, Mexico, Dominican Republic, Uruguay, Ecuador, Peru, Puerto Rico
Description
LATAM Data Suite provides high-quality datasets in Spanish, Portuguese, and American English. Ideal for NLP, AI, LLMs, translation, and education, it combines linguistic depth and regional authenticity to power scalable, multilingual language technologies.

Discover our expertly curated language datasets in the LATAM Data Suite. Compiled and annotated by language and linguistic experts, this suite offers high-quality resources tailored to your needs. This suite includes:

Monolingual and Bilingual Dictionary Data Featuring headwords, definitions, word senses, part-of-speech (POS) tags, and semantic metadata.

Sentences Curated examples of real-world usage with contextual annotations.

Synonyms & Antonyms Lexical relations to support semantic search, paraphrasing, and language understanding.

Audio Data Native speaker recordings for TTS and pronunciation modeling.

Word Lists Frequency-ranked and thematically grouped lists.

Learn more about the datasets included in the data suite:

Portuguese Monolingual Dictionary Data

Portuguese Bilingual Dictionary Data

Spanish Monolingual Dictionary Data

Spanish Bilingual Dictionary Data

Spanish Sentences Data

Spanish Synonyms and Antonyms Data

Spanish Audio Data

Spanish Word List Data

American English Monolingual Dictionary Data

American English Synonyms and Antonyms Data

American English Pronunciations with Audio

Key Features (approximate numbers):

Portuguese Monolingual Dictionary Data

Our Portuguese monolingual covers both European and Latin American varieties, featuring clear definitions and examples, a large volume of headwords, and comprehensive coverage of the Portuguese language.

Words: 143,600

Senses: 285,500

Example sentences: 69,300

Format: XML format

Delivery: Email (link-based file sharing)

Portuguese Bilingual Dictionary Data

The bilingual data provides translations in both directions, from English to Portuguese and from Portuguese to English. It is annually reviewed and updated by our in-house team of language experts. Offers comprehensive coverage of the language, providing a substantial volume of translated words of excellent quality that span both European and Latin American Portuguese varieties.

Translations: 300,000

Senses: 158,000

Example translations: 117,800

Format: XML and JSON formats

Delivery: Email (link-based file sharing) and REST API

Updated frequency: annually

Spanish Monolingual Dictionary Data

Our Spanish monolingual reliably offers clear definitions and examples, a large volume of headwords, and comprehensive coverage of the Spanish language.

Words: 73,000

Senses: 123,000

Example sentences: 104,000

Format: XML and JSON formats

Delivery: Email (link-based file sharing) and REST API

Updated frequency: annually

Spanish Bilingual Dictionary Data

The bilingual data provides translations in both directions, from English to Spanish and from Spanish to English. It is annually reviewed and updated by our in-house team of language experts. Offers significant coverage of the language, providing a large volume of translated words of excellent quality.

Translations: 221,300

Senses: 103,500

Example sentences: 74,500

Example translations: 83,800

Format: XML and JSON formats

Delivery: Email (link-based file sharing) and REST API

Updated frequency: annually

Spanish Sentences Data

Spanish sentences retrieved from corpus are ideal for NLP model training, presenting approximately 20 million words. The sentences provide a great coverage of Spanish-speaking countries and are accordingly tagged to a particular country or dialect.

Sentences volume: 1,840,000

Format: XML and JSON formats

Delivery: Email (link-based file sharing) and REST API

Spanish Synonyms and Antonyms Data

This Spanish language dataset offers a rich collection of synonyms and antonyms, accompanied by detailed definitions and part-of-speech (POS) annotations, making it a comprehensive resource for building linguistically aware AI systems and language technologies.

Synonyms: 127,700

Antonyms: 9,500

Format: XML format

Delivery: Email (link-based file sharing)

Updated frequency: annually

Spanish Audio Data (word-level)

Curated word-level audio data for the Spanish language, which covers all varieties of world Spanish, providing rich dialectal diversity in the Spanish language.

Audio files: 20,900

Format: XLSX (for index), MP3 and WAV (audio files)

Spanish Word List Data

This language data contains a carefully curated and comprehensive list of 450,000 Spanish words.

Wordforms: 450,000

Format: CSV and TXT formats

Delivery: Email (link-based file sharing)

American English Monolingual Dictionary Data

Our American English Monolingual Dictionary Data is the foremost au...
p
Trends in Hispanic Student Percentage (1991-2023): Top Of The World...
publicschoolreview.com
Updated Feb 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Public School Review (2025). Trends in Hispanic Student Percentage (1991-2023): Top Of The World Elementary School vs. California vs. Laguna Beach Unified School District [Dataset]. https://www.publicschoolreview.com/top-of-the-world-elementary-school-profile
Explore at:
Dataset updated
Feb 9, 2025
Dataset authored and provided by
Public School Review
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Laguna Beach Unified School District
Description
This dataset tracks annual hispanic student percentage from 1991 to 2023 for Top Of The World Elementary School vs. California and Laguna Beach Unified School District
The most spoken languages worldwide 2025
statista.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, The most spoken languages worldwide 2025 [Dataset]. https://www.statista.com/statistics/266808/the-most-spoken-languages-worldwide/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2025
Area covered
World
Description
In 2025, there were around 1.53 billion people worldwide who spoke English either natively or as a second language, slightly more than the 1.18 billion Mandarin Chinese speakers at the time of survey. Hindi and Spanish accounted for the third and fourth most widespread languages that year. Languages in the United States The United States does not have an official language, but the country uses English, specifically American English, for legislation, regulation, and other official pronouncements. The United States is a land of immigration, and the languages spoken in the United States vary as a result of the multicultural population. The second most common language spoken in the United States is Spanish or Spanish Creole, which over than 43 million people spoke at home in 2023. There were also 3.5 million Chinese speakers (including both Mandarin and Cantonese),1.8 million Tagalog speakers, and 1.57 million Vietnamese speakers counted in the United States that year. Different languages at home The percentage of people in the United States speaking a language other than English at home varies from state to state. The state with the highest percentage of population speaking a language other than English is California. About 45 percent of its population was speaking a language other than English at home in 2023.
T
Population Estimate, Total, Hispanic or Latino (5-year estimate) in Big...
tradingeconomics.com
csv, excel, json, xml
Updated Mar 11, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2020). Population Estimate, Total, Hispanic or Latino (5-year estimate) in Big Stone County, MN [Dataset]. https://tradingeconomics.com/united-states/population-estimate-of-hispanic-or-latino-persons-in-big-stone-county-mn-fed-data.html
Explore at:
csv, json, xml, excelAvailable download formats
Dataset updated
Mar 11, 2020
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1976 - Dec 31, 2025
Area covered
Minnesota, Big Stone County
Description
Population Estimate, Total, Hispanic or Latino (5-year estimate) in Big Stone County, MN was 130.00000 Persons in January of 2023, according to the United States Federal Reserve. Historically, Population Estimate, Total, Hispanic or Latino (5-year estimate) in Big Stone County, MN reached a record high of 130.00000 in January of 2023 and a record low of 27.00000 in January of 2009. Trading Economics provides the current actual value, an historical data chart and related indicators for Population Estimate, Total, Hispanic or Latino (5-year estimate) in Big Stone County, MN - last updated from the United States Federal Reserve on November of 2025.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista, Number of native Spanish speakers worldwide 2024, by country [Dataset]. https://www.statista.com/statistics/991020/number-native-spanish-speakers-country-worldwide/

Number of native Spanish speakers worldwide 2024, by country

Explore at:

8 scholarly articles cite this dataset (View in Google Scholar)

Dataset authored and provided by

Statistahttp://statista.com/

Area covered

World

Description

Mexico is the country with the largest number of native Spanish speakers in the world. As of 2024, 132.5 million people in Mexico spoke Spanish with a native command of the language. Colombia was the nation with the second-highest number of native Spanish speakers, at around 52.7 million. Spain came in third, with 48 million, and Argentina fourth, with 46 million. Spanish, a world language As of 2023, Spanish ranked as the fourth most spoken language in the world, only behind English, Chinese, and Hindi, with over half a billion speakers. Spanish is the official language of over 20 countries, the majority on the American continent, nonetheless, it's also one of the official languages of Equatorial Guinea in Africa. Other countries have a strong influence, like the United States, Morocco, or Brazil, countries included in the list of non-Hispanic countries with the highest number of Spanish speakers. The second most spoken language in the U.S. In the most recent data, Spanish ranked as the language, other than English, with the highest number of speakers, with 12 times more speakers as the second place. Which comes to no surprise following the long history of migrations from Latin American countries to the Northern country. Moreover, only during the fiscal year 2022. 5 out of the top 10 countries of origin of naturalized people in the U.S. came from Spanish-speaking countries.

Clear search

Close search

Google apps

Main menu

Number of native Spanish speakers worldwide 2024, by country

Spanish speakers in countries where Spanish is not an official language 2024...

Hispanic population U.S. 2023, by state

Hispanic population in the U.S. 2023, by origin

Number of students learning Spanish worldwide 2024, by country

Spanish Language Datasets | 1.8M+ Sentences | Translation Data | TTS |...

states in U.S. Ranked by Hispanic Other Race Population // 2025 Edition

About this dataset

Content

Inspiration

Mexican Spanish General Conversation Speech Dataset for ASR

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

Spanish TTS Speech Dataset for Speech Synthesis

Recording & Audio Quality

Voice Artist Selection

Script Quality & Coverage

Transcripts & Alignment

states in U.S. Ranked by Hispanic White Population // 2025 Edition

About this dataset

Content

Inspiration

Largest countries in Latin America, by land area

states in U.S. Ranked by Hispanic Pacific Islander Population // 2025...

About this dataset

Content

Inspiration

Norway, MI Hispanic or Latino Population Distribution by Their Ancestries

About this dataset

Content

Inspiration

Recommended for further research

messirve

Spanish (Spain) Call Center Data for Delivery & Logistics AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

Mexican Spanish Call Center Data for Delivery & Logistics AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

LATAM Data Suite | 1.8M+ Sentences | Natural Language Processing (NLP) Data...

Trends in Hispanic Student Percentage (1991-2023): Top Of The World...

The most spoken languages worldwide 2025

Population Estimate, Total, Hispanic or Latino (5-year estimate) in Big...

Number of native Spanish speakers worldwide 2024, by country