100+ datasets found
  1. Spanish Language Datasets | 1.8M+ Sentences | NLP | TTS | Dictionary Display...

    • datarade.ai
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oxford Languages (2025). Spanish Language Datasets | 1.8M+ Sentences | NLP | TTS | Dictionary Display | Game | Translations | European & Latin Amer. Coverage [Dataset]. https://datarade.ai/data-products/spanish-language-datasets-1-8m-sentences-nlp-tts-dic-oxford-languages
    Explore at:
    .csv, .json, .mp3, .txt, .wav, .xls, .xmlAvailable download formats
    Dataset updated
    Jul 11, 2025
    Dataset authored and provided by
    Oxford Languageshttps://www.lexico.com/
    Area covered
    Costa Rica, Chile, Honduras, Colombia, Ecuador, Nicaragua, Bolivia (Plurinational State of), Cuba, Paraguay, Panama
    Description

    Our Spanish language datasets are carefully compiled and annotated by language and linguistic experts; you can find them available for licensing:

    1. Spanish Monolingual Dictionary Data
    2. Spanish Bilingual Dictionary Data
    3. Spanish Sentences Data
    4. Synonyms and Antonyms Data
    5. Audio Data
    6. Word list Data

    Key Features (approximate numbers):

    1. Spanish Monolingual Dictionary Data

    Our Spanish monolingual reliably offers clear definitions and examples, a large volume of headwords, and comprehensive coverage of the Spanish language.

    • Headwords: 73,000
    • Senses: 123,000
    • Sentence examples: 104,000
    • Format: XML and JSON formats
    • Delivery: Email (link-based file sharing) and REST API
    • Updated frequency: annually
    1. Spanish Bilingual Dictionary Data

    The bilingual data provides translations in both directions, from English to Spanish and from Spanish to English. It is annually reviewed and updated by our in-house team of language experts. Offers significant coverage of the language, providing a large volume of translated words of excellent quality.

    • Translations: 221,300
    • Senses: 103,500
    • Example sentences: 74,500
    • Example translations: 83,800
    • Format: XML and JSON formats
    • Delivery: Email (link-based file sharing) and REST API
    • Updated frequency: annually
    1. Spanish Sentences Data

    Spanish sentences retrieved from the corpus are ideal for NLP model training, presenting approximately 20 million words. The sentences provide a great coverage of Spanish-speaking countries and are accordingly tagged to a particular country or dialect.

    • Sentences volume: 1,840,000
    • Format: XML and JSON format
    • Delivery: Email (link-based file sharing) and REST API
    1. Spanish Synonyms and Antonyms Data

    This Spanish language dataset offers a rich collection of synonyms and antonyms, accompanied by detailed definitions and part-of-speech (POS) annotations, making it a comprehensive resource for building linguistically aware AI systems and language technologies.

    • Synonyms: 127,700
    • Antonyms: 9,500
    • Format: XML format
    • Delivery: Email (link-based file sharing)
    • Updated frequency: annually
    1. Spanish Audio Data (word-level)

    Curated word-level audio data for the Spanish language, which covers all varieties of world Spanish, providing rich dialectal diversity in the Spanish language.

    • Audio files: 20,900
    • Format: XLSX (for index), MP3 and WAV (audio files)
    1. Spanish Word List Data

    This language data contains a carefully curated and comprehensive list of 450,000 Spanish words.

    • Wordforms: 450,000
    • Format: CSV and TXT formats
    • Delivery: Email (link-based file sharing)

    Use Cases:

    We consistently work with our clients on new use cases as language technology continues to evolve. These include NLP applications, TTS, dictionary display tools, games, translation, word embedding, and word sense disambiguation (WSD).

    If you have a specific use case in mind that isn't listed here, we’d be happy to explore it with you. Don’t hesitate to get in touch with us at Oxford.Languages@oup.com to start the conversation.

    Pricing:

    Oxford Languages offers flexible pricing based on use case and delivery format. Our datasets are licensed via term-based IP agreements and tiered pricing for API-delivered data. Whether you’re integrating into a product, training an LLM, or building custom NLP solutions, we tailor licensing to your specific needs.

    Contact our team or email us at Oxford.Languages@oup.com to explore pricing options and discover how our language data can support your goals.

  2. A

    The top-5000 frequent Spanish words in Twitter for 331 cities in the...

    • data.amerigeoss.org
    csv, json, rdf, xml
    Updated Nov 22, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    www.datos.gov.co (2017). The top-5000 frequent Spanish words in Twitter for 331 cities in the Spanish-speaking world [Dataset]. https://data.amerigeoss.org/id/dataset/the-top-5000-frequent-spanish-words-in-twitter-for-331-cities-in-the-spanish-speaking-world
    Explore at:
    rdf, csv, xml, jsonAvailable download formats
    Dataset updated
    Nov 22, 2017
    Dataset provided by
    www.datos.gov.co
    Description

    More than 250 million tweets in Spanish from 331 Spanish-speaking cities in Latin America, Spain and the United States were compiled from Twitter. In this data set, a column is provided with the 5000 most frequent words and one with their corresponding frequencies (the number of times the word was produced in that city) for each of the 331 cities. The reported data correspond to the years 2009 to 2016.

  3. F

    US Spanish Call Center Data for Realestate AI

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). US Spanish Call Center Data for Realestate AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/realestate-call-center-conversation-spanish-usa
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Area covered
    United States
    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    This US Spanish Call Center Speech Dataset for the Real Estate industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Spanish -speaking Real Estate customers. With over 30 hours of unscripted, real-world audio, this dataset captures authentic conversations between customers and real estate agents ideal for building robust ASR models.

    Curated by FutureBeeAI, this dataset equips voice AI developers, real estate tech platforms, and NLP researchers with the data needed to create high-accuracy, production-ready models for property-focused use cases.

    Speech Data

    The dataset features 30 hours of dual-channel call center recordings between native US Spanish speakers. Captured in realistic real estate consultation and support contexts, these conversations span a wide array of property-related topics from inquiries to investment advice offering deep domain coverage for AI model development.

    Participant Diversity:
    Speakers: 60 native US Spanish speakers from our verified contributor community.
    Regions: Representing different provinces across USA to ensure accent and dialect variation.
    Participant Profile: Balanced gender mix (60% male, 40% female) and age range from 18 to 70.
    Recording Details:
    Conversation Nature: Naturally flowing, unscripted agent-customer discussions.
    Call Duration: Average 5–15 minutes per call.
    Audio Format: Stereo WAV, 16-bit, recorded at 8kHz and 16kHz.
    Recording Environment: Captured in noise-free and echo-free conditions.

    Topic Diversity

    This speech corpus includes both inbound and outbound calls, featuring positive, neutral, and negative outcomes across a wide range of real estate scenarios.

    Inbound Calls:
    Property Inquiries
    Rental Availability
    Renovation Consultation
    Property Features & Amenities
    Investment Property Evaluation
    Ownership History & Legal Info, and more
    Outbound Calls:
    New Listing Notifications
    Post-Purchase Follow-ups
    Property Recommendations
    Value Updates
    Customer Satisfaction Surveys, and others

    Such domain-rich variety ensures model generalization across common real estate support conversations.

    Transcription

    All recordings are accompanied by precise, manually verified transcriptions in JSON format.

    Transcription Includes:
    Speaker-Segmented Dialogues
    Time-coded Segments
    Non-speech Tags (e.g., background noise, pauses)
    High transcription accuracy with word error rate below 5% via dual-layer human review.

    These transcriptions streamline ASR and NLP development for Spanish real estate voice applications.

    Metadata

    Detailed metadata accompanies each participant and conversation:

    Participant Metadata: ID, age, gender, location, accent, and dialect.
    Conversation Metadata: Topic, call type, sentiment, sample rate, and technical details.

    This enables smart filtering, dialect-focused model training, and structured dataset exploration.

    Usage and Applications

    This dataset is ideal for voice AI and NLP systems built for the real estate sector:

  4. u

    Data from: IA Tweets Analysis Dataset (Spanish)

    • produccioncientifica.uca.es
    • explore.openaire.eu
    Updated 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guerrero-Contreras, Gabriel; Balderas-Díaz, Sara; Serrano-Fernández, Alejandro; Muñoz, Andrés; Guerrero-Contreras, Gabriel; Balderas-Díaz, Sara; Serrano-Fernández, Alejandro; Muñoz, Andrés (2024). IA Tweets Analysis Dataset (Spanish) [Dataset]. https://produccioncientifica.uca.es/documentos/67321e53aea56d4af04854c2
    Explore at:
    Dataset updated
    2024
    Authors
    Guerrero-Contreras, Gabriel; Balderas-Díaz, Sara; Serrano-Fernández, Alejandro; Muñoz, Andrés; Guerrero-Contreras, Gabriel; Balderas-Díaz, Sara; Serrano-Fernández, Alejandro; Muñoz, Andrés
    Description

    Cite as

    Guerrero-Contreras, G., Balderas-Díaz, S., Serrano-Fernández, A., & Muñoz, A. (2024, June). Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights. In 2024 International Conference on Intelligent Environments (IE) (pp. 62-69). IEEE.

    General Description

    This dataset comprises 4,038 tweets in Spanish, related to discussions about artificial intelligence (AI), and was created and utilized in the publication "Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights," (10.1109/IE61493.2024.10599899) presented at the 20th International Conference on Intelligent Environments. It is designed to support research on public perception, sentiment, and engagement with AI topics on social media from a Spanish-speaking perspective. Each entry includes detailed annotations covering sentiment analysis, user engagement metrics, and user profile characteristics, among others.

    Data Collection Method

    Tweets were gathered through the Twitter API v1.1 by targeting keywords and hashtags associated with artificial intelligence, focusing specifically on content in Spanish. The dataset captures a wide array of discussions, offering a holistic view of the Spanish-speaking public's sentiment towards AI.

    Dataset Content

    ID: A unique identifier for each tweet.

    text: The textual content of the tweet. It is a string with a maximum allowed length of 280 characters.

    polarity: The tweet's sentiment polarity (e.g., Positive, Negative, Neutral).

    favorite_count: Indicates how many times the tweet has been liked by Twitter users. It is a non-negative integer.

    retweet_count: The number of times this tweet has been retweeted. It is a non-negative integer.

    user_verified: When true, indicates that the user has a verified account, which helps the public recognize the authenticity of accounts of public interest. It is a boolean data type with two allowed values: True or False.

    user_default_profile: When true, indicates that the user has not altered the theme or background of their user profile. It is a boolean data type with two allowed values: True or False.

    user_has_extended_profile: When true, indicates that the user has an extended profile. An extended profile on Twitter allows users to provide more detailed information about themselves, such as an extended biography, a header image, details about their location, website, and other additional data. It is a boolean data type with two allowed values: True or False.

    user_followers_count: The current number of followers the account has. It is a non-negative integer.

    user_friends_count: The number of users that the account is following. It is a non-negative integer.

    user_favourites_count: The number of tweets this user has liked since the account was created. It is a non-negative integer.

    user_statuses_count: The number of tweets (including retweets) posted by the user. It is a non-negative integer.

    user_protected: When true, indicates that this user has chosen to protect their tweets, meaning their tweets are not publicly visible without their permission. It is a boolean data type with two allowed values: True or False.

    user_is_translator: When true, indicates that the user posting the tweet is a verified translator on Twitter. This means they have been recognized and validated by the platform as translators of content in different languages. It is a boolean data type with two allowed values: True or False.

    Potential Use Cases

    This dataset is aimed at academic researchers and practitioners with interests in:

    Sentiment analysis and natural language processing (NLP) with a focus on AI discussions in the Spanish language.

    Social media analysis on public engagement and perception of artificial intelligence among Spanish speakers.

    Exploring correlations between user engagement metrics and sentiment in discussions about AI.

    Data Format and File Type

    The dataset is provided in CSV format, ensuring compatibility with a wide range of data analysis tools and programming environments.

    License

    The dataset is available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, permitting sharing, copying, distribution, transmission, and adaptation of the work for any purpose, including commercial, provided proper attribution is given.

  5. Z

    Data from: IA Tweets Analysis Dataset (Spanish)

    • data.niaid.nih.gov
    Updated Aug 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muñoz, Andrés (2024). IA Tweets Analysis Dataset (Spanish) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10821484
    Explore at:
    Dataset updated
    Aug 3, 2024
    Dataset provided by
    Balderas-Díaz, Sara
    Serrano-Fernández, Alejandro
    Muñoz, Andrés
    Guerrero-Contreras, Gabriel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    General Description

    This dataset comprises 4,038 tweets in Spanish, related to discussions about artificial intelligence (AI), and was created and utilized in the publication "Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights," (10.1109/IE61493.2024.10599899) presented at the 20th International Conference on Intelligent Environments. It is designed to support research on public perception, sentiment, and engagement with AI topics on social media from a Spanish-speaking perspective. Each entry includes detailed annotations covering sentiment analysis, user engagement metrics, and user profile characteristics, among others.

    Data Collection Method

    Tweets were gathered through the Twitter API v1.1 by targeting keywords and hashtags associated with artificial intelligence, focusing specifically on content in Spanish. The dataset captures a wide array of discussions, offering a holistic view of the Spanish-speaking public's sentiment towards AI.

    Dataset Content

    ID: A unique identifier for each tweet.

    text: The textual content of the tweet. It is a string with a maximum allowed length of 280 characters.

    polarity: The tweet's sentiment polarity (e.g., Positive, Negative, Neutral).

    favorite_count: Indicates how many times the tweet has been liked by Twitter users. It is a non-negative integer.

    retweet_count: The number of times this tweet has been retweeted. It is a non-negative integer.

    user_verified: When true, indicates that the user has a verified account, which helps the public recognize the authenticity of accounts of public interest. It is a boolean data type with two allowed values: True or False.

    user_default_profile: When true, indicates that the user has not altered the theme or background of their user profile. It is a boolean data type with two allowed values: True or False.

    user_has_extended_profile: When true, indicates that the user has an extended profile. An extended profile on Twitter allows users to provide more detailed information about themselves, such as an extended biography, a header image, details about their location, website, and other additional data. It is a boolean data type with two allowed values: True or False.

    user_followers_count: The current number of followers the account has. It is a non-negative integer.

    user_friends_count: The number of users that the account is following. It is a non-negative integer.

    user_favourites_count: The number of tweets this user has liked since the account was created. It is a non-negative integer.

    user_statuses_count: The number of tweets (including retweets) posted by the user. It is a non-negative integer.

    user_protected: When true, indicates that this user has chosen to protect their tweets, meaning their tweets are not publicly visible without their permission. It is a boolean data type with two allowed values: True or False.

    user_is_translator: When true, indicates that the user posting the tweet is a verified translator on Twitter. This means they have been recognized and validated by the platform as translators of content in different languages. It is a boolean data type with two allowed values: True or False.

    Cite as

    Guerrero-Contreras, G., Balderas-Díaz, S., Serrano-Fernández, A., & Muñoz, A. (2024, June). Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights. In 2024 International Conference on Intelligent Environments (IE) (pp. 62-69). IEEE.

    Potential Use Cases

    This dataset is aimed at academic researchers and practitioners with interests in:

    Sentiment analysis and natural language processing (NLP) with a focus on AI discussions in the Spanish language.

    Social media analysis on public engagement and perception of artificial intelligence among Spanish speakers.

    Exploring correlations between user engagement metrics and sentiment in discussions about AI.

    Data Format and File Type

    The dataset is provided in CSV format, ensuring compatibility with a wide range of data analysis tools and programming environments.

    License

    The dataset is available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, permitting sharing, copying, distribution, transmission, and adaptation of the work for any purpose, including commercial, provided proper attribution is given.

  6. F

    Colombian Spanish Call Center Data for Telecom AI

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Colombian Spanish Call Center Data for Telecom AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/telecom-call-center-conversation-spanish-colombia
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    This Colombian Spanish Call Center Speech Dataset for the Telecom industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Spanish-speaking telecom customers. Featuring over 30 hours of real-world, unscripted audio, it delivers authentic customer-agent interactions across key telecom support scenarios to help train robust ASR models.

    Curated by FutureBeeAI, this dataset empowers voice AI engineers, telecom automation teams, and NLP researchers to build high-accuracy, production-ready models for telecom-specific use cases.

    Speech Data

    The dataset contains 30 hours of dual-channel call center recordings between native Colombian Spanish speakers. Captured in realistic customer support settings, these conversations span a wide range of telecom topics from network complaints to billing issues, offering a strong foundation for training and evaluating telecom voice AI solutions.

    Participant Diversity:
    Speakers: 60 native Colombian Spanish speakers from our verified contributor pool.
    Regions: Representing multiple provinces across Colombia to ensure coverage of various accents and dialects.
    Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.
    Recording Details:
    Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.
    Call Duration: Ranges from 5 to 15 minutes.
    Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.
    Recording Environment: Captured in clean conditions with no echo or background noise.

    Topic Diversity

    This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral ensuring broad scenario coverage for telecom AI development.

    Inbound Calls:
    Phone Number Porting
    Network Connectivity Issues
    Billing and Payments
    Technical Support
    Service Activation
    International Roaming Enquiry
    Refund Requests and Billing Adjustments
    Emergency Service Access, and others
    Outbound Calls:
    Welcome Calls & Onboarding
    Payment Reminders
    Customer Satisfaction Surveys
    Technical Updates
    Service Usage Reviews
    Network Complaint Status Calls, and more

    This variety helps train telecom-specific models to manage real-world customer interactions and understand context-specific voice patterns.

    Transcription

    All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.

    Transcription Includes:
    Speaker-Segmented Dialogues
    Time-coded Segments
    Non-speech Tags (e.g., pauses, coughs)
    High transcription accuracy with word error rate < 5% thanks to dual-layered quality checks.

    These transcriptions are production-ready, allowing for faster development of ASR and conversational AI systems in the Telecom domain.

    Metadata

    Rich metadata is available for each participant and conversation:

    Participant Metadata: ID, age, gender, accent, dialect, and location.
    <div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap:

  7. 2013 American Community Survey - Table Packages: Detailed Language Spoken in...

    • catalog.data.gov
    Updated Jul 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Census Bureau (2023). 2013 American Community Survey - Table Packages: Detailed Language Spoken in the U.S. [Dataset]. https://catalog.data.gov/dataset/2013-american-community-survey-table-packages-detailed-language-spoken-in-the-u-s
    Explore at:
    Dataset updated
    Jul 19, 2023
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Area covered
    United States
    Description

    This data set uses the 2009-2013 American Community Survey to tabulate the number of speakers of languages spoken at home and the number of speakers of each language who speak English less than very well. These tabulations are available for the following geographies: nation; each of the 50 states, plus Washington, D.C. and Puerto Rico; counties with 100,000 or more total population and 25,000 or more speakers of languages other than English and Spanish; core-based statistical areas (metropolitan statistical areas and micropolitan statistical areas) with 100,000 or more total population and 25,000 or more speakers of languages other than English and Spanish.

  8. F

    Spanish Conversation Chat Dataset for Healthcare Domain

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Spanish Conversation Chat Dataset for Healthcare Domain [Dataset]. https://www.futurebeeai.com/dataset/text-dataset/spanish-healthcare-domain-conversation-text-dataset
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    The dataset comprises over 10,000 chat conversations, each focusing on specific Healthcare related topics. Each conversation provides a detailed interaction between a call center agent and a customer, capturing real-life scenarios and language nuances.

    Participants Details: 150+ native Spanish participants from the FutureBeeAI community.
    Word Count & Length: Chats are diverse, averaging 300 to 700 words and 50 to 150 turns across both speakers.

    Topic Diversity

    The chat dataset covers a wide range of conversations on Healthcare topics, ensuring that the dataset is comprehensive and relevant for training and fine-tuning models for various Healthcare use cases. It offers diversity in terms of conversation topics, chat types, and outcomes, including both inbound and outbound chats with positive, neutral, and negative outcomes.

    Inbound Chats:
    Appointment Scheduling
    New Patient Registration
    Surgery Consultation
    Consultation regarding Diet, and many more
    Outbound Chats:
    Appointment Reminder
    Health & Wellness Subscription Programs
    Lab Test Results
    Health Risk Assessments
    Preventive Care Reminders, and many more

    Language Variety & Nuances

    The conversations in this dataset capture the diverse language styles and expressions prevalent in Spanish Healthcare interactions. This diversity ensures the dataset accurately represents the language used by Spanish speakers in Healthcare contexts.

    The dataset encompasses a wide array of language elements, including:

    Naming Conventions: Chats include a variety of Spanish personal and business names.
    Localized Details: Real-world addresses, emails, phone numbers, and other contact information as according to different Spanish-speaking regions.
    Temporal and Numeric Expressions: Dates, times, currencies, and numbers in Spanish forms, adhering to local conventions.
    Idiomatic Expressions and Slang: It includes local slang, idioms, and informal phrase present in Spanish Healthcare conversations.

    This linguistic authenticity ensures that the dataset equips researchers and developers with a comprehensive understanding of the intricate language patterns, cultural references, and communication styles inherent to Spanish Healthcare interactions.

    Conversational Flow and Interaction Types

    The dataset includes a broad range of conversations, from simple inquiries to detailed discussions, capturing the dynamic nature of Healthcare customer-agent interactions.

    Simple Inquiries
    Detailed Discussions
    Transactional Interactions
    Problem-Solving Dialogues
    Advisory Sessions
    Routine Checks and Follow-Ups

    Each of these conversations contains various aspects of conversation flow like:

    Greetings
    Authentication
    Information gathering
    Resolution identification
    Solution Delivery
    Closing and Follow-ups
    Feedback, etc

    This structured and varied conversational flow enables the creation of advanced NLP models that can effectively manage and respond to a wide range of customer service scenarios.

    Data Format and Structure

    The dataset is available in JSON, CSV, and TXT formats, with each conversation containing attributes like participant identifiers and chat

  9. N

    cities in Morris County Ranked by Hispanic Native American Population //...

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Morris County Ranked by Hispanic Native American Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-morris-county-nj-by-hispanic-native-american-population/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Morris County, New Jersey
    Variables measured
    Hispanic Native American Population, Hispanic Native American Population as Percent of Total Population of cities in Morris County, NJ, Hispanic Native American Population as Percent of Total Hispanic Native American Population of Morris County, NJ
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 39 cities in the Morris County, NJ by Hispanic American Indian and Alaska Native (AIAN) population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic Native American Population: This column displays the rank of cities in the Morris County, NJ by their Hispanic American Indian and Alaska Native (AIAN) population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic Native American Population: The Hispanic Native American population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic Native American. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Morris County Hispanic Native American Population: This tells us how much of the entire Morris County, NJ Hispanic Native American population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  10. N

    Durham County, NC Non-Hispanic Population Breakdown By Race Dataset:...

    • neilsberg.com
    csv, json
    Updated Feb 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Durham County, NC Non-Hispanic Population Breakdown By Race Dataset: Non-Hispanic Population Counts and Percentages for 7 Racial Categories as Identified by the US Census Bureau // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/durham-county-nc-population-by-race/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 21, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Durham County, North Carolina
    Variables measured
    Non-Hispanic Asian Population, Non-Hispanic Black Population, Non-Hispanic White Population, Non-Hispanic Some other race Population, Non-Hispanic Two or more races Population, Non-Hispanic American Indian and Alaska Native Population, Non-Hispanic Native Hawaiian and Other Pacific Islander Population, Non-Hispanic Asian Population as Percent of Total Non-Hispanic Population, Non-Hispanic Black Population as Percent of Total Non-Hispanic Population, Non-Hispanic White Population as Percent of Total Non-Hispanic Population, and 4 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) Non-Hispanic population and (b) population as a percentage of the total Non-Hispanic population, we initially analyzed and categorized the data for each of the racial categories idetified by the US Census Bureau. It is ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories, and are part of Non-Hispanic classification. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the Non-Hispanic population of Durham County by race. It includes the distribution of the Non-Hispanic population of Durham County across various race categories as identified by the Census Bureau. The dataset can be utilized to understand the Non-Hispanic population distribution of Durham County across relevant racial categories.

    Key observations

    Of the Non-Hispanic population in Durham County, the largest racial group is White alone with a population of 138,134 (49.49% of the total Non-Hispanic population).

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Racial categories include:

    • White
    • Black or African American
    • American Indian and Alaska Native
    • Asian
    • Native Hawaiian and Other Pacific Islander
    • Some other race
    • Two or more races (multiracial)

    Variables / Data Columns

    • Race: This column displays the racial categories (for Non-Hispanic) for the Durham County
    • Population: The population of the racial category (for Non-Hispanic) in the Durham County is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each race as a proportion of Durham County total Non-Hispanic population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Durham County Population by Race & Ethnicity. You can refer the same here

  11. N

    cities in Coffee County Ranked by Hispanic Other Race Population // 2025...

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Coffee County Ranked by Hispanic Other Race Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-coffee-county-al-by-hispanic-other-race-population/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Coffee County, Alabama
    Variables measured
    Hispanic Other Race Population, Hispanic Other Race Population as Percent of Total Population of cities in Coffee County, AL, Hispanic Other Race Population as Percent of Total Hispanic Other Race Population of Coffee County, AL
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 5 cities in the Coffee County, AL by Hispanic Some Other Race (SOR) population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic Other Race Population: This column displays the rank of cities in the Coffee County, AL by their Hispanic Some Other Race (SOR) population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic Other Race Population: The Hispanic Other Race population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic Other Race. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Coffee County Hispanic Other Race Population: This tells us how much of the entire Coffee County, AL Hispanic Other Race population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  12. N

    cities in Story County Ranked by Hispanic White Population // 2025 Edition

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Story County Ranked by Hispanic White Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-story-county-ia-by-hispanic-white-population/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Iowa, Story County
    Variables measured
    Hispanic White Population, Hispanic White Population as Percent of Total Population of cities in Story County, IA, Hispanic White Population as Percent of Total Hispanic White Population of Story County, IA
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 17 cities in the Story County, IA by Hispanic White population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic White Population: This column displays the rank of cities in the Story County, IA by their Hispanic White population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic White Population: The Hispanic White population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic White. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Story County Hispanic White Population: This tells us how much of the entire Story County, IA Hispanic White population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  13. N

    cities in Vermont Ranked by Hispanic Other Race Population // 2025 Edition

    • neilsberg.com
    csv, json
    Updated Feb 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Vermont Ranked by Hispanic Other Race Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-vermont-by-hispanic-other-race-population/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 13, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Vermont
    Variables measured
    Hispanic Other Race Population, Hispanic Other Race Population as Percent of Total Population of cities in Vermont, Hispanic Other Race Population as Percent of Total Hispanic Other Race Population of Vermont
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 278 cities in the Vermont by Hispanic Some Other Race (SOR) population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic Other Race Population: This column displays the rank of cities in the Vermont by their Hispanic Some Other Race (SOR) population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic Other Race Population: The Hispanic Other Race population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic Other Race. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Vermont Hispanic Other Race Population: This tells us how much of the entire Vermont Hispanic Other Race population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  14. R

    Braille Iberoamericano Dataset

    • universe.roboflow.com
    zip
    Updated Oct 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Edwin Hurtado (2024). Braille Iberoamericano Dataset [Dataset]. https://universe.roboflow.com/edwin-hurtado/braille-iberoamericano-dataset/model/8
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 10, 2024
    Dataset authored and provided by
    Edwin Hurtado
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Variables measured
    A Bounding Boxes
    Description

    Spanish Braille Alphabet Dataset
    This dataset will includes over 10,000 images of the Spanish Braille alphabet, featuring lowercase and uppercase letters, numbers, and punctuation marks. While the images may not be of the highest quality, they are valuable for basic Braille recognition tasks and can serve as a foundation for models designed for accessibility, text-to-Braille conversion, and educational purposes. This dataset is an essential resource for projects focused on improving accessibility for the visually impaired in Spanish-speaking regions.

  15. N

    cities in Berks County Ranked by Hispanic White Population // 2025 Edition

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Berks County Ranked by Hispanic White Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-berks-county-pa-by-hispanic-white-population/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Pennsylvania, Berks County
    Variables measured
    Hispanic White Population, Hispanic White Population as Percent of Total Population of cities in Berks County, PA, Hispanic White Population as Percent of Total Hispanic White Population of Berks County, PA
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 75 cities in the Berks County, PA by Hispanic White population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic White Population: This column displays the rank of cities in the Berks County, PA by their Hispanic White population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic White Population: The Hispanic White population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic White. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Berks County Hispanic White Population: This tells us how much of the entire Berks County, PA Hispanic White population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  16. N

    cities in Roger Mills County Ranked by Hispanic Other Race Population //...

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Roger Mills County Ranked by Hispanic Other Race Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-roger-mills-county-ok-by-hispanic-other-race-population/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Roger Mills County, Oklahoma
    Variables measured
    Hispanic Other Race Population, Hispanic Other Race Population as Percent of Total Population of cities in Roger Mills County, OK, Hispanic Other Race Population as Percent of Total Hispanic Other Race Population of Roger Mills County, OK
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 7 cities in the Roger Mills County, OK by Hispanic Some Other Race (SOR) population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic Other Race Population: This column displays the rank of cities in the Roger Mills County, OK by their Hispanic Some Other Race (SOR) population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic Other Race Population: The Hispanic Other Race population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic Other Race. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Roger Mills County Hispanic Other Race Population: This tells us how much of the entire Roger Mills County, OK Hispanic Other Race population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  17. N

    cities in Story County Ranked by Hispanic Other Race Population // 2025...

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Story County Ranked by Hispanic Other Race Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-story-county-ia-by-hispanic-other-race-population/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Iowa, Story County
    Variables measured
    Hispanic Other Race Population, Hispanic Other Race Population as Percent of Total Population of cities in Story County, IA, Hispanic Other Race Population as Percent of Total Hispanic Other Race Population of Story County, IA
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 17 cities in the Story County, IA by Hispanic Some Other Race (SOR) population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic Other Race Population: This column displays the rank of cities in the Story County, IA by their Hispanic Some Other Race (SOR) population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic Other Race Population: The Hispanic Other Race population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic Other Race. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Story County Hispanic Other Race Population: This tells us how much of the entire Story County, IA Hispanic Other Race population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  18. N

    cities in Sumter County Ranked by Hispanic Asian Population // 2025 Edition

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Sumter County Ranked by Hispanic Asian Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-sumter-county-al-by-hispanic-asian-population/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Alabama, Sumter County
    Variables measured
    Hispanic Asian Population, Hispanic Asian Population as Percent of Total Population of cities in Sumter County, AL, Hispanic Asian Population as Percent of Total Hispanic Asian Population of Sumter County, AL
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 7 cities in the Sumter County, AL by Hispanic Asian population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic Asian Population: This column displays the rank of cities in the Sumter County, AL by their Hispanic Asian population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic Asian Population: The Hispanic Asian population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic Asian. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Sumter County Hispanic Asian Population: This tells us how much of the entire Sumter County, AL Hispanic Asian population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  19. F

    Mexican Spanish Call Center Data for Travel AI

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Mexican Spanish Call Center Data for Travel AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/travel-call-center-conversation-spanish-mexico
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Area covered
    Mexico
    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    This Mexican Spanish Call Center Speech Dataset for the Travel industry is purpose-built to power the next generation of voice AI applications for travel booking, customer support, and itinerary assistance. With over 30 hours of unscripted, real-world conversations, the dataset enables the development of highly accurate speech recognition and natural language understanding models tailored for Spanish -speaking travelers.

    Created by FutureBeeAI, this dataset supports researchers, data scientists, and conversational AI teams in building voice technologies for airlines, travel portals, and hospitality platforms.

    Speech Data

    The dataset includes 30 hours of dual-channel audio recordings between native Mexican Spanish speakers engaged in real travel-related customer service conversations. These audio files reflect a wide variety of topics, accents, and scenarios found across the travel and tourism industry.

    Participant Diversity:
    Speakers: 60 native Mexican Spanish contributors from our verified pool.
    Regions: Covering multiple Mexico provinces to capture accent and dialectal variation.
    Participant Profile: Balanced representation of age (18–70) and gender (60% male, 40% female).
    Recording Details:
    Conversation Nature: Naturally flowing, spontaneous customer-agent calls.
    Call Duration: Between 5 and 15 minutes per session.
    Audio Format: Stereo WAV, 16-bit depth, at 8kHz and 16kHz.
    Recording Environment: Captured in controlled, noise-free, echo-free settings.

    Topic Diversity

    Inbound and outbound conversations span a wide range of real-world travel support situations with varied outcomes (positive, neutral, negative).

    Inbound Calls:
    Booking Assistance
    Destination Information
    Flight Delays or Cancellations
    Support for Disabled Passengers
    Health and Safety Travel Inquiries
    Lost or Delayed Luggage, and more
    Outbound Calls:
    Promotional Travel Offers
    Customer Feedback Surveys
    Booking Confirmations
    Flight Rescheduling Alerts
    Visa Expiry Notifications, and others

    These scenarios help models understand and respond to diverse traveler needs in real-time.

    Transcription

    Each call is accompanied by manually curated, high-accuracy transcriptions in JSON format.

    Transcription Includes:
    Speaker-Segmented Dialogues
    Time-Stamped Segments
    Non-speech Markers (e.g., pauses, coughs)
    High transcription accuracy by dual-layered transcription review ensures word error rate under 5%.

    Metadata

    Extensive metadata enriches each call and speaker for better filtering and AI training:

    Participant Metadata: ID, age, gender, region, accent, and dialect.
    Conversation Metadata: Topic, domain, call type, sentiment, and audio specs.

    Usage and Applications

    This dataset is ideal for a variety of AI use cases in the travel and tourism space:

    ASR Systems: Train Spanish speech-to-text engines for travel platforms.
    <div style="margin-top:10px; margin-bottom: 10px;

  20. N

    cities in Cleveland County Ranked by Hispanic Native American Population //...

    • neilsberg.com
    csv, json
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). cities in Cleveland County Ranked by Hispanic Native American Population // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/lists/cities-in-cleveland-county-nc-by-hispanic-native-american-population/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Cleveland County, North Carolina
    Variables measured
    Hispanic Native American Population, Hispanic Native American Population as Percent of Total Population of cities in Cleveland County, NC, Hispanic Native American Population as Percent of Total Hispanic Native American Population of Cleveland County, NC
    Measurement technique
    To measure the rank and respective trends, we initially gathered data from the five most recent American Community Survey (ACS) 5-Year Estimates. We then analyzed and categorized the data for each of the racial categories identified by the U.S. Census Bureau. Based on the required racial category classification, we calculated the rank. For geographies with no population reported for the chosen race, we did not assign a rank and excluded them from the list. It is possible that a small population exists but was not reported or captured due to limitations or variations in Census data collection and reporting. We ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories and do not rely on any ethnicity classification, unless explicitly required.For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    This list ranks the 16 cities in the Cleveland County, NC by Hispanic American Indian and Alaska Native (AIAN) population, as estimated by the United States Census Bureau. It also highlights population changes in each cities over the past five years.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 5-Year Estimates, including:

    • 2019-2023 American Community Survey 5-Year Estimates
    • 2018-2022 American Community Survey 5-Year Estimates
    • 2017-2021 American Community Survey 5-Year Estimates
    • 2016-2020 American Community Survey 5-Year Estimates
    • 2015-2019 American Community Survey 5-Year Estimates

    Variables / Data Columns

    • Rank by Hispanic Native American Population: This column displays the rank of cities in the Cleveland County, NC by their Hispanic American Indian and Alaska Native (AIAN) population, using the most recent ACS data available.
    • cities: The cities for which the rank is shown in the previous column.
    • Hispanic Native American Population: The Hispanic Native American population of the cities is shown in this column.
    • % of Total cities Population: This shows what percentage of the total cities population identifies as Hispanic Native American. Please note that the sum of all percentages may not equal one due to rounding of values.
    • % of Total Cleveland County Hispanic Native American Population: This tells us how much of the entire Cleveland County, NC Hispanic Native American population lives in that cities. Please note that the sum of all percentages may not equal one due to rounding of values.
    • 5 Year Rank Trend: TThis column displays the rank trend across the last 5 years.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Oxford Languages (2025). Spanish Language Datasets | 1.8M+ Sentences | NLP | TTS | Dictionary Display | Game | Translations | European & Latin Amer. Coverage [Dataset]. https://datarade.ai/data-products/spanish-language-datasets-1-8m-sentences-nlp-tts-dic-oxford-languages
Organization logo

Spanish Language Datasets | 1.8M+ Sentences | NLP | TTS | Dictionary Display | Game | Translations | European & Latin Amer. Coverage

Explore at:
.csv, .json, .mp3, .txt, .wav, .xls, .xmlAvailable download formats
Dataset updated
Jul 11, 2025
Dataset authored and provided by
Oxford Languageshttps://www.lexico.com/
Area covered
Costa Rica, Chile, Honduras, Colombia, Ecuador, Nicaragua, Bolivia (Plurinational State of), Cuba, Paraguay, Panama
Description

Our Spanish language datasets are carefully compiled and annotated by language and linguistic experts; you can find them available for licensing:

  1. Spanish Monolingual Dictionary Data
  2. Spanish Bilingual Dictionary Data
  3. Spanish Sentences Data
  4. Synonyms and Antonyms Data
  5. Audio Data
  6. Word list Data

Key Features (approximate numbers):

  1. Spanish Monolingual Dictionary Data

Our Spanish monolingual reliably offers clear definitions and examples, a large volume of headwords, and comprehensive coverage of the Spanish language.

  • Headwords: 73,000
  • Senses: 123,000
  • Sentence examples: 104,000
  • Format: XML and JSON formats
  • Delivery: Email (link-based file sharing) and REST API
  • Updated frequency: annually
  1. Spanish Bilingual Dictionary Data

The bilingual data provides translations in both directions, from English to Spanish and from Spanish to English. It is annually reviewed and updated by our in-house team of language experts. Offers significant coverage of the language, providing a large volume of translated words of excellent quality.

  • Translations: 221,300
  • Senses: 103,500
  • Example sentences: 74,500
  • Example translations: 83,800
  • Format: XML and JSON formats
  • Delivery: Email (link-based file sharing) and REST API
  • Updated frequency: annually
  1. Spanish Sentences Data

Spanish sentences retrieved from the corpus are ideal for NLP model training, presenting approximately 20 million words. The sentences provide a great coverage of Spanish-speaking countries and are accordingly tagged to a particular country or dialect.

  • Sentences volume: 1,840,000
  • Format: XML and JSON format
  • Delivery: Email (link-based file sharing) and REST API
  1. Spanish Synonyms and Antonyms Data

This Spanish language dataset offers a rich collection of synonyms and antonyms, accompanied by detailed definitions and part-of-speech (POS) annotations, making it a comprehensive resource for building linguistically aware AI systems and language technologies.

  • Synonyms: 127,700
  • Antonyms: 9,500
  • Format: XML format
  • Delivery: Email (link-based file sharing)
  • Updated frequency: annually
  1. Spanish Audio Data (word-level)

Curated word-level audio data for the Spanish language, which covers all varieties of world Spanish, providing rich dialectal diversity in the Spanish language.

  • Audio files: 20,900
  • Format: XLSX (for index), MP3 and WAV (audio files)
  1. Spanish Word List Data

This language data contains a carefully curated and comprehensive list of 450,000 Spanish words.

  • Wordforms: 450,000
  • Format: CSV and TXT formats
  • Delivery: Email (link-based file sharing)

Use Cases:

We consistently work with our clients on new use cases as language technology continues to evolve. These include NLP applications, TTS, dictionary display tools, games, translation, word embedding, and word sense disambiguation (WSD).

If you have a specific use case in mind that isn't listed here, we’d be happy to explore it with you. Don’t hesitate to get in touch with us at Oxford.Languages@oup.com to start the conversation.

Pricing:

Oxford Languages offers flexible pricing based on use case and delivery format. Our datasets are licensed via term-based IP agreements and tiered pricing for API-delivered data. Whether you’re integrating into a product, training an LLM, or building custom NLP solutions, we tailor licensing to your specific needs.

Contact our team or email us at Oxford.Languages@oup.com to explore pricing options and discover how our language data can support your goals.

Search
Clear search
Close search
Google apps
Main menu