80 datasets found

d
Handphone Users Survey - Use of Smartphones for Phone Calls - Dataset -...
archive.data.gov.my
Updated Jul 24, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2017). Handphone Users Survey - Use of Smartphones for Phone Calls - Dataset - MAMPU [Dataset]. https://archive.data.gov.my/data/dataset/use-of-smartphones-for-phone-calls
Explore at:
Dataset updated
Jul 24, 2017
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Handphone Users Survey - Use of Smartphones for Phone Calls since 2012
d
Call Center Metrics for the Health Service System
catalog.data.gov
data.sfgov.org
+5more
Updated Mar 29, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.sfgov.org (2025). Call Center Metrics for the Health Service System [Dataset]. https://catalog.data.gov/dataset/call-center-metrics-for-the-health-service-system
Explore at:
Dataset updated
Mar 29, 2025
Dataset provided by
data.sfgov.org
Description
This dataset captures monthly data from HSS' phone system and includes metrics pertaining to Calls Answered, Average Speed of Answer, Abandonment Rate, In-person Assistance. This data supports the City's Performance Measures requirements. In April of 2023 HSS switched to a new phone system - WEBEX (Finess).
d
815 Million Global Contact Data - B2B / Email / Mobile Phone / LinkedIn URL...
datarade.ai
.json, .csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RampedUp Global Data Solutions, 815 Million Global Contact Data - B2B / Email / Mobile Phone / LinkedIn URL - RampedUp [Dataset]. https://datarade.ai/data-products/global-contact-data-personal-and-professional-840-million-rampedup-global-data-solutions
Explore at:
.json, .csvAvailable download formats
Dataset authored and provided by
RampedUp Global Data Solutions
Area covered
Greece, Haiti, Pakistan, Ireland, Chad, Uganda, Grenada, Sint Eustatius and Saba, Bolivia (Plurinational State of), United States Minor Outlying Islands
Description
Sign Up for a free trial: https://rampedup.io/sign-up-%2F-log-in - 7 Days and 50 Credits to test our quality and accuracy.

These are the fields available within the RampedUp Global dataset.

CONTACT DATA: Personal Email Address - We manage over 115 million personal email addresses Professional Email - We manage over 200 million professional email addresses Home Address - We manage over 20 million home addresses Mobile Phones - 65 million direct lines to decision makers Social Profiles - Individual Facebook, Twitter, and LinkedIn Local Address - We manage 65M locations for local office mailers, event-based marketing or face-to-face sales calls.

JOB DATA: Job Title - Standardized titles for ease of use and selection Company Name - The Contact's current employer Job Function - The Company Department associated with the job role Title Level - The Level in the Company associated with the job role Job Start Date - Identify people new to their role as a potential buyer

EMPLOYER DATA: Websites - Company Website, Root Domain, or Full Domain Addresses - Standardized Address, City, Region, Postal Code, and Country Phone - E164 phone with country code Social Profiles - LinkedIn, CrunchBase, Facebook, and Twitter

FIRMOGRAPHIC DATA: Industry - 420 classifications for categorizing the company’s main field of business Sector - 20 classifications for categorizing company industries 4 Digit SIC Code - 239 classifications and their definitions 6 Digit NAICS - 452 classifications and their definitions Revenue - Estimated revenue and bands from 1M to over 1B Employee Size - Exact employee count and bands Email Open Scores - Aggregated data at the domain level showing relationships between email opens and corporate domains. IP Address -Company level IP Addresses associated to Domains from a DNS lookup

CONSUMER DATA: Education - Alma Mater, Degree, Graduation Date Skills - Accumulated Skills associated with work experience
Interests - Known interests of contact Connections - Number of social connections. Followers - Number of social followers

Download our data dictionary: https://rampedup.io/our-data
V
Norfolk Cares Center Call Statistics
data.virginia.gov
data.norfolk.gov
csv, json, rdf, xsl
Updated Jun 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Norfolk (2025). Norfolk Cares Center Call Statistics [Dataset]. https://data.virginia.gov/dataset/norfolk-cares-center-call-statistics
Explore at:
rdf, xsl, json, csvAvailable download formats
Dataset updated
Jun 25, 2025
Dataset provided by
data.norfolk.gov
Authors
City of Norfolk
Area covered
Norfolk
Description
Norfolk Cares is the single point of contact for visitors, residents and businesses to request a City service, report a concern, seek information about Norfolk, or follow-up on a previous request. This dataset provides statistics about incoming calls to the Norfolk Cares Center. The dataset is grouped in 15-minute increments (showing only active call-taking hours) and includes statistics such as the number of incoming calls, abandoned calls, time to answer calls, call duration, and active all takers. This dataset will be updated daily.
A
Do Not Call (DNC) Reported Calls Data 2/18/19 - 2/21/19
data.amerigeoss.org
s.cnmilf.com
+1more
csv
Updated Aug 28, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States (2022). Do Not Call (DNC) Reported Calls Data 2/18/19 - 2/21/19 [Dataset]. https://data.amerigeoss.org/dataset/do-not-call-dnc-reported-calls-data-2-18-19-2-21-191
Explore at:
csvAvailable download formats
Dataset updated
Aug 28, 2022
Dataset provided by
United States
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Description
This data set includes information on Do Not Call and robocall complaints reported to the Federal Trade Commission. The data set contains information reported by consumers, including the telephone number originating the unwanted call, the date the complaint was created, the time the call was made, the consumer’s city and state locations reported, the subject of the call, and whether the call was a robocall. None of the information about the reported calls is verified.
T
Call Data
cos-data.seattle.gov
data.seattle.gov
+3more
application/rdfxml +5
Updated Jul 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Seattle (2025). Call Data [Dataset]. https://cos-data.seattle.gov/Public-Safety/Call-Data/33kz-ixgy
Explore at:
json, csv, application/rdfxml, xml, tsv, application/rssxmlAvailable download formats
Dataset updated
Jul 20, 2025
Dataset authored and provided by
City of Seattle
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Description
Please review this brief video for a better understanding of how these data are created: https://www.youtube.com/watch?v=lvTCjVHxpAU

This data represents police response activity. Each row is a record of a Call for Service (CfS) logged with the Seattle Police Department (SPD) Communications Center. Calls originated from the community and range from in progress or active emergencies to requests for problem solving. Additionally, officers will log calls from their observations of the field.

Previous versions of this data set have withheld approximately 40% of calls. This updated process will release more than 95% of all calls but we will no longer provide latitude and longitude specific location data. In an effort to safeguard the privacy of our community, calls will only be located to the “beat” level. Beats are the most granular unit of management used for patrol deployment. To learn more about patrol deployment, please visit: https://www.seattle.gov/police/about-us/about-policing/precinct-and-patrol-boundaries.

As with any data, certain conditions and qualifications apply:

1) These data are queried from the Data Analytics Platform (DAP), and updated incrementally on a daily basis. A full refresh will occur twice a year and is intended to reconcile minor changes.

2) This data set only contains records of police response. If a call is queued in the system but cleared before an officer can respond, it will not be included.

3) These data contain administrative call types. Use the “Initial” and “Final” call type to identify the calls you wish to include in your analysis.

We invite you to engage these data, ask questions and explore.
A
‘Do Not Call (DNC) Reported Calls Data 4/6/18 - 4/12/18’ analyzed by...
analyst-2.ai
Updated Apr 12, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2018). ‘Do Not Call (DNC) Reported Calls Data 4/6/18 - 4/12/18’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/data-gov-do-not-call-dnc-reported-calls-data-4-6-18-4-12-18-8d90/latest
Explore at:
Dataset updated
Apr 12, 2018
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Do Not Call (DNC) Reported Calls Data 4/6/18 - 4/12/18’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/aa701dbd-d3d9-4316-8e11-0e49c32b078a on 12 February 2022.

--- Dataset description provided by original source is as follows ---

This data set includes information on Do Not Call and robocall complaints reported to the Federal Trade Commission. The data set contains information reported by consumers, including the telephone number originating the unwanted call, the date the complaint was created, the time the call was made, the consumer’s city and state locations reported, the subject of the call, and whether the call was a robocall. None of the information about the reported calls is verified.

--- Original source retains full ownership of the source dataset ---
O
311 Service Calls
data.sanantonio.gov
csv, xlsx
Updated Jul 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
311- City Services (2025). 311 Service Calls [Dataset]. https://data.sanantonio.gov/dataset/service-calls
Explore at:
csv(7640293), csv(42759826), csv(125267698), xlsx(17346), csv(23852597), csv(5368359), csv(34726098), csv(14721574), csv(2114311), csv(717651)Available download formats
Dataset updated
Jul 27, 2025
Dataset authored and provided by
311- City Services
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
311 Customer Service connects citizens with specially-trained customer service representatives ready to assist and facilitate City services including animals, potholes, solid waste collection, property maintenance, downed trees, etc. See more Online Service Requests at www.sanantonio.gov/311. NOTE: Data represents the past 365 days of the last upload date.
F
Japanese Call Center Data for Telecom AI
futurebeeai.com
wav
Updated Aug 1, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Japanese Call Center Data for Telecom AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/telecom-call-center-conversation-japanese-japan
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
This Japanese Call Center Speech Dataset for the Telecom industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Japanese-speaking telecom customers. Featuring over 40 hours of real-world, unscripted audio, it delivers authentic customer-agent interactions across key telecom support scenarios to help train robust ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI engineers, telecom automation teams, and NLP researchers to build high-accuracy, production-ready models for telecom-specific use cases.
Speech Data
The dataset contains 40 hours of dual-channel call center recordings between native Japanese speakers. Captured in realistic customer support settings, these conversations span a wide range of telecom topics from network complaints to billing issues, offering a strong foundation for training and evaluating telecom voice AI solutions.
•Participant Diversity:
•
Speakers: 80 native Japanese speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across Japan to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral ensuring broad scenario coverage for telecom AI development.
•Inbound Calls:
•Phone Number Porting
•Network Connectivity Issues
•Billing and Payments
•Technical Support
•Service Activation
•International Roaming Enquiry
•Refund Requests and Billing Adjustments
•Emergency Service Access, and others
•Outbound Calls:
•Welcome Calls & Onboarding
•Payment Reminders
•Customer Satisfaction Surveys
•Technical Updates
•Service Usage Reviews
•Network Complaint Status Calls, and more
This variety helps train telecom-specific models to manage real-world customer interactions and understand context-specific voice patterns.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•Time-coded Segments
•Non-speech Tags (e.g., pauses, coughs)
•High transcription accuracy with word error rate < 5% thanks to dual-layered quality checks.
These transcriptions are production-ready, allowing for faster development of ASR and conversational AI systems in the Telecom domain.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender, accent, dialect, and location.
A
‘Do Not Call (DNC) Reported Calls Data 12/14/18 - 12/20/18’ analyzed by...
analyst-2.ai
Updated Dec 20, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2018). ‘Do Not Call (DNC) Reported Calls Data 12/14/18 - 12/20/18’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/data-gov-do-not-call-dnc-reported-calls-data-12-14-18-12-20-18-ba9b/6d47552b/?iid=001-739&v=presentation
Explore at:
Dataset updated
Dec 20, 2018
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Do Not Call (DNC) Reported Calls Data 12/14/18 - 12/20/18’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/d0fc1d53-5a49-4162-8436-836cae414b6f on 12 February 2022.

--- Dataset description provided by original source is as follows ---

This data set includes information on Do Not Call and robocall complaints reported to the Federal Trade Commission. The data set contains information reported by consumers, including the telephone number originating the unwanted call, the date the complaint was created, the time the call was made, the consumer’s city and state locations reported, the subject of the call, and whether the call was a robocall. None of the information about the reported calls is verified.

--- Original source retains full ownership of the source dataset ---
F
Punjabi Call Center Data for Healthcare AI
futurebeeai.com
wav
Updated Aug 1, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Punjabi Call Center Data for Healthcare AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/healthcare-call-center-conversation-punjabi-india
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
This Punjabi Call Center Speech Dataset for the Healthcare industry is purpose-built to accelerate the development of Punjabi speech recognition, spoken language understanding, and conversational AI systems. With 30 Hours of unscripted, real-world conversations, it delivers the linguistic and contextual depth needed to build high-performance ASR models for medical and wellness-related customer service.
Created by FutureBeeAI, this dataset empowers voice AI teams, NLP researchers, and data scientists to develop domain-specific models for hospitals, clinics, insurance providers, and telemedicine platforms.
Speech Data
The dataset features 30 Hours of dual-channel call center conversations between native Punjabi speakers. These recordings cover a variety of healthcare support topics, enabling the development of speech technologies that are contextually aware and linguistically rich.
•Participant Diversity:
•
Speakers: 60 verified native Punjabi speakers from our contributor community.

•
Regions: Diverse regions across Punjab to ensure broad dialectal representation.

•
Participant Profile: Age range of 18–70 with a gender mix of 60% male and 40% female.

•RecordingDetails:
•
Conversation Nature: Naturally flowing, unscripted conversations.

•
Call Duration: Each session ranges between 5 to 15 minutes.

•
Audio Format: WAV format, stereo, 16-bit depth at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clear conditions without background noise or echo.

Topic Diversity
The dataset spans inbound and outbound calls, capturing a broad range of healthcare-specific interactions and sentiment types (positive, neutral, negative).
•Inbound Calls:
•Appointment Scheduling
•New Patient Registration
•Surgical Consultation
•Dietary Advice and Consultations
•Insurance Coverage Inquiries
•Follow-up Treatment Requests, and more
•OutboundCalls:
•Appointment Reminders
•Preventive Care Campaigns
•Test Results & Lab Reports
•Health Risk Assessment Calls
•Vaccination Updates
•Wellness Subscription Outreach, and more
These real-world interactions help build speech models that understand healthcare domain nuances and user intent.
Transcription
Every audio file is accompanied by high-quality, manually created transcriptions in JSON format.
•Transcription Includes:
•Speaker-identified Dialogues
•Time-coded Segments
•Non-speech Annotations (e.g., silence, cough)
•High transcription accuracy with word error rate is below 5%, backed by dual-layer QA checks.
Metadata
Each conversation and speaker includes detailed metadata to support fine-tuned training and analysis.
•
Participant Metadata: ID, gender, age, region, accent, and dialect.

•
Conversation Metadata: Topic, sentiment, call type, sample rate, and technical specs.

Usage and Applications
This dataset can be used across a range of healthcare and voice AI use cases:
•
<b style="font-weight:
F
French Call Center Data for BFSI AI
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). French Call Center Data for BFSI AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/bfsi-call-center-conversation-french-france
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
French
Dataset funded by
FutureBeeAI
Description
Introduction
This French Call Center Speech Dataset for the BFSI (Banking, Financial Services, and Insurance) sector is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for French-speaking customers. Featuring over 30 hours of real-world, unscripted audio, it offers authentic customer-agent interactions across a range of BFSI services to train robust and domain-aware ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI developers, financial technology teams, and NLP researchers to build high-accuracy, production-ready models across BFSI customer service scenarios.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native French speakers. Captured in realistic financial support settings, these conversations span diverse BFSI topics from loan enquiries and card disputes to insurance claims and investment options, providing deep contextual coverage for model training and evaluation.
•Participant Diversity:
•
Speakers: 60 native French speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across France to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral, ensuring real-world BFSI voice coverage.
•Inbound Calls:
•Debit Card Block Request
•Transaction Disputes
•Loan Enquiries
•Credit Card Billing Issues
•Account Closure & Claims
•Policy Renewals & Cancellations
•Retirement & Tax Planning
•Investment Risk Queries, and more
•Outbound Calls:
•Loan & Credit Card Offers
•Customer Surveys
•EMI Reminders
•Policy Upgrades
•Insurance Follow-ups
•Investment Opportunity Calls
•Retirement Planning Reviews, and more
This variety ensures models trained on the dataset are equipped to handle complex financial dialogues with contextual accuracy.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•30 hours-coded Segments
•Non-speech Tags (e.g., pauses, background noise)
•High transcription accuracy with word error rate < 5% due to double-layered quality checks.
These transcriptions are production-ready, making financial domain model training faster and more accurate.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender, accent, dialect, and
F
Thai Call Center Data for Retail & E-Commerce AI
futurebeeai.com
wav
Updated Aug 1, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Thai Call Center Data for Retail & E-Commerce AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/retail-call-center-conversation-thai-thailand
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
This Thai Call Center Speech Dataset for the Retail and E-commerce industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Thai speakers. Featuring over 30 hours of real-world, unscripted audio, it provides authentic human-to-human customer service conversations vital for training robust ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI developers, data scientists, and language model researchers to build high-accuracy, production-ready models across retail-focused use cases.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Thai speakers. Captured in realistic scenarios, these conversations span diverse retail topics from product inquiries to order cancellations, providing a wide context range for model training and testing.
•Participant Diversity:
•
Speakers: 60 native Thai speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across Thailand to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral, ensuring real-world scenario coverage.
•Inbound Calls:
•Product Inquiries
•Order Cancellations
•Refund & Exchange Requests
•Subscription Queries, and more
•Outbound Calls:
•Order Confirmations
•Upselling & Promotions
•Account Updates
•Loyalty Program Offers
•Customer Verifications, and others
Such variety enhances your model’s ability to generalize across retail-specific voice interactions.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•30 hours-coded Segments
•Non-speech Tags (e.g., pauses, cough)
•High transcription accuracy with word error rate < 5% due to double-layered quality checks.
These transcriptions are production-ready, making model training faster and more accurate.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender, accent, dialect, and location.

•
Conversation Metadata: Topic, sentiment, call type, sample rate, and technical specs.

This granularity supports advanced analytics, dialect filtering, and fine-tuned model evaluation.
Usage and Applications
This dataset is ideal for a range of voice AI and NLP applications:
•
Automatic Speech Recognition (ASR): Fine-tune Thai speech-to-text systems.

•
Public customer service operations records
catalog.data.gov
Updated Jun 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DHS (2025). Public customer service operations records [Dataset]. https://catalog.data.gov/dataset/public-customer-service-operations-records-6f74b
Explore at:
Dataset updated
Jun 15, 2025
Dataset provided by
U.S. Department of Homeland Securityhttp://www.dhs.gov/
Description
Records from operating a customer call center or service center providing services to the public. Services may address a wide variety of topics such as understanding agency mission-specific functions or how to resolve technical difficulties with external-facing systems or programs. Includes:rn- incoming requests and responsesrn- trouble tickets and tracking logs rn- recordings of call center phone conversations with customers used for quality control and customer service trainingrn- system data, including customer ticket numbers and visit tracking rn- evaluations and feedback about customer servicesrn- information about customer services, such as “Frequently Asked Questions” (FAQs) and user guidesrn- reports generated from customer management datarn- complaints and commendation records; customer feedback and satisfaction surveys, including survey instruments, data, background materials, and reports.
h
glaive-function-calling-v2
huggingface.co
kaggle.com
Updated Aug 24, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Glaive AI (2023). glaive-function-calling-v2 [Dataset]. https://huggingface.co/datasets/glaiveai/glaive-function-calling-v2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 24, 2023
Dataset authored and provided by
Glaive AI
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
glaiveai/glaive-function-calling-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
d
2.03 311 First-Call Resolution (summary)
catalog.data.gov
data.tempe.gov
+7more
Updated Jun 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tempe (2025). 2.03 311 First-Call Resolution (summary) [Dataset]. https://catalog.data.gov/dataset/2-03-311-first-call-resolution-summary-5048d
Explore at:
Dataset updated
Jun 21, 2025
Dataset provided by
City of Tempe
Description
The Customer Relations Center (CRC) or Tempe 311 is often the first and possibly only contact a resident has with the City. Our goal is to make each interaction as smooth and efficient as possible. To efficiently provide our residents an improved level of customer service, Tempe 311 strives to serve our residents by acting as the central connection to accessible information and government services. Our purpose is realized through our ability to resolve calls with a single point of contact. When we do this, we have met 311’s mission and provided effective customer service. Tempe 311 CRC strives to achieve Single Point of Contact (SPOC) resolution rate greater than or equal to 75% of incoming calls.This page provides data for the 311 First-Call Resolution Rate performance measure.The performance measure dashboard is available at 2.03 311 First-Call Resolution Rate.Additional InformationSource:Contact: Moncayo, KimContact E-Mail: Kim_Moncayo@tempe.govData Source Type: Accela CRM, Excel, Cisco Unified IntelligencePreparation Method: The data from every 311 call is entered into the city's Accela CRM database system. We use that information in conjunction with Cisco Unified Intelligence Center, a separate report is generated to pull out transferred and non 311 callsPublish Frequency: QuarterlyPublish Method: ManualData Dictionary
F
Malay Call Center Data for Telecom AI
futurebeeai.com
wav
Updated Aug 1, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Malay Call Center Data for Telecom AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/telecom-call-center-conversation-malay-malaysia
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
This Malay Call Center Speech Dataset for the Telecom industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Malay-speaking telecom customers. Featuring over 30 hours of real-world, unscripted audio, it delivers authentic customer-agent interactions across key telecom support scenarios to help train robust ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI engineers, telecom automation teams, and NLP researchers to build high-accuracy, production-ready models for telecom-specific use cases.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Malay speakers. Captured in realistic customer support settings, these conversations span a wide range of telecom topics from network complaints to billing issues, offering a strong foundation for training and evaluating telecom voice AI solutions.
•Participant Diversity:
•
Speakers: 60 native Malay speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across Malaysia to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral ensuring broad scenario coverage for telecom AI development.
•Inbound Calls:
•Phone Number Porting
•Network Connectivity Issues
•Billing and Payments
•Technical Support
•Service Activation
•International Roaming Enquiry
•Refund Requests and Billing Adjustments
•Emergency Service Access, and others
•Outbound Calls:
•Welcome Calls & Onboarding
•Payment Reminders
•Customer Satisfaction Surveys
•Technical Updates
•Service Usage Reviews
•Network Complaint Status Calls, and more
This variety helps train telecom-specific models to manage real-world customer interactions and understand context-specific voice patterns.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•Time-coded Segments
•Non-speech Tags (e.g., pauses, coughs)
•High transcription accuracy with word error rate < 5% thanks to dual-layered quality checks.
These transcriptions are production-ready, allowing for faster development of ASR and conversational AI systems in the Telecom domain.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender, accent, dialect, and location.
D
Call Type Suppression Mapping for Law Enforcement Dispatched Calls for...
data.sfgov.org
application/rdfxml +5
Updated May 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Call Type Suppression Mapping for Law Enforcement Dispatched Calls for Service [Dataset]. https://data.sfgov.org/dataset/Call-Type-Suppression-Mapping-for-Law-Enforcement-/s8ks-esac
Explore at:
application/rssxml, csv, xml, json, application/rdfxml, tsvAvailable download formats
Dataset updated
May 13, 2022
Description
PURPOSE Private dataset used to update the following datasets: Law Enforcement Dispatched Calls for Service: Real-Time Law Enforcement Dispatched Calls for Service: Closed

BACKGROUND Logic based on rules developed in conjunction with DEM (Michelle Geddes), POL (Jason Cunningham), and MTA and documented here.

UPDATE PROCESS To update this mapping, edit the Google Spreadsheet here, then download as a CSV and replace the data here in the portal.

NOTES 1. Rules are defined by matching the first characters of a call type (rule_type='prefix'), last characters of a call type (rule_type='suffix'), or the exact call type (rule_type='exact'). 2. Rules are applied in the following order: (1) prefix, (2) suffix, (3) exact. In case of conflict, each rule type supercedes the previous one. 3. Calls not captured by any rule will not have their geographic location suppressed in either dataset. 4. Take care to ensure that no 'prefix' rules conflict with each other, no 'suffix' rules conflict either. An example of a potential conflict: You add a rule stating that all calls beginning with 261B should only be suppressed in the real-time dataset. This would conflict with the existing rule that all 261 calls should be suppressed in both datasets. To resolve this conflict, you would need to specify the behavior for all calls that are exactly '261' as well as for each call beginning with 261A-261Z.
w
311 calls
data.wu.ac.at
csv, json, xml
Updated Jun 18, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
311, DoITT (2018). 311 calls [Dataset]. https://data.wu.ac.at/schema/bronx_lehman_cuny_edu/dDY5OS11dXNp
Explore at:
json, csv, xmlAvailable download formats
Dataset updated
Jun 18, 2018
Dataset provided by
311, DoITT
Description
All 311 Service Requests from 2010 to present. This information is automatically updated daily.

Click here to download data from 2011 - https://data.cityofnewyork.us/dataset/311-Service-Requests-From-2011/fpz8-jqf4

Click here to download data from 2012 - https://data.cityofnewyork.us/dataset/311-Service-Requests-From-2012/as38-8eb5

Click here to download data from 2013 - https://data.cityofnewyork.us/dataset/311-Service-Requests-From-2013/hybb-af8n

Click here to download data from 2014 - https://data.cityofnewyork.us/dataset/311-Service-Requests-From-2014/vtzg-7562

Click here to download data from 2015 - https://data.cityofnewyork.us/dataset/311-Service-Requests-From-2015/57g5-etyj
G
Study on fraud calls in Canada
open.canada.ca
ouvert.canada.ca
html
Updated Dec 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Canadian Radio-television and Telecommunications Commission (2020). Study on fraud calls in Canada [Dataset]. https://open.canada.ca/data/en/dataset/2432bf42-99a1-4c84-a897-08211d889cd6
Explore at:
htmlAvailable download formats
Dataset updated
Dec 9, 2020
Dataset provided by
Canadian Radio-television and Telecommunications Commission
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Area covered
Canada
Description
Briefing materials for the presentation to the Standing Committee on Industry, Science and Technology on the subject of Study on fraud calls in Canada

Facebook

Twitter

Click to copy link

Link copied

Cite

(2017). Handphone Users Survey - Use of Smartphones for Phone Calls - Dataset - MAMPU [Dataset]. https://archive.data.gov.my/data/dataset/use-of-smartphones-for-phone-calls

Handphone Users Survey - Use of Smartphones for Phone Calls - Dataset - MAMPU

Explore at:

Dataset updated

Jul 24, 2017

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Handphone Users Survey - Use of Smartphones for Phone Calls since 2012

Clear search

Close search

Google apps

Main menu

Handphone Users Survey - Use of Smartphones for Phone Calls - Dataset -...

Call Center Metrics for the Health Service System

815 Million Global Contact Data - B2B / Email / Mobile Phone / LinkedIn URL...

Norfolk Cares Center Call Statistics

Do Not Call (DNC) Reported Calls Data 2/18/19 - 2/21/19

Call Data

‘Do Not Call (DNC) Reported Calls Data 4/6/18 - 4/12/18’ analyzed by...

311 Service Calls

Japanese Call Center Data for Telecom AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

‘Do Not Call (DNC) Reported Calls Data 12/14/18 - 12/20/18’ analyzed by...

Punjabi Call Center Data for Healthcare AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

French Call Center Data for BFSI AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Thai Call Center Data for Retail & E-Commerce AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

Public customer service operations records

glaive-function-calling-v2

2.03 311 First-Call Resolution (summary)

Malay Call Center Data for Telecom AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Call Type Suppression Mapping for Law Enforcement Dispatched Calls for...

311 calls

Study on fraud calls in Canada

Handphone Users Survey - Use of Smartphones for Phone Calls - Dataset - MAMPUSee More Versions

Handphone Users Survey - Use of Smartphones for Phone Calls - Dataset - MAMPU