48 datasets found

Population of Montréal in Canada 2021, by official language spoken and...
statista.com
Updated Jan 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Population of Montréal in Canada 2021, by official language spoken and gender [Dataset]. https://www.statista.com/statistics/1339075/population-montreal-canada-official-language-spoken-gender/
Explore at:
Dataset updated
Jan 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2021
Area covered
Canada
Description
In 2021, French was the first language spoken by over 71 percent of the population of Montréal, Québec in Canada. 20.4 percent of the city's residents had English as their first language, 6.7 percent used both English and French as their primary language, and 1.6 percent of the population spoke another language. That same year, 46.4 percent of people living in the province of Québec could speak both English and French.
Rate of English–French bilingualism in Québec and Canada 1971-2021
statista.com
Updated Jul 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Rate of English–French bilingualism in Québec and Canada 1971-2021 [Dataset]. https://www.statista.com/statistics/1338881/rate-english-french-bilingualism-quebec-canada/
Explore at:
Dataset updated
Jul 9, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Canada
Description
Over the past fifty years, the proportion of Quebecers speaking both English and French has increased steadily, from **** percent in 1971 to almost half the population (**** percent) in 2021. The rate of English-French bilingualism, on the other hand, has declined in the rest of the country: outside Quebec, just over ten percent of people were bilingual in English and French in 2001, compared to *** percent two decades later.
Population by knowledge of official languages and geography, 1951 to 2021
www150.statcan.gc.ca
open.canada.ca
Updated Mar 21, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2023). Population by knowledge of official languages and geography, 1951 to 2021 [Dataset]. http://doi.org/10.25318/1510000401-eng
Explore at:
Unique identifier
https://doi.org/10.25318/1510000401-eng
Dataset updated
Mar 21, 2023
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Area covered
Canada
Description
Data on the knowledge of official languages by the population of Canada and Canada outside Quebec, and of all provinces and territories, for Census years 1951 to 2021.
G
Type and level of French program attended and number of years of primary or...
open.canada.ca
data.urbandatacentre.ca
+2more
csv, html, xml
Updated Aug 2, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics Canada (2024). Type and level of French program attended and number of years of primary or secondary schooling in a regular French program in a French-language school by mother tongue: Canada outside of Quebec, provinces and territories, census divisions and census subdivisions [Dataset]. https://open.canada.ca/data/dataset/820bf55e-f91e-43fb-b819-97721b0b18a6
Explore at:
html, csv, xmlAvailable download formats
Dataset updated
Aug 2, 2024
Dataset provided by
Statistics Canada
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Area covered
Quebec, French, Canada
Description
Data on type and level of French program attended, number of years of primary or secondary schooling in a regular French program in a French-language school and mother tongue for the population outside of Quebec, in private households in Canada outside of Quebec, provinces and territories, census divisions and census subdivisions.
Languages in Canada 2022
statista.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, Languages in Canada 2022 [Dataset]. https://www.statista.com/statistics/271218/languages-in-canada/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2022
Area covered
Canada
Description
The statistic reflects the distribution of languages in Canada in 2022. In 2022, 87.1 percent of the total population in Canada spoke English as their native tongue.
s
Population by first official language spoken and geography, 1971 to 2016,...
www150.statcan.gc.ca
open.canada.ca
+1more
Updated Mar 6, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2019). Population by first official language spoken and geography, 1971 to 2016, inactive [Dataset]. http://doi.org/10.25318/1510000501-eng
Explore at:
Unique identifier
https://doi.org/10.25318/1510000501-eng
Dataset updated
Mar 6, 2019
Dataset provided by
Government of Canada, Statistics Canada
Area covered
Canada
Description
Data on the first official language spoken of the population of Canada and Canada outside Quebec, and of all provinces and territories, for Census years 1971 to 2016.
F
Canadian French General Conversation Speech Dataset for ASR
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Canadian French General Conversation Speech Dataset for ASR [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/general-conversation-canadian-french
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
French, Canada
Dataset funded by
FutureBeeAI
Description
Introduction
Welcome to the Canadian French General Conversation Speech Dataset — a rich, linguistically diverse corpus purpose-built to accelerate the development of French speech technologies. This dataset is designed to train and fine-tune ASR systems, spoken language understanding models, and generative voice AI tailored to real-world Canadian French communication.
Curated by FutureBeeAI, this 30 hours dataset offers unscripted, spontaneous two-speaker conversations across a wide array of real-life topics. It enables researchers, AI developers, and voice-first product teams to build robust, production-grade French speech models that understand and respond to authentic Canadian accents and dialects.
Speech Data
The dataset comprises 30 hours of high-quality audio, featuring natural, free-flowing dialogue between native speakers of Canadian French. These sessions range from informal daily talks to deeper, topic-specific discussions, ensuring variability and context richness for diverse use cases.
•Participant Diversity:
•
Speakers: 60 verified native Canadian French speakers from FutureBeeAI’s contributor community.

•
Regions: Representing various provinces of Canada to ensure dialectal diversity and demographic balance.

•
Demographics: A balanced gender ratio (60% male, 40% female) with participant ages ranging from 18 to 70 years.

•Recording Details:
•
Conversation Style: Unscripted, spontaneous peer-to-peer dialogues.

•
Duration: Each conversation ranges from 15 to 60 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, recorded at 16kHz sample rate.

•
Environment: Quiet, echo-free settings with no background noise.

Topic Diversity
The dataset spans a wide variety of everyday and domain-relevant themes. This topic diversity ensures the resulting models are adaptable to broad speech contexts.
•Sample Topics Include:
•Family & Relationships
•Food & Recipes
•Education & Career
•Healthcare Discussions
•Social Issues
•Technology & Gadgets
•Travel & Local Culture
•Shopping & Marketplace Experiences, and many more.
Transcription
Each audio file is paired with a human-verified, verbatim transcription available in JSON format.
•Transcription Highlights:
•Speaker-segmented dialogues
•Time-coded utterances
•Non-speech elements (pauses, laughter, etc.)
•High transcription accuracy, achieved through double QA pass, average WER < 5%
These transcriptions are production-ready, enabling seamless integration into ASR model pipelines or conversational AI workflows.
Metadata
The dataset comes with granular metadata for both speakers and recordings:
•
Speaker Metadata: Age, gender, accent, dialect, state/province, and participant ID.

•
Recording Metadata: Topic, duration, audio format, device type, and sample rate.

Such metadata helps developers fine-tune model training and supports use-case-specific filtering or demographic analysis.
Usage and Applications
This dataset is a versatile resource for multiple French speech and language AI applications:
•
ASR Development: Train accurate speech-to-text systems for Canadian French.

•
Voice Assistants: Build smart assistants capable of understanding natural Canadian conversations.

<div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px;
Population by first official language spoken, immigrant status, period of...
www150.statcan.gc.ca
ouvert.canada.ca
+1more
Updated Jan 23, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2024). Population by first official language spoken, immigrant status, period of immigration and geography, 1971 to 2021 [Dataset]. http://doi.org/10.25318/1510003601-eng
Explore at:
Unique identifier
https://doi.org/10.25318/1510003601-eng
Dataset updated
Jan 23, 2024
Dataset provided by
Government of Canadahttp://www.gg.ca/
Statistics Canadahttps://statcan.gc.ca/en
Area covered
Canada
Description
First official language spoken by immigrant status and period of immigration for the population of Canada and Canada outside Quebec, and of all provinces and territories, for Census years 1971 to 2021.
Population of Montréal in Canada 2021, by knowledge of official languages...
statista.com
Updated Jan 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Population of Montréal in Canada 2021, by knowledge of official languages and gender [Dataset]. https://www.statista.com/statistics/1338899/population-montreal-canada-knowledge-official-languages-gender/
Explore at:
Dataset updated
Jan 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2021
Area covered
Canada
Description
In 2021, most of the population of the city of Montreal, located in the Canadian province of Quebec, could speak both English and French. In fact, approximately 1.23 million men and 1.68 million women were bilingual. Of those who spoke only one of the official languages, the majority (1.43 million people) spoke only French. In addition, more than 68,400 people did not know either language, with women outnumbering men.
u
English spoken at home by French spoken at home, Indigenous language spoken...
data.urbandatacentre.ca
Updated Oct 1, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). English spoken at home by French spoken at home, Indigenous language spoken at home, other non-official language spoken at home and mother tongue: Canada, provinces and territories, census divisions and census subdivisions - Catalogue - Canadian Urban Data Catalogue (CUDC) [Dataset]. https://data.urbandatacentre.ca/dataset/gov-canada-f64aba6f-7056-4cfe-8144-a350aa5f78eb
Explore at:
Dataset updated
Oct 1, 2024
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Area covered
French, Canada
Description
Data on English spoken at home by French spoken at home, Indigenous language spoken at home, other non-official language spoken at home, mother tongue and gender for the population excluding institutional residents for Canada, provinces and territories, census divisions and census subdivisions.
F
Canadian French Call Center Data for Telecom AI
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Canadian French Call Center Data for Telecom AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/telecom-call-center-conversation-french-canada
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
French, Canada
Dataset funded by
FutureBeeAI
Description
Introduction
This Canadian French Call Center Speech Dataset for the Telecom industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for French-speaking telecom customers. Featuring over 30 hours of real-world, unscripted audio, it delivers authentic customer-agent interactions across key telecom support scenarios to help train robust ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI engineers, telecom automation teams, and NLP researchers to build high-accuracy, production-ready models for telecom-specific use cases.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Canadian French speakers. Captured in realistic customer support settings, these conversations span a wide range of telecom topics from network complaints to billing issues, offering a strong foundation for training and evaluating telecom voice AI solutions.
•Participant Diversity:
•
Speakers: 60 native Canadian French speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across Canada to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral ensuring broad scenario coverage for telecom AI development.
•Inbound Calls:
•Phone Number Porting
•Network Connectivity Issues
•Billing and Payments
•Technical Support
•Service Activation
•International Roaming Enquiry
•Refund Requests and Billing Adjustments
•Emergency Service Access, and others
•Outbound Calls:
•Welcome Calls & Onboarding
•Payment Reminders
•Customer Satisfaction Surveys
•Technical Updates
•Service Usage Reviews
•Network Complaint Status Calls, and more
This variety helps train telecom-specific models to manage real-world customer interactions and understand context-specific voice patterns.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•Time-coded Segments
•Non-speech Tags (e.g., pauses, coughs)
•High transcription accuracy with word error rate < 5% thanks to dual-layered quality checks.
These transcriptions are production-ready, allowing for faster development of ASR and conversational AI systems in the Telecom domain.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender, accent, dialect, and location.

<div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px;
u
English Spoken at Home (7), French Spoken at Home (7), Aboriginal Language...
data.urbandatacentre.ca
Updated Oct 1, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). English Spoken at Home (7), French Spoken at Home (7), Aboriginal Language Spoken at Home (7), Immigrant Language Spoken at Home (7), Mother Tongue (10), Age (15A) and Sex (3) for the Population Excluding Institutional Residents of Canada, Provinces and Territories, Census Metropolitan Areas and Census Agglomerations, 2016 Census - 100% Data - Catalogue - Canadian Urban Data Catalogue (CUDC) [Dataset]. https://data.urbandatacentre.ca/dataset/gov-canada-66011e02-2782-4b4d-806d-87bcf5459cf1
Explore at:
Dataset updated
Oct 1, 2024
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Area covered
French, Canada
Description
This table is part of a series of tables that present a portrait of Canada based on the various census topics. The tables range in complexity and levels of geography. Content varies from a simple overview of the country to complex cross-tabulations; the tables may also cover several censuses.
English Spoken at Home (7), French Spoken at Home (7), Aboriginal Language...
open.canada.ca
html, xml
Updated Feb 23, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics Canada (2022). English Spoken at Home (7), French Spoken at Home (7), Aboriginal Language Spoken at Home (7), Immigrant Language Spoken at Home (7), Mother Tongue (10), Age (15A) and Sex (3) for the Population Excluding Institutional Residents of Canada, Provinces and Territories, Census Metropolitan Areas and Census Agglomerations, 2016 Census - 100% Data [Dataset]. https://open.canada.ca/data/en/dataset/66011e02-2782-4b4d-806d-87bcf5459cf1
Explore at:
xml, htmlAvailable download formats
Dataset updated
Feb 23, 2022
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Time period covered
May 10, 2016 - May 10, 2017
Area covered
French, Canada
Description
This table is part of a series of tables that present a portrait of Canada based on the various census topics. The tables range in complexity and levels of geography. Content varies from a simple overview of the country to complex cross-tabulations; the tables may also cover several censuses.
g
Statistics Canada, Population by Language Spoken at Home by Census Division,...
geocommons.com
Updated Jul 3, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Brendan (2008). Statistics Canada, Population by Language Spoken at Home by Census Division, Alberta-Canada, 2006 [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
Jul 3, 2008
Dataset provided by
Brendan
Statistics Canada
Description
This dataset displays information regarding the language spoken most often at home. This data is available on the Census Division level, and is available from the 2006 Canadian Census. This data was obtained through: Statistics Canada. This data refers to the language spoken most often at home by the individual at the time of the census. Other languages spoken at home on a regular basis were also collected. Included are population figures for the following attributes: Total Population, English, French, Non-Official, English and French, English and Non-Official Language, French and Non-Official Language, and English French and Non-Official Speaking. This data is also broken down by Age Group.
F
Canadian French Call Center Data for BFSI AI
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Canadian French Call Center Data for BFSI AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/bfsi-call-center-conversation-french-canada
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
French, Canada
Dataset funded by
FutureBeeAI
Description
Introduction
This Canadian French Call Center Speech Dataset for the BFSI (Banking, Financial Services, and Insurance) sector is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for French-speaking customers. Featuring over 30 hours of real-world, unscripted audio, it offers authentic customer-agent interactions across a range of BFSI services to train robust and domain-aware ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI developers, financial technology teams, and NLP researchers to build high-accuracy, production-ready models across BFSI customer service scenarios.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Canadian French speakers. Captured in realistic financial support settings, these conversations span diverse BFSI topics from loan enquiries and card disputes to insurance claims and investment options, providing deep contextual coverage for model training and evaluation.
•Participant Diversity:
•
Speakers: 60 native Canadian French speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across Canada to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral, ensuring real-world BFSI voice coverage.
•Inbound Calls:
•Debit Card Block Request
•Transaction Disputes
•Loan Enquiries
•Credit Card Billing Issues
•Account Closure & Claims
•Policy Renewals & Cancellations
•Retirement & Tax Planning
•Investment Risk Queries, and more
•Outbound Calls:
•Loan & Credit Card Offers
•Customer Surveys
•EMI Reminders
•Policy Upgrades
•Insurance Follow-ups
•Investment Opportunity Calls
•Retirement Planning Reviews, and more
This variety ensures models trained on the dataset are equipped to handle complex financial dialogues with contextual accuracy.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•30 hours-coded Segments
•Non-speech Tags (e.g., pauses, background noise)
•High transcription accuracy with word error rate < 5% due to double-layered quality checks.
These transcriptions are production-ready, making financial domain model training faster and more accurate.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender,
Population of Montréal in Canada 2021, by mother tongue
statista.com
Updated Jan 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Population of Montréal in Canada 2021, by mother tongue [Dataset]. https://www.statista.com/statistics/1339083/population-montreal-canada-mother-tongue/
Explore at:
Dataset updated
Jan 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2021
Area covered
Canada
Description
According to the Canadian government, approximately 2.54 million people residing in Montreal, in the province of Quebec, had French as their mother tongue in 2021. About 474,730 of them had English, the second official language, as their birth language. However, there were more people that year ( 522,255) whose mother tongue was an Indo-European language, such as German, Russian or Polish.
F
Canadian French Call Center Data for Realestate AI
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Canadian French Call Center Data for Realestate AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/realestate-call-center-conversation-french-canada
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
French, Canada
Dataset funded by
FutureBeeAI
Description
Introduction
This Canadian French Call Center Speech Dataset for the Real Estate industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for French -speaking Real Estate customers. With over 30 hours of unscripted, real-world audio, this dataset captures authentic conversations between customers and real estate agents ideal for building robust ASR models.
Curated by FutureBeeAI, this dataset equips voice AI developers, real estate tech platforms, and NLP researchers with the data needed to create high-accuracy, production-ready models for property-focused use cases.
Speech Data
The dataset features 30 hours of dual-channel call center recordings between native Canadian French speakers. Captured in realistic real estate consultation and support contexts, these conversations span a wide array of property-related topics from inquiries to investment advice offering deep domain coverage for AI model development.
•Participant Diversity:
•
Speakers: 60 native Canadian French speakers from our verified contributor community.

•
Regions: Representing different provinces across Canada to ensure accent and dialect variation.

•
Participant Profile: Balanced gender mix (60% male, 40% female) and age range from 18 to 70.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted agent-customer discussions.

•
Call Duration: Average 5–15 minutes per call.

•
Audio Format: Stereo WAV, 16-bit, recorded at 8kHz and 16kHz.

•
Recording Environment: Captured in noise-free and echo-free conditions.

Topic Diversity
This speech corpus includes both inbound and outbound calls, featuring positive, neutral, and negative outcomes across a wide range of real estate scenarios.
•Inbound Calls:
•Property Inquiries
•Rental Availability
•Renovation Consultation
•Property Features & Amenities
•Investment Property Evaluation
•Ownership History & Legal Info, and more
•Outbound Calls:
•New Listing Notifications
•Post-Purchase Follow-ups
•Property Recommendations
•Value Updates
•Customer Satisfaction Surveys, and others
Such domain-rich variety ensures model generalization across common real estate support conversations.
Transcription
All recordings are accompanied by precise, manually verified transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•Time-coded Segments
•Non-speech Tags (e.g., background noise, pauses)
•High transcription accuracy with word error rate below 5% via dual-layer human review.
These transcriptions streamline ASR and NLP development for French real estate voice applications.
Metadata
Detailed metadata accompanies each participant and conversation:
•
Participant Metadata: ID, age, gender, location, accent, and dialect.

•
Conversation Metadata: Topic, call type, sentiment, sample rate, and technical details.

This enables smart filtering, dialect-focused model training, and structured dataset exploration.
Usage and Applications
This dataset is ideal for voice AI and NLP systems built for the real estate sector:
<div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px; align-items:
F
Canadian French Call Center Data for Retail & E-Commerce AI
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Canadian French Call Center Data for Retail & E-Commerce AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/retail-call-center-conversation-french-canada
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
French, Canada
Dataset funded by
FutureBeeAI
Description
Introduction
This Canadian French Call Center Speech Dataset for the Retail and E-commerce industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for French speakers. Featuring over 30 hours of real-world, unscripted audio, it provides authentic human-to-human customer service conversations vital for training robust ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI developers, data scientists, and language model researchers to build high-accuracy, production-ready models across retail-focused use cases.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Canadian French speakers. Captured in realistic scenarios, these conversations span diverse retail topics from product inquiries to order cancellations, providing a wide context range for model training and testing.
•Participant Diversity:
•
Speakers: 60 native Canadian French speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across Canada to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral, ensuring real-world scenario coverage.
•Inbound Calls:
•Product Inquiries
•Order Cancellations
•Refund & Exchange Requests
•Subscription Queries, and more
•Outbound Calls:
•Order Confirmations
•Upselling & Promotions
•Account Updates
•Loyalty Program Offers
•Customer Verifications, and others
Such variety enhances your model’s ability to generalize across retail-specific voice interactions.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•30 hours-coded Segments
•Non-speech Tags (e.g., pauses, cough)
•High transcription accuracy with word error rate < 5% due to double-layered quality checks.
These transcriptions are production-ready, making model training faster and more accurate.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender, accent, dialect, and location.

•
Conversation Metadata: Topic, sentiment, call type, sample rate, and technical specs.

This granularity supports advanced analytics, dialect filtering, and fine-tuned model evaluation.
Usage and Applications
This dataset is ideal for a range of voice AI and NLP applications:
•
Automatic Speech Recognition (ASR): Fine-tune French speech-to-text systems.

<span
Selected Demographic, Cultural, Educational, Labour Force and Income...
datasets.ai
open.canada.ca
55
Updated Aug 27, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics Canada | Statistique Canada (2024). Selected Demographic, Cultural, Educational, Labour Force and Income Characteristics (725), First Official Language Spoken (4) and Sex (3) for Population Having English, French or English and French as First Official Language Spoken, for Canada, Provinces, Territories, Census Divisions and Census Subdivisions, 2001 Census - 20% Sample Data [Dataset]. https://datasets.ai/datasets/3f8f670e-a143-4880-897a-d849afe7f8f2
Explore at:
55Available download formats
Dataset updated
Aug 27, 2024
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Authors
Statistics Canada | Statistique Canada
Area covered
French, Canada
Description
This table is part of a series of tables that present a portrait of Canada based on the various census topics. The tables range in complexity and levels of geography. Content varies from a simple overview of the country to complex cross-tabulations; the tables may also cover several censuses.
u
Knowledge of Language of Aboriginal Identity Population, Canada, Provinces...
data.urbandatacentre.ca
Updated Oct 1, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Knowledge of Language of Aboriginal Identity Population, Canada, Provinces and Territories - Catalogue - Canadian Urban Data Catalogue (CUDC) [Dataset]. https://data.urbandatacentre.ca/dataset/ab-knowledge-of-language-of-aboriginal-identity-population-canada-provinces-and-territories
Explore at:
Dataset updated
Oct 1, 2024
Area covered
Canada
Description
This Alberta Official Statistic compares the knowledge of languages among the Aboriginal Identity population in provinces and territories, based on self-assessment of the ability to converse in the language. Based on the 2011 National Household Survey (NHS), English is the most common language known by the Aboriginal Identity Population across Canada. In most provinces, nearly 100% of the Aboriginal Identity population can converse in English. The lowest proportion of English-speaking Aboriginal people is in Quebec, where the majority speak French. The highest proportion of Aboriginal people who speak Aboriginal languages was in Nunavut at 88.6%, followed by Quebec (32.4%) and the Northwest Territories (32.1%). In Alberta, more Aboriginal people are able to speak Aboriginal languages (15.1%) than are able to speak French or other (non-Aboriginal) languages. The proportion of Alberta Aboriginal people able to speak Aboriginal languages was sixth highest among provinces and territories.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2025). Population of Montréal in Canada 2021, by official language spoken and gender [Dataset]. https://www.statista.com/statistics/1339075/population-montreal-canada-official-language-spoken-gender/

Population of Montréal in Canada 2021, by official language spoken and gender

Explore at:

2 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jan 23, 2025

Dataset authored and provided by

Statistahttp://statista.com/

Time period covered

2021

Area covered

Canada

Description

In 2021, French was the first language spoken by over 71 percent of the population of Montréal, Québec in Canada. 20.4 percent of the city's residents had English as their first language, 6.7 percent used both English and French as their primary language, and 1.6 percent of the population spoke another language. That same year, 46.4 percent of people living in the province of Québec could speak both English and French.

Clear search

Close search

Google apps

Main menu

Population of Montréal in Canada 2021, by official language spoken and...

Rate of English–French bilingualism in Québec and Canada 1971-2021

Population by knowledge of official languages and geography, 1951 to 2021

Type and level of French program attended and number of years of primary or...

Languages in Canada 2022

Population by first official language spoken and geography, 1971 to 2016,...

Canadian French General Conversation Speech Dataset for ASR

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

Population by first official language spoken, immigrant status, period of...

Population of Montréal in Canada 2021, by knowledge of official languages...

English spoken at home by French spoken at home, Indigenous language spoken...

Canadian French Call Center Data for Telecom AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

English Spoken at Home (7), French Spoken at Home (7), Aboriginal Language...

English Spoken at Home (7), French Spoken at Home (7), Aboriginal Language...

Statistics Canada, Population by Language Spoken at Home by Census Division,...

Canadian French Call Center Data for BFSI AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Population of Montréal in Canada 2021, by mother tongue

Canadian French Call Center Data for Realestate AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

Canadian French Call Center Data for Retail & E-Commerce AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

Selected Demographic, Cultural, Educational, Labour Force and Income...

Knowledge of Language of Aboriginal Identity Population, Canada, Provinces...

Population of Montréal in Canada 2021, by official language spoken and gender