Facebook
TwitterThe statistic reflects the distribution of languages in Canada in 2022. In 2022, 87.1 percent of the total population in Canada spoke English as their native tongue.
Facebook
TwitterIn 2021, French was the first language spoken by over 71 percent of the population of Montréal, Québec in Canada. 20.4 percent of the city's residents had English as their first language, 6.7 percent used both English and French as their primary language, and 1.6 percent of the population spoke another language. That same year, 46.4 percent of people living in the province of Québec could speak both English and French.
Facebook
TwitterMain language of instruction at the post-secondary level, by level of study, by age group and gender, Canada, Quebec, Canada outside Quebec, by select provinces and regions, among adults in the official language minority population, 2022.
Facebook
TwitterOpen Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This service shows the predominant mother tongue in each census subdivision based on English, French or non-official language. The data is from the data table Mother Tongue (10), Age (27) and Sex (3) for the Population of Canada, Provinces and Territories, Census Divisions and Census Subdivisions, 2016 Census - 100% Data, Statistics Canada Catalogue no. 98-400-X2016046. Mother tongue refers to the first language learned at home in childhood and still understood by the person at the time the data was collected. If the person no longer understands the first language learned, the mother tongue is the second language learned. For a person who learned two languages at the same time in early childhood, the mother tongue is the language this person spoke most often at home before starting school. The person has two mother tongues only if the two languages were used equally often and are still understood by the person. For a child who has not yet learned to speak, the mother tongue is the language spoken most often to this child at home. The child has two mother tongues only if both languages are spoken equally often so that the child learns both languages at the same time. For additional information refer to the 2016 Census Dictionary for 'Mother tongue'. To have a cartographic representation of the ecumene with this socio-economic indicator, it is recommended to add as the first layer, the “NRCan - 2016 population ecumene by census subdivision” web service, accessible in the data resources section below.
Facebook
TwitterThis service shows the predominant mother tongue in each census subdivision based on English, French or non-official language. The data is from the data table Mother Tongue (10), Age (27) and Sex (3) for the Population of Canada, Provinces and Territories, Census Divisions and Census Subdivisions, 2016 Census - 100% Data, Statistics Canada Catalogue no. 98-400-X2016046. Mother tongue refers to the first language learned at home in childhood and still understood by the person at the time the data was collected. If the person no longer understands the first language learned, the mother tongue is the second language learned. For a person who learned two languages at the same time in early childhood, the mother tongue is the language this person spoke most often at home before starting school. The person has two mother tongues only if the two languages were used equally often and are still understood by the person. For a child who has not yet learned to speak, the mother tongue is the language spoken most often to this child at home. The child has two mother tongues only if both languages are spoken equally often so that the child learns both languages at the same time.
Facebook
TwitterOpen Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Data about the number of English-language students enrolled in French as a Second Language (FSL) classes. Information is broken down by elementary and secondary panels for each district school board. Includes: * board number * board name * FSL - core elementary enrolment * FSL - core secondary enrolment * FSL - core total enrolment * FSL - extended elementary enrolment * FSL - extended secondary enrolment * FSL - extended total enrolment * FSL - immersion elementary enrolment * FSL - immersion secondary enrolment * FSL - immersion total enrolment * FSL - total elementary enrolment * FSL - total secondary enrolment * total FSL enrolment FSL data is reported by schools to the Ontario School Information System (OnSIS), October Submissions. The following English-language school types are included: * public * catholic To protect privacy, numbers are suppressed in categories with less than 10 students. Suppressed totals or cells that could be used to derive data in suppressed cell are depicted with "SP". Note: * Starting 2018-2019, enrolment numbers have been rounded to the nearest five. * Where sum/totals are required, actual totals are calculated and then rounded to the nearest 5. As such, rounded numbers may not add up to the reported rounded totals. ## Related * College enrolment * College enrolments - 1996 to 2011 * University enrolment * Enrolment by grade in secondary schools * School enrolment by gender * Second language course enrolment * Course enrolment in secondary schools * Enrolment by grade in elementary schools *[FSL]: French as a Second Language
Facebook
TwitterThe footnotes in the table are represented in brackets.Footnotes: 1 For the 2011 National Household Survey (NHS) estimates, the global non-response rate (GNR) is used as an indicator of data quality. This indicator combines complete non-response (household) and partial non-response (question) into a single rate. The value of the GNR is presented to users. A smaller GNR indicates a lower risk of non-response bias and as a result, lower risk of inaccuracy. The threshold used for estimates' suppression is a GNR of 50% or more. For more information, please refer to the National Household Survey User Guide, 2011. 2 Language groups are defined as follows: 'English' includes respondents who reported English only or English and one non-official language; 'French' includes respondents who reported French only or French and one non-official language; 'English and French' includes respondents who reported English and French, with or without one non-official language. 'Total' category includes all groups mentioned as well as respondents who reported a non-official language as their only mother tongue. 3 The median age is an age 'x', such that exactly one half of the population is older than 'x' and the other half is younger than 'x'. 4 Marital status: Refers to the marital status of the person, taking into account his/her common-law status. Persons who are married or living common law may be of opposite sex or of the same sex. The classification is as follows: Married (and not separated): A person who is married and has not separated or obtained a divorce, and whose spouse is living. Common-law: A person who is living with another person as a couple but who is not legally married to that person. Separated: A person who is married but who no longer lives with his/her spouse (for any reason other than illness, work or school) and who has not obtained a divorce. Persons living common law are not included in this category. Divorced: A person who has obtained a legal divorce and who has not remarried. Persons living common law are not included in this category. Widowed: A person who has lost his/her spouse through death and who has not remarried. Persons living common law are not included in this category. Single (never legally married): A person who has never married or a person whose marriage has been annulled and who has not remarried. Persons living common law are not included in this category. 5 Refers to the ability to conduct a conversation in English only, in French only, in both English and French, or in neither English nor French. 6 Selected Aboriginal languages: The languages shown were selected based on the Aboriginal languages spoken most often reported as single responses in Canada in the 2011 National Household Survey. 7 Selected non-Aboriginal languages: The languages shown were selected based on the non-Aboriginal most often spoken at home (other than English or French) most often reported as single responses in Canada in the 2011 National Household Survey. 8 Other languages: This is a subtotal of all languages collected by the National Household Survey that are not displayed separately here. 9 Refers to languages, other than English or French, in which the respondent can conduct a conversation. The category 'Non-official languages spoken' represents the sum of single language responses and multiple language responses received in the NHS. Hence, this total is greater than the total population. 10 Cree languages include the following categories: Cree not otherwise specified (which refers to those who reported 'Cree'), Swampy Cree, Plains Cree, Woods Cree, and a category labelled 'Cree not included elsewhere' (which includes Moose Cree, Northern East Cree and Southern East Cree). 11 This is a subtotal of all Aboriginal languages collected on May 10, 2011 that are not displayed separately here. 12 This is a subtotal of all non-Aboriginal languages, other than English or French, collected on May 10, 2011 that are not displayed separately here. 13 Refers to the status of a person with regard to the place of residence on the reference day, May 10, 2011, in relation to the place of residence on the same date one year earlier. Persons who have not moved are referred to as non-movers and persons who have moved from one residence to another are referred to as movers. Movers include non-migrants and migrants. Non-migrants are persons who did move but remained in the same city, town, township, village or Indian reserve. Migrants include internal migrants who moved to a different city, town, township, village or Indian reserve within Canada. External migrants include persons who lived outside Canada at the earlier reference date. 14 Refers to the status of a person with regard to the place of residence on the reference day, May 10, 2011, in relation to the place of residence on the same date five years earlier. Persons who have not moved are referred to as non-movers and persons who have moved from one residence to another are referred to as movers. Movers include non-migrants and migrants. Non-migrants are persons who did move but remained in the same city, town, township, village or Indian reserve. Migrants include internal migrants who moved to a different city, town, township, village or Indian reserve within Canada. External migrants include persons who lived outside Canada at the earlier reference date. 15 Citizenship refers to the legal citizenship status of a person. Citizenship can be by birth or naturalization. A person may have more than one citizenship. A person may be stateless, that is, they may have no citizenship. 16 Includes persons who are stateless. 17 The places of birth selected are the most frequently reported by immigrants at the Canada level. 18 Non-immigrant refers to a person who is a Canadian citizen by birth. 19 Immigrant refers to a person who is or has ever been a landed immigrant/permanent resident. This person has been granted the right to live in Canada permanently by immigration authorities. Some immigrants have resided in Canada for a number of years, while others have arrived recently. Some immigrants are Canadian citizens, while others are not. Most immigrants are born outside Canada, but a small number are born in Canada. In the 2011 National Household Survey, 'Immigrants' includes immigrants who landed in Canada prior to May 10, 2011. 20 The official name of United Kingdom is United Kingdom of Great Britain and Northern Ireland. United Kingdom includes Scotland, Wales, England and Northern Ireland (excludes Isle of Man, the Channel Islands and British Overseas Territories). 21 China excludes Hong Kong Special Administrative Region and Macao Special Administrative Region. 22 The official name of Viet Nam is Socialist Republic of Viet Nam. 23 The official name of Iran is Islamic Republic of Iran. 24 The official name of South Korea is Republic of Korea. 25 The category 'Oceania and other' includes places of birth in Oceania and responses not included elsewhere, such as 'born at sea.' 26 The category 'Other places of birth' includes other places of birth in Oceania and responses not included elsewhere, such as 'born at sea.' 27 Non-permanent resident refers to a person from another country who has a work or study permit, or who is a refugee claimant, and any non-Canadian-born family member living in Canada with them. 28 Recent immigrants are immigrants who landed in Canada between January 1, 2006 and May 10, 2011. Immigrant refers to a person who is or has ever been a landed immigrant/permanent resident. This person has been granted the right to live in Canada permanently by immigration authorities. Some immigrants have resided in Canada for a number of years, while others have arrived recently. Some immigrants are Canadian citizens, while others are not. Most immigrants are born outside Canada, but a small number are born in Canada. The places of birth selected are the most frequently reported by recent immigrants at the Canada level. 29 The official name of Venezuela is Bolivarian Republic of Venezuela. 30 The official name of Moldova is Republic of Moldova. 31 The official name of United Kingdom is United Kingdom of Great Britain and Northern Ireland. United Kingdom includes Scotland, Wales, England and Northern Ireland (excludes Isle of Man, the Channel Islands and British Overseas Territories). 32 China excludes Hong Kong Special Administrative Region and Macao Special Administrative Region. 33 The official name of Iran is Islamic Republic of Iran. 34 The official name of South Korea is Republic of Korea. 35 The official name of Viet Nam is Socialist Republic of Viet Nam. 36 The official name of Syria is Syrian Arab Republic. 37 The category 'Oceania and other' includes places of birth in Oceania and responses not included elsewhere, such as 'born at sea.' 38 Period of immigration refers to the period in which the immigrant first obtained his or her landed immigrant/permanent resident status. A landed immigrant/permanent resident refers to a person who has been granted the right to live permanently in Canada by immigration authorities. 39 Non-immigrant refers to a person who is a Canadian citizen by birth. 40 Immigrant refers to a person who is or has ever been a landed immigrant/permanent resident. This person has been granted the right to live in Canada permanently by immigration authorities. Some immigrants have resided in Canada for a number of years, while others have arrived recently. Some immigrants are Canadian citizens, while others are not. Most immigrants are born outside Canada, but a small number are born in Canada. In the 2011 National Household Survey, 'Immigrants' includes immigrants who landed in Canada prior to May 10, 2011. 41 Includes immigrants who landed in Canada prior to May 10, 2011. 42 Includes immigrants who landed in Canada
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
The SPADE project aims to develop and apply user-friendly software for large-scale speech analysis of existing public and private English speech datasets, in order to understand more about English speech over space and time. To date, we have worked with 42 shared corpora comprising dialects from across the British Isles (England, Wales, Scotland, Ireland) and North America (US, Canada), with an effective time span of over 100 years. We make available here a link to our OSF repository (see below) which has acoustic measures datasets for sibilants and durations and static formants for vowels, for 39 corpora (~2200 hours of speech analysed from ~8600 speakers), with information about dataset generation. In addition, at the OSF site, we provide Praat TextGrids created by SPADE for some corpora. Reading passage text is provided when the measures are based on reading only. Datasets are in their raw form and will require cleaning (e.g. outlier removal) before analysis. In addition, we used whitelisting to anonymise measures datasets generated from non-public, restricted corpora.
Obtaining a data visualization of a text search within seconds via generic, large-scale search algorithms, such as Google n-gram viewer, is available to anyone. By contrast, speech research is only now entering its own 'big data' revolution. Historically, linguistic research has tended to carry out fine-grained analysis of a few aspects of speech from one or a few languages or dialects. The current scale of speech research studies has shaped our understanding of spoken language and the kinds of questions that we ask. Today, massive digital collections of transcribed speech are available from many different languages, gathered for many different purposes: from oral histories, to large datasets for training speech recognition systems, to legal and political interactions. Sophisticated speech processing tools exist to analyze these data, but require substantial technical skill. Given this confluence of data and tools, linguists have a new opportunity to answer fundamental questions about the nature and development of spoken language.
Our project seeks to establish the key tools to enable large-scale speech research to become as powerful and pervasive as large-scale text mining. It is based on a partnership of three teams based in Scotland, Canada and the US. Together we exploit methods from computing science and put them to work with tools and methods from speech science, linguistics and digital humanities, to discover how much the sounds of English across the Atlantic vary over space and time.
We have developed innovative and user-friendly software which exploits the availability of existing speech data and speech processing tools to facilitate large-scale integrated speech corpus analysis across many datasets together. The gains of such an approach are substantial: linguists will be able to scale up answers to existing research questions from one to many varieties of a language, and ask new and different questions about spoken language within and across social, regional, and cultural, contexts. Computational linguistics, speech technology, forensic and clinical linguistics researchers, who engage with variability in spoken language, will also benefit directly from our software. This project also opens up vast potential for those who already use digital scholarship for spoken language collections in the humanities and social sciences more broadly, e.g. literary scholars, sociologists, anthropologists, historians, political scientists. The possibility of ethically non-invasive inspection of speech and texts will allow analysts to uncover far more than is possible through textual analysis alone.
Our project has developed and applied our new software to a global language, English, using existing public and private spoken datasets of Old World (British Isles) and New World (North American) English, across an effective time span of more than 100 years, spanning the entire 20th century. Much of what we know about spoken English comes from influential studies on a few specific aspects of speech from one or two dialects. This vast literature has established important research questions which has been investigated for the first time on a much larger scale, through standardized data across many different varieties of English.
Our large-scale study complements current-scale studies, by enabling us to consider stability and change in English across the 20th century on an unparalleled scale. The global nature of English means that our findings will be interesting and relevant to a large international non-academic audience; they have been made accessible through an innovative and dynamic visualization of linguistic variation via an interactive sound mapping website. In addition to new insights into spoken English, this project also lays the crucial groundwork for large-scale speech studies across many datasets from different languages, of different formats and structures.
Facebook
Twitterhttps://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the Canadian French Scripted Monologue Speech Dataset for the Retail & E-commerce domain. This dataset is built to accelerate the development of French language speech technologies especially for use in retail-focused automatic speech recognition (ASR), natural language processing (NLP), voicebots, and conversational AI applications.
This training dataset includes 6,000+ high-quality scripted audio recordings in Canadian French, created to reflect real-world scenarios in the Retail & E-commerce sector. These prompts are tailored to improve the accuracy and robustness of customer-facing speech technologies.
This dataset includes a comprehensive set of retail-specific topics to ensure wide linguistic coverage for AI training:
To increase training utility, prompts include contextual data such as:
These additions help your models learn to recognize structured and unstructured retail-related speech.
Every audio file is paired with a verbatim transcription, ensuring consistency and alignment for model training.
Detailed metadata is included to support filtering, analysis, and model evaluation:
Facebook
TwitterYouth not in education, employment or training by visible minority, selected sociodemographic characteristics and the census year: Canada, geographical regions of Canada, provinces and territories and census metropolitan areas with parts (1)Frequency: OccasionalTable: 98-10-0648-01Release date: 2024-03-26Geography: Canada, Geographical region of Canada, Province or territory, Census metropolitan area, Census metropolitan area partUniverse: Persons in private households in occupied private dwellings, 2021 and 2016 censuses — 25% Sample dataVariable List: Visible minority (15), Gender (3a), Age (6), First official language spoken (5), Immigrant and generation status (7), Census year (2), Youth not in employment, education or training (1)List of abbreviations and acronyms found within various Census products.(https://www12.statcan.gc.ca/census-recensement/2021/ref/symb-ab-acr-eng.cfm)Footnotes:1 Historical comparison of geographic areas The boundaries and names of census geographies can change from one census to the next. In order to facilitate data comparisons between censuses, previous census data have been adjusted to reflect as closely as possible the 2021 boundaries of these areas. The methodology used for this adjustment involved spatially linking blocks of previous censuses (concordance to the 1996 Census used the 1996 enumeration areas to the 2021 boundaries). A previous census block was linked to the 2021 area within which its representative point fell. A limited number of interactive linkages were completed to further enhance the adjustment in certain areas. For some census geographies, it was not possible to reflect the 2021 boundaries. The 2021 boundaries may not be reflected as there was no previous census block to assign to the 2021 area. As well previous census data for some 2021 areas may not be available due to the fact that the concordance did not produce an accurate representation of the 2021 area.2 Gender Gender refers to an individual's personal and social identity as a man, woman or non-binary person (a person who is not exclusively a man or a woman). Gender includes the following concepts: gender identity, which refers to the gender that a person feels internally and individually; gender expression, which refers to the way a person presents their gender, regardless of their gender identity, through body language, aesthetic choices or accessories (e.g., clothes, hairstyle and makeup), which may have traditionally been associated with a specific gender. A person's gender may differ from their sex at birth, and from what is indicated on their current identification or legal documents such as their birth certificate, passport or driver's licence. A person's gender may change over time. Some people may not identify with a specific gender.3 Given that the non-binary population is small, data aggregation to a two-category gender variable is sometimes necessary to protect the confidentiality of responses provided. In these cases, individuals in the category “non-binary persons” are distributed into the other two gender categories and are denoted by the “+” symbol. The sex variable in census years prior to 2021 and the two-category gender variable in the 2021 Census are included together. Although sex and gender refer to two different concepts, the introduction of gender is not expected to have a significant impact on data analysis and historical comparability, given the small size of the transgender and non-binary populations. For additional information on changes of concepts over time, please consult the Age, Sex at Birth and Gender Reference Guide.4 Age' refers to the age of a person (or subject) of interest at last birthday (or relative to a specified, well-defined reference date).5 First official language spoken refers to the first official language (English or French) spoken by the person.6 Immigrant status refers to whether the person is a non-immigrant, an immigrant or a non-permanent resident. Period of immigration refers to the period in which the immigrant first obtained landed immigrant or permanent resident status. For more information on immigration variables, including information on their classifications, the questions from which they are derived, data quality and their comparability with other sources of data, please refer to the Place of Birth, Generation Status, Citizenship and Immigration Reference Guide, Census of Population, 2021.7 Generation status refers to whether or not the person or the person's parents were born in Canada.8 "Visible minority refers to whether a person is a visible minority or not, as defined by the Employment Equity Act. The Employment Equity Act defines visible minorities as persons other than Aboriginal peoples who are non-Caucasian in race or non-white in colour." The visible minority population consists mainly of the following groups: South Asian, Chinese, Black, Filipino, Arab, Latin American, Southeast Asian, West Asian, Korean, and Japanese.9 For more information on language variables, including information on their classifications, the questions from which they are derived, data quality and their comparability with other sources of data, please refer to the Languages Reference Guide, Census of Population, 2021.10 Non-immigrants' includes persons who are Canadian citizens by birth.11 Immigrants' includes persons who are, or who have ever been, landed immigrants or permanent residents. Such persons have been granted the right to live in Canada permanently by immigration authorities. Immigrants who have obtained Canadian citizenship by naturalization are included in this category. In the 2021 Census of Population, 'Immigrants' includes immigrants who were admitted to Canada on or prior to May 11, 2021.12 Non-permanent residents' includes persons from another country with a usual place of residence in Canada and who have a work or study permit or who have claimed refugee status (asylum claimants). Family members living with work or study permit holders are also included, unless these family members are already Canadian citizens, landed immigrants or permanent residents.13 First generation' includes persons who were born outside Canada. For the most part, these are people who are now, or once were, immigrants to Canada.14 Second generation' includes persons who were born in Canada and had at least one parent born outside Canada. For the most part, these are the children of immigrants.15 "Refers to the proportion of youth aged 15 to 29 who were not in employment during the census reference week (in 2021, the reference week is May 2 to May 8) and who had not attended any accredited educational institution or program in the eight months preceding the census day (for example, in 2021 this period is between September 2020 and 11 May 2021). The Labor Force Survey (LFS) is the main data source for calculating national estimates of the youth not in employment, education, or training indicator, commonly known as NEET." This indicator is calculated using data from the first quarter or the average of the first three months of the calendar year which excludes summer employment. This LFS-based indicator is published on an annual basis and is used for international comparisons. The NEET indicator has regularly published by the Organization for Economic Cooperation and Development (OECD) since the late 1990s. However the census and other data sources such as social surveys like the Canadian Community Health Survey serve a different purpose. These data sources provide more specialized data that allowed deeper analysis of specific sociodemographic characteristics and conditions for a given population group which is a rich complement to understand the context and the factors behind the NEET estimates provided by the LFS. Although the Census of the Canadian population and the Labor Force Survey (LFS) measure similar concepts linked to labour market activities there are several fundamental differences between the characteristics of the two concept that measure the population of youth not in employment education or training. The most important of these differences is that in the LFS the reference period for school attendance and the reference period for employment are the same whereas in the Census they are different. Other differences between the census and the LFS include the length of the reference period the number of questions and their content the sample size the enumeration method and the coverage. For more information about the comparability of labour force status data from the Census of Population versus that of the LFS please consult the Appendix 2.11 from the Dictionary Census of Population 2021. which excludes summer employment. This LFS-based indicator is published on an annual basis and is used for international comparisons. The NEET indicator has regularly published by the Organization for Economic Cooperation and Development (OECD) since the late 1990s. However the census and other data sources such as social surveys like the Canadian Community Health Survey serve a different purpose. These data sources provide more specialized data that allowed deeper analysis of specific sociodemographic characteristics and conditions for a given population group which is a rich complement to understand the context and the factors behind the NEET estimates provided by the LFS. Although the Census of the Canadian population and the Labor Force Survey (LFS) measure similar concepts linked to labour market activities there are several fundamental differences between the characteristics of the two concept that measure the population of youth not in employment education or training. The most important of these differences is that in the LFS the reference period for school attendance and the reference period for employment are the same whereas in the Census they are different. Other differences between the census and the LFS include the length of the reference
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterThe statistic reflects the distribution of languages in Canada in 2022. In 2022, 87.1 percent of the total population in Canada spoke English as their native tongue.