Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Historical chart and dataset showing French Polynesia literacy rate by year from N/A to N/A.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Historical chart and dataset showing St. Martin (French part) literacy rate by year from N/A to N/A.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about books. It has 1 row and is filtered where the book is Reading bande dessinee : critical approaches to French-language comic strip. It features 7 columns including author, publication date, language, and book publisher.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The iRead4Skills Dataset 2: annotated corpora by level of complexity for FR, PT and SP is a collection of texts categorized by complexity level and annotated for complexity features, presented in xlsx format. These corpora were compiled, classified and annotated under the scope of the project iRead4Skills – Intelligent Reading Improvement System for Fundamental and Transversal Skills Development, funded by the European Commission (grant number: 1010094837). The project aims to enhance reading skills within the adult population by creating an intelligent system that assesses text complexity and recommends suitable reading materials to adults with low literacy skills, contributing to reducing skills gaps and facilitating access to information and culture (https://iread4skills.com/). This dataset is the result of specifically devised classification and annotation tasks, in which selected texts were organized and distributed to trainers in Adult Learning (AL) and Vocational Education Training (VET) Centres, as well as to adult students in AL and VET centres. This task was conducted via the Qualtrics platform. The Dataset 2: annotated corpora by level of complexity for FR, PT and SP is derived from the iRead4Skills Dataset 1: corpora by level of complexity for FR, PT and SP ( https://doi.org/10.5281/zenodo.10055909), which comprises written texts of various genres and complexity levels. From this collection, a subset of texts was selected for classification and annotation. This classification and annotation task aimed to provide additional data and test sets for the complexity analysis systems for the three languages of the project: French, Portuguese, and Spanish. The texts in each of the language corpora were selected taking into account the diversity of topics/domains, genres, and the reading preferences of the target audience of the iRead4Skills project. This percentage amounted to the total of 462 texts per language, which were divided by level of complexity, resulting in the following distribution: · 140 Very Easy texts · 140 Easy texts · 140 Plain texts · 42 More Complex texts. Trainers were asked to classify the texts according to the complexity levels of the project, here informally defined as:
They were also asked to annotate the parts of the texts considered complex according to various type of features, at word-level and at sentence-level (e.g., word order, sentence composition, etc.), according to following categories: Lexical/word-related features - unknown word - word too technical/specialized or archaic - complex derived word - points to a previous reference that is not obvious - word (other) Syntactic/sentence-level features - unusual word order - too much embedded secondary information - too many connectors in the same sentence - sentence (other) - other (please specify) The sets were divided in three parts in Qualtrics and, in each part, the texts are shown randomly to the annotator. Students were asked to confirm that they could read without difficulty texts adequate to their literacy level. Each set contained texts from a given level, plus one text of the level immediately above. They were also asked to annotate words or sequences of words in the text that they did not understand, according to the following categories: - difficult word - difficult part of the text The complete results and datasets are in TSV/Excel format, in pairs of two files, with one file concerning the results from the classification (trainers)/validation (students) task and one file concerning the results from the annotation task. The complete datasets will be available under creative CC BY-NC-ND 4.0 |
The iRead4Skills Dataset 1: corpora by level of complexity for FR, PT and SP is a collection of written texts of several genres and levels of complexity, in txt format, compiled under the scope of the project iReadSkills – Intelligent Reading Improvement System for Fundamental and Transversal Skills Development. The project, funded by the European Commission (grant number: 1010094837) aims to improve reading skills in the adult population by creating an intelligent system that assesses text complexity and suggests appropriate reading materials to adults with low literacy skills, contributing to reducing skills gaps and to provide access to information and culture (https://iread4skills.com/).
The compilation of this first dataset was based on the complexity levels established as relevant for the project (Very Easy (approx. A1), Easy (approx. A2) and Plain (approx. B1) and on the expected needs of learners and trainers. For some genres, there are also texts of a more complex level. The data will provide the basis for the training and test sets for the complexity analysis systems for the three languages of the project: French, Portuguese, and Spanish. The dataset will be further enhanced, validated, and annotated by end-users, originating forthcoming versions and a second, derived, dataset.
The resource is composed of three sub corpora: French, Portuguese and Spanish.
Each of the sub corpora considers different complexity levels and covers texts from the following communication domains:
01_personal communication
02_institutional/professional communication
03_social media
04_commercial communication/dissemination
05_non-fiction book
06_fiction book
07_didactic book
08_academic/school
09_political communication/dissemination
10_legal documentation
11_religious texts/dissemination
French corpus:
Number of texts: 2 199
Number of tokens: 530 298
Spanish corpus:
Number of texts: 2 533
Number of tokens: 960 644
Portuguese corpus:
Number of texts: 2 933
Number of tokens: 946 131
More information about the corpus constitution and text samples is available at https://iread4skills.com/tools-resources/ D3.3 Data set 1: corpora by level of complexity FR, PT and SP.
The USAID-funded project, Let’s Learn to Read and Write, rendered in Haitian Creole as “An n aprann li ak ekri (Ann ALE), is a four-year early grade reading and writing (EGR/W) project that is based on evidence generated under the two-year USAID applied research activity Tout Timoun Ap Li (ToTAL) or, in English, “All Children Reading.” Ann ALE supports an expanded effort to achieve a more coordinated, effective and sustainable EGR/W program for an estimated 68,600 children in grades 1-4 in 550 schools in three corridors (Cul-de-Sac, St. Marc and Nord) in Haiti. The project will be co-led by the Haiti’s Ministry of Education, the Ministère de l'Éducation Nationale et de la Formation Professionnelle (MENFP). To accomplish its goals in Haiti’s multi-faceted educational system, Ann ALE will vigorously promote a collaborative approach to build successful partnerships able to overcome longstanding barriers to positive change. For this reason, Ann ALE will help to strengthen the MENFP’s capacity to engage a wide range of public and non-public EGR/W stakeholders to improve reading results in Haitian Creole and French. All interventions will be co-designed and co-implemented with the MENFP to ensure their institutionalization and sustainability.
A French Native Reading Comprehension dataset of questions and answers on a set of Wikipedia articles that consists of 25,000+ samples for the 1.0 version and 60,000+ samples for the 1.1 version.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
France FR: SPI: Pillar 5 Data Infrastructure Score: Scale 0-100 data was reported at 100.000 NA in 2019. This stayed constant from the previous number of 100.000 NA for 2018. France FR: SPI: Pillar 5 Data Infrastructure Score: Scale 0-100 data is updated yearly, averaging 100.000 NA from Dec 2016 to 2019, with 4 observations. The data reached an all-time high of 100.000 NA in 2019 and a record low of 100.000 NA in 2019. France FR: SPI: Pillar 5 Data Infrastructure Score: Scale 0-100 data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s France – Table FR.World Bank.WDI: Policy and Institutions. The data infrastructure pillar overall score measures the hard and soft infrastructure segments, itemizing essential cross cutting requirements for an effective statistical system. The segments are: (i) legislation and governance covering the existence of laws and a functioning institutional framework for the statistical system; (ii) standards and methods addressing compliance with recognized frameworks and concepts; (iii) skills including level of skills within the statistical system and among users (statistical literacy); (iv) partnerships reflecting the need for the statistical system to be inclusive and coherent; and (v) finance mobilized both domestically and from donors.; ; Statistical Performance Indicators, The World Bank (https://datacatalog.worldbank.org/dataset/statistical-performance-indicators); Weighted average;
https://www.ontario.ca/page/open-government-licence-ontariohttps://www.ontario.ca/page/open-government-licence-ontario
Data about the number of students enrolled in second language courses. Information is specific to publicly funded secondary schools at the provincial level. Second language courses include:
Data includes:
Note: Students enrolled in a course more than once are counted each time they enroll.
Source: Course enrolment data is reported by schools to the Ontario School Information System. The following secondary schools are included:
To protect privacy, numbers are suppressed in categories with less than 10 students.
Note:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ABSTRACT: This study investigated the reading and writing teaching practices in two pre-school classes (five-year-old children): one belonging to the public schools in Recife/PE (Brazil) and another in Paris (France). It was sought to analyze more specifically the work focused on the appropriation of alphabetic writing from the reading and writing activities developed in the classroom. The methodological procedures were interviews, observations of the daily school routine, and writing words activities with students. The analysis of the collected data showed that the teachers organized their pedagogical work in order to emphasize the use of texts and readings that were already part of the children’s universe, and also developed games and activities that led the students to think about the principles of alphabetic writing. At the end of the school year most students of both classes showed understanding of the relationship between writing and the sound of words.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual reading and language arts proficiency from 2011 to 2022 for French Elementary School vs. Colorado and School District No. 3 In The County Of El Paso And State Of
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Literacy courses in the City of BrusselsLearning to speak, write, read and calculate in French/Dutch to adults who have never been to school or who have not obtained any school certificate, in Belgium or abroad, and/or who have not mastered skills equivalent to basic education in any language.More info : https://social.brussels/category/615https://social.brussels/category/616
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Historical chart and dataset showing French Polynesia literacy rate by year from N/A to N/A.