12 datasets found
  1. Data from: Distance Learning: Russian Experience

    • figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bogdan Ershov; Tatyana Chekmenyova (2023). Distance Learning: Russian Experience [Dataset]. http://doi.org/10.6084/m9.figshare.22637380.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Authors
    Bogdan Ershov; Tatyana Chekmenyova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The article discusses innovative approaches to education on the example of the introduction of distance learning in Russia, the main forms of its organization, the necessary means, areas of application, advantages, and disadvantages. The authors note that distance learning is becoming more in demand, has many advantages, and therefore, will develop. For its development, the modern education system in Russia has all the possibilities, both technically and intellectually. But we must not forget about the shortcomings of distance learning, which must be eliminated in the process of its development and improvement. Particular attention is paid to the legal regulation of distance education.

  2. d

    Touchscreen training data for great-tailed grackles

    • dataone.org
    • knb.ecoinformatics.org
    • +1more
    Updated Jan 25, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Benjamin Seitz; Kelsey McCune; Maggie MacPherson; Luisa Bergeron; Aaron Blaisdell; Corina Logan (2021). Touchscreen training data for great-tailed grackles [Dataset]. http://doi.org/10.5063/028PXJ
    Explore at:
    Dataset updated
    Jan 25, 2021
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    Benjamin Seitz; Kelsey McCune; Maggie MacPherson; Luisa Bergeron; Aaron Blaisdell; Corina Logan
    Time period covered
    Apr 11, 2018 - May 8, 2020
    Area covered
    Variables measured
    ID, Bird, Date, Batch, Trial, Choice, Passed, Session, Duration, TSFacing, and 10 more
    Description

    Operant chambers are small enclosures used to test animal behavior and cognition. While traditionally reliant on simple technologies for presenting stimuli (e.g., lights and sounds) and recording responses made to basic manipulanda (e.g., levers and buttons), an increasing number of researchers are beginning to use Touchscreen-equipped Operant Chambers (TOCs). These TOCs have obvious advantages, namely by allowing researchers to present a near infinite number of stimuli as well as increased flexibility in the types of responses that can be made and recorded. Here, we trained wild-caught adult and juvenile great-tailed grackles (Quiscalus mexicanus) to complete experiments using a TOC. We have learned much from these efforts, and outline the advantages and disadvantages of these two approaches. We report data from our training sessions and discuss important modifications we made to facilitate animal engagement and participation in various tasks. Finally, we provide a "training guide" for creating experiments using PsychoPy, a free and open-source software that we have found to be incredibly useful during these endeavors. This article, therefore, should serve as a useful resource to those interested in switching to or maintaining a TOC, or who similarly wish to use a TOC to test the cognitive abilities of non-model species or wild-caught individuals.

  3. Millennium Cohort Study: Linked Health Administrative Data (Scottish Medical...

    • beta.ukdataservice.ac.uk
    • datacatalogue.cessda.eu
    Updated 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCL Institute Of Education University College London (2025). Millennium Cohort Study: Linked Health Administrative Data (Scottish Medical Records), Child Health Reviews, 2000-2015: Secure Access [Dataset]. http://doi.org/10.5255/ukda-sn-8709-1
    Explore at:
    Dataset updated
    2025
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    datacite
    Authors
    UCL Institute Of Education University College London
    Area covered
    Scotland
    Description

    Background:
    The Millennium Cohort Study (MCS) is a large-scale, multi-purpose longitudinal dataset providing information about babies born at the beginning of the 21st century, their progress through life, and the families who are bringing them up, for the four countries of the United Kingdom. The original objectives of the first MCS survey, as laid down in the proposal to the Economic and Social Research Council (ESRC) in March 2000, were:

    • to chart the initial conditions of social, economic and health advantages and disadvantages facing children born at the start of the 21st century, capturing information that the research community of the future will require
    • to provide a basis for comparing patterns of development with the preceding cohorts (the National Child Development Study, held at the UK Data Archive under GN 33004, and the 1970 Birth Cohort Study, held under GN 33229)
    • to collect information on previously neglected topics, such as fathers' involvement in children's care and development
    • to focus on parents as the most immediate elements of the children's 'background', charting their experience as mothers and fathers of newborn babies in the year 2000, recording how they (and any other children in the family) adapted to the newcomer, and what their aspirations for her/his future may be
    • to emphasise intergenerational links including those back to the parents' own childhood
    • to investigate the wider social ecology of the family, including social networks, civic engagement and community facilities and services, splicing in geo-coded data when available
    Additional objectives subsequently included for MCS were:
    • to provide control cases for the national evaluation of Sure Start (a government programme intended to alleviate child poverty and social exclusion)
    • to provide samples of adequate size to analyse and compare the smaller countries of the United Kingdom, and include disadvantaged areas of England

    Further information about the MCS can be found on the Centre for Longitudinal Studies web pages.

    The content of MCS studies, including questions, topics and variables can be explored via the CLOSER Discovery website.

    The first sweep (MCS1) interviewed both mothers and (where resident) fathers (or father-figures) of infants included in the sample when the babies were nine months old, and the second sweep (MCS2) was carried out with the same respondents when the children were three years of age. The third sweep (MCS3) was conducted in 2006, when the children were aged five years old, the fourth sweep (MCS4) in 2008, when they were seven years old, the fifth sweep (MCS5) in 2012-2013, when they were eleven years old, the sixth sweep (MCS6) in 2015, when they were fourteen years old, and the seventh sweep (MCS7) in 2018, when they were seventeen years old.

    End User Licence versions of MCS studies:
    The End User Licence (EUL) versions of MCS1, MCS2, MCS3, MCS4, MCS5, MCS6 and MCS7 are held under UK Data Archive SNs 4683, 5350, 5795, 6411, 7464, 8156 and 8682 respectively. The longitudinal family file is held under SN 8172.

    Sub-sample studies:
    Some studies based on sub-samples of MCS have also been conducted, including a study of MCS respondent mothers who had received assisted fertility treatment, conducted in 2003 (see EUL SN 5559). Also, birth registration and maternity hospital episodes for the MCS respondents are held as a separate dataset (see EUL SN 5614).

    Release of Sweeps 1 to 4 to Long Format (Summer 2020)
    To support longitudinal research and make it easier to compare data from different time points, all data from across all sweeps is now in a consistent format. The update affects the data from sweeps 1 to 4 (from 9 months to 7 years), which are updated from the old/wide to a new/long format to match the format of data of sweeps 5 and 6 (age 11 and 14 sweeps). The old/wide formatted datasets contained one row per family with multiple variables for different respondents. The new/long formatted datasets contain one row per respondent (per parent or per cohort member) for each MCS family. Additional updates have been made to all sweeps to harmonise variable labels and enhance anonymisation.

    How to access genetic and/or bio-medical sample data from a range of longitudinal surveys:
    For information on how to access biomedical data from MCS that are not held at the UKDS, see the CLS Genetic data and biological samples webpage.

    Secure Access datasets:
    Secure Access versions of the MCS have more restrictive access conditions than versions available under the standard End User Licence or Special Licence (see 'Access data' tab above).

    Secure Access versions of the MCS include:
    • detailed sensitive variables not available under EUL. These have been grouped thematically and are held under SN 8753 (socio-economic, accommodation and occupational data), SN 8754 (self-reported health, behaviour and fertility), SN 8755 (demographics, language and religion) and SN 8756 (exact participation dates). These files replace previously available studies held under SNs 8456 and 8622-8627
    • detailed geographical identifier files which are grouped by sweep held under SN 7758 (MCS1), SN 7759 (MCS2), SN 7760 (MCS3), SN 7761 (MCS4), SN 7762 (MCS5 2001 Census Boundaries), SN 7763 (MCS5 2011 Census Boundaries), SN 8231 (MCS6 2001 Census Boundaries), SN 8232 (MCS6 2011 Census Boundaries), SN 8757 (MCS7), SN 8758 (MCS7 2001 Census Boundaries) and SN 8759 (MCS7 2011 Census Boundaries). These files replace previously available files grouped by geography SN 7049 (Ward level), SN 7050 (Lower Super Output Area level), and SN 7051 (Output Area level)
    • linked education administrative datasets for Key Stages 1, 2, 4 and 5 held under SN 8481 (England). This replaces previously available datasets for Key Stage 1 (SN 6862) and Key Stage 2 (SN 7712)
    • linked education administrative datasets for Key Stage 1 held under SN 7414 (Scotland)
    • linked education administrative dataset for Key Stages 1, 2, 3 and 4 under SN 9085 (Wales)
    • linked NHS Patient Episode Database for Wales (PEDW) for MCS1 – MCS5 held under SN 8302
    • linked Scottish Medical Records data held under SNs 8709, 8710, 8711, 8712, 8713 and 8714;
    • Banded Distances to English Grammar Schools for MCS5 held under SN 8394
    • linked Health Administrative Datasets (Hospital Episode Statistics) for England for years 2000-2019 held under SN 9030
    • linked Health Administrative Datasets (SAIL) for Wales held under SN 9310
    • linked Hospital of Birth data held under SN 5724.
    The linked education administrative datasets held under SNs 8481,7414 and 9085 may be ordered alongside the MCS detailed geographical identifier files only if sufficient justification is provided in the application. Users are also only allowed access to either 2001 or 2011 of Geographical Identifiers Census Boundaries studies. So for MCS5 either SN 7762 (2001 Census Boundaries) or SN 7763 (2011 Census Boundaries), for the MCS6 users are only allowed either SN 8231 (2001 Census Boundaries) or SN 8232 (2011 Census Boundaries); and the same applies for MCS7 so either SN 8758 (2001 Census Boundaries) or SN 8759 (2011 Census Boundaries).

    Researchers applying for access to the Secure Access MCS datasets should indicate on their ESRC Accredited Researcher application form the EUL dataset(s) that they also wish to access (selected from the MCS Series Access web page).

    The Millennium Cohort Study: Linked Health Administrative Data (Scottish Medical Records), Child Health Reviews, 2000-2015: Secure Access includes data files from the NHS Digital Hospital Episode Statistics database for those cohort members who provided consent to health data linkage in the Age 50 sweep, and had ever lived in Scotland. The Scottish Medical Records database contains information about all hospital admissions in Scotland. This study concerns the Child Health Reviews (CHR) from first visit to school reviews.

    Other datasets are available from the Scottish Medical Records database, these include:

    • Prescribing Information System (PIS) held under SN 8710
    • Scottish Immunisation and Recall System (SIRS) held under SN 8711
    • Scottish Birth Records (SMR11) held under <a

  4. f

    Summary of GPT-4 TR review.

    • plos.figshare.com
    xls
    Updated Jan 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jack Gallifant; Amelia Fiske; Yulia A. Levites Strekalova; Juan S. Osorio-Valencia; Rachael Parke; Rogers Mwavu; Nicole Martinez; Judy Wawira Gichoya; Marzyeh Ghassemi; Dina Demner-Fushman; Liam G. McCoy; Leo Anthony Celi; Robin Pierce (2024). Summary of GPT-4 TR review. [Dataset]. http://doi.org/10.1371/journal.pdig.0000417.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jan 18, 2024
    Dataset provided by
    PLOS Digital Health
    Authors
    Jack Gallifant; Amelia Fiske; Yulia A. Levites Strekalova; Juan S. Osorio-Valencia; Rachael Parke; Rogers Mwavu; Nicole Martinez; Judy Wawira Gichoya; Marzyeh Ghassemi; Dina Demner-Fushman; Liam G. McCoy; Leo Anthony Celi; Robin Pierce
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The study provides a comprehensive review of OpenAI’s Generative Pre-trained Transformer 4 (GPT-4) technical report, with an emphasis on applications in high-risk settings like healthcare. A diverse team, including experts in artificial intelligence (AI), natural language processing, public health, law, policy, social science, healthcare research, and bioethics, analyzed the report against established peer review guidelines. The GPT-4 report shows a significant commitment to transparent AI research, particularly in creating a systems card for risk assessment and mitigation. However, it reveals limitations such as restricted access to training data, inadequate confidence and uncertainty estimations, and concerns over privacy and intellectual property rights. Key strengths identified include the considerable time and economic investment in transparent AI research and the creation of a comprehensive systems card. On the other hand, the lack of clarity in training processes and data raises concerns about encoded biases and interests in GPT-4. The report also lacks confidence and uncertainty estimations, crucial in high-risk areas like healthcare, and fails to address potential privacy and intellectual property issues. Furthermore, this study emphasizes the need for diverse, global involvement in developing and evaluating large language models (LLMs) to ensure broad societal benefits and mitigate risks. The paper presents recommendations such as improving data transparency, developing accountability frameworks, establishing confidence standards for LLM outputs in high-risk settings, and enhancing industry research review processes. It concludes that while GPT-4’s report is a step towards open discussions on LLMs, more extensive interdisciplinary reviews are essential for addressing bias, harm, and risk concerns, especially in high-risk domains. The review aims to expand the understanding of LLMs in general and highlights the need for new reflection forms on how LLMs are reviewed, the data required for effective evaluation, and addressing critical issues like bias and risk.

  5. c

    Learning Outcomes and Teacher Effectiveness for Children Facing Multiple...

    • datacatalogue.cessda.eu
    • beta.ukdataservice.ac.uk
    Updated Mar 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rose, P; Malik, R; De, A (2025). Learning Outcomes and Teacher Effectiveness for Children Facing Multiple Disadvantages in India and Pakistan, 2015-2018 [Dataset]. http://doi.org/10.5255/UKDA-SN-855294
    Explore at:
    Dataset updated
    Mar 26, 2025
    Dataset provided by
    University of Cambridge
    CORD India
    IDEAS Pakistan
    Authors
    Rose, P; Malik, R; De, A
    Time period covered
    Jul 1, 2015 - Dec 31, 2018
    Area covered
    Pakistan
    Variables measured
    Household, Other
    Description

    The quantitative data from the TEACh project allows us to identify the characteristics of children not learning, and factors associated with disparities in educational outcomes in India and Pakistan. Information was collected from households and schools, at the beginning of the school year (in April). Assessments were made in school again at the end of the school year to identify what learning gains have been made, and the role of teacher and other factors (such as related parental support) in these gains for children with different characteristics.

    The first stage of the quantitative data collection required us to identify the children whose learning we want to assess. Cross-sectional data was collected from households to enumerate key household and individual characteristics. This included information such as household size and socio-economic status, as well as individual information on all of the children within the household (irrespective of their schooling participation). The household survey provides the first step towards quantifying whether children with different characteristics are in school. For those in school, it identifies the type of school they are attending (whether a mainstream or special school, and whether run by government, private sector, or NGOs). We also assessed learning of children aged 8-12 (approximately equivalent to grades 3-5) in the selected households.

    The second stage was to identify primary schools within the vicinity that are accessed by a majority of the children in the sample community or village. Children in grades 3-5 were tested both at the beginning and end of the school year in order to identify learning gains, using the same instruments as used in the households. These classes contained some children from the sampled households which allows us to link them back to the household information that has been gathered. Some basic household level information was also collected from all sampled children in the school (such as parental education and household size) to ensure this information is available for all children.

    Questionnaires were also administered to teachers to identify their background and other characteristics commonly associated with teacher effectiveness. Existing instruments such asSchoolTELLS in India and Pakistan, were adapted to draft the teacher surveys. The teacher instruments were designed to capture the extent to which teachers are aware of, and respond to, children’s diverse learning needs, their perceptions and attitudes towards these children, and the extent to which they feel prepared to teach children of different abilities, including related to training and other forms of support that they receive. As with SchoolTELLS, teachers were also asked to mark student tests to identify teachers’ content knowledge of subjects they are teaching.

    Governments across the world recognize the importance of providing an education to all children within an inclusive education system. Yet, despite great progress in getting more children into school over the past decade, children from disadvantaged backgrounds are likely to experience poor quality of education limiting chances of fulfilling their learning potential. Children who face multiple disadvantages related to disability, poverty, gender, caste, religion or where they live, are amongst those least likely to be learning.

    The project aims to identify strategies to raise learning outcomes for all children, regardless of their background. It is widely recognized that teachers are central to a child's educational experience. Yet, in low income countries, disadvantaged learners often face poor quality teaching: many teachers are recruited without having a basic subject knowledge themselves, receive inadequate training with limited attention to strategies to support children from diverse backgrounds, and weak incentives and poor teacher governance can lead to low motivation and high levels of teacher absenteeism. The research will, therefore identify which aspects of teaching are most important for improving all children's learning, and so inform governments on the strategies needed to support children who face multiple disadvantages.

    The research will be conducted in India and Pakistan, countries characteristic of other poor countries in terms of wide learning inequalities. India shows some advances in identifying strategies to tackle disadvantage, while Pakistan is similar to many other low income countries in not yet having such strategies.

    Recognising that limited information is available on learning levels of children facing different forms of disadvantages who are not in school, the research will assess children both in the household and in schools. The focus of these tests will be on achievement of foundation skills of reading, writing, reasoning and numeracy that children are expected to acquire in primary school. This will be followed up with a test a year later in order to identify what...

  6. f

    Table_1_Practical Lessons on 12-Lead ECG Classification: Meta-Analysis of...

    • figshare.com
    xlsx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shenda Hong; Wenrui Zhang; Chenxi Sun; Yuxi Zhou; Hongyan Li (2023). Table_1_Practical Lessons on 12-Lead ECG Classification: Meta-Analysis of Methods From PhysioNet/Computing in Cardiology Challenge 2020.XLSX [Dataset]. http://doi.org/10.3389/fphys.2021.811661.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Frontiers
    Authors
    Shenda Hong; Wenrui Zhang; Chenxi Sun; Yuxi Zhou; Hongyan Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cardiovascular diseases (CVDs) are one of the most fatal disease groups worldwide. Electrocardiogram (ECG) is a widely used tool for automatically detecting cardiac abnormalities, thereby helping to control and manage CVDs. To encourage more multidisciplinary researches, PhysioNet/Computing in Cardiology Challenge 2020 (Challenge 2020) provided a public platform involving multi-center databases and automatic evaluations for ECG classification tasks. As a result, 41 teams successfully submitted their solutions and were qualified for rankings. Although Challenge 2020 was a success, there has been no in-depth methodological meta-analysis of these solutions, making it difficult for researchers to benefit from the solutions and results. In this study, we aim to systematically review the 41 solutions in terms of data processing, feature engineering, model architecture, and training strategy. For each perspective, we visualize and statistically analyze the effectiveness of the common techniques, and discuss the methodological advantages and disadvantages. Finally, we summarize five practical lessons based on the aforementioned analysis: (1) Data augmentation should be employed and adapted to specific scenarios; (2) Combining different features can improve performance; (3) A hybrid design of different types of deep neural networks (DNNs) is better than using a single type; (4) The use of end-to-end architectures should depend on the task being solved; (5) Multiple models are better than one. We expect that our meta-analysis will help accelerate the research related to ECG classification based on machine-learning models.

  7. Millennium Cohort Study: Age 17, Sweep 7, 2018

    • beta.ukdataservice.ac.uk
    • datacatalogue.cessda.eu
    Updated 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Institute Of Education University Of London (2024). Millennium Cohort Study: Age 17, Sweep 7, 2018 [Dataset]. http://doi.org/10.5255/ukda-sn-8682-2
    Explore at:
    Dataset updated
    2024
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    datacite
    Authors
    Institute Of Education University Of London
    Description

    Background:
    The Millennium Cohort Study (MCS) is a large-scale, multi-purpose longitudinal dataset providing information about babies born at the beginning of the 21st century, their progress through life, and the families who are bringing them up, for the four countries of the United Kingdom. The original objectives of the first MCS survey, as laid down in the proposal to the Economic and Social Research Council (ESRC) in March 2000, were:

    • to chart the initial conditions of social, economic and health advantages and disadvantages facing children born at the start of the 21st century, capturing information that the research community of the future will require
    • to provide a basis for comparing patterns of development with the preceding cohorts (the National Child Development Study, held at the UK Data Archive under GN 33004, and the 1970 Birth Cohort Study, held under GN 33229)
    • to collect information on previously neglected topics, such as fathers' involvement in children's care and development
    • to focus on parents as the most immediate elements of the children's 'background', charting their experience as mothers and fathers of newborn babies in the year 2000, recording how they (and any other children in the family) adapted to the newcomer, and what their aspirations for her/his future may be
    • to emphasise intergenerational links including those back to the parents' own childhood
    • to investigate the wider social ecology of the family, including social networks, civic engagement and community facilities and services, splicing in geo-coded data when available
    Additional objectives subsequently included for MCS were:
    • to provide control cases for the national evaluation of Sure Start (a government programme intended to alleviate child poverty and social exclusion)
    • to provide samples of adequate size to analyse and compare the smaller countries of the United Kingdom, and include disadvantaged areas of England

    Further information about the MCS can be found on the Centre for Longitudinal Studies web pages.

    The content of MCS studies, including questions, topics and variables can be explored via the CLOSER Discovery website.

    The first sweep (MCS1) interviewed both mothers and (where resident) fathers (or father-figures) of infants included in the sample when the babies were nine months old, and the second sweep (MCS2) was carried out with the same respondents when the children were three years of age. The third sweep (MCS3) was conducted in 2006, when the children were aged five years old, the fourth sweep (MCS4) in 2008, when they were seven years old, the fifth sweep (MCS5) in 2012-2013, when they were eleven years old, the sixth sweep (MCS6) in 2015, when they were fourteen years old, and the seventh sweep (MCS7) in 2018, when they were seventeen years old.

    End User Licence versions of MCS studies:
    The End User Licence (EUL) versions of MCS1, MCS2, MCS3, MCS4, MCS5, MCS6 and MCS7 are held under UK Data Archive SNs 4683, 5350, 5795, 6411, 7464, 8156 and 8682 respectively. The longitudinal family file is held under SN 8172.

    Sub-sample studies:
    Some studies based on sub-samples of MCS have also been conducted, including a study of MCS respondent mothers who had received assisted fertility treatment, conducted in 2003 (see EUL SN 5559). Also, birth registration and maternity hospital episodes for the MCS respondents are held as a separate dataset (see EUL SN 5614).

    Release of Sweeps 1 to 4 to Long Format (Summer 2020)
    To support longitudinal research and make it easier to compare data from different time points, all data from across all sweeps is now in a consistent format. The update affects the data from sweeps 1 to 4 (from 9 months to 7 years), which are updated from the old/wide to a new/long format to match the format of data of sweeps 5 and 6 (age 11 and 14 sweeps). The old/wide formatted datasets contained one row per family with multiple variables for different respondents. The new/long formatted datasets contain one row per respondent (per parent or per cohort member) for each MCS family. Additional updates have been made to all sweeps to harmonise variable labels and enhance anonymisation.

    How to access genetic and/or bio-medical sample data from a range of longitudinal surveys:
    For information on how to access biomedical data from MCS that are not held at the UKDS, see the CLS Genetic data and biological samples webpage.

    Secure Access datasets:
    Secure Access versions of the MCS have more restrictive access conditions than versions available under the standard End User Licence or Special Licence (see 'Access data' tab above).

    Secure Access versions of the MCS include:
    • detailed sensitive variables not available under EUL. These have been grouped thematically and are held under SN 8753 (socio-economic, accommodation and occupational data), SN 8754 (self-reported health, behaviour and fertility), SN 8755 (demographics, language and religion) and SN 8756 (exact participation dates). These files replace previously available studies held under SNs 8456 and 8622-8627
    • detailed geographical identifier files which are grouped by sweep held under SN 7758 (MCS1), SN 7759 (MCS2), SN 7760 (MCS3), SN 7761 (MCS4), SN 7762 (MCS5 2001 Census Boundaries), SN 7763 (MCS5 2011 Census Boundaries), SN 8231 (MCS6 2001 Census Boundaries), SN 8232 (MCS6 2011 Census Boundaries), SN 8757 (MCS7), SN 8758 (MCS7 2001 Census Boundaries) and SN 8759 (MCS7 2011 Census Boundaries). These files replace previously available files grouped by geography SN 7049 (Ward level), SN 7050 (Lower Super Output Area level), and SN 7051 (Output Area level)
    • linked education administrative datasets for Key Stages 1, 2, 4 and 5 held under SN 8481 (England). This replaces previously available datasets for Key Stage 1 (SN 6862) and Key Stage 2 (SN 7712)
    • linked education administrative datasets for Key Stage 1 held under SN 7414 (Scotland)
    • linked education administrative dataset for Key Stages 1, 2, 3 and 4 under SN 9085 (Wales)
    • linked NHS Patient Episode Database for Wales (PEDW) for MCS1 – MCS5 held under SN 8302
    • linked Scottish Medical Records data held under SNs 8709, 8710, 8711, 8712, 8713 and 8714;
    • Banded Distances to English Grammar Schools for MCS5 held under SN 8394
    • linked Health Administrative Datasets (Hospital Episode Statistics) for England for years 2000-2019 held under SN 9030
    • linked Hospital of Birth data held under SN 5724.
    The linked education administrative datasets held under SNs 8481,7414 and 9085 may be ordered alongside the MCS detailed geographical identifier files only if sufficient justification is provided in the application. Users are also only allowed access to either 2001 or 2011 of Geographical Identifiers Census Boundaries studies. So for MCS5 either SN 7762 (2001 Census Boundaries) or SN 7763 (2011 Census Boundaries), for the MCS6 users are only allowed either SN 8231 (2001 Census Boundaries) or SN 8232 (2011 Census Boundaries); and the same applies for MCS7 so either SN 8758 (2001 Census Boundaries) or SN 8759 (2011 Census Boundaries).

    Researchers applying for access to the Secure Access MCS datasets should indicate on their ESRC Accredited Researcher application form the EUL dataset(s) that they also wish to access (selected from the MCS Series Access web page).

    The seventh sweep of the Millennium Cohort Study (MCS7) was carried out when the cohort members were 17 years old. As 17 is a key transitional age, the sweep purposefully focused on engaging with the cohort members themselves (in addition to their parents). MCS7 marks an important transitional time in the cohort members' lives, where educational and occupational paths can diverge significantly. It is also an important age in data collection terms since it may be the last sweep at which parents are interviewed and it is an age when direct engagement with the cohort members themselves rather than their families is crucial to the long term viability of the study. To reflect this, face-to-face interviews with the cohort members have been conducted for the first time. Cohort members were also asked to do a range of other activities including filling in a self-completion questionnaire on the interviewer's tablet, completing a cognitive assessment (number activity) and having their height, weight and body fat measurements taken. In addition, they were asked to complete a short online questionnaire after the visit.

    Parents were still interviewed at MCS7. Resident parents were asked to complete a household interview and a short online questionnaire, and one parent was asked to complete a Strengths and Difficulties Questionnaire (SDQ) about the cohort member. Cohort members who were either unable or unwilling to complete the main survey were asked to complete a short follow up questionnaire online after the fieldwork finished. This contained some key questions and was designed to boost response and maintain engagement.

    For the second edition (March 2021), two new

  8. f

    Data from: Data labeling.

    • plos.figshare.com
    xls
    Updated Jan 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saad Hammood Mohammed; Mandeep S. Jit Singh; Abdulmajeed Al-Jumaily; Mohammad Tariqul Islam; Md. Shabiul Islam; Abdulmajeed M. Alenezi; Mohamed S. Soliman (2025). Data labeling. [Dataset]. http://doi.org/10.1371/journal.pone.0316536.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jan 27, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Saad Hammood Mohammed; Mandeep S. Jit Singh; Abdulmajeed Al-Jumaily; Mohammad Tariqul Islam; Md. Shabiul Islam; Abdulmajeed M. Alenezi; Mohamed S. Soliman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Modernizing power systems into smart grids has introduced numerous benefits, including enhanced efficiency, reliability, and integration of renewable energy sources. However, this advancement has also increased vulnerability to cyber threats, particularly False Data Injection Attacks (FDIAs). Traditional Intrusion Detection Systems (IDS) often fall short in identifying sophisticated FDIAs due to their reliance on predefined rules and signatures. This paper addresses this gap by proposing a novel IDS that utilizes hybrid feature selection and deep learning classifiers to detect FDIAs in smart grids. The main objective is to enhance the accuracy and robustness of IDS in smart grids. The proposed methodology combines Particle Swarm Optimization (PSO) and Grey Wolf Optimization (GWO) for hybrid feature selection, ensuring the selection of the most relevant features for detecting FDIAs. Additionally, the IDS employs a hybrid deep learning classifier that integrates Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks to capture the smart grid data’s spatial and temporal features. The dataset used for evaluation, the Industrial Control System (ICS) Cyber Attack Dataset (Power System Dataset) consists of various FDIA scenarios simulated in a smart grid environment. Experimental results demonstrate that the proposed IDS framework significantly outperforms traditional methods. The hybrid feature selection effectively reduces the dimensionality of the dataset, improving computational efficiency and detection performance. The hybrid deep learning classifier performs better in key metrics, including accuracy, recall, precision, and F-measure. Precisely, the proposed approach attains higher accuracy by accurately identifying true positives and minimizing false negatives, ensuring the reliable operation of smart grids. Recall is enhanced by capturing critical features relevant to all attack types, while precision is improved by reducing false positives, leading to fewer unnecessary interventions. The F-measure balances recall and precision, indicating a robust and reliable detection system. This study presents a practical dual-hybrid IDS framework for detecting FDIAs in smart grids, addressing the limitations of existing IDS techniques. Future research should focus on integrating real-world smart grid data for validation, developing adaptive learning mechanisms, exploring other bio-inspired optimization algorithms, and addressing real-time processing and scalability challenges in large-scale deployments.

  9. Millennium Cohort Study: Linked Education Administrative Datasets (KS1-KS4),...

    • beta.ukdataservice.ac.uk
    • datacatalogue.cessda.eu
    Updated 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Institute Of Education University College London (2024). Millennium Cohort Study: Linked Education Administrative Datasets (KS1-KS4), Wales: Secure Access [Dataset]. http://doi.org/10.5255/ukda-sn-9085-1
    Explore at:
    Dataset updated
    2024
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    datacite
    Authors
    Institute Of Education University College London
    Area covered
    Wales
    Description

    Background:
    The Millennium Cohort Study (MCS) is a large-scale, multi-purpose longitudinal dataset providing information about babies born at the beginning of the 21st century, their progress through life, and the families who are bringing them up, for the four countries of the United Kingdom. The original objectives of the first MCS survey, as laid down in the proposal to the Economic and Social Research Council (ESRC) in March 2000, were:

    • to chart the initial conditions of social, economic and health advantages and disadvantages facing children born at the start of the 21st century, capturing information that the research community of the future will require
    • to provide a basis for comparing patterns of development with the preceding cohorts (the National Child Development Study, held at the UK Data Archive under GN 33004, and the 1970 Birth Cohort Study, held under GN 33229)
    • to collect information on previously neglected topics, such as fathers' involvement in children's care and development
    • to focus on parents as the most immediate elements of the children's 'background', charting their experience as mothers and fathers of newborn babies in the year 2000, recording how they (and any other children in the family) adapted to the newcomer, and what their aspirations for her/his future may be
    • to emphasise intergenerational links including those back to the parents' own childhood
    • to investigate the wider social ecology of the family, including social networks, civic engagement and community facilities and services, splicing in geo-coded data when available
    Additional objectives subsequently included for MCS were:
    • to provide control cases for the national evaluation of Sure Start (a government programme intended to alleviate child poverty and social exclusion)
    • to provide samples of adequate size to analyse and compare the smaller countries of the United Kingdom, and include disadvantaged areas of England

    Further information about the MCS can be found on the Centre for Longitudinal Studies web pages.

    The content of MCS studies, including questions, topics and variables can be explored via the CLOSER Discovery website.

    The first sweep (MCS1) interviewed both mothers and (where resident) fathers (or father-figures) of infants included in the sample when the babies were nine months old, and the second sweep (MCS2) was carried out with the same respondents when the children were three years of age. The third sweep (MCS3) was conducted in 2006, when the children were aged five years old, the fourth sweep (MCS4) in 2008, when they were seven years old, the fifth sweep (MCS5) in 2012-2013, when they were eleven years old, the sixth sweep (MCS6) in 2015, when they were fourteen years old, and the seventh sweep (MCS7) in 2018, when they were seventeen years old.

    End User Licence versions of MCS studies:
    The End User Licence (EUL) versions of MCS1, MCS2, MCS3, MCS4, MCS5, MCS6 and MCS7 are held under UK Data Archive SNs 4683, 5350, 5795, 6411, 7464, 8156 and 8682 respectively. The longitudinal family file is held under SN 8172.

    Sub-sample studies:
    Some studies based on sub-samples of MCS have also been conducted, including a study of MCS respondent mothers who had received assisted fertility treatment, conducted in 2003 (see EUL SN 5559). Also, birth registration and maternity hospital episodes for the MCS respondents are held as a separate dataset (see EUL SN 5614).

    Release of Sweeps 1 to 4 to Long Format (Summer 2020)
    To support longitudinal research and make it easier to compare data from different time points, all data from across all sweeps is now in a consistent format. The update affects the data from sweeps 1 to 4 (from 9 months to 7 years), which are updated from the old/wide to a new/long format to match the format of data of sweeps 5 and 6 (age 11 and 14 sweeps). The old/wide formatted datasets contained one row per family with multiple variables for different respondents. The new/long formatted datasets contain one row per respondent (per parent or per cohort member) for each MCS family. Additional updates have been made to all sweeps to harmonise variable labels and enhance anonymisation.

    How to access genetic and/or bio-medical sample data from a range of longitudinal surveys:
    For information on how to access biomedical data from MCS that are not held at the UKDS, see the CLS Genetic data and biological samples webpage.

    Secure Access datasets:
    Secure Access versions of the MCS have more restrictive access conditions than versions available under the standard End User Licence or Special Licence (see 'Access data' tab above).

    Secure Access versions of the MCS include:
    • detailed sensitive variables not available under EUL. These have been grouped thematically and are held under SN 8753 (socio-economic, accommodation and occupational data), SN 8754 (self-reported health, behaviour and fertility), SN 8755 (demographics, language and religion) and SN 8756 (exact participation dates). These files replace previously available studies held under SNs 8456 and 8622-8627
    • detailed geographical identifier files which are grouped by sweep held under SN 7758 (MCS1), SN 7759 (MCS2), SN 7760 (MCS3), SN 7761 (MCS4), SN 7762 (MCS5 2001 Census Boundaries), SN 7763 (MCS5 2011 Census Boundaries), SN 8231 (MCS6 2001 Census Boundaries), SN 8232 (MCS6 2011 Census Boundaries), SN 8757 (MCS7), SN 8758 (MCS7 2001 Census Boundaries) and SN 8759 (MCS7 2011 Census Boundaries). These files replace previously available files grouped by geography SN 7049 (Ward level), SN 7050 (Lower Super Output Area level), and SN 7051 (Output Area level)
    • linked education administrative datasets for Key Stages 1, 2, 4 and 5 held under SN 8481 (England). This replaces previously available datasets for Key Stage 1 (SN 6862) and Key Stage 2 (SN 7712)
    • linked education administrative datasets for Key Stage 1 held under SN 7414 (Scotland)
    • linked education administrative dataset for Key Stages 1, 2, 3 and 4 under SN 9085 (Wales)
    • linked NHS Patient Episode Database for Wales (PEDW) for MCS1 – MCS5 held under SN 8302
    • linked Scottish Medical Records data held under SNs 8709, 8710, 8711, 8712, 8713 and 8714;
    • Banded Distances to English Grammar Schools for MCS5 held under SN 8394
    • linked Health Administrative Datasets (Hospital Episode Statistics) for England for years 2000-2019 held under SN 9030
    • linked Health Administrative Datasets (SAIL) for Wales held under SN 9310
    • linked Hospital of Birth data held under SN 5724.
    The linked education administrative datasets held under SNs 8481,7414 and 9085 may be ordered alongside the MCS detailed geographical identifier files only if sufficient justification is provided in the application. Users are also only allowed access to either 2001 or 2011 of Geographical Identifiers Census Boundaries studies. So for MCS5 either SN 7762 (2001 Census Boundaries) or SN 7763 (2011 Census Boundaries), for the MCS6 users are only allowed either SN 8231 (2001 Census Boundaries) or SN 8232 (2011 Census Boundaries); and the same applies for MCS7 so either SN 8758 (2001 Census Boundaries) or SN 8759 (2011 Census Boundaries).

    Researchers applying for access to the Secure Access MCS datasets should indicate on their ESRC Accredited Researcher application form the EUL dataset(s) that they also wish to access (selected from the MCS Series Access web page).

    Millennium Cohort Study: Linked Education Administrative Datasets (KS1-KS4), Wales: Secure Access

    These datasets include education administrative records for Wales up to age 16 to survey data for cohort members in the MCS. The main aim of this data linkage exercise is to enhance the research potential of the study, by combining administrative education records with the rich information collected in the surveys.

    Datasets include anonymised Local Education Authorities (LEA) to allow comparison of results across LEA. The data were obtained only for children whose parents/carers gave consent to data linkage, and who were successfully matched.

  10. f

    GWO parameters.

    • plos.figshare.com
    xls
    Updated Jan 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saad Hammood Mohammed; Mandeep S. Jit Singh; Abdulmajeed Al-Jumaily; Mohammad Tariqul Islam; Md. Shabiul Islam; Abdulmajeed M. Alenezi; Mohamed S. Soliman (2025). GWO parameters. [Dataset]. http://doi.org/10.1371/journal.pone.0316536.t009
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jan 27, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Saad Hammood Mohammed; Mandeep S. Jit Singh; Abdulmajeed Al-Jumaily; Mohammad Tariqul Islam; Md. Shabiul Islam; Abdulmajeed M. Alenezi; Mohamed S. Soliman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Modernizing power systems into smart grids has introduced numerous benefits, including enhanced efficiency, reliability, and integration of renewable energy sources. However, this advancement has also increased vulnerability to cyber threats, particularly False Data Injection Attacks (FDIAs). Traditional Intrusion Detection Systems (IDS) often fall short in identifying sophisticated FDIAs due to their reliance on predefined rules and signatures. This paper addresses this gap by proposing a novel IDS that utilizes hybrid feature selection and deep learning classifiers to detect FDIAs in smart grids. The main objective is to enhance the accuracy and robustness of IDS in smart grids. The proposed methodology combines Particle Swarm Optimization (PSO) and Grey Wolf Optimization (GWO) for hybrid feature selection, ensuring the selection of the most relevant features for detecting FDIAs. Additionally, the IDS employs a hybrid deep learning classifier that integrates Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks to capture the smart grid data’s spatial and temporal features. The dataset used for evaluation, the Industrial Control System (ICS) Cyber Attack Dataset (Power System Dataset) consists of various FDIA scenarios simulated in a smart grid environment. Experimental results demonstrate that the proposed IDS framework significantly outperforms traditional methods. The hybrid feature selection effectively reduces the dimensionality of the dataset, improving computational efficiency and detection performance. The hybrid deep learning classifier performs better in key metrics, including accuracy, recall, precision, and F-measure. Precisely, the proposed approach attains higher accuracy by accurately identifying true positives and minimizing false negatives, ensuring the reliable operation of smart grids. Recall is enhanced by capturing critical features relevant to all attack types, while precision is improved by reducing false positives, leading to fewer unnecessary interventions. The F-measure balances recall and precision, indicating a robust and reliable detection system. This study presents a practical dual-hybrid IDS framework for detecting FDIAs in smart grids, addressing the limitations of existing IDS techniques. Future research should focus on integrating real-world smart grid data for validation, developing adaptive learning mechanisms, exploring other bio-inspired optimization algorithms, and addressing real-time processing and scalability challenges in large-scale deployments.

  11. f

    Statistics of ‘diffrate’.

    • figshare.com
    xls
    Updated Mar 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuancheng Si; Saralees Nadarajah; Zongxin Zhang; Chunmin Xu (2024). Statistics of ‘diffrate’. [Dataset]. http://doi.org/10.1371/journal.pone.0299164.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Mar 13, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Yuancheng Si; Saralees Nadarajah; Zongxin Zhang; Chunmin Xu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the dynamic landscape of financial markets, accurate forecasting of stock indices remains a pivotal yet challenging task, essential for investors and policymakers alike. This study is motivated by the need to enhance the precision of predicting the Shanghai Composite Index’s opening price spread, a critical measure reflecting market volatility and investor sentiment. Traditional time series models like ARIMA have shown limitations in capturing the complex, nonlinear patterns inherent in stock price movements, prompting the exploration of advanced methodologies. The aim of this research is to bridge the gap in forecasting accuracy by developing a hybrid model that integrates the strengths of ARIMA with deep learning techniques, specifically Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks. This novel approach leverages the ARIMA model’s proficiency in linear trend analysis and the deep learning models’ capability in modeling nonlinear dependencies, aiming to provide a comprehensive tool for market prediction. Utilizing a comprehensive dataset covering the period from December 20, 1990, to June 2, 2023, the study develops and assesses the efficacy of ARIMA, LSTM, GRU, ARIMA-LSTM, and ARIMA-GRU models in forecasting the Shanghai Composite Index’s opening price spread. The evaluation of these models is based on key statistical metrics, including Mean Squared Error (MSE) and Mean Absolute Error (MAE), to gauge their predictive accuracy. The findings indicate that the hybrid models, ARIMA-LSTM and ARIMA-GRU, perform better in forecasting the opening price spread of the Shanghai Composite Index than their standalone counterparts. This outcome suggests that combining traditional statistical methods with advanced deep learning algorithms can enhance stock market prediction. The research contributes to the field by providing evidence of the potential benefits of integrating different modeling approaches for financial forecasting, offering insights that could inform investment strategies and financial decision-making.

  12. f

    Data from: S1 Dataset -

    • figshare.com
    xlsx
    Updated Feb 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fanghua Liu; Yanchao Yang; Feng (Robin) Wang; Wangze Li (2025). S1 Dataset - [Dataset]. http://doi.org/10.1371/journal.pone.0314879.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Fanghua Liu; Yanchao Yang; Feng (Robin) Wang; Wangze Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The importance of English debate in fostering critical thinking and the role of self-efficacy in enhancing confidence and performance in this domain are widely acknowledged. However, a significant gap exists in the literature regarding the measurement of self-efficacy specifically within English debate. This research seeks to fill this gap by developing and validating an English Debate Self-Efficacy Scale (EDSS). Using a sample of 1,259 participants from an independent college in Hebei Province, China, the study divided participants into two groups: 613 for exploratory factor analysis (EFA) and 646 for confirmatory factor analysis (CFA), with convenience sampling as the chosen methodology. EFA revealed three core dimensions of debate-related self-efficacy: Language proficiency (Cronbach’s Alpha = .894), Debating skills (Cronbach’s Alpha = .861), and Team collaboration (Cronbach’s Alpha = .831). Subsequent CFA validation with an independent sample confirmed the scale’s structure, demonstrating strong structural, convergent, and discriminant validity. Additionally, significant correlations between the English Debate Self-Efficacy Scale and the English Proficiency Self-Efficacy Scale supported the scale’s criterion validity. These findings underscore the scale’s potential as a reliable tool for assessing self-efficacy in English debate contexts, offering valuable insights for research, teaching, and training in educational settings. Limitations related to sample representativeness and research design were also discussed, providing a foundation for future studies to expand upon. In conclusion, the English Debate Self-Efficacy Scale (EDSS) is a reliable and valid instrument for measuring self-efficacy in the context of English debate.

  13. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bogdan Ershov; Tatyana Chekmenyova (2023). Distance Learning: Russian Experience [Dataset]. http://doi.org/10.6084/m9.figshare.22637380.v1
Organization logo

Data from: Distance Learning: Russian Experience

Related Article
Explore at:
pdfAvailable download formats
Dataset updated
May 31, 2023
Dataset provided by
figshare
Authors
Bogdan Ershov; Tatyana Chekmenyova
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The article discusses innovative approaches to education on the example of the introduction of distance learning in Russia, the main forms of its organization, the necessary means, areas of application, advantages, and disadvantages. The authors note that distance learning is becoming more in demand, has many advantages, and therefore, will develop. For its development, the modern education system in Russia has all the possibilities, both technically and intellectually. But we must not forget about the shortcomings of distance learning, which must be eliminated in the process of its development and improvement. Particular attention is paid to the legal regulation of distance education.

Search
Clear search
Close search
Google apps
Main menu