Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The information refers to NI domiciled students enrolled at higher education institutions in the UK. The dataset is collected annually and is based on enrolments in higher education institutions in the UK on 1st December each year. The dataset is collected by the Higher Education Statistics Agency from higher education institutions throughout the UK and provided to the Department for Employment and Learning, Northern Ireland, for analysis. For 2013/14, NI Domiciled enrolments and qualifications at Open University are available. In previous years, these figures were included in NI students studying in England, as the administrative centre of the Open University is located in England. The specification of the HESA Standard Registration Population has changed for 2007/08 enrolments onwards. Writing up and sabbatical students are now excluded from this population where they were previously included in published enrolment data and therefore 2007/08 data onwards cannot be directly compared to previous years.
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This dataset shows the location of Higher Education (HE) and Further Education (FE) institutes in the Great Britain. This should cover Universities and Colleges. Many institutes have more than one campus and where possible this is refelcted in the data so a University may have more than one entry. Postcodes have also been included for instities where possible. This data was collected from various sources connected with HEFE in the UK including JISC and EDINA. This represents the fullest list that the author could compile from various sources. If you spot a missing institution, please contact the author and they will add it to the dataset. GIS vector data. This dataset was first accessioned in the EDINA ShareGeo Open repository on 2011-02-01 and migrated to Edinburgh DataShare on 2017-02-21.
DOI Abstract copyright UK Data Service and data collection copyright owner.The USR consists of records of undergraduate students on courses of one academic year or more; postgraduate students on courses of one academic year or more; academic and related staff holding regular salaried appointments, and finance data for all UK universities. The Finance dataset contains details of income and expenditure for all of the UK universities. These data are contained in a series of files for each year. For detailed information on structure and content of these files users should refer to the documentation that accompanies this dataset. Also included in the Finance dataset is the Student Load data. Student Load is, in the USR context, a reallocation of student-head count numbers, by apportioning them as a percentage to the departmental cost centres where they are taught, thus enabling student load, staff and financial data to be brought together. Main Topics: Finance: income and expenditure; university; cost centre. Student load: undergraduate, postgraduate (taught course or research); cost centre. No information recorded Annual returns from each university.
Abstract copyright UK Data Service and data collection copyright owner.The USR consists of records of undergraduate students on courses of one academic year or more; postgraduate students on courses of one academic year or more; academic and related staff holding regular salaried appointments, and finance data for all UK universities. No information recorded Annual returns from each university.
This dataset presents a cluster analysis of UK universities based on four synthetic environments: social, cultural, physical and economic. These were developed based on variables that represented an educational ecosystem of well-being. The cluster analysis was initially linked to the LSYPE-Secure dataset using the UKPRNs (i.e. higher education institutional number) and hence the cluster analysis used data from around 2009-2012 to represent Wave 6 and Wave 7 of the LSYPE-Secure dataset. The cluster analysis was based on using a variety of variables available from HESA and the Office for Students (OfS) to represent these environments, for example: Social: had demographics of students and staff including ethnicity and sex Cultural: had data on research and teaching scores Economic: had data on student: staff ratio and expenditure Physical: had data related to the built and natural environment including residential sites, blue and green spacesEarlier last year (April 2018), the UK Office for Students (OfS) noted that students from underrepresented groups such as black and minority ethnic (BME) students and those from disadvantaged backgrounds were less likely to succeed at university. Coupled with this, research has shown that students from these groups are also more likely to have poorer mental health and wellbeing. However, there is substantial social and political pressure on universities to act to improve student mental health. For example, the Telegraph ran the headline "Do British universities have a suicide problem?" Thus, in June 2018, the Hon. Sam Gyimah, the then UK universities minister, informed university vice-chancellors that student mental health and wellbeing has to be one of their top priorities. Universities are investing substantive sums in activities to tackle student mental health but doing so with no evidence base to guide strategic policy and practice. These activities may potentially be ineffective, financially wasteful, and possibly, counter-productive. Therefore, we need a better evidence base which this project intends to fulfil. Currently, there is a lack of evidence and understanding about which groups of young people going to universities may have poorer life outcomes (such as education, employment, and mental health and well-being) as a result of their mental health and wellbeing during their adolescent years. These life outcomes and their mental health and wellbeing, however, are important for understanding the context of the complex social identities of the young people, such as the intersections between their gender, ethnicity, sexuality, religion and socio-economic status. Otherwise, these young people may feel misunderstood or judged. Most of the large body of quantitative research on life outcomes tend to focus on one social characteristic/identity of the student, such as the young person's gender or ethnicity or socio-economic status, but not the combination of all of these, i.e. the intersectionalities. Primarily, the reason for this has been the lack of sufficient data. This research draws on data from the Longitudinal Study of Young People in England (LSYPE), which tracked over 15,000 adolescents' education and health over 7 years between 2004-2010 (from when they were 13-19 years old), and the Next Steps Survey, which collected data from the same individuals in 2015 when they were 25 years and in the job market. This dataset also had an ethnic boost, which thus allows for the exploratory analysis of intersectionalities. Currently, there are a number of interventions being implemented to improve the university environment. However, there is a lack of evidence on how the university environment (such as their its size, amount of academic support available, availability of sports activities, students' sense of belonging, etc.) can affect the young person'students' mental health and wellbeing life outcomes. This evidence can be determined through by using the LSYPE data supplemented and by university environment data supplemented from the National Student Survey (NSS) and the Higher Education Statistics Agency (HESA). Thus this research uses an intersectional approach to investigate the extent to which the life outcomes of young persons who go to university are affected by their social inequality groupings and mental health and well-being during adolescence. Additionally, this research also aims to determine the characteristics of university environments that can improve the life outcomes of these young people depending on their social and mental health/wellbeing background. We use secondary data analysis of mainly HESA and OfS variables and created derived variables.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is a compilation of processed data on citation and references for research papers including their author, institution and open access info for a selected sample of academics analysed using Microsoft Academic Graph (MAG) data and CORE. The data for this dataset was collected during December 2019 to January 2020.Six countries (Austria, Brazil, Germany, India, Portugal, United Kingdom and United States) were the focus of the six questions which make up this dataset. There is one csv file per country and per question (36 files in total). More details about the creation of this dataset are available on the public ON-MERRIT D3.1 deliverable report.The dataset is a combination of two different data sources, one part is a dataset created on analysing promotion policies across the target countries, while the second part is a set of data points available to understand the publishing behaviour. To facilitate the analysis the dataset is organised in the following seven folders:PRTThe dataset with the file name "PRT_policies.csv" contains the related information as this was extracted from promotion, review and tenure (PRT) policies. Q1: What % of papers coming from a university are Open Access?- Dataset Name format: oa_status_countryname_papers.csv- Dataset Contents: Open Access (OA) status of all papers of all the universities listed in Times Higher Education World University Rankings (THEWUR) for the given country. A paper is marked OA if there is at least an OA link available. OA links are collected using the CORE Discovery API.- Important considerations about this dataset: - Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. - The service we used to recognise if a paper is OA, CORE Discovery, does not contain entries for all paperids in MAG. This implies that some of the records in the dataset extracted will not have either a true or false value for the _is_OA_ field. - Only those records marked as true for _is_OA_ field can be said to be OA. Others with false or no value for is_OA field are unknown status (i.e. not necessarily closed access).Q2: How are papers, published by the selected universities, distributed across the three scientific disciplines of our choice?- Dataset Name format: fsid_countryname_papers.csv- Dataset Contents: For the given country, all papers for all the universities listed in THEWUR with the information of fieldofstudy they belong to.- Important considerations about this dataset: * MAG can associate a paper to multiple fieldofstudyid. If a paper belongs to more than one of our fieldofstudyid, separate records were created for the paper with each of those _fieldofstudyid_s.- MAG assigns fieldofstudyid to every paper with a score. We preserve only those records whose score is more than 0.5 for any fieldofstudyid it belongs to.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. Papers with authorship from multiple universities are counted once towards each of the universities concerned.Q3: What is the gender distribution in authorship of papers published by the universities?- Dataset Name format: author_gender_countryname_papers.csv- Dataset Contents: All papers with their author names for all the universities listed in THEWUR.- Important considerations about this dataset :- When there are multiple collaborators(authors) for the same paper, this dataset makes sure that only the records for collaborators from within selected universities are preserved.- An external script was executed to determine the gender of the authors. The script is available here.Q4: Distribution of staff seniority (= number of years from their first publication until the last publication) in the given university.- Dataset Name format: author_ids_countryname_papers.csv- Dataset Contents: For a given country, all papers for authors with their publication year for all the universities listed in THEWUR.- Important considerations about this work :- When there are multiple collaborators(authors) for the same paper, this dataset makes sure that only the records for collaborators from within selected universities are preserved.- Calculating staff seniority can be achieved in various ways. The most straightforward option is to calculate it as _academic_age = MAX(year) - MIN(year) _for each authorid.Q5: Citation counts (incoming) for OA vs Non-OA papers published by the university.- Dataset Name format: cc_oa_countryname_papers.csv- Dataset Contents: OA status and OA links for all papers of all the universities listed in THEWUR and for each of those papers, count of incoming citations available in MAG.- Important considerations about this dataset :- CORE Discovery was used to establish the OA status of papers.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to.- Only those records marked as true for _is_OA_ field can be said to be OA. Others with false or no value for is_OA field are unknown status (i.e. not necessarily closed access).Q6: Count of OA vs Non-OA references (outgoing) for all papers published by universities.- Dataset Name format: rc_oa_countryname_-papers.csv- Dataset Contents: Counts of all OA and unknown papers referenced by all papers published by all the universities listed in THEWUR.- Important considerations about this dataset :- CORE Discovery was used to establish the OA status of papers being referenced.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. Papers with authorship from multiple universities are counted once towards each of the universities concerned.Additional files:- _fieldsofstudy_mag_.csv: this file contains a dump of fieldsofstudy table of MAG mapping each of the ids to their actual field of study name.
Abstract copyright UK Data Service and data collection copyright owner.The USR consists of records of undergraduate students on courses of one academic year or more; postgraduate students on courses of one academic year or more; academic and related staff holding regular salaried appointments, and finance data for all UK universities. Main Topics: (i) Personal information: date of birth; sex; marital status; country/county of domicile; country of birth; whether home or overseas student for fee purposes; occupation of parent or guardian. (ii) Academic history: last full-time school attended; other full-time/part-time post secondary educational institution attended; GCE `A' level or Scottish Certificate of Education higher grade results; other entrance qualifications; course for which admitted. (iii) Annual information: university; subject of course; normal duration of course; type of course; year of course; date of enrolment; method of study (full-time, part-time, sandwich, etc.); qualification aimed for; source of fees; accommodation (hall. lodgings, home, etc.). (iv) Leavers' details: qualification obtained; class of degree; date of leaving; reason for leaving; first destination. No information recorded Annual returns from each university.
Percentage of students, who entered an A Level or equivalent qualification, going to, or remaining in, an education destination or employment. The percentage of students progressing to further learning in a school, Further Education or Sixth Form College, Apprenticeship, work based learning provider or Higher Education Institution. To be included in the Measure, young people have to show sustained participation in an education destination in all of the first two terms of the year after they completed KS4 or took A level or equivalent qualifications. The first two terms is defined as October to March. The statistics are published as "Experimental Statistics" and do not display the National Statistics Logo. They are still being evaluated and remain subject to further testing to determine their reliability and ability to meet customer needs. The figures should be treated with caution as this is the first year for which such data have been produced. As improvements are made to the methodology, data quality will be assessed to establish whether the statistics meet the quality standards for National Statistics. “x“ means the data has been suppressed as the school or college has fewer than 6 students in a particular denominator, or small numbers for the numerator (1’s and 2’s). Results are not shown because of the risk of an individual student being identified. All totals have been rounded to the nearest 10. Zeros are shown as zeros. See Technical Note from Department of Education for further detail. https://www.gov.uk/government/collections/statistics-destinations
The project uses a unique dataset collected from UK higher education institutions comprised of individual-level data on undergraduate students from the UK and EU (i.e. those potentially eligible for bursaries), including the bursary they are awarded each year, academic outcomes, prior attainment and other demographic information. Collection consists of data from 10 English universities on bursary awards, student characteristics, and student outcomes over the period 2006-2011. The aim is to identify the impact of bursaries on the academic outcomes of students by exploiting variation in bursary rules across institutions. This will be achieved by comparing students with similar characteristics but receiving different levels of bursary due to the institution they are attending. To account for underlying differences across universities we will exploit changes in bursary eligibility rules within a university over time. The findings should be useful for universities and policy makers when considering the role of bursaries in improving student outcomes. Higher education bursaries and performance: annual test scores, drop out and degree outcomes Despite some £300m per year being spent on higher education bursaries in the UK, there remains no empirical research that examines the effectiveness of this element of financial aid as a means to improve student outcomes whilst at university. The aim of this project is to investigate the impact of bursaries on students’ academic outcomes – including annual test results, completion rates and degree classification. All universities in England where contacted, requesting individual level data on undergraduates, on the following: Student-level data for all UK/EU full-time undergraduate students (i.e. only those eligible for bursaries), with, for each undergraduate: • Their year of entry (from 2006 onwards - or any previous years, if available) • Their A level grades (or other qualifications such as BTEC, HND etc) on entry (with subject of study, if possible) • The subject of degree student studying • Amount of bursary awarded each year, including zeros (i.e. a report for every student, whether they got a bursary or not) • Their annual examination/module scores (by subject if possible) • Their final degree classification • Whether dropped out, and year of drop out • Basic demographics such as age at point of entry, gender, ethnicity, SES, parental income and any other demographic information available • Outline of means-testing rules for bursary awards (as detailed as possible - i.e. parents income>£40k = no bursary; parents income>£10k & <£15k=£2000 bursary etc) each year. 22 universities provided data. 17 are useable.
Understanding Society (the UK Household Longitudinal Study), which began in 2009, is conducted by the Institute for Social and Economic Research (ISER) at the University of Essex, and the survey research organisations Verian Group (formerly Kantar Public) and NatCen. It builds on and incorporates, the British Household Panel Survey (BHPS), which began in 1991.
The Understanding Society: Calendar Year Dataset, 2022: Special Licence Access, is designed for analysts to conduct cross-sectional analysis for the 2022 calendar year. The Calendar Year datasets combine data collected in a specific year from across multiple waves and these are released as separate calendar year studies, with appropriate analysis weights, starting with the 2020 Calendar Year dataset. Each subsequent year, an additional yearly study is released.
The Calendar Year data is designed to enable timely cross-sectional analysis of individuals and households in a calendar year. Such analysis can however, only involve variables that are collected in every wave (excluding rotating content which is only collected in some of the waves). Due to overlapping fieldwork the data files combine data collected in the three waves that make up a calendar year. Analysis cannot be restricted to data collected in one wave during a calendar year, as this subset will not be representative of the population. Further details and guidance on this study can be found in the document 9334_main_survey_calendar_year_user_guide_2022.
These calendar year datasets should be used for cross-sectional analysis only. For those interested in longitudinal analyses using Understanding Society please access the main survey datasets: End User Licence version or Special Licence version.
Understanding Society: the UK Household Longitudinal Study, started in 2009 with a general population sample (GPS) of UK residents living in private households of around 26,000 households and an ethnic minority boost sample (EMBS) of 4,000 households. All members of these responding households and their descendants became part of the core sample who were eligible to be interviewed every year. Anyone who joined these households after this initial wave, were also interviewed as long as they lived with these core sample members to provide the household context. At each annual interview, some basic demographic information was collected about every household member, information about the household is collected from one household member, all 16+ year old household members are eligible for adult interviews, 10-15 year old household members are eligible for youth interviews, and some information is collected about 0-9 year olds from their parents or guardians. Since 1991 until 2008/9 a similar survey, the British Household Panel Survey (BHPS), was fielded. The surviving members of this survey sample were incorporated into Understanding Society in 2010. In 2015, an immigrant and ethnic minority boost sample (IEMBS) of around 2,500 households was added. In 2022 a GPS boost sample (GPS2) of around 5,700 households was added. To know more about the sample design, following rules, interview modes, incentives, consent, questionnaire content please see the study overview and user guide.
Co-funders
In addition to the Economic and Social Research Council, co-funders for the study included the Department of Work and Pensions, the Department for Education, the Department for Transport, the Department of Culture, Media and Sport, the Department for Community and Local Government, the Department of Health, the Scottish Government, the Welsh Assembly Government, the Northern Ireland Executive, the Department of Environment and Rural Affairs, and the Food Standards Agency.
End User Licence and Special Licence versions:
There are two versions of the Calendar Year 2022 data. One is available under the standard End User Licence (EUL) agreement (SN 9333), and the other is a Special Licence (SL) version (SN 9334). The SL version contains month and year of birth variables instead of just age, more detailed country and occupation coding for a number of variables and various income variables have not been top-coded (see 9334_eul_vs_sl_variable_differences for more details). Users are advised to first obtain the standard EUL version of the data to see if they are sufficient for their research requirements. The SL data have more restrictive access conditions; prospective users of the SL version will need to complete an extra application form and demonstrate to the data owners exactly why they need access to the additional variables in order to get permission to use that version. The main longitudinal versions of the Understanding Society study may be found under SNs 6614 (EUL) and 6931 (SL).
Low- and Medium-level geographical identifiers produced for the mainstage longitudinal dataset can be used with this Calendar Year 2022 dataset, subject to SL access conditions. See the User Guide for further details.
Suitable data analysis software
These data are provided by the depositor in Stata format. Users are strongly advised to analyse them in Stata. Transfer to other formats may result in unforeseen issues. Stata SE or MP software is needed to analyse the larger files, which contain about 1,800 variables.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Full resource found at: https://sparcopen.org/our-work/big-deal-knowledge-base
Sourcing: Pricing Data: Individual entries are linked to third party resources within the database; non-linked entries come from Freedom of Information requests (courtesy of Ted Bergstrom and Paul Courant). FTE Data: UK Higher Education Statistics Agency for UK FTE (HE student enrollment FTE + HE staff); DOE IPEDS for US FTE (“Full-time equivalent fall enrollment” + “Total FTE staff”); Universities Canada and COPPUL for Canadian FTE (student data only). Institutional Categories: Carnegie Classification of Institutions of Higher Education.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This dataset provides Census 2021 estimates that classify schoolchildren and full-time students aged 5 years and over in England and Wales by student accommodation and by age. The estimates are as at Census Day, 21 March 2021.
Estimates for single year of age between ages 90 and 100+ are less reliable than other ages. Estimation and adjustment at these ages was based on the age range 90+ rather than five-year age bands. Read more about this quality notice.
Area type
Census 2021 statistics are published for a number of different geographies. These can be large, for example the whole of England, or small, for example an output area (OA), the lowest level of geography for which statistics are produced.
For higher levels of geography, more detailed statistics can be produced. When a lower level of geography is used, such as output areas (which have a minimum of 100 persons), the statistics produced have less detail. This is to protect the confidentiality of people and ensure that individuals or their characteristics cannot be identified.
Coverage
Census 2021 statistics are published for the whole of England and Wales. Data are also available in these geographic types:
Student accommodation type
Combines the living situation of students and school children in full-time education, whether they are living:
It also includes whether these households contain one or multiple families.
This variable is comparable with the student accommodation variable but splits the communal establishment type into “university” and “other” categories.
Age
A person’s age on Census Day, 21 March 2021 in England and Wales. Infants aged under 1 year are classified as 0 years of age.
At the end of 2019 the Greater London Authority (GLA) commissioned the Social Market Foundation (SMF) to conduct research focusing on how the outcomes of graduates who have studied in London and those from London vary, by a range of different characteristics. This research uses a range of methods to gain insight into the outcomes of graduates who were domiciled in London prior to university and those who studied at a London institution. In particular, the SMF undertook a literature review of academic, government and policy papers on degree outcomes and the factors that interact with these; conducted descriptive analysis of data provided by the Higher Education Statistics Agency (HESA); and ran a series of logit regression models to look further into how different characteristics influence graduate outcomes when controlling for other variables. The data includes young first degree students studying at a Higher Education Institution within London and students domiciled in London prior to university who study outside of the capital. The data includes four cohorts from the academic years 2010/11 to 2013/14.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This dataset provides Census 2022 estimates for distance travelled to place of study of people aged 4 and over studying by age (in 4 categories) in Scotland.
A person's age on Census Day, 20 March 2022. Infants aged under 1 year are classified as 0 years of age.
The distance between a person’s home address and their main place of work or study (Grouped).
Address of place of work or study is used (along with home address) to explore the relationship between where people live and where they work or study. Used in conjunction with information from the method of travel question, the data helps to identify commuter patterns and routes and provide a reliable indicator for the demands placed on public and private transport.
It is used to inform the balance of housing and jobs in particular areas and assess the need for services such as new schools. Information on where people live and work is used by government departments to define “Travel to Work Areas” - these are approximations of self-contained labour markets and are the smallest areas for which unemployment rates are published. Collecting information on both work and study address enables a more accurate count of daytime populations to be obtained, which is particularly useful for areas accommodating universities and businesses. It also allows the differences in travel patterns between these groups to be compared.
Details of classification can be found here
The quality assurance report can be found here
This dataset compares existing research data policies at UK higher education institutions. It consists of 83 cases. Polices were compared on a range of variables. Variables included policy length in words, whether the policy offers definitions, length of their definition of "data", defines institutional support, requires data management plans, states scope of staff and student coverage, specifies ownership of research outputs, details where external funder rights take precedent, guides on what data and documentation is required to be retained, how long it needs to be retained, reinforces where research ethics prevent open data, finalises where data can be accessed, speaks about open data requirements, includes a statement on funding the costs of Research Data Management, and specifies a review period for the policy. Data also includes the institution's year of foundation and a categorical variable grouping institutions by year of foundation allowing comparison across cohort groups of universities. A further two variables allow for identification of research based universities. Data on total research funding and research council for the year 2014/2015 was added, along with the number of research staff eligible for the 2014 UK Research Excellence Framework (REF). Also included is the institution's Grade Point Average based on its REF score using a Times Higher Education (THES) calculated score. Data collection was based on a list of UK Higher Education Institutions with data policies. This list was provided by the Digital Curation Centre. I also conducted a google search for UK university data policies to discover additional institutions that had adopted Research Data Management requirements. The data does not include 'Roadmaps' to EPSRC compliance.
https://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
DECOVID, a multi-centre research consortium, was founded in March 2020 by two United Kingdom (UK) National Health Service (NHS) Foundation Trusts (comprising three acute care hospitals) and three research institutes/universities: University Hospitals Birmingham (UHB), University College London Hospitals (UCLH), University of Birmingham, University College London and The Alan Turing Institute. The original aim of DECOVID was to share harmonised electronic health record (EHR) data from UCLH and UHB to enable researchers affiliated with the DECOVID consortium to answer clinical questions to support the COVID-19 response. The DECOVID database has now been placed within the infrastructure of PIONEER, a Health Data Research (HDR) UK funded data hub that contains data from acute care providers, to make the DECOVID database accessible to external researchers not affiliated with the DECOVID consortium.
This highly granular dataset contains 256,804 spells and 165,414 hospitalised patients. The data includes demographics, serial physiological measurements, laboratory test results, medications, procedures, drugs, mortality and readmission.
Geography: UHB is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & > 120 ITU bed capacity. UCLH provides first-class acute and specialist services in six hospitals in central London, seeing more than 1 million outpatient and 100,000 admissions per year. Both UHB and UCLH have fully electronic health records. Data has been harmonised using the OMOP data model. Data set availability: Data access is available via the PIONEER Hub for projects which will benefit the public or patients. This can be by developing a new understanding of disease, by providing insights into how to improve care, or by developing new models, tools, treatments, or care processes. Data access can be provided to NHS, academic, commercial, policy and third sector organisations. Applications from SMEs are welcome. There is a single data access process, with public oversight provided by our public review committee, the Data Trust Committee. Contact pioneer@uhb.nhs.uk or visit www.pioneerdatahub.co.uk for more details.
Available supplementary data: Matched controls; ambulance and community data. Unstructured data (images). We can provide the dataset in other common data models and can build synthetic data to meet bespoke requirements.
Available supplementary support: Analytics, model build, validation & refinement; A.I. support. Data partner support for ETL (extract, transform & load) processes. Bespoke and “off the shelf” Trusted Research Environment (TRE) build and run. Consultancy with clinical, patient & end-user and purchaser access/ support. Support for regulatory requirements. Cohort discovery. Data-driven trials and “fast screen” services to assess population size.
Young people who were in Year 11 in the 2020-2021 academic year were drawn as a clustered and stratified random sample from the National Pupil Database held by the DfE, as well as from a separate sample of independent schools from DfE's Get Information about Schools database. The parents/guardians of the sampled young people were also invited to take part in COSMO. Data from parents/guardians complement the data collected from young people.
Further information about the study may be found on the COVID Social Mobility and Opportunities Study (COSMO) webpage.
COSMO Wave 2, 2022-2023
All young people who took part in Wave 1 (see SN 9000) were invited to the second Wave of the study, along with their parents (whether or not they took part in Wave 1).
Data collection in Wave 2 was carried out between October 2022 and April 2023 where young people and parents/guardians were first invited to a web survey. In addition to online reminders, some non-respondents were followed up via face-to-face visits or telephone calls over the winter and throughout spring. Online ‘mop-up’ fieldwork was also carried out to invite all non-respondents into the survey one last time before the end of fieldwork.
Latest edition information:
For the second edition (April 2024), a standalone dataset from the Keeping in Touch (KIT) exercise carried out after the completion of Wave 2, late 2023 have been deposited. This entailed a very short questionnaire for updating contact details and brief updates on young people's lives. A longitudinal parents dataset has also been deposited, to help data users find core background information from parents who took part in either Wave 1 or Wave 2 in one place. Finally, the young people's dataset has been updated (version 1.1) with additional codes added from some open-ended questions. The COSMO Wave 1 Data User Guide Version 1.1 explains these updates in detail. A technical report and accompanying appendices has also been deposited.
Further information about the study may be found on the COSMO website.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset was created and deposited onto the University of Sheffield Online Research Data repository (ORDA) on 23-Jun-2023 by Dr. Matthew S. Hanchard, Research Associate at the University of Sheffield iHuman Institute.
The dataset forms part of three outputs from a project titled ‘Fostering cultures of open qualitative research’ which ran from January 2023 to June 2023:
· Fostering cultures of open qualitative research: Dataset 1 – Survey Responses · Fostering cultures of open qualitative research: Dataset 2 – Interview Transcripts · Fostering cultures of open qualitative research: Dataset 3 – Coding Book
The project was funded with £13,913.85 Research England monies held internally by the University of Sheffield - as part of their ‘Enhancing Research Cultures’ scheme 2022-2023.
The dataset aligns with ethical approval granted by the University of Sheffield School of Sociological Studies Research Ethics Committee (ref: 051118) on 23-Jan-2021.This includes due concern for participant anonymity and data management.
ORDA has full permission to store this dataset and to make it open access for public re-use on the basis that no commercial gain will be made form reuse. It has been deposited under a CC-BY-NC license.
This dataset comprises one spreadsheet with N=91 anonymised survey responses .xslx format. It includes all responses to the project survey which used Google Forms between 06-Feb-2023 and 30-May-2023. The spreadsheet can be opened with Microsoft Excel, Google Sheet, or open-source equivalents.
The survey responses include a random sample of researchers worldwide undertaking qualitative, mixed-methods, or multi-modal research.
The recruitment of respondents was initially purposive, aiming to gather responses from qualitative researchers at research-intensive (targetted Russell Group) Universities. This involved speculative emails and a call for participant on the University of Sheffield ‘Qualitative Open Research Network’ mailing list. As result, the responses include a snowball sample of scholars from elsewhere.
The spreadsheet has two tabs/sheets: one labelled ‘SurveyResponses’ contains the anonymised and tidied set of survey responses; the other, labelled ‘VariableMapping’, sets out each field/column in the ‘SurveyResponses’ tab/sheet against the original survey questions and responses it relates to.
The survey responses tab/sheet includes a field/column labelled ‘RespondentID’ (using randomly generated 16-digit alphanumeric keys) which can be used to connect survey responses to interview participants in the accompanying ‘Fostering cultures of open qualitative research: Dataset 2 – Interview transcripts’ files.
A set of survey questions gathering eligibility criteria detail and consent are not listed with in this dataset, as below. All responses provide in the dataset gained a ‘Yes’ response to all the below questions (with the exception of one question, marked with an asterisk (*) below):
· I am aged 18 or over · I have read the information and consent statement and above. · I understand how to ask questions and/or raise a query or concern about the survey. · I agree to take part in the research and for my responses to be part of an open access dataset. These will be anonymised unless I specifically ask to be named. · I understand that my participation does not create a legally binding agreement or employment relationship with the University of Sheffield · I understand that I can withdraw from the research at any time. · I assign the copyright I hold in materials generated as part of this project to The University of Sheffield. · * I am happy to be contacted after the survey to take part in an interview.
The project was undertaken by two staff: Co-investigator: Dr. Itzel San Roman Pineda ORCiD ID: 0000-0002-3785-8057 i.sanromanpineda@sheffield.ac.uk
Postdoctoral Research Assistant Principal Investigator (corresponding dataset author): Dr. Matthew Hanchard ORCiD ID: 0000-0003-2460-8638 m.s.hanchard@sheffield.ac.uk Research Associate iHuman Institute, Social Research Institutes, Faculty of Social Science
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This data relates to careers advice and guidance to year 11 and 12 pupils in Northern Ireland and their university aspirations. Data is presented for year 11 and 12 pupils on their confidence making career decisions, the support they require to achieve their career goals, their awareness of the all-age Careers Service, their knowledge of how to contact a Careers Adviser outside school and also their university aspirations. The data is derived from the Young Persons’ Behaviour and Attitudes Survey, carried out between September 2019 and February 2020.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset was created and deposited onto the University of Sheffield Online Research Data repository (ORDA) on 23-Jun-2023 by Dr. Matthew S. Hanchard, Research Associate at the University of Sheffield iHuman Institute. The dataset forms part of three outputs from a project titled ‘Fostering cultures of open qualitative research’ which ran from January 2023 to June 2023:
· Fostering cultures of open qualitative research: Dataset 1 – Survey Responses · Fostering cultures of open qualitative research: Dataset 2 – Interview Transcripts · Fostering cultures of open qualitative research: Dataset 3 – Coding Book
The project was funded with £13,913.85 of Research England monies held internally by the University of Sheffield - as part of their ‘Enhancing Research Cultures’ scheme 2022-2023.
The dataset aligns with ethical approval granted by the University of Sheffield School of Sociological Studies Research Ethics Committee (ref: 051118) on 23-Jan-2021. This includes due concern for participant anonymity and data management.
ORDA has full permission to store this dataset and to make it open access for public re-use on the basis that no commercial gain will be made form reuse. It has been deposited under a CC-BY-NC license. Overall, this dataset comprises:
· 15 x Interview transcripts - in .docx file format which can be opened with Microsoft Word, Google Doc, or an open-source equivalent.
All participants have read and approved their transcripts and have had an opportunity to retract details should they wish to do so.
Participants chose whether to be pseudonymised or named directly. The pseudonym can be used to identify individual participant responses in the qualitative coding held within the ‘Fostering cultures of open qualitative research: Dataset 3 – Coding Book’ files.
For recruitment, 14 x participants we selected based on their responses to the project survey., whilst one participant was recruited based on specific expertise.
· 1 x Participant sheet – in .csv format which may by opened with Microsoft Excel, Google Sheet, or an open-source equivalent.
The provides socio-demographic detail on each participant alongside their main field of research and career stage. It includes a RespondentID field/column which can be used to connect interview participants with their responses to the survey questions in the accompanying ‘Fostering cultures of open qualitative research: Dataset 1 – Survey Responses’ files.
The project was undertaken by two staff:
Co-investigator: Dr. Itzel San Roman Pineda ORCiD ID: 0000-0002-3785-8057 i.sanromanpineda@sheffield.ac.uk Postdoctoral Research Assistant Labelled as ‘Researcher 1’ throughout the dataset
Principal Investigator (corresponding dataset author): Dr. Matthew Hanchard ORCiD ID: 0000-0003-2460-8638 m.s.hanchard@sheffield.ac.uk Research Associate iHuman Institute, Social Research Institutes, Faculty of Social Science Labelled as ‘Researcher 2’ throughout the dataset
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The information refers to NI domiciled students enrolled at higher education institutions in the UK. The dataset is collected annually and is based on enrolments in higher education institutions in the UK on 1st December each year. The dataset is collected by the Higher Education Statistics Agency from higher education institutions throughout the UK and provided to the Department for Employment and Learning, Northern Ireland, for analysis. For 2013/14, NI Domiciled enrolments and qualifications at Open University are available. In previous years, these figures were included in NI students studying in England, as the administrative centre of the Open University is located in England. The specification of the HESA Standard Registration Population has changed for 2007/08 enrolments onwards. Writing up and sabbatical students are now excluded from this population where they were previously included in published enrolment data and therefore 2007/08 data onwards cannot be directly compared to previous years.