ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This dataset shows the location of Higher Education (HE) and Further Education (FE) institutes in the Great Britain. This should cover Universities and Colleges. Many institutes have more than one campus and where possible this is refelcted in the data so a University may have more than one entry. Postcodes have also been included for instities where possible. This data was collected from various sources connected with HEFE in the UK including JISC and EDINA. This represents the fullest list that the author could compile from various sources. If you spot a missing institution, please contact the author and they will add it to the dataset. GIS vector data. This dataset was first accessioned in the EDINA ShareGeo Open repository on 2011-02-01 and migrated to Edinburgh DataShare on 2017-02-21.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is a compilation of processed data on citation and references for research papers including their author, institution and open access info for a selected sample of academics analysed using Microsoft Academic Graph (MAG) data and CORE. The data for this dataset was collected during December 2019 to January 2020.Six countries (Austria, Brazil, Germany, India, Portugal, United Kingdom and United States) were the focus of the six questions which make up this dataset. There is one csv file per country and per question (36 files in total). More details about the creation of this dataset are available on the public ON-MERRIT D3.1 deliverable report.The dataset is a combination of two different data sources, one part is a dataset created on analysing promotion policies across the target countries, while the second part is a set of data points available to understand the publishing behaviour. To facilitate the analysis the dataset is organised in the following seven folders:PRTThe dataset with the file name "PRT_policies.csv" contains the related information as this was extracted from promotion, review and tenure (PRT) policies. Q1: What % of papers coming from a university are Open Access?- Dataset Name format: oa_status_countryname_papers.csv- Dataset Contents: Open Access (OA) status of all papers of all the universities listed in Times Higher Education World University Rankings (THEWUR) for the given country. A paper is marked OA if there is at least an OA link available. OA links are collected using the CORE Discovery API.- Important considerations about this dataset: - Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. - The service we used to recognise if a paper is OA, CORE Discovery, does not contain entries for all paperids in MAG. This implies that some of the records in the dataset extracted will not have either a true or false value for the _is_OA_ field. - Only those records marked as true for _is_OA_ field can be said to be OA. Others with false or no value for is_OA field are unknown status (i.e. not necessarily closed access).Q2: How are papers, published by the selected universities, distributed across the three scientific disciplines of our choice?- Dataset Name format: fsid_countryname_papers.csv- Dataset Contents: For the given country, all papers for all the universities listed in THEWUR with the information of fieldofstudy they belong to.- Important considerations about this dataset: * MAG can associate a paper to multiple fieldofstudyid. If a paper belongs to more than one of our fieldofstudyid, separate records were created for the paper with each of those _fieldofstudyid_s.- MAG assigns fieldofstudyid to every paper with a score. We preserve only those records whose score is more than 0.5 for any fieldofstudyid it belongs to.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. Papers with authorship from multiple universities are counted once towards each of the universities concerned.Q3: What is the gender distribution in authorship of papers published by the universities?- Dataset Name format: author_gender_countryname_papers.csv- Dataset Contents: All papers with their author names for all the universities listed in THEWUR.- Important considerations about this dataset :- When there are multiple collaborators(authors) for the same paper, this dataset makes sure that only the records for collaborators from within selected universities are preserved.- An external script was executed to determine the gender of the authors. The script is available here.Q4: Distribution of staff seniority (= number of years from their first publication until the last publication) in the given university.- Dataset Name format: author_ids_countryname_papers.csv- Dataset Contents: For a given country, all papers for authors with their publication year for all the universities listed in THEWUR.- Important considerations about this work :- When there are multiple collaborators(authors) for the same paper, this dataset makes sure that only the records for collaborators from within selected universities are preserved.- Calculating staff seniority can be achieved in various ways. The most straightforward option is to calculate it as _academic_age = MAX(year) - MIN(year) _for each authorid.Q5: Citation counts (incoming) for OA vs Non-OA papers published by the university.- Dataset Name format: cc_oa_countryname_papers.csv- Dataset Contents: OA status and OA links for all papers of all the universities listed in THEWUR and for each of those papers, count of incoming citations available in MAG.- Important considerations about this dataset :- CORE Discovery was used to establish the OA status of papers.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to.- Only those records marked as true for _is_OA_ field can be said to be OA. Others with false or no value for is_OA field are unknown status (i.e. not necessarily closed access).Q6: Count of OA vs Non-OA references (outgoing) for all papers published by universities.- Dataset Name format: rc_oa_countryname_-papers.csv- Dataset Contents: Counts of all OA and unknown papers referenced by all papers published by all the universities listed in THEWUR.- Important considerations about this dataset :- CORE Discovery was used to establish the OA status of papers being referenced.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. Papers with authorship from multiple universities are counted once towards each of the universities concerned.Additional files:- _fieldsofstudy_mag_.csv: this file contains a dump of fieldsofstudy table of MAG mapping each of the ids to their actual field of study name.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Presents statistics on student support paid to students in the form of loans and grants or to their University/College in the form of tution fees. The students are English domiciles studying anywhere in the UK or EU students studying in England. Source agency: Student Loans Company Designation: National Statistics Language: English Alternative title: Student Support for Higher Education in England Data and Resources 2012/13 (Final) and 2013/14 (Provisional)HTML 2012/13 (Final) and 2013/14 (Provisional)
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The information refers to NI domiciled students enrolled at higher education institutions in the UK. The dataset is collected annually and is based on enrolments in higher education institutions in the UK on 1st December each year. The dataset is collected by the Higher Education Statistics Agency from higher education institutions throughout the UK and provided to the Department for Employment and Learning, Northern Ireland, for analysis. For 2013/14, NI Domiciled enrolments and qualifications at Open University are available. In previous years, these figures were included in NI students studying in England, as the administrative centre of the Open University is located in England. The specification of the HESA Standard Registration Population has changed for 2007/08 enrolments onwards. Writing up and sabbatical students are now excluded from this population where they were previously included in published enrolment data and therefore 2007/08 data onwards cannot be directly compared to previous years.
This dataset compares existing research data policies at UK higher education institutions. It consists of 83 cases. Polices were compared on a range of variables. Variables included policy length in words, whether the policy offers definitions, length of their definition of "data", defines institutional support, requires data management plans, states scope of staff and student coverage, specifies ownership of research outputs, details where external funder rights take precedent, guides on what data and documentation is required to be retained, how long it needs to be retained, reinforces where research ethics prevent open data, finalises where data can be accessed, speaks about open data requirements, includes a statement on funding the costs of Research Data Management, and specifies a review period for the policy. Data also includes the institution's year of foundation and a categorical variable grouping institutions by year of foundation allowing comparison across cohort groups of universities. A further two variables allow for identification of research based universities. Data on total research funding and research council for the year 2014/2015 was added, along with the number of research staff eligible for the 2014 UK Research Excellence Framework (REF). Also included is the institution's Grade Point Average based on its REF score using a Times Higher Education (THES) calculated score. Data collection was based on a list of UK Higher Education Institutions with data policies. This list was provided by the Digital Curation Centre. I also conducted a google search for UK university data policies to discover additional institutions that had adopted Research Data Management requirements. The data does not include 'Roadmaps' to EPSRC compliance.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The information refers to NI domiciled students gaining higher education qualifications from UK higher education institutions. The dataset is collected annually and is based on students obtaining a qualification at UK higher education institutions. The dataset is collected by the Higher Education Statistics Agency from higher education institutions throughout the UK and provided to the Department for Employment and Learning, Northern Ireland, for analysis. For 2013/14, NI Domiciled enrolments and qualifications at Open University are available. In previous years, these figures were included in NI students studying in England, as the administrative centre of the Open University is located in England.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We recruited 835 faculty members from 40 universities in the United Kingdom (UK) via our networks within UK STEM departments. Participants were drawn from various STEM departments, including biological science (18%), computer science (7%), engineering (28%) mathematical science (16%), and physics (13%). Respondents completed an online survey in which details about their employment were collected at the beginning and additional demographic information was collected at the end. The middle section of the survey contained measures of: identity and career perceptions; staying in academia; collaborative working style, received opportunities; workplace diversity and inclusion and affective workplace climate; experience of harassment; and assessment of a workshop intervention.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset was created and deposited onto the University of Sheffield Online Research Data repository (ORDA) on 14-Dec-2023 by Dr. Matthew S. Hanchard, Research Associate at the University of Sheffield iHuman Institute. The dataset forms part of the outputs from a project titled ‘Fostering cultures of open qualitative research’ which ran from January 2023 to June 2023, and was funded with £13,913.85 of Research England monies held internally by the University of Sheffield as part of their ‘Enhancing Research Cultures’ scheme 2022-2023. The dataset aligns with ethical approval granted by the University of Sheffield School of Sociological Studies Research Ethics Committee (ref: 051118) on 23-Jan-2023. This includes due concern for participant anonymity and data management. ORDA has full permission to store this dataset and to make it open access for public re-use on the basis that no commercial gain will be made from reuse. It has been deposited under a CC-BY-NC license. Overall, this dataset comprises: 1 x Workshop transcript - in .docx file format which can be opened with Microsoft Word, Google Doc, or an open-source equivalent. The workshop took place on 18-Jul-2023 at the Wave Building, University of Sheffield. All five attendees have read and approved a portion of transcripts containing their own discussion. All workshop attendees have had an opportunity to retract details should they wish to do so. All workshop attendees have chosen whether to be pseudonymised or named directly. The pseudonym or real name can be used to identify individual participant responses in the qualitative coding held within accompanying dataset from the same project - Survey Responses: Hanchard M and San Roman Pineda I (2023) Fostering cultures of open qualitative research: Dataset 1 – Survey Responses. The University of Sheffield. DOI: 10.15131/shef.data.23567250.v1. Interviews: Hanchard M and San Roman Pineda I (2023) Fostering cultures of open qualitative research: Dataset 2 – Interview Transcripts. The University of Sheffield. DOI: 10.15131/shef.data.23567223.v2. As a limitation, the audio recording of the workshop session that this transcript is based upon is missing a section (due to a recording error) and may contain errors/inaccuracies (due to poor audio conditions within the workshop room). Every effort has been taken to correct these, including participants themselves reviewing their discussion/quotes, but the transcript may still contain minor inaccuracies, typos, and/or other errors in the text - as is noted on the transcript itself. The project was undertaken by two staff: Co-investigator: Dr. Itzel San Roman Pineda (Postdoctoral Research Assistant) ORCiD ID: 0000-0002-3785-8057 i.sanromanpineda@sheffield.ac.uk Labelled as ‘Researcher 1’ throughout all project datasets. Principal Investigator (corresponding dataset author): Dr. Matthew Hanchard (Research Associate) ORCiD ID: 0000-0003-2460-8638 m.s.hanchard@sheffield.ac.uk iHuman Institute, Social Research Institutes, Faculty of Social Science Labelled as ‘Researcher 2’ throughout all project datasets.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This Statistical First Release (SFR) is the annual first release of HESA Destinations of Leavers from Higher Education (DLHE) data. In previous years it has been titled Destinations of Leavers from Higher Education in the United Kingdom. This release focuses on all publicly funded UK HE providers and the University of Buckingham. It also includes, for 2016/17, data for leavers from HE level courses at further education (FE) colleges in Wales (of which there were 355 leavers in the DLHE target population). All are fully subscribed members of HESA.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This dataset was created and deposited onto the University of Sheffield Online Research Data repository (ORDA) on 23-Jun-2023 by Dr. Matthew S. Hanchard, Research Associate at the University of Sheffield iHuman Institute. The dataset forms part of three outputs from a project titled ‘Fostering cultures of open qualitative research’ which ran from January 2023 to June 2023:
· Fostering cultures of open qualitative research: Dataset 1 – Survey Responses · Fostering cultures of open qualitative research: Dataset 2 – Interview Transcripts · Fostering cultures of open qualitative research: Dataset 3 – Coding Book
The project was funded with £13,913.85 of Research England monies held internally by the University of Sheffield - as part of their ‘Enhancing Research Cultures’ scheme 2022-2023.
The dataset aligns with ethical approval granted by the University of Sheffield School of Sociological Studies Research Ethics Committee (ref: 051118) on 23-Jan-2021. This includes due concern for participant anonymity and data management.
ORDA has full permission to store this dataset and to make it open access for public re-use on the basis that no commercial gain will be made form reuse. It has been deposited under a CC-BY-NC license. Overall, this dataset comprises:
· 15 x Interview transcripts - in .docx file format which can be opened with Microsoft Word, Google Doc, or an open-source equivalent.
All participants have read and approved their transcripts and have had an opportunity to retract details should they wish to do so.
Participants chose whether to be pseudonymised or named directly. The pseudonym can be used to identify individual participant responses in the qualitative coding held within the ‘Fostering cultures of open qualitative research: Dataset 3 – Coding Book’ files.
For recruitment, 14 x participants we selected based on their responses to the project survey., whilst one participant was recruited based on specific expertise.
· 1 x Participant sheet – in .csv format which may by opened with Microsoft Excel, Google Sheet, or an open-source equivalent.
The provides socio-demographic detail on each participant alongside their main field of research and career stage. It includes a RespondentID field/column which can be used to connect interview participants with their responses to the survey questions in the accompanying ‘Fostering cultures of open qualitative research: Dataset 1 – Survey Responses’ files.
The project was undertaken by two staff:
Co-investigator: Dr. Itzel San Roman Pineda ORCiD ID: 0000-0002-3785-8057 i.sanromanpineda@sheffield.ac.uk Postdoctoral Research Assistant Labelled as ‘Researcher 1’ throughout the dataset
Principal Investigator (corresponding dataset author): Dr. Matthew Hanchard ORCiD ID: 0000-0003-2460-8638 m.s.hanchard@sheffield.ac.uk Research Associate iHuman Institute, Social Research Institutes, Faculty of Social Science Labelled as ‘Researcher 2’ throughout the dataset
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Presents statistics on student support paid to students in the form of loans and grants or to their University/College in the form of tution fees. The students are Northern Ireland domiciles studying anywhere in the UK, ROI or EU students studying in Northern Ireland. Source agency: Student Loans Company Designation: Official Statistics not designated as National Statistics Language: English Alternative title: Student Support for Higher Education in Northern Ireland
This dataset presents a cluster analysis of UK universities based on four synthetic environments: social, cultural, physical and economic. These were developed based on variables that represented an educational ecosystem of well-being. The cluster analysis was initially linked to the LSYPE-Secure dataset using the UKPRNs (i.e. higher education institutional number) and hence the cluster analysis used data from around 2009-2012 to represent Wave 6 and Wave 7 of the LSYPE-Secure dataset. The cluster analysis was based on using a variety of variables available from HESA and the Office for Students (OfS) to represent these environments, for example: Social: had demographics of students and staff including ethnicity and sex Cultural: had data on research and teaching scores Economic: had data on student: staff ratio and expenditure Physical: had data related to the built and natural environment including residential sites, blue and green spacesEarlier last year (April 2018), the UK Office for Students (OfS) noted that students from underrepresented groups such as black and minority ethnic (BME) students and those from disadvantaged backgrounds were less likely to succeed at university. Coupled with this, research has shown that students from these groups are also more likely to have poorer mental health and wellbeing. However, there is substantial social and political pressure on universities to act to improve student mental health. For example, the Telegraph ran the headline "Do British universities have a suicide problem?" Thus, in June 2018, the Hon. Sam Gyimah, the then UK universities minister, informed university vice-chancellors that student mental health and wellbeing has to be one of their top priorities. Universities are investing substantive sums in activities to tackle student mental health but doing so with no evidence base to guide strategic policy and practice. These activities may potentially be ineffective, financially wasteful, and possibly, counter-productive. Therefore, we need a better evidence base which this project intends to fulfil. Currently, there is a lack of evidence and understanding about which groups of young people going to universities may have poorer life outcomes (such as education, employment, and mental health and well-being) as a result of their mental health and wellbeing during their adolescent years. These life outcomes and their mental health and wellbeing, however, are important for understanding the context of the complex social identities of the young people, such as the intersections between their gender, ethnicity, sexuality, religion and socio-economic status. Otherwise, these young people may feel misunderstood or judged. Most of the large body of quantitative research on life outcomes tend to focus on one social characteristic/identity of the student, such as the young person's gender or ethnicity or socio-economic status, but not the combination of all of these, i.e. the intersectionalities. Primarily, the reason for this has been the lack of sufficient data. This research draws on data from the Longitudinal Study of Young People in England (LSYPE), which tracked over 15,000 adolescents' education and health over 7 years between 2004-2010 (from when they were 13-19 years old), and the Next Steps Survey, which collected data from the same individuals in 2015 when they were 25 years and in the job market. This dataset also had an ethnic boost, which thus allows for the exploratory analysis of intersectionalities. Currently, there are a number of interventions being implemented to improve the university environment. However, there is a lack of evidence on how the university environment (such as their its size, amount of academic support available, availability of sports activities, students' sense of belonging, etc.) can affect the young person'students' mental health and wellbeing life outcomes. This evidence can be determined through by using the LSYPE data supplemented and by university environment data supplemented from the National Student Survey (NSS) and the Higher Education Statistics Agency (HESA). Thus this research uses an intersectional approach to investigate the extent to which the life outcomes of young persons who go to university are affected by their social inequality groupings and mental health and well-being during adolescence. Additionally, this research also aims to determine the characteristics of university environments that can improve the life outcomes of these young people depending on their social and mental health/wellbeing background. We use secondary data analysis of mainly HESA and OfS variables and created derived variables.
Background:
The Millennium Cohort Study (MCS) is a large-scale, multi-purpose longitudinal dataset providing information about babies born at the beginning of the 21st century, their progress through life, and the families who are bringing them up, for the four countries of the United Kingdom. The original objectives of the first MCS survey, as laid down in the proposal to the Economic and Social Research Council (ESRC) in March 2000, were:
Further information about the MCS can be found on the Centre for Longitudinal Studies web pages.
The content of MCS studies, including questions, topics and variables can be explored via the CLOSER Discovery website.
The seventh sweep of the Millennium Cohort Study (MCS7) was carried out when the cohort members were 17 years old. As 17 is a key transitional age, the sweep purposefully focused on engaging with the cohort members themselves (in addition to their parents). MCS7 marks an important transitional time in the cohort members' lives, where educational and occupational paths can diverge significantly. It is also an important age in data collection terms since it may be the last sweep at which parents are interviewed and it is an age when direct engagement with the cohort members themselves rather than their families is crucial to the long term viability of the study. To reflect this, face-to-face interviews with the cohort members have been conducted for the first time. Cohort members were also asked to do a range of other activities including filling in a self-completion questionnaire on the interviewer's tablet, completing a cognitive assessment (number activity) and having their height, weight and body fat measurements taken. In addition, they were asked to complete a short online questionnaire after the visit.
Parents were still interviewed at MCS7. Resident parents were asked to complete a household interview and a short online questionnaire, and one parent was asked to complete a Strengths and Difficulties Questionnaire (SDQ) about the cohort member. Cohort members who were either unable or unwilling to complete the main survey were asked to complete a short follow up questionnaire online after the fieldwork finished. This contained some key questions and was designed to boost response and maintain engagement.
For the second edition (March 2021), two new data files have been added (mcs7_cm_qualifications and mcs7_parent_derived), and five existing data files have been updated (mcs7_cm_derived, mcs7_cm_interview, mcs7_hhgrid, mcs7_parent_cm_interview, cs7_parent_interview). In addition the User Guide, the Derived Variables User Guide and the Longitudinal Data Dictionary have all been updated.
https://www.insight.hdrhub.org/https://www.insight.hdrhub.org/
Background Glaucoma is a worldwide leading cause of irreversible sight loss. Worldwide, an estimated 60 million people have glaucoma. Glaucoma is a condition of increased intraocular pressure in the eye. Because it may be asymptomatic until a relatively late stage, diagnosis is frequently delayed. There are four general categories of glaucoma: primary open-angle and angle-closure, and secondary open and angle-closure glaucoma.
The UHB glaucoma dataset is a longitudinal dataset consisting of routinely collected clinical metadata from patients receiving treatment for glaucoma at UHB, from 2007 to the present.
This dataset encompasses all patients at UHB who have received a diagnosis of primary or secondary glaucoma or ocular hypertension. Clinical metadata includes demographic information, visual acuities, central corneal thickness, intraocular pressure, optic nerve head findings, and mean deviation of the Humphrey visual fields.
This dataset is continuously updating, however, as of 1st October 2021, it consisted of 5065 people This is a large single centre database from patients with glaucoma and covers more than a decade of follow-up for these patients.
Geography The Queen Elizabeth Hospital is one of the largest single-site hospitals in the United Kingdom, with 1,215 inpatient beds. Queen Elizabeth Hospital is part of one of the largest teaching trusts in England (University Hospitals Birmingham). Set within the West Midlands and it has a catchment population of circa 5.9million. The region includes a diverse ethnic, and socio-economic mix, with a higher than UK average of minority ethnic groups. It has a large number of elderly residents but is the youngest population in the UK. There are particularly high rates of diabetes, physical inactivity, obesity, and smoking.
Data source: Ophthalmology department at Queen Elizabeth Hospital, University Hospitals Birmingham NHS Foundation Trust, Birmingham, United Kingdom.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This data relates to careers advice and guidance to year 11 and 12 pupils in Northern Ireland and their university aspirations. Data is presented for year 11 and 12 pupils on their confidence making career decisions, the support they require to achieve their career goals, their awareness of the all-age Careers Service, their knowledge of how to contact a Careers Adviser outside school and also their university aspirations. The data is derived from the Young Persons’ Behaviour and Attitudes Survey, carried out between September 2019 and February 2020.
https://vocab.nerc.ac.uk/collection/L08/current/LI/https://vocab.nerc.ac.uk/collection/L08/current/LI/
The Changing Arctic Ocean (CAO) oceanographic dataset comprises data collected in the Arctic Ocean, including the Barents Sea and Fram Strait, as part of the Changing Arctic Ocean programme. The data were collected over multiple research cruises starting in June 2017. The majority of these cruises were conducted during the Arctic summer on board the RRS James Clark Ross, with further winter cruises completed in collaboration with the Nansen Legacy project on board the RV Helmer Hanssen. Shipboard data collection included the deployment of conductivity-temperature-depth (CTD) packages, ocean seagliders, mulitcorers, grabs, nets, trawls, and a shelf underwater camera system. The CAO programme aims to understand the changes in Arctic marine ecosystem in a quantifiable way, enabling computer models to help predict the consequences of these changes on, for example; surface ocean productivity; species distributions; food webs; and ecosystems, and the services they provide (ecosystem services). It was initially a Natural Environment Research Council (NERC) funded programme comprising four projects: Arctic PRIZE (Arctic productivity in the seasonal ice zone), led by Finlo Cottier (Scottish Association for Marine Science - SAMS); ARISE (Can we detect changes in Arctic ecosystems?), led by Claire Mahaffey (University of Liverpool); ChAOS (The Changing Arctic Ocean Seafloor), led by Christian Maerz (University of Leeds) and DIAPOD (Mechanistic understanding of the role of diatoms in the success of the Arctic Calanus complex and implications for a warmer Arctic), led by David Pond (University of Stirling). Additional projects were added to the programme in July 2018 through funding provided by NERC and the German Federal Ministry of Education and Research (BMBF). The majority of data are held by the British Oceanographic Data Centre (BODC) but a proportion of the data, primarily biological, are stored at the British Antarctic Survey Polar Data Centre (polardatacentre@bas.ac.uk) and any BMBF funded data are held by Pangaea (https://www.pangaea.de/).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset containing the images and labels for the Language data used in the CVPR NAS workshop Unseen-data challenge under the codename "LaMelo"The Language dataset is a constructed dataset using words from aspell dictionaries. The intention of this dataset is to require machine learning models to not only perform image classification but also linguistic analysis to figure out which letter frequency is associated with each language. For each Language image we selected four six-letter words using the standard latin alphabet and removed any words with letters that used diacritics (such as ́e or ̈u) or included ‘y’ or ‘z’.We encode these words on a graph with one axis representing the index of the 24 character long string (the four words joined together) and the other representing the letter (going A-X).The data is in a channels-first format with a shape of (n, 1, 24, 24) where n is the number of samples in the corresponding set (50,000 for training, 10,000 for validation, and 10,000 for testing).There are ten classes in the dataset, with 7,000 examples of each, distributed evenly between the three subsets.The ten classes and corresponding numerical label are as follows:English: 0,Dutch: 1,German: 2,Spanish: 3,French: 4,Portuguese: 5,Swahili: 6,Zulu: 7,Finnish: 8,Swedish: 9
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
RESPOND project produced a high level of empirical material in 11 countries (Sweden, the UK, Germany, Italy, Poland, Austria, Greece, Bulgaria, Turkey, Iraq, and Lebanon) where the research is conducted between the period 2017-2020. The country teams gathered macro (policies), meso (implementation/stakeholders) and micro (individuals/asylum seekers and refuges) level data related to the thematic fields formulated in four work packages: borders, protection regimes, reception, and integration. An important contribution of this research has been its micro/individual focus which enabled the research teams to capture and understand the migration experiences of asylum seekers and refugees and their responses to the policies and obstacles that they have encountered.
Country teams conducted in total 539 interviews with refugees and asylum seekers, and more than 210 interviews with stakeholders (state and non-state actors) working in the field of migration. Additionally, the project has conducted a survey study in Sweden and Turkey (n=700 in each country), covering similar topics.
This dataset is only about the micro part of the Respond research, and reflects data derived out of 539 interviews conducted with asylum seekers and refugees in 11 countries and here presented in a quantitative form. The whole dataset is structured along the work package topics: Border, Protection, Reception and Integration.
This dataset is prepared as part of Work Package D4.4 (Dataset on Reception) the Horizon 2020 RESPOND project as a joint effort of the below listed project partners.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Presents statistics on student support paid to students in the form of loans and grants or to their University/College in the form of tution fees. The students are Welsh domiciles studying anywhere in the UK or EU students studying in Wales. Source agency: Student Loans Company Designation: Official Statistics not designated as National Statistics Language: English Alternative title: Student Support for Higher Education in Wales Data and Resources Academic Year 2012/13 (Final) and 2013/14 (Provisional)HTML Academic Year 2012/13 (Final) and 2013/14 (Provisional)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Open University (OU) dataset is an open database containing student demographic and click-stream interaction with the virtual learning platform. The available data are structured in different CSV files. You can find more information about the original dataset at the following link: https://analyse.kmi.open.ac.uk/open_dataset.
We extracted a subset of the original dataset that focuses on student information. 25,819 records were collected referring to a specific student, course and semester. Each record is described by the following 20 attributes: code_module, code_presentation, gender, highest_education, imd_band, age_band, num_of_prev_attempts, studies_credits, disability, resource, homepage, forum, glossary, outcontent, subpage, url, outcollaborate, quiz, AvgScore, count.
Two target classes were considered, namely Fail and Pass, combining the original four classes (Fail and Withdrawn and Pass and Distinction, respectively). The final_result attribute contains the target values.
All features have been converted to numbers for automatic processing.
Below is the mapping used to convert categorical values to numeric:
code_module: 'AAA'=0, 'BBB'=1, 'CCC'=2, 'DDD'=3, 'EEE'=4, 'FFF'=5, 'GGG'=6
code_presentation: '2013B'=0, '2013J'=1, '2014B'=2, '2014J'=3
gender: 'F'=0, 'M'=1
highest_education: 'No_Formal_quals'=0, 'Post_Graduate_Qualification'=1, 'HE_Qualification'=2, 'Lower_Than_A_Level'=3, 'A_level_or_Equivalent'=4
IMBD_band: 'unknown'=0, 'between_0_and_10_percent'=1, 'between_10_and_20_percent'=2, 'between_20_and_30_percent'=3, 'between_30_and_40_percent'=4, 'between_40_and_50_percent'=5, 'between_50_and_60_percent'=6, 'between_60_and_70_percent'=7, 'between_70_and_80_percent'=8, 'between_80_and_90_percent'=9, 'between_90_and_100_percent'=10
age_band: 'between_0_and_35'=0, 'between_35_and_55'=1, 'higher_than_55'=2
disability: 'N'=0, 'Y'=1
student's outcome: 'Fail'=0, 'Pass'=1
For more detailed information, please refer to:
Casalino G., Castellano G., Vessio G. (2021) Exploiting Time in Adaptive Learning from Educational Data. In: Agrati L.S. et al. (eds) Bridges and Mediation in Higher Distance Education. HELMeTO 2020. Communications in Computer and Information Science, vol 1344. Springer, Cham. https://doi.org/10.1007/978-3-030-67435-9_1
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This dataset shows the location of Higher Education (HE) and Further Education (FE) institutes in the Great Britain. This should cover Universities and Colleges. Many institutes have more than one campus and where possible this is refelcted in the data so a University may have more than one entry. Postcodes have also been included for instities where possible. This data was collected from various sources connected with HEFE in the UK including JISC and EDINA. This represents the fullest list that the author could compile from various sources. If you spot a missing institution, please contact the author and they will add it to the dataset. GIS vector data. This dataset was first accessioned in the EDINA ShareGeo Open repository on 2011-02-01 and migrated to Edinburgh DataShare on 2017-02-21.