Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
By Jonathan Ortiz [source]
This College Completion dataset provides an invaluable insight into the success and progress of college students in the United States. It contains graduation rates, race and other data to offer a comprehensive view of college completion in America. The data is sourced from two primary sources – the National Center for Education Statistics (NCES)’ Integrated Postsecondary Education System (IPEDS) and Voluntary System of Accountability’s Student Success and Progress rate.
At four-year institutions, the graduation figures come from IPEDS for first-time, full-time degree seeking students at the undergraduate level, who entered college six years earlier at four-year institutions or three years earlier at two-year institutions. Furthermore, colleges report how many students completed their program within 100 percent and 150 percent of normal time which corresponds with graduation within four years or six year respectively. Students reported as being of two or more races are included in totals but not shown separately
When analyzing race and ethnicity data NCES have classified student demographics since 2009 into seven categories; White non-Hispanic; Black non Hispanic; American Indian/ Alaskan native ; Asian/ Pacific Islander ; Unknown race or ethnicity ; Non resident with two new categorize Native Hawaiian or Other Pacific Islander combined with Asian plus students belonging to several races. Also worth noting is that different classifications for graduate data stemming from 2008 could be due to variations in time frame examined & groupings used by particular colleges – those who can’t be identified from National Student Clearinghouse records won’t be subjected to penalty by these locations .
When it comes down to efficiency measures parameters like “Awards per 100 Full Time Undergraduate Students which includes all undergraduate completions reported by a particular institution including associate degrees & certificates less than 4 year programme will assist us here while we also take into consideration measures like expenditure categories , Pell grant percentage , endowment values , average student aid amounts & full time faculty members contributing outstandingly towards instructional research / public service initiatives .
When trying to quantify outcomes back up Median Estimated SAT score metric helps us when it is derived either on 25th percentile basis / 75th percentile basis with all these factors further qualified by identifying required criteria meeting 90% threshold when incoming students are considered for relevance . Last but not least , Average Student Aid equalizes amount granted by institution dividing same over total sum received against what was allotted that particular year .
All this analysis gives an opportunity get a holistic overview about performance , potential deficits &
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset contains data on student success, graduation rates, race and gender demographics, an efficiency measure to compare colleges across states and more. It is a great source of information to help you better understand college completion and student success in the United States.
In this guide we’ll explain how to use the data so that you can find out the best colleges for students with certain characteristics or focus on your target completion rate. We’ll also provide some useful tips for getting the most out of this dataset when seeking guidance on which institutions offer the highest graduation rates or have a good reputation for success in terms of completing programs within normal timeframes.
Before getting into specifics about interpreting this dataset, it is important that you understand that each row represents information about a particular institution – such as its state affiliation, level (two-year vs four-year), control (public vs private), name and website. Each column contains various demographic information such as rate of awarding degrees compared to other institutions in its sector; race/ethnicity Makeup; full-time faculty percentage; median SAT score among first-time students; awards/grants comparison versus national average/state average - all applicable depending on institution location — and more!
When using this dataset, our suggestion is that you begin by forming a hypothesis or research question concerning student completion at a given school based upon observable characteristics like financ...
Facebook
TwitterThere were approximately 18.58 million college students in the U.S. in 2022, with around 13.49 million enrolled in public colleges and a further 5.09 million students enrolled in private colleges. The figures are projected to remain relatively constant over the next few years.
What is the most expensive college in the U.S.? The overall number of higher education institutions in the U.S. totals around 4,000, and California is the state with the most. One important factor that students – and their parents – must consider before choosing a college is cost. With annual expenses totaling almost 78,000 U.S. dollars, Harvey Mudd College in California was the most expensive college for the 2021-2022 academic year. There are three major costs of college: tuition, room, and board. The difference in on-campus and off-campus accommodation costs is often negligible, but they can change greatly depending on the college town.
The differences between public and private colleges Public colleges, also called state colleges, are mostly funded by state governments. Private colleges, on the other hand, are not funded by the government but by private donors and endowments. Typically, private institutions are much more expensive. Public colleges tend to offer different tuition fees for students based on whether they live in-state or out-of-state, while private colleges have the same tuition cost for every student.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Dataset Description
This dataset was created by collecting publicly available university and college information from the U.S. Department of Education’s College Scorecard. It focuses on higher education institutions across the United States as of 2025.
The dataset provides a comprehensive view of universities and colleges, including details on cost, graduation rates, and post-graduation earnings — helping students, researchers, and policymakers analyze educational outcomes and institutional performance nationwide.
The dataset contains 5546 records and includes the following key features:
University_Name: The official name of the college or university.
Location: The city and state where the institution is located.
Type: Indicates the nature of the institution — Public, Private Nonprofit, or For-Profit.
Place: Describes the campus setting, such as City, Suburban, or Rural.
Group: Represents the size or category of the institution (e.g., Small, Medium, Large).
Graduate_Rate: The percentage of students who successfully complete their degree programs.
Annual_Cost: The average yearly cost of attendance, including tuition and fees.
Earnings: The median annual earnings of graduates after completing their degree programs.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about universities in the United States. It has 333 rows. It features 3 columns: country, and total students.
Facebook
TwitterThe College Scorecard dataset is provided by the U.S. Department of Education and contains information on nearly every college and university in the United States. The dataset includes data on student loan repayment rates, graduation rates, affordability, earnings after graduation, and more. The goal of this dataset is to help students make informed decisions about their college choice by providing them with clear and concise information about each school's performance
This dataset can help understand the cost of attending college in the United States, as well as the average debt load for students. It can also be used to compare different schools in terms of their graduation rates and repayment rates
This data was originally collected by the US Department of Education and made available on their website. Thank you to the US Department of Education for making this data available!
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about universities in the United States. It has 333 rows. It features 5 columns: city, total students, foundation year, and graduate students.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The Colleges and Universities feature class/shapefile is composed of all Post Secondary Education facilities as defined by the Integrated Post Secondary Education System (IPEDS, http://nces.ed.gov/ipeds/), National Center for Education Statistics (NCES, https://nces.ed.gov/), US Department of Education for the 2018-2019 school year. Included are Doctoral/Research Universities, Masters Colleges and Universities, Baccalaureate Colleges, Associates Colleges, Theological seminaries, Medical Schools and other health care professions, Schools of engineering and technology, business and management, art, music, design, Law schools, Teachers colleges, Tribal colleges, and other specialized institutions. Overall, this data layer covers all 50 states, as well as Puerto Rico and other assorted U.S. territories. This feature class contains all MEDS/MEDS+ as approved by the National Geospatial-Intelligence Agency (NGA) Homeland Security Infrastructure Program (HSIP) Team. Complete field and attribute information is available in the ”Entities and Attributes” metadata section. Geographical coverage is depicted in the thumbnail above and detailed in the "Place Keyword" section of the metadata. This feature class does not have a relationship class but is related to Supplemental Colleges. Colleges and Universities that are not included in the NCES IPEDS data are added to the Supplemental Colleges feature class when found. This release includes the addition of 175 new records, the removal of 468 no longer reported by NCES, and modifications to the spatial location and/or attribution of 6682 records.
Facebook
TwitterThis report provides 2014 National Surveys on Drug Use and Health (NSDUHs) estimates of perceptions of great risk of harm from substance use and perceptions of availability of substances among persons aged 18 to 22 by their full-time college student status. Additionally, the 2014 estimates for full-time college students are compared with estimates from 2002 to 2013 to examine changes in these measures over time.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the State College population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for State College. The dataset can be utilized to understand the population distribution of State College by age. For example, using this dataset, we can identify the largest age group in State College.
Key observations
The largest age group in State College, PA was for the group of age 20-24 years with a population of 15,950 (39.53%), according to the 2021 American Community Survey. At the same time, the smallest age group in State College, PA was the 80-84 years with a population of 316 (0.78%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for State College Population by Age. You can refer the same here
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the State College population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of State College across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.
Key observations
In 2023, the population of State College was 40,687, a 0.77% increase year-by-year from 2022. Previously, in 2022, State College population was 40,378, a decline of 1.54% compared to a population of 41,011 in 2021. Over the last 20 plus years, between 2000 and 2023, population of State College increased by 1,405. In this period, the peak population was 42,559 in the year 2016. The numbers suggest that the population has already reached its peak and is showing a trend of decline. Source: U.S. Census Bureau Population Estimates Program (PEP).
When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).
Data Coverage:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for State College Population by Year. You can refer the same here
Facebook
Twitterhttps://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de450955https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de450955
Abstract (en): The American College Catalog Study Database (CCS) contains academic data on 286 four-year colleges and universities in the United States. CCS is one of two databases produced by the Colleges and Universities 2000 project based at the University of California-Riverside. The CCS database comprises a sampled subset of institutions from the related Institutional Data Archive (IDA) on American Higher Education (ICPSR 34874). Coding for CCS was based on college catalogs obtained from College Source, Inc. The data are organized in a panel design, with measurements taken at five-year intervals: academic years 1975-76, 1980-81, 1985-86, 1990-91, 1995-96, 2000-01, 2005-06, and 2010-11. The database is based on information reported in each institution's college catalog, and includes data regarding changes in major academic units (schools and colleges), departments, interdisciplinary programs, and general education requirements. For schools and departments, changes in structure were coded, including new units, name changes, splits in units, units moved to new schools, reconstituted units, consolidated units, departments reduced to program status, and eliminated units. The American College Catalog Study Database (CCS) is intended to allow researchers to examine changes in the structure of institutionalized knowledge in four-year colleges and universities within the United States. For information on the study design, including detailed coding conventions, please see the Original P.I. Documentation section of the ICPSR Codebook. The data are not weighted. Dataset 1, Characteristics Variables, contains three weight variables (IDAWT, CCSWT, and CASEWEIGHT) which users may wish to apply during analysis. For additional information on weights, please see the Original P.I. Documentation section of the ICPSR Codebook. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Checked for undocumented or out-of-range codes.. Response Rates: Approximately 75 percent of IDA institutions are included in CCS. For additional information on response rates, please see the Original P.I. Documentation section of the ICPSR Codebook. Four-year not-for-profit colleges and universities in the United States. Smallest Geographic Unit: state CCS includes 286 institutions drawn from the IDA sample of 384 United States four-year colleges and universities. CCS contains every IDA institution for which a full set of catalogs could be located at the initiation of the project in 2000. CCS contains seven datasets that can be linked through an institutional identification number variable (PROJ_ID). Since the data are organized in a panel format, it is also necessary to use a second variable (YEAR) to link datasets. For a brief description of each CCS dataset, please see Appendix B within the Original P.I. Documentation section of the ICPSR Codebook.There are date discrepancies between the data and the Original P.I. Documentation. Study Time Periods and Collection Dates reflect dates that are present in the data. No additional information was provided.Please note that the related data collection featuring the Institutional Data Archive on American Higher Education, 1970-2011, will be available as ICPSR 34874. Additional information on the American College Catalog Study Database (CCS) and the Institutional Data Archive (IDA) database can be found on the Colleges and Universities 2000 Web site.
Facebook
TwitterReport by NYS High School of the number of SUNY First Time Undergraduates entering in the Term, who reported attending that High School. The total number of First Time Undergraduates at SUNY from each High School is provided. The total is sub-divided by the SUNY Sector of the Institution the student is attending; Doctoral, Comprehensive, Technology and Community College. The report also provides a total count by NYS County, which is an aggregate of all the high schools in that County.
Facebook
Twitterhttps://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
The dataset contains Academic-year- and Country-wise historically compiled data on the total number of International students enrolled for studying Undergraduate, Graduate, Non-Degree and Optional Practical Training (OPT) courses in the United States of America (USA).
Facebook
TwitterColleges and Universities This feature layer, utilizing data from the National Center for Education Statistics (NCES), displays colleges and universities in the U.S. and its territories. NCES uses the Integrated Postsecondary Education Data System (IPEDS) as the "primary source for information on U.S. colleges, universities, and technical and vocational institutions." According to NCES, this layer "contains directory information for every institution in the 2023-24 IPEDS universe. Includes name, address, city, state, zip code and various URL links to the institution"s home page, admissions, financial aid offices and the net price calculator. Identifies institutions as currently active, and institutions that participate in Title IV federal financial aid programs for which IPEDS is mandatory." University of the District of ColumbiaData currency: 2023Data source: IPEDS Complete Data FilesData modification: Removed fields with coded values and replaced with descriptionsFor more information: Integrated Postsecondary Education Data SystemSupport documentation: Data DictionaryFor feedback, please contact: ArcGIScomNationalMaps@esri.com U.S. Department of Education (ED) Per ED, The mission of the Department of Education (ED) is to promote student achievement and preparation for global competitiveness by fostering educational excellence and ensuring equal access for students of all ages.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This repository contains a dataset of higher education institutions in the United States of America. This dataset was compiled in response to a cybersecurity research of American higher education institutions' websites [1]. The data is being made publicly available to promote open science principles [2].
The data includes the following fields for each institution:
The dataset was obtained from the Higher Education Integrated Data System (IPEDS) website [3], which is administered by the National Center for Education Statistics (NCES). NCES serves as the primary federal entity for collecting and analyzing education-related data in the United States. The data was collected on February 2, 2023.
The initial list of institutions was derived from the IPEDS database using the following criteria: (1) US institutions only, (2) degree-granting institutions, primarily bachelor's or higher, and (3) industry classification, which includes: public 4 - year or above, private not-for-profit 4 years or more, private for-profit 4 years or more, public 2 years, private not-for-profit 2 years, private for-profit 2 years, public less than 2 years, private not-for-profit for-profit less than 2 years and private for-profit less than 2 years.
The following variables have been added to the list of institutions: Control of the institution, state abbreviation, degree-granting status, Status of the institution, and Institution's internet website address. This resulted in a report with 1,979 institutions.
The institution's status was labeled with the following values: A (Active), N (New), R (Restored), M (Closed in the current year), C (Combined with another institution), D (Deleted out of business), I (Inactive due to hurricane-related issues), O (Outside IPEDS scope), P (Potential new/add institution), Q (Potential institution reestablishment), W (Potential addition outside IPEDS scope), X ( Potential restoration outside the scope of IPEDS) and G (Perfect Children's Campus).
A filter was applied to the report to retain only institutions with an A, N, or R status, resulting in 1,978 institutions. Finally, a data cleaning process was applied, which involved removing the whitespace at the beginning and end of cell content and duplicate whitespace. The final data were compiled into the dataset included in this repository.
This data is available under the Creative Commons Zero (CC0) license and can be used for any purpose, including academic research purposes. We encourage the sharing of knowledge and the advancement of research in this field by adhering to open science principles [2].
If you use this data in your research, please cite the source and include a link to this repository. To properly attribute this data, please use the following DOI: 10.5281/zenodo.7614862
If you have any updates or corrections to the data, please feel free to open a pull request or contact us directly. Let's work together to keep this data accurate and up-to-date.
We would like to acknowledge the support of the Norte Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF), within the project "Cybers SeC IP" (NORTE-01-0145-FEDER-000044). This study was also developed as part of the Master in Cybersecurity Program at the Instituto Politécnico de Viana do Castelo, Portugal.
Facebook
TwitterThis study was designed to collect college student victimization data to satisfy four primary objectives: (1) to determine the prevalence and nature of campus crime, (2) to help the campus community more fully assess crime, perceived risk, fear of victimization, and security problems, (3) to aid in the development and evaluation of location-specific and campus-wide security policies and crime prevention measures, and (4) to make a contribution to the theoretical study of campus crime and security. Data for Part 1, Student-Level Data, and Part 2, Incident-Level Data, were collected from a random sample of college students in the United States using a structured telephone interview modeled after the redesigned National Crime Victimization Survey administered by the Bureau of Justice Statistics. Using stratified random sampling, over 3,000 college students from 12 schools were interviewed. Researchers collected detailed information about the incident and the victimization, and demographic characteristics of victims and nonvictims, as well as data on self-protection, fear of crime, perceptions of crime on campus, and campus security measures. For Part 3, School Data, the researchers surveyed campus officials at the sampled schools and gathered official data to supplement institution-level crime prevention information obtained from the students. Mail-back surveys were sent to directors of campus security or campus police at the 12 sampled schools, addressing various aspects of campus security, crime prevention programs, and crime prevention services available on the campuses. Additionally, mail-back surveys were sent to directors of campus planning, facilities management, or related offices at the same 12 schools to obtain information on the extent and type of planning and design actions taken by the campus for crime prevention. Part 3 also contains data on the characteristics of the 12 schools obtained from PETERSON'S GUIDE TO FOUR-YEAR COLLEGES (1994). Part 4, Census Data, is comprised of 1990 Census data describing the census tracts in which the 12 schools were located and all tracts adjacent to the schools. Demographic variables in Part 1 include year of birth, sex, race, marital status, current enrollment status, employment status, residency status, and parents' education. Victimization variables include whether the student had ever been a victim of theft, burglary, robbery, motor vehicle theft, assault, sexual assault, vandalism, or harassment. Students who had been victimized were also asked the number of times victimization incidents occurred, how often the police were called, and if they knew the perpetrator. All students were asked about measures of self-protection, fear of crime, perceptions of crime on campus, and campus security measures. For Part 2, questions were asked about the location of each incident, whether the offender had a weapon, a description of the offense and the victim's response, injuries incurred, characteristics of the offender, and whether the incident was reported to the police. For Part 3, respondents were asked about how general campus security needs were met, the nature and extent of crime prevention programs and services available at the school (including when the program or service was first implemented), and recent crime prevention activities. Campus planners were asked if specific types of campus security features (e.g., emergency telephone, territorial markers, perimeter barriers, key-card access, surveillance cameras, crime safety audits, design review for safety features, trimming shrubs and underbrush to reduce hiding places, etc.) were present during the 1993-1994 academic year and if yes, how many or how often. Additionally, data were collected on total full-time enrollment, type of institution, percent of undergraduate female students enrolled, percent of African-American students enrolled, acreage, total fraternities, total sororities, crime rate of city/county where the school was located, and the school's Carnegie classification. For Part 4, Census data were compiled on percent unemployed, percent having a high school degree or higher, percent of all persons below the poverty level, and percent of the population that was Black.
Facebook
TwitterThe primary research objective of this study was to examine the prevalence, nature, and reporting of various types of sexual assault experienced by university students in an effort to inform the development of targeted intervention strategies. In addition, the study had two service-oriented objectives: (1) to educate students about various types of sexual assault, how they can maximize their safety, and what they can do if they or someone they know has been victimized and (2) to provide students with information about the campus and community resources that are available should they need assistance or have any concerns or questions. The study involved a Web-based survey of random samples of undergraduate students at two large public universities, one located in the South (University 1) and one located in the Midwest (University 2). Researchers drew random samples of students aged 18-25 and enrolled at least three-quarters' time at each university to participate in the study. The survey was administered in the winter of 2005-2006, and a total of 5,446 undergraduate women and 1,375 undergraduate men participated for a grand total of 6,821 respondents. Sampled students were sent an initial recruitment e-mail that described the study, provided a unique study ID number, and included a hyperlink to the study Web site. During each of the following weeks, students who had not completed the survey were sent follow-up e-mails and a hard-copy letter encouraging them to participate. The survey was administered anonymously and was designed to be completed in an average of 15 minutes. Respondents were provided with a survey completion code that, when entered with their study ID number at a separate Web site, enabled them to obtain a $10 Amazon.com gift certificate. The survey was divided into six modules. The Background Information module included survey items on demographics, school classification (year of study, year of enrollment, transfer status), residential characteristics, academic performance, and school involvement. An Alcohol and Other Drug Use module generated a number of measures of alcohol and drug use, and related substance use behaviors. A Dating module included items on sexual orientation, dating, consensual sexual activity, and dating violence. The Experiences module was developed after extensive reviews of past surveys of sexual assault and generated information on physically forced sexual assault and incapacitated sexual assault. For both physically forced and incapacitated sexual assault, information was collected on completed and attempted assaults experienced before entering college and since entering college. For male respondents, a Behaviors module asking about the perpetration of the same types of sexual assault covered in the Experiences module was included. The final module of the survey covered attitudes about sexual assault and attitudes about the survey. The data file contains 747 variables.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is a compilation of processed data on citation and references for research papers including their author, institution and open access info for a selected sample of academics analysed using Microsoft Academic Graph (MAG) data and CORE. The data for this dataset was collected during December 2019 to January 2020.Six countries (Austria, Brazil, Germany, India, Portugal, United Kingdom and United States) were the focus of the six questions which make up this dataset. There is one csv file per country and per question (36 files in total). More details about the creation of this dataset are available on the public ON-MERRIT D3.1 deliverable report.The dataset is a combination of two different data sources, one part is a dataset created on analysing promotion policies across the target countries, while the second part is a set of data points available to understand the publishing behaviour. To facilitate the analysis the dataset is organised in the following seven folders:PRTThe dataset with the file name "PRT_policies.csv" contains the related information as this was extracted from promotion, review and tenure (PRT) policies. Q1: What % of papers coming from a university are Open Access?- Dataset Name format: oa_status_countryname_papers.csv- Dataset Contents: Open Access (OA) status of all papers of all the universities listed in Times Higher Education World University Rankings (THEWUR) for the given country. A paper is marked OA if there is at least an OA link available. OA links are collected using the CORE Discovery API.- Important considerations about this dataset: - Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. - The service we used to recognise if a paper is OA, CORE Discovery, does not contain entries for all paperids in MAG. This implies that some of the records in the dataset extracted will not have either a true or false value for the _is_OA_ field. - Only those records marked as true for _is_OA_ field can be said to be OA. Others with false or no value for is_OA field are unknown status (i.e. not necessarily closed access).Q2: How are papers, published by the selected universities, distributed across the three scientific disciplines of our choice?- Dataset Name format: fsid_countryname_papers.csv- Dataset Contents: For the given country, all papers for all the universities listed in THEWUR with the information of fieldofstudy they belong to.- Important considerations about this dataset: * MAG can associate a paper to multiple fieldofstudyid. If a paper belongs to more than one of our fieldofstudyid, separate records were created for the paper with each of those _fieldofstudyid_s.- MAG assigns fieldofstudyid to every paper with a score. We preserve only those records whose score is more than 0.5 for any fieldofstudyid it belongs to.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. Papers with authorship from multiple universities are counted once towards each of the universities concerned.Q3: What is the gender distribution in authorship of papers published by the universities?- Dataset Name format: author_gender_countryname_papers.csv- Dataset Contents: All papers with their author names for all the universities listed in THEWUR.- Important considerations about this dataset :- When there are multiple collaborators(authors) for the same paper, this dataset makes sure that only the records for collaborators from within selected universities are preserved.- An external script was executed to determine the gender of the authors. The script is available here.Q4: Distribution of staff seniority (= number of years from their first publication until the last publication) in the given university.- Dataset Name format: author_ids_countryname_papers.csv- Dataset Contents: For a given country, all papers for authors with their publication year for all the universities listed in THEWUR.- Important considerations about this work :- When there are multiple collaborators(authors) for the same paper, this dataset makes sure that only the records for collaborators from within selected universities are preserved.- Calculating staff seniority can be achieved in various ways. The most straightforward option is to calculate it as _academic_age = MAX(year) - MIN(year) _for each authorid.Q5: Citation counts (incoming) for OA vs Non-OA papers published by the university.- Dataset Name format: cc_oa_countryname_papers.csv- Dataset Contents: OA status and OA links for all papers of all the universities listed in THEWUR and for each of those papers, count of incoming citations available in MAG.- Important considerations about this dataset :- CORE Discovery was used to establish the OA status of papers.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to.- Only those records marked as true for _is_OA_ field can be said to be OA. Others with false or no value for is_OA field are unknown status (i.e. not necessarily closed access).Q6: Count of OA vs Non-OA references (outgoing) for all papers published by universities.- Dataset Name format: rc_oa_countryname_-papers.csv- Dataset Contents: Counts of all OA and unknown papers referenced by all papers published by all the universities listed in THEWUR.- Important considerations about this dataset :- CORE Discovery was used to establish the OA status of papers being referenced.- Papers with multiple authorship are preserved only once towards each of the distinct institutions their authors may belong to. Papers with authorship from multiple universities are counted once towards each of the universities concerned.Additional files:- _fieldsofstudy_mag_.csv: this file contains a dump of fieldsofstudy table of MAG mapping each of the ids to their actual field of study name.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the population of State College by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of State College across both sexes and to determine which sex constitutes the majority.
Key observations
There is a majority of male population, with 53.74% of total population being male. Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Scope of gender :
Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for State College Population by Gender. You can refer the same here
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains information about the relationship between the frequency of library visits by undergraduate residential students and their academic performance, measured by GPA, during the Fall 2023 and Spring 2024 semesters at a private University in the United States of America. The data comprises anonymized library gate entry records and GPA scores, filtered to include students aged 17 and older.Dataset Details:Total Records: 3,340Semester Coverage: Fall 2023 and Spring 2024Student Demographics: Undergraduate residential students aged 17 and olderData Fields:Student ID (Anonymized)AgeSemesterGPALibrary Visits (Total)Library Visits per WeekWeeks with Library Visits
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
By Jonathan Ortiz [source]
This College Completion dataset provides an invaluable insight into the success and progress of college students in the United States. It contains graduation rates, race and other data to offer a comprehensive view of college completion in America. The data is sourced from two primary sources – the National Center for Education Statistics (NCES)’ Integrated Postsecondary Education System (IPEDS) and Voluntary System of Accountability’s Student Success and Progress rate.
At four-year institutions, the graduation figures come from IPEDS for first-time, full-time degree seeking students at the undergraduate level, who entered college six years earlier at four-year institutions or three years earlier at two-year institutions. Furthermore, colleges report how many students completed their program within 100 percent and 150 percent of normal time which corresponds with graduation within four years or six year respectively. Students reported as being of two or more races are included in totals but not shown separately
When analyzing race and ethnicity data NCES have classified student demographics since 2009 into seven categories; White non-Hispanic; Black non Hispanic; American Indian/ Alaskan native ; Asian/ Pacific Islander ; Unknown race or ethnicity ; Non resident with two new categorize Native Hawaiian or Other Pacific Islander combined with Asian plus students belonging to several races. Also worth noting is that different classifications for graduate data stemming from 2008 could be due to variations in time frame examined & groupings used by particular colleges – those who can’t be identified from National Student Clearinghouse records won’t be subjected to penalty by these locations .
When it comes down to efficiency measures parameters like “Awards per 100 Full Time Undergraduate Students which includes all undergraduate completions reported by a particular institution including associate degrees & certificates less than 4 year programme will assist us here while we also take into consideration measures like expenditure categories , Pell grant percentage , endowment values , average student aid amounts & full time faculty members contributing outstandingly towards instructional research / public service initiatives .
When trying to quantify outcomes back up Median Estimated SAT score metric helps us when it is derived either on 25th percentile basis / 75th percentile basis with all these factors further qualified by identifying required criteria meeting 90% threshold when incoming students are considered for relevance . Last but not least , Average Student Aid equalizes amount granted by institution dividing same over total sum received against what was allotted that particular year .
All this analysis gives an opportunity get a holistic overview about performance , potential deficits &
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset contains data on student success, graduation rates, race and gender demographics, an efficiency measure to compare colleges across states and more. It is a great source of information to help you better understand college completion and student success in the United States.
In this guide we’ll explain how to use the data so that you can find out the best colleges for students with certain characteristics or focus on your target completion rate. We’ll also provide some useful tips for getting the most out of this dataset when seeking guidance on which institutions offer the highest graduation rates or have a good reputation for success in terms of completing programs within normal timeframes.
Before getting into specifics about interpreting this dataset, it is important that you understand that each row represents information about a particular institution – such as its state affiliation, level (two-year vs four-year), control (public vs private), name and website. Each column contains various demographic information such as rate of awarding degrees compared to other institutions in its sector; race/ethnicity Makeup; full-time faculty percentage; median SAT score among first-time students; awards/grants comparison versus national average/state average - all applicable depending on institution location — and more!
When using this dataset, our suggestion is that you begin by forming a hypothesis or research question concerning student completion at a given school based upon observable characteristics like financ...