100+ datasets found

o
Harvard Electroencephalography Database
registry.opendata.aws
bdsp.io
Updated Jun 20, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Brain Data Science Platform (2023). Harvard Electroencephalography Database [Dataset]. https://registry.opendata.aws/bdsp-harvard-eeg/
Explore at:
Dataset updated
Jun 20, 2023
Dataset provided by
<a href="https://bdsp.io/">Brain Data Science Platform</a>
Description
The Harvard EEG Database will encompass data gathered from four hospitals affiliated with Harvard University:Massachusetts General Hospital (MGH), Brigham and Women's Hospital (BWH), Beth Israel Deaconess Medical Center (BIDMC), and Boston Children's Hospital (BCH).
b
Harvard-Emory ECG Database
bdsp.io
registry.opendata.aws
Updated Jul 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zuzana Koscova; Valdery Moura Junior; Matthew Reyna; Shenda Hong; Aditya Gupta; Manohar Ghanta; Reza Sameni; Aaron Aguirre; Qiao Li; Sahar Zafar; Gari Clifford; M Brandon Westover (2025). Harvard-Emory ECG Database [Dataset]. http://doi.org/10.60508/rv6h-7d10
Explore at:
Unique identifier
https://doi.org/10.60508/rv6h-7d10
Dataset updated
Jul 28, 2025
Authors
Zuzana Koscova; Valdery Moura Junior; Matthew Reyna; Shenda Hong; Aditya Gupta; Manohar Ghanta; Reza Sameni; Aaron Aguirre; Qiao Li; Sahar Zafar; Gari Clifford; M Brandon Westover
License
https://github.com/bdsp-core/bdsp-license-and-duahttps://github.com/bdsp-core/bdsp-license-and-dua
Description
The Harvard-Emory ECG database (HEEDB) is a large collection of 12-lead electrocardiography (ECG) recordings, prepared through a collaboration between Harvard University and Emory University investigators.

In version 1.0 of the database, these ECGs from Massachusetts General Brigham hospital sites were provided without labels or metadata, to enable pre-training of ECG analysis models.

In version 2.0, metadata is included.

In version 3.0, Emory ECGs are included together with metadata, labels from the 12SL ECG analysis program (GE Healthcare ) and ICD-9/10 codes.

In version 4.0, typos were corrected in the data description.

HEEDB is published as part of the Human Sleep Project (HSP), funded by a grant (R01HL161253) from the National Heart Lung and Blood Institute (NHLBI) of the NIH to Massachusetts General Hospital, Emory University, Stanford University, Kaiser Permanente, Boston Children's Hospital, and Beth Israel Deaconess Medical Center.
H
FAVOR Essential Database
dataverse.harvard.edu
search.dataone.org
Updated Apr 12, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hufeng Zhou; Theodore Arapoglou; Xihao Li; Zilin Li; Xihong Lin (2022). FAVOR Essential Database [Dataset]. http://doi.org/10.7910/DVN/1VGTJI
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/1VGTJI
Dataset updated
Apr 12, 2022
Dataset provided by
Harvard Dataverse
Authors
Hufeng Zhou; Theodore Arapoglou; Xihao Li; Zilin Li; Xihong Lin
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Functional Annotation of Variants - Online Resource (FAVOR, https://favor.genohub.org) is a comprehensive whole-genome variant annotation database and a variant browser, providing hundreds of functional annotation scores from a variety of aspects of variant biological function. This FAVOR Essential Database is comprised of a collection of essential annotation scores for all possible SNVs (8,812,917,339) and observed indels (79,997,898) in Build GRCh38/hg38, including variant info, chromosome, position, reference allele, alternative allele, aPC-Conservation, aPC-Epigenetics, aPC-Epigenetics-Active, aPC-Epigenetics-Repressed, aPC-Epigenetics-Transcription, aPC-Local-Nucleotide-Diversity, aPC-Mappability, aPC-Mutation-Density, aPC-Protein-Function, aPC-Proximity-To-TSSTES, aPC-Transcription-Factor, CAGE promoter, CAGE, MetaSVM, rsID, FATHMM-XF, Gencode Comprehensive Category, Gencode Comprehensive Info, Gencode Comprehensive Exonic Category, Gencode Comprehensive Exonic Info, GeneHancer, LINSIGHT, CADD, rDHS. These annotation scores can be integrated into FAVORannotator (https://github.com/zhouhufeng/FAVORannotator) to create an annotated GDS (aGDS) file by storing the genotype data and their functional annotation data in an all-in-one file. The aGDS file can then facilitate a wide range of functionally-informed downstream analyses.
H
International Authority Database
dataverse.harvard.edu
Updated Jun 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael Zürn; Alexandros Tokhi; Martin Binder (2021). International Authority Database [Dataset]. http://doi.org/10.7910/DVN/VA6RQV
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/VA6RQV
Dataset updated
Jun 21, 2021
Dataset provided by
Harvard Dataverse
Authors
Michael Zürn; Alexandros Tokhi; Martin Binder
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
1919 - 2013
Description
The International Authority Database (IAD) informs about the degree of authority of 34 International Organizations from 1919 to 2013. Our cross-sectional time-series data count 1694 observations and offer systematic information on the exercise of authority across seven policy functions: agenda setting, rule making, compliance monitoring, norm interpretation and dispute settlement, enforcement, knowledge generation, and institutional evaluation.
Harvard University Ratings and Reviews
kaggle.com
zip
Updated Mar 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kanchana1990 (2024). Harvard University Ratings and Reviews [Dataset]. https://www.kaggle.com/datasets/kanchana1990/harvard-university-ratings-and-reviews
Explore at:
zip(486485 bytes)Available download formats
Dataset updated
Mar 29, 2024
Authors
Kanchana1990
License
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
Description
Dataset Description

Dataset Overview

The "Harvard University Ratings and Reviews" dataset presents a rich compilation of experiences from one of the most esteemed institutions globally. It uniquely encompasses a broad spectrum of perspectives, including in-depth academic evaluations and impressions from travelers intrigued by Harvard's historical and architectural significance. This dataset serves as a bridge, connecting the academic excellence of Harvard with the experiences of visitors who come to admire its iconic campus.

Data Science Applications

Sentiment Analysis: Understand the emotional tone behind the textual reviews, distinguishing between academic and traveler feedback.

Trend Analysis: Examine how perceptions of Harvard evolve over time, correlating them with significant events or changes within the university.

Text Mining and NLP: Utilize the rich review texts for topic modeling, keyword extraction, and building sentiment classifiers.

Comparative Analysis: Investigate differences in experiences based on the review platform (Mobile vs. Desktop) and the nature of the visit (academic vs. tourism).

Column Descriptors

published_date: Timestamp of when the review was posted, providing insight into temporal trends.

published_platform: The platform (Mobile/Desktop) used to publish the review, indicating user engagement preferences.

rating: Numerical rating given by the reviewer, reflecting their level of satisfaction.

type: Type of submission, with a focus on reviews in this dataset.

helpful_votes: Number of times a review was marked as helpful, suggesting its impact and relevance.

title: Brief headline of the review, encapsulating the main sentiment.

text: Detailed narrative of the reviewer's experience and feedback.

Ethically Mined Data

This dataset has been ethically curated, with careful consideration to exclude any personal identifiers. By focusing purely on the content of the reviews, it respects privacy while still offering valuable insights.

Acknowledgments

We extend our gratitude to TripAdvisor for providing a platform that captures such a diverse range of experiences and to Harvard University for being the subject of this intriguing dataset. Their contributions enrich our understanding of academic and visitor perceptions alike.

Image Acknowledgment

The dataset's thumbnail, featuring an iconic view of Harvard University, has been sourced from AdmissionSight. This image captures the essence of Harvard's sprawling campus, inviting further exploration through the reviews within this dataset.
H
Global High-Resolution Soil Profile Database for Crop Modeling Applications
dataverse.harvard.edu
dataone.org
+2more
Updated Jun 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Dataverse (2025). Global High-Resolution Soil Profile Database for Crop Modeling Applications [Dataset]. http://doi.org/10.7910/DVN/1PEEY0
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/1PEEY0
Dataset updated
Jun 18, 2025
Dataset provided by
Harvard Dataverse
License
https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.7/customlicense?persistentId=doi:10.7910/DVN/1PEEY0https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.7/customlicense?persistentId=doi:10.7910/DVN/1PEEY0
Dataset funded by
USAID Bureau of Food Security
CGIAR Research Program on Policies, Institutions, and Markets (PIM)
Description
One of the obstacles in applying advanced crop simulation models such as DSSAT at a grid-based platform is the lack of gridded soil input data at various resolutions. Recently, there has been many efforts in scientific communities to develop spatially continuous soil database across the globe. The most representative example is the SoilGrids 1km released by ISRIC in 2014. In addition recent AfSIS project put a lot of efforts to develop more accurate soil database in Africa at high spatial resolution. Taking advantage of those two available high resolution soil databases (SoilGrids 1km and ISRIC-AfSIS at 1km resolution), this project aims to develop a set of DSSAT compatible soil profiles on 5 arc-minute grid (which is HarvestChoice’s standard grid). Six soil properties (bulk density, organic carbon, percentage of clay and silt, soil pH and cation exchange capacity) available from the original SoilGrids 1km or ISRIC-AfSIS were directly used as DSSAT inputs. We applied a pedo-transfer function to derive some soil hydraulic properties (saturated hydraulic conductivity, soil water content at field capacity, wilting point and saturation) which are critical to simulate crop growth. For other required variables, HarvestChoice’s HC27 database are used as a reference. Final outputs are provided in *.SOL file format (DSSAT soil database) for each country at 5-min resolution. In addition, uncertainty maps for organic carbon and soil water content at wilting points at the top 15 cm soil layers were generated to provide brief idea about accuracy of the final products. The generated soil properties were evaluated by visualizing their global maps and by comparing them with IIASA-IFPRI cropland map and AfSIS-GYGA’s available water content maps.
Harvard Air Quality Data
redivis.com
application/jsonl +7
Updated Mar 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford Center for Population Health Sciences (2023). Harvard Air Quality Data [Dataset]. http://doi.org/10.57761/j4q9-aj68
Explore at:
application/jsonl, avro, csv, arrow, spss, parquet, stata, sasAvailable download formats
Unique identifier
https://doi.org/10.57761/j4q9-aj68
Dataset updated
Mar 9, 2023
Dataset provided by
Redivis Inc.
Authors
Stanford Center for Population Health Sciences
Description
Abstract

To provide annual PM2.5 component concentration data for the contiguous U.S. at resolutions of 50m in urban areas and 1km in non-urban areas for public health research to estimate effects on human health, and for other related research.

Methodology

The Annual Mean PM2.5 Components (EC, NH4, NO3, OC, SO4) 50m Urban and 1km Non-Urban Area Grids for Contiguous U.S., 2000-2019, v1 data set contains annual predictions of the chemical concentrations at a hyper resolution (50m x 50m grid cells) in urban areas and at a high resolution (1km x 1km grid cells) in non-urban areas for the years 2000 to 2019. Particulate matter with an aerodynamic diameter less than 2.5 µm (PM2.5) increases mortality and morbidity. PM2.5 is composed of a mixture of chemical components that vary across space and time. Due to limited hyperlocal data availability, less is known about health risks of PM2.5 components, their U.S.-wide exposure disparities, or which species are driving the biggest intra-urban changes in PM2.5 mass. The national super-learned models were developed across the U.S. for hyperlocal estimation of annual mean elemental carbon, ammonium, nitrate, organic carbon, and sulfate concentrations across 3,535 urban areas at a 50m spatial resolution, and at a 1km resolution for non-urban areas from 2000 to 2019. Using Machine-Learning models (ML), combined with either a Generalized Additive Model (GAM) Ensemble Geographically-Weighted-Averaging (GAM-ENWA) or Super-Learning (SL) and approximately 82 billion predictions across 20 years, hyperlocal super-learned PM2.5 components are now available for further research. The overall R-squared values of 10-fold cross validated models ranged from 0.910 to 0.970 on the training sets for these components, while on the test sets the R-squared values ranged from 0.860 to 0.960. Remarkable spatiotemporal intra-urban and inter-urban variabilities were found in PM2.5 components. The Coordinate Reference System (CRS) for predictions is the World Geodetic System 1984 (WGS84) and the units for the PM2.5 Components are µg/m^3.

Usage

The data are provided in RDS tabular format, a file format native to the R programming language, but can also be opened by other languages such as Python.
d
Harvard Common Data Set
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office of Institutional Research (2023). Harvard Common Data Set [Dataset]. http://doi.org/10.7910/DVN/AOD2ZV
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/AOD2ZV
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Office of Institutional Research
Description
This represents Harvard's responses to the Common Data Initiative. The Common Data Set (CDS) initiative is a collaborative effort among data providers in the higher education community and publishers as represented by the College Board, Peterson's, and U.S. News & World Report. The combined goal of this collaboration is to improve the quality and accuracy of information provided to all involved in a student's transition into higher education, as well as to reduce the reporting burden on data providers. This goal is attained by the development of clear, standard data items and definitions in order to determine a specific cohort relevant to each item. Data items and definitions used by the U.S. Department of Education in its higher education surveys often serve as a guide in the continued development of the CDS. Common Data Set items undergo broad review by the CDS Advisory Board as well as by data providers representing secondary schools and two- and four-year colleges. Feedback from those who utilize the CDS also is considered throughout the annual review process.
Minimal data schema for geoBoundaries files.
plos.figshare.com
xls
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs (2023). Minimal data schema for geoBoundaries files. [Dataset]. http://doi.org/10.1371/journal.pone.0231866.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0231866.t001
Dataset updated
May 30, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
All fields noted in this table must be collected and validated for inclusion in a release. *URLs provided as exemplars only; within the database, full paths to exact landing pages from which data was retrieved are included.
A summary of license types currently included in the geoBoundaries dataset.
plos.figshare.com
xls
Updated Jun 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs (2023). A summary of license types currently included in the geoBoundaries dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0231866.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0231866.t003
Dataset updated
Jun 6, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Explicit detail on the license for every boundary is provided in the metadata.
d
Harvard Dataverse Optional Feature Use Data
search.dataone.org
dataverse.harvard.edu
Updated Nov 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Boyd, Ceilyn (2023). Harvard Dataverse Optional Feature Use Data [Dataset]. http://doi.org/10.7910/DVN/9STGWE
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/9STGWE
Dataset updated
Nov 23, 2023
Dataset provided by
Harvard Dataverse
Authors
Boyd, Ceilyn
Time period covered
Oct 28, 2019 - Oct 29, 2019
Description
This dataset contains data, documentation, and code files associated with studies performed on snapshots of the contents of Harvard Dataverse taken on 28 and 29 October 2019.
H
United Nations General Assembly Ideal Points
dataverse.harvard.edu
Updated Jul 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Erik Voeten (2025). United Nations General Assembly Ideal Points [Dataset]. http://doi.org/10.7910/DVN/LEJUQZ
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/LEJUQZ
Dataset updated
Jul 31, 2025
Dataset provided by
Harvard Dataverse
Authors
Erik Voeten
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
1946 - 2024
Area covered
World
Description
This dataset contains ideal point estimates based on voting behavior in the United Nations General Assembly. This is the first version where the ideal point estimates are based on years rather than UNGA sessions. The reasons for this are that researchers in practice virtually always use years as the basis of analysis and that UNGA sessions increasingly spill over into the following year and are held in special emergency sessions on issues such as the Gaza and Ukraine. There are two types of ideal point estimates: • idealpointfp: ideal point estimates based only on votes on the final passage of resolutions (including failed votes). These are now available from 1946-2024 in IdealPointsJuly2025.tab. These estimates are updated most frequently as the raw data can readily be found online. • Idealpointall: ideal point estimates based on all votes, including votes on paragraphs, motions, and amendments. These votes are based on more data and thus should be more precise. One word of caution is that in some years this means that there are very large numbers of votes on a specific issue, such as the war in Gaza. The correlation between these ideal points is .9846 but there could of course still be some important differences. The data also includes idealpointlegacy, which is based on sessions (all votes). The correlation with idealpointall is .9877. Aside from the 2024 final passage votes, the raw UN voting data are from the UNGA-DM Database: https://unvotes.unige.ch/ Citation: Fjelstul, Joshua, Simon Hug, and Christopher Kilby. "Decision-making in the United Nations General Assembly: A comprehensive database of resolution-related decisions." The Review of International Organizations (2025): 1-18. The ideal point estimates are based on the methodology described in: Citation: Bailey, Michael A., Anton Strezhnev, and Erik Voeten. 2017. Estimating dynamic state preferences from united nations voting data. Journal of Conflict Resolution 61 (2): 430-56.
H
Political Party Database Round 2 v4 (first public version)
dataverse.harvard.edu
search.dataone.org
Updated Mar 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Susan Scarrow; Paul D. Webb; Thomas Poguntke (2022). Political Party Database Round 2 v4 (first public version) [Dataset]. http://doi.org/10.7910/DVN/0JVUM8
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/0JVUM8
Dataset updated
Mar 7, 2022
Dataset provided by
Harvard Dataverse
Authors
Susan Scarrow; Paul D. Webb; Thomas Poguntke
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The Political Party Database (PPDB) is an online public database that is a central source for key information about political party organization, party resources, leadership selection, and partisan political participation in many representative democracies. The files contain the data in SPSS, STATA, and CSV formats. The dataset also includes a PDF with the text responses for the appropriate variables. The PPDB Round 2 dataset complements the Round 1a_1b Dataset. Round 2 data covers 51 countries, reflecting the state of 288 parties in the years 2017-2020.
d
Middle East Mass Movements Database
search.dataone.org
datasetcatalog.nlm.nih.gov
+1more
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kosack, Stephen; Smith, Evann (2023). Middle East Mass Movements Database [Dataset]. http://doi.org/10.7910/DVN/VKECUK
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/VKECUK
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Kosack, Stephen; Smith, Evann
Area covered
Middle East
Description
The Middle East Mass Movements Database, a part of the larger Mass Movements Project, contains basic characteristics of all mass movements in the region for each year that they mobilize at least 1,000 participants in costly action for a least a month in pursuit of a common political goal. The data are the result of a lengthy coding process in which two researchers independently explore each known mobilization with all available secondary sources and, if they determine that it meets the thresholds, separately code its observable characteristics; any coding disagreements are resolved by moderated debate until the researchers reach consensus. The data cover 16 variables on movement characteristics, including mobilizing identities, organization, and action, for the 19 countries of the Middle East and North Africa from 1900-2012.
Harvard Tuition
kaggle.com
zip
Updated Nov 11, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard University (2016). Harvard Tuition [Dataset]. https://www.kaggle.com/harvard-university/harvard-tuition
Explore at:
zip(3232 bytes)Available download formats
Dataset updated
Nov 11, 2016
Dataset authored and provided by
Harvard University
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Harvard tuition data since 1985, for both the undergraduate College and the graduate and professional schools.

The Data

This dataset consists of two files: tuition_graduate.csv and undergraduate_package.csv, which contain the tuition and fees data for the graduate schools and undergraduate College, respectively.

tuition_graduate.csv contains the following fields:

academic.year: the academic year, between 1985 and 2017

school: the name of the graduate or professional school; one of GSAS, Business (MBA), Design, Divinity, Education, Government, Law, Medical/Dental, Public Health (1-Year MPH)

cost: the cost of tuition at a given school in a given year

undergraduate_package.csv contains the following fields:

academic.year: the academic year, between 1985 and 2017

component: the component of undergraduate fees; one of Tuition,*Health Services Fee*,*Student Services Fee*,*Room*,*Board*,*Total*

cost: the cost of the component; or, if the component is Total, the sum of the costs of the other components in that year

Acknowledgements

All of the data in this dataset comes from The Harvard Open Data Dataverse. Specific citations are as follows:

for the graduate tuition data:
Harvard Financial Aid Office, 2015, "Harvard graduate school tuition", doi:10.7910/DVN/LV0YSQ, Harvard Dataverse, V1

for the undergraduate tuition and fees data:
Harvard Financial Aid, 2015, "Harvard College Tuition", doi:10.7910/DVN/MSS2BE, Harvard Dataverse, V1 [UNF:6:FyXNny+KBTgLX+DzewzEfg==]
h
cold-cases
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Library Innovation Lab, cold-cases [Dataset]. https://huggingface.co/datasets/harvard-lil/cold-cases
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset authored and provided by
Harvard Library Innovation Lab
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Collaborative Open Legal Data (COLD) - Cases

COLD Cases is a dataset of 8.3 million United States legal decisions with text and metadata, formatted as compressed parquet files. If you'd like to view a sample of the dataset formatted as JSON Lines, you can view one here This dataset exists to support the open legal movement exemplified by projects like Pile of Law and LegalBench. A key input to legal understanding projects is caselaw -- the published, precedential decisions of… See the full description on the dataset page: https://huggingface.co/datasets/harvard-lil/cold-cases.
d
Electromyography Analysis of Human Activities - DataBase 2 (EMAHA-DB2)
search.dataone.org
dataverse.harvard.edu
+1more
Updated Nov 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kusuru, Durgesh; Turlapaty, Anish Chand; Thakur, Mainak (2023). Electromyography Analysis of Human Activities - DataBase 2 (EMAHA-DB2) [Dataset]. http://doi.org/10.7910/DVN/MBG5UY
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/MBG5UY
Dataset updated
Nov 8, 2023
Dataset provided by
Harvard Dataverse
Authors
Kusuru, Durgesh; Turlapaty, Anish Chand; Thakur, Mainak
Description
We present a sEMG signal database corresponding to the Indian population named “ElectroMyography Analysis of Human Activities - DataBase -2 (EMAHA-DB2).” This data set consists of two different weight training activities which involve isotonic and isometric contractions. Weight training activities are effective for improving muscle strength, overall health, and regaining limb functionality for people undergoing rehabilitation post stroke-related episodes. The EMG signals acquired during weight training can be used for muscle recruitment analysis. For example, during a specific movement, it can determine the set of recruited muscles and their order of recruitment. The institutional ethics committee of Indian Institute of Information Technology Sri City (No. IIITS/EC/2022/01) approved the proposed data collection protocol developed in accordance with the declaration of Helsinki and the “National Ethical Guidelines for Biomedical and Health Research involving human participants" of India. Nine healthy male subjects with no history of upper limb pathology participated in the sEMG data collection process. The average age is 21 years. Before the first session of activities, each of the participants gave written informed consent and the data collection process is completely non-invasive. At the beginning of each session, the participant's hands are cleaned with an alcohol based wet wipe. The total duration of each session is up-to one hour per subject depending on adaptability. Each of the hand muscle activity is recorded with a 2-channel Noraxon Ultium wireless sEMG sensor setup. Two self-adhesive Ag/AgCL dual electrodes were placed at Biceps Brachi(BB) and Flexor carpi ulnaris (FCU) muscle locations. During an activity, the subject is in a standing position and the weight is placed on a table at a convenient height for pickup. Each activity has three phases: rest followed by action and release. Each activity is repeated nine times. In order to avoid muscle fatigue, subjects rest for two minutes between different activities.
Chicago and harvard artwork
kaggle.com
zip
Updated Jul 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UmbraVenus (2021). Chicago and harvard artwork [Dataset]. https://www.kaggle.com/umbravenus/chicago-and-harvard-artwork
Explore at:
zip(2787243 bytes)Available download formats
Dataset updated
Jul 21, 2021
Authors
UmbraVenus
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Area covered
Chicago
Description
Dataset

This dataset was created by UmbraVenus

Released under Database: Open Database, Contents: Database Contents

Contents
e
Data from: Harvard Forest Herbarium Database from 1908 to Present
portal.edirepository.org
search.dataone.org
csv
Updated Dec 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jerry Jenkins; Glenn Motzkin (2023). Harvard Forest Herbarium Database from 1908 to Present [Dataset]. http://doi.org/10.6073/pasta/72233c38003a4a41112a37a9a1c3e6d8
Explore at:
csv(464318 byte)Available download formats
Unique identifier
https://doi.org/10.6073/pasta/72233c38003a4a41112a37a9a1c3e6d8
Dataset updated
Dec 5, 2023
Dataset provided by
EDI
Authors
Jerry Jenkins; Glenn Motzkin
License
https://spdx.org/licenses/CC0-1.0https://spdx.org/licenses/CC0-1.0
Time period covered
1908 - 2009
Area covered

Variables measured
day, town, year, genus, month, state, tract, family, folder, source, and 7 more
Description
As a part of the broader Harvard Forest Flora project (see data set HF116), we prepared a database of all specimens located in the Harvard Forest herbarium.
N
Harvard, NE Age Group Population Dataset: A complete breakdown of Harvard...
neilsberg.com
csv, json
Updated Sep 16, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2023). Harvard, NE Age Group Population Dataset: A complete breakdown of Harvard age demographics from 0 to 85 years, distributed across 18 age groups [Dataset]. https://www.neilsberg.com/research/datasets/706ee446-3d85-11ee-9abe-0aa64bf2eeb2/
Explore at:
json, csvAvailable download formats
Dataset updated
Sep 16, 2023
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Nebraska, Harvard
Variables measured
Population Under 5 Years, Population over 85 years, Population Between 5 and 9 years, Population Between 10 and 14 years, Population Between 15 and 19 years, Population Between 20 and 24 years, Population Between 25 and 29 years, Population Between 30 and 34 years, Population Between 35 and 39 years, Population Between 40 and 44 years, and 9 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Harvard population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Harvard. The dataset can be utilized to understand the population distribution of Harvard by age. For example, using this dataset, we can identify the largest age group in Harvard.

Key observations

The largest age group in Harvard, NE was for the group of age 15-19 years with a population of 106 (9.53%), according to the 2021 American Community Survey. At the same time, the smallest age group in Harvard, NE was the 85+ years with a population of 10 (0.90%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Variables / Data Columns

Age Group: This column displays the age group in consideration

Population: The population for the specific age group in the Harvard is shown in this column.

% of Total Population: This column displays the population of each age group as a proportion of Harvard total population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Harvard Population by Age. You can refer the same here

Facebook

Twitter

Click to copy link

Link copied

Cite

Brain Data Science Platform (2023). Harvard Electroencephalography Database [Dataset]. https://registry.opendata.aws/bdsp-harvard-eeg/

Harvard Electroencephalography Database

Explore at:

Dataset updated

Jun 20, 2023

Dataset provided by

<a href="https://bdsp.io/">Brain Data Science Platform</a>

Description

The Harvard EEG Database will encompass data gathered from four hospitals affiliated with Harvard University:Massachusetts General Hospital (MGH), Brigham and Women's Hospital (BWH), Beth Israel Deaconess Medical Center (BIDMC), and Boston Children's Hospital (BCH).

Clear search

Close search

Google apps

Main menu

Harvard Electroencephalography Database

Harvard-Emory ECG Database

FAVOR Essential Database

International Authority Database

Harvard University Ratings and Reviews

Dataset Description

Dataset Overview

Data Science Applications

Column Descriptors

Ethically Mined Data

Acknowledgments

Image Acknowledgment

Global High-Resolution Soil Profile Database for Crop Modeling Applications

Harvard Air Quality Data

Abstract

Methodology

Usage

Harvard Common Data Set

Minimal data schema for geoBoundaries files.

A summary of license types currently included in the geoBoundaries dataset.

Harvard Dataverse Optional Feature Use Data

United Nations General Assembly Ideal Points

Political Party Database Round 2 v4 (first public version)

Middle East Mass Movements Database

Harvard Tuition

The Data

Acknowledgements

cold-cases

Electromyography Analysis of Human Activities - DataBase 2 (EMAHA-DB2)

Chicago and harvard artwork

Dataset

Contents

Data from: Harvard Forest Herbarium Database from 1908 to Present

Harvard, NE Age Group Population Dataset: A complete breakdown of Harvard...

About this dataset

Content

Inspiration

Recommended for further research

Harvard Electroencephalography DatabaseSee More Versions

Harvard Electroencephalography Database