100+ datasets found
  1. o

    Harvard Electroencephalography Database

    • registry.opendata.aws
    • bdsp.io
    Updated Jun 20, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brain Data Science Platform (2023). Harvard Electroencephalography Database [Dataset]. https://registry.opendata.aws/bdsp-harvard-eeg/
    Explore at:
    Dataset updated
    Jun 20, 2023
    Dataset provided by
    <a href="https://bdsp.io/">Brain Data Science Platform</a>
    Description

    The Harvard EEG Database will encompass data gathered from four hospitals affiliated with Harvard University:Massachusetts General Hospital (MGH), Brigham and Women's Hospital (BWH), Beth Israel Deaconess Medical Center (BIDMC), and Boston Children's Hospital (BCH).

  2. b

    Harvard-Emory ECG Database

    • bdsp.io
    • registry.opendata.aws
    Updated Jul 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zuzana Koscova; Valdery Moura Junior; Matthew Reyna; Shenda Hong; Aditya Gupta; Manohar Ghanta; Reza Sameni; Aaron Aguirre; Qiao Li; Sahar Zafar; Gari Clifford; M Brandon Westover (2025). Harvard-Emory ECG Database [Dataset]. http://doi.org/10.60508/rv6h-7d10
    Explore at:
    Dataset updated
    Jul 28, 2025
    Authors
    Zuzana Koscova; Valdery Moura Junior; Matthew Reyna; Shenda Hong; Aditya Gupta; Manohar Ghanta; Reza Sameni; Aaron Aguirre; Qiao Li; Sahar Zafar; Gari Clifford; M Brandon Westover
    License

    https://github.com/bdsp-core/bdsp-license-and-duahttps://github.com/bdsp-core/bdsp-license-and-dua

    Description

    The Harvard-Emory ECG database (HEEDB) is a large collection of 12-lead electrocardiography (ECG) recordings, prepared through a collaboration between Harvard University and Emory University investigators.

    In version 1.0 of the database, these ECGs from Massachusetts General Brigham hospital sites were provided without labels or metadata, to enable pre-training of ECG analysis models.

    In version 2.0, metadata is included.

    In version 3.0, Emory ECGs are included together with metadata, labels from the 12SL ECG analysis program (GE Healthcare ) and ICD-9/10 codes.

    In version 4.0, typos were corrected in the data description.

    HEEDB is published as part of the Human Sleep Project (HSP), funded by a grant (R01HL161253) from the National Heart Lung and Blood Institute (NHLBI) of the NIH to Massachusetts General Hospital, Emory University, Stanford University, Kaiser Permanente, Boston Children's Hospital, and Beth Israel Deaconess Medical Center.

  3. H

    FAVOR Essential Database

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Apr 12, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hufeng Zhou; Theodore Arapoglou; Xihao Li; Zilin Li; Xihong Lin (2022). FAVOR Essential Database [Dataset]. http://doi.org/10.7910/DVN/1VGTJI
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 12, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Hufeng Zhou; Theodore Arapoglou; Xihao Li; Zilin Li; Xihong Lin
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Functional Annotation of Variants - Online Resource (FAVOR, https://favor.genohub.org) is a comprehensive whole-genome variant annotation database and a variant browser, providing hundreds of functional annotation scores from a variety of aspects of variant biological function. This FAVOR Essential Database is comprised of a collection of essential annotation scores for all possible SNVs (8,812,917,339) and observed indels (79,997,898) in Build GRCh38/hg38, including variant info, chromosome, position, reference allele, alternative allele, aPC-Conservation, aPC-Epigenetics, aPC-Epigenetics-Active, aPC-Epigenetics-Repressed, aPC-Epigenetics-Transcription, aPC-Local-Nucleotide-Diversity, aPC-Mappability, aPC-Mutation-Density, aPC-Protein-Function, aPC-Proximity-To-TSSTES, aPC-Transcription-Factor, CAGE promoter, CAGE, MetaSVM, rsID, FATHMM-XF, Gencode Comprehensive Category, Gencode Comprehensive Info, Gencode Comprehensive Exonic Category, Gencode Comprehensive Exonic Info, GeneHancer, LINSIGHT, CADD, rDHS. These annotation scores can be integrated into FAVORannotator (https://github.com/zhouhufeng/FAVORannotator) to create an annotated GDS (aGDS) file by storing the genotype data and their functional annotation data in an all-in-one file. The aGDS file can then facilitate a wide range of functionally-informed downstream analyses.

  4. H

    International Authority Database

    • dataverse.harvard.edu
    Updated Jun 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Zürn; Alexandros Tokhi; Martin Binder (2021). International Authority Database [Dataset]. http://doi.org/10.7910/DVN/VA6RQV
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 21, 2021
    Dataset provided by
    Harvard Dataverse
    Authors
    Michael Zürn; Alexandros Tokhi; Martin Binder
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    1919 - 2013
    Description

    The International Authority Database (IAD) informs about the degree of authority of 34 International Organizations from 1919 to 2013. Our cross-sectional time-series data count 1694 observations and offer systematic information on the exercise of authority across seven policy functions: agenda setting, rule making, compliance monitoring, norm interpretation and dispute settlement, enforcement, knowledge generation, and institutional evaluation.

  5. Harvard University Ratings and Reviews

    • kaggle.com
    zip
    Updated Mar 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kanchana1990 (2024). Harvard University Ratings and Reviews [Dataset]. https://www.kaggle.com/datasets/kanchana1990/harvard-university-ratings-and-reviews
    Explore at:
    zip(486485 bytes)Available download formats
    Dataset updated
    Mar 29, 2024
    Authors
    Kanchana1990
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Dataset Description

    Dataset Overview

    The "Harvard University Ratings and Reviews" dataset presents a rich compilation of experiences from one of the most esteemed institutions globally. It uniquely encompasses a broad spectrum of perspectives, including in-depth academic evaluations and impressions from travelers intrigued by Harvard's historical and architectural significance. This dataset serves as a bridge, connecting the academic excellence of Harvard with the experiences of visitors who come to admire its iconic campus.

    Data Science Applications

    • Sentiment Analysis: Understand the emotional tone behind the textual reviews, distinguishing between academic and traveler feedback.
    • Trend Analysis: Examine how perceptions of Harvard evolve over time, correlating them with significant events or changes within the university.
    • Text Mining and NLP: Utilize the rich review texts for topic modeling, keyword extraction, and building sentiment classifiers.
    • Comparative Analysis: Investigate differences in experiences based on the review platform (Mobile vs. Desktop) and the nature of the visit (academic vs. tourism).

    Column Descriptors

    • published_date: Timestamp of when the review was posted, providing insight into temporal trends.
    • published_platform: The platform (Mobile/Desktop) used to publish the review, indicating user engagement preferences.
    • rating: Numerical rating given by the reviewer, reflecting their level of satisfaction.
    • type: Type of submission, with a focus on reviews in this dataset.
    • helpful_votes: Number of times a review was marked as helpful, suggesting its impact and relevance.
    • title: Brief headline of the review, encapsulating the main sentiment.
    • text: Detailed narrative of the reviewer's experience and feedback.

    Ethically Mined Data

    This dataset has been ethically curated, with careful consideration to exclude any personal identifiers. By focusing purely on the content of the reviews, it respects privacy while still offering valuable insights.

    Acknowledgments

    We extend our gratitude to TripAdvisor for providing a platform that captures such a diverse range of experiences and to Harvard University for being the subject of this intriguing dataset. Their contributions enrich our understanding of academic and visitor perceptions alike.

    Image Acknowledgment

    The dataset's thumbnail, featuring an iconic view of Harvard University, has been sourced from AdmissionSight. This image captures the essence of Harvard's sprawling campus, inviting further exploration through the reviews within this dataset.

  6. H

    Global High-Resolution Soil Profile Database for Crop Modeling Applications

    • dataverse.harvard.edu
    • dataone.org
    • +2more
    Updated Jun 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2025). Global High-Resolution Soil Profile Database for Crop Modeling Applications [Dataset]. http://doi.org/10.7910/DVN/1PEEY0
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 18, 2025
    Dataset provided by
    Harvard Dataverse
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.7/customlicense?persistentId=doi:10.7910/DVN/1PEEY0https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.7/customlicense?persistentId=doi:10.7910/DVN/1PEEY0

    Dataset funded by
    USAID Bureau of Food Security
    CGIAR Research Program on Policies, Institutions, and Markets (PIM)
    Description

    One of the obstacles in applying advanced crop simulation models such as DSSAT at a grid-based platform is the lack of gridded soil input data at various resolutions. Recently, there has been many efforts in scientific communities to develop spatially continuous soil database across the globe. The most representative example is the SoilGrids 1km released by ISRIC in 2014. In addition recent AfSIS project put a lot of efforts to develop more accurate soil database in Africa at high spatial resolution. Taking advantage of those two available high resolution soil databases (SoilGrids 1km and ISRIC-AfSIS at 1km resolution), this project aims to develop a set of DSSAT compatible soil profiles on 5 arc-minute grid (which is HarvestChoice’s standard grid). Six soil properties (bulk density, organic carbon, percentage of clay and silt, soil pH and cation exchange capacity) available from the original SoilGrids 1km or ISRIC-AfSIS were directly used as DSSAT inputs. We applied a pedo-transfer function to derive some soil hydraulic properties (saturated hydraulic conductivity, soil water content at field capacity, wilting point and saturation) which are critical to simulate crop growth. For other required variables, HarvestChoice’s HC27 database are used as a reference. Final outputs are provided in *.SOL file format (DSSAT soil database) for each country at 5-min resolution. In addition, uncertainty maps for organic carbon and soil water content at wilting points at the top 15 cm soil layers were generated to provide brief idea about accuracy of the final products. The generated soil properties were evaluated by visualizing their global maps and by comparing them with IIASA-IFPRI cropland map and AfSIS-GYGA’s available water content maps.

  7. Harvard Air Quality Data

    • redivis.com
    application/jsonl +7
    Updated Mar 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2023). Harvard Air Quality Data [Dataset]. http://doi.org/10.57761/j4q9-aj68
    Explore at:
    application/jsonl, avro, csv, arrow, spss, parquet, stata, sasAvailable download formats
    Dataset updated
    Mar 9, 2023
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford Center for Population Health Sciences
    Description

    Abstract

    To provide annual PM2.5 component concentration data for the contiguous U.S. at resolutions of 50m in urban areas and 1km in non-urban areas for public health research to estimate effects on human health, and for other related research.

    Methodology

    The Annual Mean PM2.5 Components (EC, NH4, NO3, OC, SO4) 50m Urban and 1km Non-Urban Area Grids for Contiguous U.S., 2000-2019, v1 data set contains annual predictions of the chemical concentrations at a hyper resolution (50m x 50m grid cells) in urban areas and at a high resolution (1km x 1km grid cells) in non-urban areas for the years 2000 to 2019. Particulate matter with an aerodynamic diameter less than 2.5 µm (PM2.5) increases mortality and morbidity. PM2.5 is composed of a mixture of chemical components that vary across space and time. Due to limited hyperlocal data availability, less is known about health risks of PM2.5 components, their U.S.-wide exposure disparities, or which species are driving the biggest intra-urban changes in PM2.5 mass. The national super-learned models were developed across the U.S. for hyperlocal estimation of annual mean elemental carbon, ammonium, nitrate, organic carbon, and sulfate concentrations across 3,535 urban areas at a 50m spatial resolution, and at a 1km resolution for non-urban areas from 2000 to 2019. Using Machine-Learning models (ML), combined with either a Generalized Additive Model (GAM) Ensemble Geographically-Weighted-Averaging (GAM-ENWA) or Super-Learning (SL) and approximately 82 billion predictions across 20 years, hyperlocal super-learned PM2.5 components are now available for further research. The overall R-squared values of 10-fold cross validated models ranged from 0.910 to 0.970 on the training sets for these components, while on the test sets the R-squared values ranged from 0.860 to 0.960. Remarkable spatiotemporal intra-urban and inter-urban variabilities were found in PM2.5 components. The Coordinate Reference System (CRS) for predictions is the World Geodetic System 1984 (WGS84) and the units for the PM2.5 Components are µg/m^3.

    Usage

    The data are provided in RDS tabular format, a file format native to the R programming language, but can also be opened by other languages such as Python.

  8. d

    Harvard Common Data Set

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of Institutional Research (2023). Harvard Common Data Set [Dataset]. http://doi.org/10.7910/DVN/AOD2ZV
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Office of Institutional Research
    Description

    This represents Harvard's responses to the Common Data Initiative. The Common Data Set (CDS) initiative is a collaborative effort among data providers in the higher education community and publishers as represented by the College Board, Peterson's, and U.S. News & World Report. The combined goal of this collaboration is to improve the quality and accuracy of information provided to all involved in a student's transition into higher education, as well as to reduce the reporting burden on data providers. This goal is attained by the development of clear, standard data items and definitions in order to determine a specific cohort relevant to each item. Data items and definitions used by the U.S. Department of Education in its higher education surveys often serve as a guide in the continued development of the CDS. Common Data Set items undergo broad review by the CDS Advisory Board as well as by data providers representing secondary schools and two- and four-year colleges. Feedback from those who utilize the CDS also is considered throughout the annual review process.

  9. Minimal data schema for geoBoundaries files.

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs (2023). Minimal data schema for geoBoundaries files. [Dataset]. http://doi.org/10.1371/journal.pone.0231866.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All fields noted in this table must be collected and validated for inclusion in a release. *URLs provided as exemplars only; within the database, full paths to exact landing pages from which data was retrieved are included.

  10. A summary of license types currently included in the geoBoundaries dataset.

    • plos.figshare.com
    xls
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs (2023). A summary of license types currently included in the geoBoundaries dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0231866.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Daniel Runfola; Austin Anderson; Heather Baier; Matt Crittenden; Elizabeth Dowker; Sydney Fuhrig; Seth Goodman; Grace Grimsley; Rachel Layko; Graham Melville; Maddy Mulder; Rachel Oberman; Joshua Panganiban; Andrew Peck; Leigh Seitz; Sylvia Shea; Hannah Slevin; Rebecca Youngerman; Lauren Hobbs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explicit detail on the license for every boundary is provided in the metadata.

  11. d

    Harvard Dataverse Optional Feature Use Data

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Boyd, Ceilyn (2023). Harvard Dataverse Optional Feature Use Data [Dataset]. http://doi.org/10.7910/DVN/9STGWE
    Explore at:
    Dataset updated
    Nov 23, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Boyd, Ceilyn
    Time period covered
    Oct 28, 2019 - Oct 29, 2019
    Description

    This dataset contains data, documentation, and code files associated with studies performed on snapshots of the contents of Harvard Dataverse taken on 28 and 29 October 2019.

  12. H

    United Nations General Assembly Ideal Points

    • dataverse.harvard.edu
    Updated Jul 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erik Voeten (2025). United Nations General Assembly Ideal Points [Dataset]. http://doi.org/10.7910/DVN/LEJUQZ
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 31, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Erik Voeten
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    1946 - 2024
    Area covered
    World
    Description

    This dataset contains ideal point estimates based on voting behavior in the United Nations General Assembly. This is the first version where the ideal point estimates are based on years rather than UNGA sessions. The reasons for this are that researchers in practice virtually always use years as the basis of analysis and that UNGA sessions increasingly spill over into the following year and are held in special emergency sessions on issues such as the Gaza and Ukraine. There are two types of ideal point estimates: • idealpointfp: ideal point estimates based only on votes on the final passage of resolutions (including failed votes). These are now available from 1946-2024 in IdealPointsJuly2025.tab. These estimates are updated most frequently as the raw data can readily be found online. • Idealpointall: ideal point estimates based on all votes, including votes on paragraphs, motions, and amendments. These votes are based on more data and thus should be more precise. One word of caution is that in some years this means that there are very large numbers of votes on a specific issue, such as the war in Gaza. The correlation between these ideal points is .9846 but there could of course still be some important differences. The data also includes idealpointlegacy, which is based on sessions (all votes). The correlation with idealpointall is .9877. Aside from the 2024 final passage votes, the raw UN voting data are from the UNGA-DM Database: https://unvotes.unige.ch/ Citation: Fjelstul, Joshua, Simon Hug, and Christopher Kilby. "Decision-making in the United Nations General Assembly: A comprehensive database of resolution-related decisions." The Review of International Organizations (2025): 1-18. The ideal point estimates are based on the methodology described in: Citation: Bailey, Michael A., Anton Strezhnev, and Erik Voeten. 2017. Estimating dynamic state preferences from united nations voting data. Journal of Conflict Resolution 61 (2): 430-56.

  13. H

    Political Party Database Round 2 v4 (first public version)

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Mar 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Susan Scarrow; Paul D. Webb; Thomas Poguntke (2022). Political Party Database Round 2 v4 (first public version) [Dataset]. http://doi.org/10.7910/DVN/0JVUM8
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 7, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Susan Scarrow; Paul D. Webb; Thomas Poguntke
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The Political Party Database (PPDB) is an online public database that is a central source for key information about political party organization, party resources, leadership selection, and partisan political participation in many representative democracies. The files contain the data in SPSS, STATA, and CSV formats. The dataset also includes a PDF with the text responses for the appropriate variables. The PPDB Round 2 dataset complements the Round 1a_1b Dataset. Round 2 data covers 51 countries, reflecting the state of 288 parties in the years 2017-2020.

  14. d

    Middle East Mass Movements Database

    • search.dataone.org
    • datasetcatalog.nlm.nih.gov
    • +1more
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kosack, Stephen; Smith, Evann (2023). Middle East Mass Movements Database [Dataset]. http://doi.org/10.7910/DVN/VKECUK
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Kosack, Stephen; Smith, Evann
    Area covered
    Middle East
    Description

    The Middle East Mass Movements Database, a part of the larger Mass Movements Project, contains basic characteristics of all mass movements in the region for each year that they mobilize at least 1,000 participants in costly action for a least a month in pursuit of a common political goal. The data are the result of a lengthy coding process in which two researchers independently explore each known mobilization with all available secondary sources and, if they determine that it meets the thresholds, separately code its observable characteristics; any coding disagreements are resolved by moderated debate until the researchers reach consensus. The data cover 16 variables on movement characteristics, including mobilizing identities, organization, and action, for the 19 countries of the Middle East and North Africa from 1900-2012.

  15. Harvard Tuition

    • kaggle.com
    zip
    Updated Nov 11, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard University (2016). Harvard Tuition [Dataset]. https://www.kaggle.com/harvard-university/harvard-tuition
    Explore at:
    zip(3232 bytes)Available download formats
    Dataset updated
    Nov 11, 2016
    Dataset authored and provided by
    Harvard University
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Harvard tuition data since 1985, for both the undergraduate College and the graduate and professional schools.

    The Data

    This dataset consists of two files: tuition_graduate.csv and undergraduate_package.csv, which contain the tuition and fees data for the graduate schools and undergraduate College, respectively.

    tuition_graduate.csv contains the following fields:

    • academic.year: the academic year, between 1985 and 2017
    • school: the name of the graduate or professional school; one of GSAS, Business (MBA), Design, Divinity, Education, Government, Law, Medical/Dental, Public Health (1-Year MPH)
    • cost: the cost of tuition at a given school in a given year

    undergraduate_package.csv contains the following fields:

    • academic.year: the academic year, between 1985 and 2017
    • component: the component of undergraduate fees; one of Tuition,*Health Services Fee*,*Student Services Fee*,*Room*,*Board*,*Total*
    • cost: the cost of the component; or, if the component is Total, the sum of the costs of the other components in that year

    Acknowledgements

    All of the data in this dataset comes from The Harvard Open Data Dataverse. Specific citations are as follows:

    for the graduate tuition data:
    Harvard Financial Aid Office, 2015, "Harvard graduate school tuition", doi:10.7910/DVN/LV0YSQ, Harvard Dataverse, V1

    for the undergraduate tuition and fees data:
    Harvard Financial Aid, 2015, "Harvard College Tuition", doi:10.7910/DVN/MSS2BE, Harvard Dataverse, V1 [UNF:6:FyXNny+KBTgLX+DzewzEfg==]

  16. h

    cold-cases

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Library Innovation Lab, cold-cases [Dataset]. https://huggingface.co/datasets/harvard-lil/cold-cases
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    Harvard Library Innovation Lab
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Collaborative Open Legal Data (COLD) - Cases

    COLD Cases is a dataset of 8.3 million United States legal decisions with text and metadata, formatted as compressed parquet files. If you'd like to view a sample of the dataset formatted as JSON Lines, you can view one here This dataset exists to support the open legal movement exemplified by projects like Pile of Law and LegalBench. A key input to legal understanding projects is caselaw -- the published, precedential decisions of… See the full description on the dataset page: https://huggingface.co/datasets/harvard-lil/cold-cases.

  17. d

    Electromyography Analysis of Human Activities - DataBase 2 (EMAHA-DB2)

    • search.dataone.org
    • dataverse.harvard.edu
    • +1more
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kusuru, Durgesh; Turlapaty, Anish Chand; Thakur, Mainak (2023). Electromyography Analysis of Human Activities - DataBase 2 (EMAHA-DB2) [Dataset]. http://doi.org/10.7910/DVN/MBG5UY
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Kusuru, Durgesh; Turlapaty, Anish Chand; Thakur, Mainak
    Description

    We present a sEMG signal database corresponding to the Indian population named “ElectroMyography Analysis of Human Activities - DataBase -2 (EMAHA-DB2).” This data set consists of two different weight training activities which involve isotonic and isometric contractions. Weight training activities are effective for improving muscle strength, overall health, and regaining limb functionality for people undergoing rehabilitation post stroke-related episodes. The EMG signals acquired during weight training can be used for muscle recruitment analysis. For example, during a specific movement, it can determine the set of recruited muscles and their order of recruitment. The institutional ethics committee of Indian Institute of Information Technology Sri City (No. IIITS/EC/2022/01) approved the proposed data collection protocol developed in accordance with the declaration of Helsinki and the “National Ethical Guidelines for Biomedical and Health Research involving human participants" of India. Nine healthy male subjects with no history of upper limb pathology participated in the sEMG data collection process. The average age is 21 years. Before the first session of activities, each of the participants gave written informed consent and the data collection process is completely non-invasive. At the beginning of each session, the participant's hands are cleaned with an alcohol based wet wipe. The total duration of each session is up-to one hour per subject depending on adaptability. Each of the hand muscle activity is recorded with a 2-channel Noraxon Ultium wireless sEMG sensor setup. Two self-adhesive Ag/AgCL dual electrodes were placed at Biceps Brachi(BB) and Flexor carpi ulnaris (FCU) muscle locations. During an activity, the subject is in a standing position and the weight is placed on a table at a convenient height for pickup. Each activity has three phases: rest followed by action and release. Each activity is repeated nine times. In order to avoid muscle fatigue, subjects rest for two minutes between different activities.

  18. Chicago and harvard artwork

    • kaggle.com
    zip
    Updated Jul 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UmbraVenus (2021). Chicago and harvard artwork [Dataset]. https://www.kaggle.com/umbravenus/chicago-and-harvard-artwork
    Explore at:
    zip(2787243 bytes)Available download formats
    Dataset updated
    Jul 21, 2021
    Authors
    UmbraVenus
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Area covered
    Chicago
    Description

    Dataset

    This dataset was created by UmbraVenus

    Released under Database: Open Database, Contents: Database Contents

    Contents

  19. e

    Data from: Harvard Forest Herbarium Database from 1908 to Present

    • portal.edirepository.org
    • search.dataone.org
    csv
    Updated Dec 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jerry Jenkins; Glenn Motzkin (2023). Harvard Forest Herbarium Database from 1908 to Present [Dataset]. http://doi.org/10.6073/pasta/72233c38003a4a41112a37a9a1c3e6d8
    Explore at:
    csv(464318 byte)Available download formats
    Dataset updated
    Dec 5, 2023
    Dataset provided by
    EDI
    Authors
    Jerry Jenkins; Glenn Motzkin
    License

    https://spdx.org/licenses/CC0-1.0https://spdx.org/licenses/CC0-1.0

    Time period covered
    1908 - 2009
    Area covered
    Variables measured
    day, town, year, genus, month, state, tract, family, folder, source, and 7 more
    Description

    As a part of the broader Harvard Forest Flora project (see data set HF116), we prepared a database of all specimens located in the Harvard Forest herbarium.

  20. N

    Harvard, NE Age Group Population Dataset: A complete breakdown of Harvard...

    • neilsberg.com
    csv, json
    Updated Sep 16, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2023). Harvard, NE Age Group Population Dataset: A complete breakdown of Harvard age demographics from 0 to 85 years, distributed across 18 age groups [Dataset]. https://www.neilsberg.com/research/datasets/706ee446-3d85-11ee-9abe-0aa64bf2eeb2/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Sep 16, 2023
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Nebraska, Harvard
    Variables measured
    Population Under 5 Years, Population over 85 years, Population Between 5 and 9 years, Population Between 10 and 14 years, Population Between 15 and 19 years, Population Between 20 and 24 years, Population Between 25 and 29 years, Population Between 30 and 34 years, Population Between 35 and 39 years, Population Between 40 and 44 years, and 9 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the Harvard population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Harvard. The dataset can be utilized to understand the population distribution of Harvard by age. For example, using this dataset, we can identify the largest age group in Harvard.

    Key observations

    The largest age group in Harvard, NE was for the group of age 15-19 years with a population of 106 (9.53%), according to the 2021 American Community Survey. At the same time, the smallest age group in Harvard, NE was the 85+ years with a population of 10 (0.90%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

    Age groups:

    • Under 5 years
    • 5 to 9 years
    • 10 to 14 years
    • 15 to 19 years
    • 20 to 24 years
    • 25 to 29 years
    • 30 to 34 years
    • 35 to 39 years
    • 40 to 44 years
    • 45 to 49 years
    • 50 to 54 years
    • 55 to 59 years
    • 60 to 64 years
    • 65 to 69 years
    • 70 to 74 years
    • 75 to 79 years
    • 80 to 84 years
    • 85 years and over

    Variables / Data Columns

    • Age Group: This column displays the age group in consideration
    • Population: The population for the specific age group in the Harvard is shown in this column.
    • % of Total Population: This column displays the population of each age group as a proportion of Harvard total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Harvard Population by Age. You can refer the same here

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Brain Data Science Platform (2023). Harvard Electroencephalography Database [Dataset]. https://registry.opendata.aws/bdsp-harvard-eeg/

Harvard Electroencephalography Database

Explore at:
Dataset updated
Jun 20, 2023
Dataset provided by
<a href="https://bdsp.io/">Brain Data Science Platform</a>
Description

The Harvard EEG Database will encompass data gathered from four hospitals affiliated with Harvard University:Massachusetts General Hospital (MGH), Brigham and Women's Hospital (BWH), Beth Israel Deaconess Medical Center (BIDMC), and Boston Children's Hospital (BCH).

Search
Clear search
Close search
Google apps
Main menu