https://github.com/bdsp-core/bdsp-license-and-duahttps://github.com/bdsp-core/bdsp-license-and-dua
The Harvard EEG Database will encompass data gathered from four hospitals affiliated with Harvard University: Massachusetts General Hospital (MGH), Brigham and Women's Hospital (BWH), Beth Israel Deaconess Medical Center (BIDMC), and Boston Children's Hospital (BCH). The EEG data includes three types:
rEEG: "routine EEGs" recorded in the outpatient setting.
EMU: recordings obtained in the inpatient setting, within the Epilepsy Monitoring Unit (EMU).
ICU/LTM: recordings obtained from acutely and critically ill patients within the intensive care unit (ICU).
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Cross-national research on the causes and consequences of income inequality has been hindered by the limitations of the existing inequality datasets: greater coverage across countries and over time has been available from these sources only at the cost of significantly reduced comparability across observations. The goal of the Standardized World Income Inequality Database (SWIID) is to meet the needs of those engaged in broadly cross-national research by maximizing the comparability of income inequality data while maintaining the widest possible coverage across countries and over time. The SWIID’s income inequality estimates are based on thousands of reported Gini indices from hundreds of published sources, including the OECD Income Distribution Database, the Socio-Economic Database for Latin America and the Caribbean generated by CEDLAS and the World Bank, Eurostat, the World Bank’s PovcalNet, the UN Economic Commission for Latin America and the Caribbean, national statistical offices around the world, and academic studies while minimizing reliance on problematic assumptions by using as much information as possible from proximate years within the same country. The data collected and harmonized by the Luxembourg Income Study is employed as the standard. The SWIID currently incorporates comparable Gini indices of disposable and market income inequality for 199 countries for as many years as possible from 1960 to the present; it also includes information on absolute and relative redistribution.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains all digitized specimen records stewarded by the Harvard University Herbaria. The Harvard University Herbaria, with over 5 million specimens, is the world’s largest University Herbaria. Included in the Herbaria are what were once six separate herbarium collections: * Herbarium of the Arnold Arboretum (A) * Economic Herbarium of Oakes Ames (ECON) * Oakes Ames Orchid Herbarium (AMES) * Farlow Herbarium (FH) * Gray Herbarium (GH) * New England Botanical Club Herbarium (NEBC). DarwinCore data follows the AppleCore guidance https://code.google.com/p/applecore/.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Functional Annotation of Variants - Online Resource (FAVOR, https://favor.genohub.org) is a comprehensive whole-genome variant annotation database and a variant browser, providing hundreds of functional annotation scores from a variety of aspects of variant biological function. This FAVOR Essential Database is comprised of a collection of essential annotation scores for all possible SNVs (8,812,917,339) and observed indels (79,997,898) in Build GRCh38/hg38, including variant info, chromosome, position, reference allele, alternative allele, aPC-Conservation, aPC-Epigenetics, aPC-Epigenetics-Active, aPC-Epigenetics-Repressed, aPC-Epigenetics-Transcription, aPC-Local-Nucleotide-Diversity, aPC-Mappability, aPC-Mutation-Density, aPC-Protein-Function, aPC-Proximity-To-TSSTES, aPC-Transcription-Factor, CAGE promoter, CAGE, MetaSVM, rsID, FATHMM-XF, Gencode Comprehensive Category, Gencode Comprehensive Info, Gencode Comprehensive Exonic Category, Gencode Comprehensive Exonic Info, GeneHancer, LINSIGHT, CADD, rDHS. These annotation scores can be integrated into FAVORannotator (https://github.com/zhouhufeng/FAVORannotator) to create an annotated GDS (aGDS) file by storing the genotype data and their functional annotation data in an all-in-one file. The aGDS file can then facilitate a wide range of functionally-informed downstream analyses.
As a part of the broader Harvard Forest Flora project (see data set HF116), we prepared a database of all specimens located in the Harvard Forest herbarium.
We conducted a floristic inventory of Harvard Forest, in order to: (1) document the current vascular flora of Harvard Forest; (2) evaluate the extent to which the flora has changed over the past century.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Using a Bayesian supervised learning approach, we identify individual inventors from the U.S. utility patent database, from 1975 to the present. An interface to calculate and illustrate patent co-authorship networks and social network measures is also provided. The network representation does not require bounding the social network beforehand. We provide descriptive statistics of individual and collaborative vari ables and illustrate examples of networks for an individual, an organization, a technology, and a region. The paper provides an overview of the technical algorithms and pointers to the data, code, and documentation, with the hope of further open development by the research community. Go here for theNBER pdpass file -- https://sites.google.com/site/patentdataproject/Home/downloads. It's old and hasn't been updated
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Harvard population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Harvard. The dataset can be utilized to understand the population distribution of Harvard by age. For example, using this dataset, we can identify the largest age group in Harvard.
Key observations
The largest age group in Harvard, IL was for the group of age 10-14 years with a population of 1,169 (12.33%), according to the 2021 American Community Survey. At the same time, the smallest age group in Harvard, IL was the 85+ years with a population of 46 (0.49%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Harvard Population by Age. You can refer the same here
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
American Stories offers high-quality structured data from historical newspapers suitable for pre-training large language models to enhance the understanding of historical English and world knowledge. It can also be integrated into external databases of retrieval-augmented language models, enabling broader access to historical information, including interpretations of political events and intricate details about people's ancestors. Additionally, the structured article texts facilitate the application of transformer-based methods for popular tasks like detecting reproduced content, significantly improving accuracy compared to traditional OCR methods. American Stories serves as a substantial and valuable dataset for advancing multimodal layout analysis models and other multimodal applications.
The EcoTrends project was established in 2004 by Dr. Debra Peters (Jornada Basin LTER, USDA-ARS Jornada Experimental Range) and Dr. Ariel Lugo (Luquillo LTER, USDA-FS Luquillo Experimental Forest) to support the collection and analysis of long-term ecological datasets. The project is a large synthesis effort focused on improving the accessibility and use of long-term data. At present, there are ~50 state and federally funded research sites that are participating and contributing to the EcoTrends project, including all 26 Long-Term Ecological Research (LTER) sites and sites funded by the USDA Agriculture Research Service (ARS), USDA Forest Service, US Department of Energy, US Geological Survey (USGS) and numerous universities.
Data from the EcoTrends project are available through an exploratory web portal (http://www.ecotrends.info). This web portal enables the continuation of data compilation and accessibility by users through an interactive web application. Ongoing data compilation is updated through both manual and automatic processing as part of the LTER Provenance Aware Synthesis Tracking Architecture (PASTA). The web portal is a collaboration between the Jornada LTER and the LTER Network Office.
The following dataset from Harvard Forest (HFR) contains hurricanes (number) measurements in number units and were aggregated to a yearly timescale.
https://www.icpsr.umich.edu/web/ICPSR/studies/38341/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/38341/terms
This catalog record includes detailed variable-level descriptions, enabling data discovery and comparison. The data are not archived at ICPSR. Users should consult the data owners (via the Roper Center for Public Opinion Research) directly for details on obtaining the data. This collection includes variable-level metadata of Health Care Priorities, a survey by Harvard School of Public Health/Robert Wood Johnson Foundation conducted by ICR Survey Research Group. Topics covered in this survey include: Important health issues Laws The data and documentation files for this survey are available through the Roper Center for Public Opinion Research [Roper #31092259]. Frequencies and summary statistics for the 97 variables from this survey are available through the ICPSR social science variable database and can be accessed from the Variables tab.
This dataset provides information about the number of properties, residents, and average property values for Harris Lane cross streets in Harvard, MA.
Electoral accountability rests on voters re-electing high-performing and removing low-performing incumbents. However, voters in many developing contexts are poorly informed about incumbent performance, particularly of local politicians. This dissertation asks: how do voters in low-information environments hold local governments to account for their performance in office? I seek to explain when Mexican voters obtain performance information pertaining to their municipal incumbents, and ultimately how it impacts their beliefs and voting behavior. I argue that voters are able and willing to sanction local governments upon receiving incumbent performance indicators. However, electoral accountability requires incentives for voters and media outlets to respectively acquire and supply politically-relevant news. Information in the news just before elections, when these incentives align, thus strongly influences electoral accountability. I test these propositions by examining in detail voter responses to two key issues in Mexican politics---malfeasance in office and violent crime.
The EcoTrends project was established in 2004 by Dr. Debra Peters (Jornada Basin LTER, USDA-ARS Jornada Experimental Range) and Dr. Ariel Lugo (Luquillo LTER, USDA-FS Luquillo Experimental Forest) to support the collection and analysis of long-term ecological datasets. The project is a large synthesis effort focused on improving the accessibility and use of long-term data. At present, there are ~50 state and federally funded research sites that are participating and contributing to the EcoTrends project, including all 26 Long-Term Ecological Research (LTER) sites and sites funded by the USDA Agriculture Research Service (ARS), USDA Forest Service, US Department of Energy, US Geological Survey (USGS) and numerous universities.
Data from the EcoTrends project are available through an exploratory web portal (http://www.ecotrends.info). This web portal enables the continuation of data compilation and accessibility by users through an interactive web application. Ongoing data compilation is updated through both manual and automatic processing as part of the LTER Provenance Aware Synthesis Tracking Architecture (PASTA). The web portal is a collaboration between the Jornada LTER and the LTER Network Office.
The following dataset from Harvard Forest (HFR) contains population employed in service (percent of total) measurements in percent units and were aggregated to a yearly timescale.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This database is a disambiguated version of the official release of the campaign contributions database published by the Federal Election Commission (FEC). This database contains every monetary contribution over $200 by an individual to a registered political action committee (PAC) for a federal US election, from 1979 to present (updated periodically with new data). The present database is disambiguated, meaning that we have inferred the identities of the individuals represented in the data, and assigned unique "identity" identifiers to the records associated with each inferred individual.
This dataset provides information about the number of properties, residents, and average property values for Page Street cross streets in Harvard, IL.
The EcoTrends project was established in 2004 by Dr. Debra Peters (Jornada Basin LTER, USDA-ARS Jornada Experimental Range) and Dr. Ariel Lugo (Luquillo LTER, USDA-FS Luquillo Experimental Forest) to support the collection and analysis of long-term ecological datasets. The project is a large synthesis effort focused on improving the accessibility and use of long-term data. At present, there are ~50 state and federally funded research sites that are participating and contributing to the EcoTrends project, including all 26 Long-Term Ecological Research (LTER) sites and sites funded by the USDA Agriculture Research Service (ARS), USDA Forest Service, US Department of Energy, US Geological Survey (USGS) and numerous universities.
Data from the EcoTrends project are available through an exploratory web portal (http://www.ecotrends.info). This web portal enables the continuation of data compilation and accessibility by users through an interactive web application. Ongoing data compilation is updated through both manual and automatic processing as part of the LTER Provenance Aware Synthesis Tracking Architecture (PASTA). The web portal is a collaboration between the Jornada LTER and the LTER Network Office.
The following dataset from Harvard Forest (HFR) contains percent urban population measurements in percent units and were aggregated to a yearly timescale.
The GIS data maintained by HPPM includes information on buildings and grounds related to Harvard University. Our "standard" base layers are available to Harvard affiliates and their service providers (for example, architects) working on Harvard projects in AutoCAD DWG, ESRI SHP or File Geodatabase format. Additional datasets are sometimes available by special arrangement. http://home.hppm.harvard.edu/pages/gis-data-layers
The EcoTrends project was established in 2004 by Dr. Debra Peters (Jornada Basin LTER, USDA-ARS Jornada Experimental Range) and Dr. Ariel Lugo (Luquillo LTER, USDA-FS Luquillo Experimental Forest) to support the collection and analysis of long-term ecological datasets. The project is a large synthesis effort focused on improving the accessibility and use of long-term data. At present, there are ~50 state and federally funded research sites that are participating and contributing to the EcoTrends project, including all 26 Long-Term Ecological Research (LTER) sites and sites funded by the USDA Agriculture Research Service (ARS), USDA Forest Service, US Department of Energy, US Geological Survey (USGS) and numerous universities.
Data from the EcoTrends project are available through an exploratory web portal (http://www.ecotrends.info). This web portal enables the continuation of data compilation and accessibility by users through an interactive web application. Ongoing data compilation is updated through both manual and automatic processing as part of the LTER Provenance Aware Synthesis Tracking Architecture (PASTA). The web portal is a collaboration between the Jornada LTER and the LTER Network Office.
The following dataset from Harvard Forest (HFR) contains population (urban) measurements in number units and were aggregated to a yearly timescale.
The EcoTrends project was established in 2004 by Dr. Debra Peters (Jornada Basin LTER, USDA-ARS Jornada Experimental Range) and Dr. Ariel Lugo (Luquillo LTER, USDA-FS Luquillo Experimental Forest) to support the collection and analysis of long-term ecological datasets. The project is a large synthesis effort focused on improving the accessibility and use of long-term data. At present, there are ~50 state and federally funded research sites that are participating and contributing to the EcoTrends project, including all 26 Long-Term Ecological Research (LTER) sites and sites funded by the USDA Agriculture Research Service (ARS), USDA Forest Service, US Department of Energy, US Geological Survey (USGS) and numerous universities.
Data from the EcoTrends project are available through an exploratory web portal (http://www.ecotrends.info). This web portal enables the continuation of data compilation and accessibility by users through an interactive web application. Ongoing data compilation is updated through both manual and automatic processing as part of the LTER Provenance Aware Synthesis Tracking Architecture (PASTA). The web portal is a collaboration between the Jornada LTER and the LTER Network Office.
The following dataset from Harvard Forest (HFR) contains population employed in service (percent of total) measurements in percent units and were aggregated to a yearly timescale.
https://github.com/bdsp-core/bdsp-license-and-duahttps://github.com/bdsp-core/bdsp-license-and-dua
The Harvard EEG Database will encompass data gathered from four hospitals affiliated with Harvard University: Massachusetts General Hospital (MGH), Brigham and Women's Hospital (BWH), Beth Israel Deaconess Medical Center (BIDMC), and Boston Children's Hospital (BCH). The EEG data includes three types:
rEEG: "routine EEGs" recorded in the outpatient setting.
EMU: recordings obtained in the inpatient setting, within the Epilepsy Monitoring Unit (EMU).
ICU/LTM: recordings obtained from acutely and critically ill patients within the intensive care unit (ICU).