Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The Alzheimer's Disease Multiclass Dataset contains approximately 44,000 MRI images categorized into four distinct classes based on the severity of Alzheimer's disease. This dataset is intended for use in machine learning model training and testing. All images are skull-stripped and clean of non-brain tissue.
Dataset Structure The dataset is organized into the following four directories, each representing a different class of disease severity: NonDemented: Contains 12,800 MRI images of subjects with no signs of dementia. VeryMildDemented: Contains 11,200 MRI images of subjects with very mild symptoms of dementia. MildDemented: Contains 10,000 MRI images of subjects with mild dementia. ModerateDemented: Contains 10,000 MRI images of subjects with moderate dementia.
Image Details Total Number of Images: 44,000 Image Format: MRI scans as .JPG files Image Usage: Suitable for training and testing machine learning models focused on classifying Alzheimer's disease stages.
Disease Severity Classification The dataset follows a severity ranking system for Alzheimer's disease: NonDemented: No dementia. Very Mild Demented: Early signs of dementia, very mild symptoms. Mild Demented: Clear signs of dementia, but still mild. Moderate Demented: More pronounced symptoms of dementia, moderate severity.
This dataset is an augmented and upsampled version of the dataset below: https://www.kaggle.com/datasets/uraninjo/augmented-alzheimer-mri-dataset-v2
This dataset was upsampled as the original dataset had a large class imbalance.
Facebook
Twitterdvs/90sclub-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Jules Dataset is a dataset for instance segmentation tasks - it contains Pallet annotations for 2,140 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Dataset comprises 5,000+ images of grocery shelves captured in various grocery stores and supermarkets under different lighting conditions. It is designed for research in object detection and product recognition, providing valuable insights into the retail industry for enhancing computer vision applications.
By utilizing this dataset, users can improve their understanding of deep learning methods and develop more effective vision applications tailored to the retail sector. - Get the data
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22059654%2F7e72fe74d53eeb40dc28e6a315bcf49b%2FFrame%20184%20(1).png?generation=1734837963888145&alt=media" alt="">
Each image is accompanied by an XML-annotation indicating the labeled types of product for each image in the dataset. Each image has an attribute of the product(boolean): facing, flipped, occluded.
Researchers can leverage this dataset to advance their work in object detection and product recognition, ultimately contributing to the development of smarter grocery delivery systems and enhanced shopping experiences for consumers. It includes a diverse range of shelf images that reflect real-world grocery market environments, making it an invaluable resource for researchers and developers focused on image classification and computer vision tasks.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Journey9ni/VLM-3R-DATA dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset presents the median household incomes over the past decade across various racial categories identified by the U.S. Census Bureau in Suwannee County. It portrays the median household income of the head of household across racial categories (excluding ethnicity) as identified by the Census Bureau. It also showcases the annual income trends, between 2013 and 2023, providing insights into the economic shifts within diverse racial communities.The dataset can be utilized to gain insights into income disparities and variations across racial categories, aiding in data analysis and decision-making..
Key observations
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
Racial categories include:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Suwannee County median household income by race. You can refer the same here
Facebook
TwitterThe dataset represents a compilation of user interaction data generated by users who participated in the project's pilot activities in Patras, Greece. Data was generated by users in the SMARTBUY app and includes information about users, stores, product categories, professions, and events.
The dataset comprises the following data: - users: user account data for the Patras pilot users - occupation: all possible occupations that the pilot users could choose from - stores: stores which participated in the Patras pilot - sel_products_cat: products uploaded to the SMARTBUY platform by retailers - events: geo-stamped and time-stamped descriptions of a user interaction event (for instance, "user_id 67 rated product_id 722 with rating 4 at location x1 at datetime y1", or "user_id 91 denoted product_id 78 as favorite at location x2 at datetime y2") - event_types: all possible event types captured by the SMARTBUY platform ('Product searches', 'Product views', 'Featured product', 'Products near you views', 'Product photos browsed', 'Product ratings', 'Clicks on Read More button to read product reviews', 'Clicks on Open map button', 'Clicks on Send this info by email button', 'Products denoted as Favorite')
Privacy-sensitive information such as user names, retailer owner names and store names and keywords searched are anonymized.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the population of Merced by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Merced across both sexes and to determine which sex constitutes the majority.
Key observations
There is a slight majority of female population, with 50.64% of total population being female. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.
Scope of gender :
Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Merced Population by Race & Ethnicity. You can refer the same here
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The Synthetic Employee Attrition Dataset is a simulated dataset designed for the analysis and prediction of employee attrition. It contains detailed information about various aspects of an employee's profile, including demographics, job-related features, and personal circumstances.
The dataset comprises 74,498 samples, split into training and testing sets to facilitate model development and evaluation. Each record includes a unique Employee ID and features that influence employee attrition. The goal is to understand the factors contributing to attrition and develop predictive models to identify at-risk employees.
This dataset is ideal for HR analytics, machine learning model development, and demonstrating advanced data analysis techniques. It provides a comprehensive and realistic view of the factors affecting employee retention, making it a valuable resource for researchers and practitioners in the field of human resources and organizational development.
FEATURES:
Employee ID: A unique identifier assigned to each employee. Age: The age of the employee, ranging from 18 to 60 years. Gender: The gender of the employee Years at Company: The number of years the employee has been working at the company. Monthly Income: The monthly salary of the employee, in dollars. Job Role: The department or role the employee works in, encoded into categories such as Finance, Healthcare, Technology, Education, and Media. Work-Life Balance: The employee's perceived balance between work and personal life, (Poor, Below Average, Good, Excellent) Job Satisfaction: The employee's satisfaction with their job: (Very Low, Low, Medium, High) Performance Rating: The employee's performance rating: (Low, Below Average, Average, High) Number of Promotions: The total number of promotions the employee has received. Distance from Home: The distance between the employee's home and workplace, in miles. Education Level: The highest education level attained by the employee: (High School, Associate Degree, Bachelor’s Degree, Master’s Degree, PhD) Marital Status: The marital status of the employee: (Divorced, Married, Single) Job Level: The job level of the employee: (Entry, Mid, Senior) Company Size: The size of the company the employee works for: (Small,Medium,Large) Company Tenure: The total number of years the employee has been working in the industry. Remote Work: Whether the employee works remotely: (Yes or No) Leadership Opportunities: Whether the employee has leadership opportunities: (Yes or No) Innovation Opportunities: Whether the employee has opportunities for innovation: (Yes or No) Company Reputation: The employee's perception of the company's reputation: (Very Poor, Poor,Good, Excellent) Employee Recognition: The level of recognition the employee receives:(Very Low, Low, Medium, High)
Attrition: Whether the employee has left the company, encoded as 0 (stayed) and 1 (Left).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Detect Mix is a dataset for object detection tasks - it contains Crack annotations for 1,448 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterIn diesem Datensatz sind alle ( 15) Klimastationen des gewählten Bundeslandes abgelegt. Je Station 10 Realisierungen. 150 Dateien mit je 4'241'439 Byte. Datensatz ist zip-gepackt.
Daten: (ASCII) Datasatz Kürzel : WR2010_EH5_1_A1B_MV_KL Datasatz Name : UBA-WETTREG ECHAM5/OM 20C + A1B Lauf 1 1961-2100 für das gewählte Bundesland, Klimastationen
Dateistruktur Klimastation: (Kopfzeilen) Stationsname Breite Länge Höhe Typ ta.mo.jahr TX TM TN RR RF PP DD SD NN FF
Stationslist: Stationsliste_MV_KL.txt Stationsnummer, Stationsname, Bundeslandkürzel, Breite, Länge, Stationshöhe,Typ
Es gibt keine Jahre mit Schalttag. Die Ausfallkennung ist -999.0
This data set is a pool of all ( 15) climate stations of the selected Federal State, specified in the entry_name. 10 realizations per station . 150 files with 4'292'439 Byte. Dataset is zip-compressed.
Data: (ASCII) Dataset acronym: WR2010_EH5_1_A1B_MV_KL Dataset name: UBA-WETTREG ECHAM5/OM 20C + A1B Run 1 realization 1961-2100 for the selected Federal State - climate stations
File structure climate stations: (header) station name Latitude Longitude height type ta.mo.jahr TX TM TN RR RF PP DD SD NN FF
Station list: Stationsliste_MV_KL.txt station number, name of station, Abbreviation of federal state, latitude, longitude, height over sea level,type
There are no leap years. Missing values are indicated with -999.0
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Hydrometeorological time series and catchment attributes from the CABra dataset. The manuscript of "CABra: a novel large-sample dataset for Brazilian catchments" is under review in Hydrology and Earth System Sciences (HESS) journal.
Here we present the Catchments Attributes for Brazil (CABra), which is a large-sample dataset for Brazilian catchments that includes long-term data (30 years) for 735 catchments in eight main catchment attribute classes (climate, streamflow, groundwater, geology, soil, topography, land-use and land-cover, and hydrologic disturbance). We have collected and synthesized data from multiple sources (ground stations, remote sensing, and gridded datasets). To prepare the dataset, we delineated all the catchments using the Multi-Error-Removed Improved-Terrain Digital Elevation Model and the coordinates of the streamflow stations provided by the Brazilian Water Agency (ANA), where only the stations with 30 years (1980-2010) of data and less than 10% of missing records were included. Catchment areas range from 9 to 4,800,000 km² and the mean daily streamflow varies from 0.02 to 9 mm day-1. Several signatures and indices were calculated based on the climate and streamflow data. Additionally, our dataset includes boundary shapefiles, geographic coordinates, and drainage areas for each catchment, aside from more than 100 attributes within the attribute classes.
Data can also be accessed at: thecabradataset.shinyapps.io/CABra
* This version includes water demand in CABra catchments for 2020 and 2040 (projection).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Algorithmic trading space is buzzing with new strategies. Companies have spent billions in infrastructure and R&D to be able to jump ahead of the competition and beat the market. Finding value in stocks is an art that very few mastered. Can a computer do that?
This dataset contains 200+ financial indicators that are commonly found in the 10-K filings each publicly traded company releases yearly, for a period of US stocks for 2018.
## Target Variables The dataset includes two class labels: 1. PRICE VAR [%]: Lists the percent price variation for 2018 2. class: Binary classification for each stock where: - 1: Identifies stocks that an hypothetical trader should BUY - 0: Identifies stocks that an hypothetical trader should NOT BUY
Facebook
TwitterThe Allen Brain Observatory – Visual Coding is a large-scale, standardized survey of physiological activity across the mouse visual cortex, hippocampus, and thalamus. It includes datasets collected with both two-photon imaging and Neuropixels probes, two complementary techniques for measuring the activity of neurons in vivo. The two-photon imaging dataset features visually evoked calcium responses from GCaMP6-expressing neurons in a range of cortical layers, visual areas, and Cre lines. The Neuropixels dataset features spiking activity from distributed cortical and subcortical brain regions, collected under analogous conditions to the two-photon imaging experiments. We hope that experimentalists and modelers will use these comprehensive, open datasets as a testbed for theories of visual information processing.
Facebook
Twittersome abs. Visit https://dataone.org/datasets/urn%3Auuid%3Ae6f97c09-945e-40b7-8a0f-8551618ea78c for complete metadata about this dataset.
Facebook
TwitterThis data has been superseded by a newer version of the dataset. Please refer to NOAA's Climate Divisional Database for more information. The U.S. Climate Divisional Dataset provides data access to current U.S. temperature, precipitation and drought indeces. Divisional indices included are: Precipitation Index, Palmer Drought Severity Index, Palmer Hydrological Drought Index, Modified Palmer Drought Severity Index, Temperature, Palmer Z Index, Cooling Degree Days, Heating Degree Days, 1-Month Standardized Precipitation Index (SPI), 2-Month (SPI), 3-Month (SPI), 6-Month (SPI),12-Month (SPI) and the 24-Month (SPI). All of these Indices, except for the SPI, are available for Regional, State and National views as well. There are 344 climate divisions in the CONUS. For each climate division, monthly station temperature and precipitation values are computed from the daily observations. The divisional values are weighted by area to compute statewide values and the statewide values are weighted by area to compute regional values. The indices were computed using daily station data from 1895 to present.
Facebook
TwitterThis database, compiled by Matthews and Fung (1987), provides information on the distribution and environmental characteristics of natural wetlands. The database was developed to evaluate the role of wetlands in the annual emission of methane from terrestrial sources. The original data consists of five global 1-degree latitude by 1-degree longitude arrays. This subset, for the study area of the Large Scale Biosphere-Atmosphere Experiment in Amazonia (LBA) in South America, retains all five arrays at the 1-degree resolution but only for the area of interest (i.e., longitude 85 deg to 30 deg W, latitude 25 deg S to 10 deg N). The arrays are (1) wetland data source, (2) wetland type, (3) fractional inundation, (4) vegetation type, and (5) soil type. The data subsets are in both ASCII GRID and binary image file formats.The data base is the result of the integration of three independent digital sources: (1) vegetation classified according to the United Nations Educational Scientific and Cultural Organization (UNESCO) system (Matthews, 1983), (2) soil properties from the Food and Agriculture Organization (FAO) soil maps (Zobler, 1986), and (3) fractional inundation in each 1-degree cell compiled from a global map survey of Operational Navigation Charts (ONC). With vegetation, soil, and inundation characteristics of each wetland site identified, the data base has been used for a coherent and systematic estimate of methane emissions from wetlands and for an analysis of the causes for uncertainties in the emission estimate.The complete global data base is available from NASA/GISS [http://www.giss.nasa.gov] and NCAR data set ds765.5 [http://www.ncar.ucar.edu]; the global vegetation types data are available from ORNL DAAC [http://www.daac.ornl.gov].
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Added new dataset OpenDataLog. The dataset stores detailed information regarding issues with the open data portal, new or changes to datasets on the portal as well as other information related to the City's Open Data Portal
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Gopalatius/bitcoin-historical-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset constitutes a comprehensive inventory of 119 excavated sites from the territory of southern Bilād al-Shām attesting the occurrence of the Islamic Cream Ware (ICW). It provides information on the typological variety, dating, and general contexts of appearance of this pottery class. In addition, it is supplemented by bibliographical references to all sites included in the database. In general, the following dataset can appear useful for scholars working on the various subjects related to Early Islamic pottery and settlement of southern Bilād al-Shām.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The Alzheimer's Disease Multiclass Dataset contains approximately 44,000 MRI images categorized into four distinct classes based on the severity of Alzheimer's disease. This dataset is intended for use in machine learning model training and testing. All images are skull-stripped and clean of non-brain tissue.
Dataset Structure The dataset is organized into the following four directories, each representing a different class of disease severity: NonDemented: Contains 12,800 MRI images of subjects with no signs of dementia. VeryMildDemented: Contains 11,200 MRI images of subjects with very mild symptoms of dementia. MildDemented: Contains 10,000 MRI images of subjects with mild dementia. ModerateDemented: Contains 10,000 MRI images of subjects with moderate dementia.
Image Details Total Number of Images: 44,000 Image Format: MRI scans as .JPG files Image Usage: Suitable for training and testing machine learning models focused on classifying Alzheimer's disease stages.
Disease Severity Classification The dataset follows a severity ranking system for Alzheimer's disease: NonDemented: No dementia. Very Mild Demented: Early signs of dementia, very mild symptoms. Mild Demented: Clear signs of dementia, but still mild. Moderate Demented: More pronounced symptoms of dementia, moderate severity.
This dataset is an augmented and upsampled version of the dataset below: https://www.kaggle.com/datasets/uraninjo/augmented-alzheimer-mri-dataset-v2
This dataset was upsampled as the original dataset had a large class imbalance.