100+ datasets found
  1. Alzheimer's Disease Multiclass Images Dataset

    • kaggle.com
    zip
    Updated Jun 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aryan Singhal (2024). Alzheimer's Disease Multiclass Images Dataset [Dataset]. https://www.kaggle.com/datasets/aryansinghal10/alzheimers-multiclass-dataset-equal-and-augmented
    Explore at:
    zip(417170579 bytes)Available download formats
    Dataset updated
    Jun 26, 2024
    Authors
    Aryan Singhal
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Alzheimer's Disease Multiclass Dataset contains approximately 44,000 MRI images categorized into four distinct classes based on the severity of Alzheimer's disease. This dataset is intended for use in machine learning model training and testing. All images are skull-stripped and clean of non-brain tissue.

    Dataset Structure The dataset is organized into the following four directories, each representing a different class of disease severity: NonDemented: Contains 12,800 MRI images of subjects with no signs of dementia. VeryMildDemented: Contains 11,200 MRI images of subjects with very mild symptoms of dementia. MildDemented: Contains 10,000 MRI images of subjects with mild dementia. ModerateDemented: Contains 10,000 MRI images of subjects with moderate dementia.

    Image Details Total Number of Images: 44,000 Image Format: MRI scans as .JPG files Image Usage: Suitable for training and testing machine learning models focused on classifying Alzheimer's disease stages.

    Disease Severity Classification The dataset follows a severity ranking system for Alzheimer's disease: NonDemented: No dementia. Very Mild Demented: Early signs of dementia, very mild symptoms. Mild Demented: Clear signs of dementia, but still mild. Moderate Demented: More pronounced symptoms of dementia, moderate severity.

    This dataset is an augmented and upsampled version of the dataset below: https://www.kaggle.com/datasets/uraninjo/augmented-alzheimer-mri-dataset-v2

    This dataset was upsampled as the original dataset had a large class imbalance.

  2. h

    VLM-3R-DATA

    • huggingface.co
    Updated Jun 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JIAN ZHANG (2025). VLM-3R-DATA [Dataset]. https://huggingface.co/datasets/Journey9ni/VLM-3R-DATA
    Explore at:
    Dataset updated
    Jun 10, 2025
    Authors
    JIAN ZHANG
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Journey9ni/VLM-3R-DATA dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. Grocery Shelves Dataset

    • kaggle.com
    zip
    Updated Jun 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2025). Grocery Shelves Dataset [Dataset]. https://www.kaggle.com/datasets/unidpro/grocery-shelves
    Explore at:
    zip(36611803 bytes)Available download formats
    Dataset updated
    Jun 16, 2025
    Authors
    Unidata
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Grocery Shelves Dataset

    Dataset comprises 5,000+ images of grocery shelves captured in various grocery stores and supermarkets under different lighting conditions. It is designed for research in object detection and product recognition, providing valuable insights into the retail industry for enhancing computer vision applications.

    By utilizing this dataset, users can improve their understanding of deep learning methods and develop more effective vision applications tailored to the retail sector. - Get the data

    Example of the data

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22059654%2F7e72fe74d53eeb40dc28e6a315bcf49b%2FFrame%20184%20(1).png?generation=1734837963888145&alt=media" alt=""> Each image is accompanied by an XML-annotation indicating the labeled types of product for each image in the dataset. Each image has an attribute of the product(boolean): facing, flipped, occluded.

    💵 Buy the Dataset: This is a limited preview of the data. To access the full dataset, please contact us at https://unidata.pro to discuss your requirements and pricing options.

    Researchers can leverage this dataset to advance their work in object detection and product recognition, ultimately contributing to the development of smarter grocery delivery systems and enhanced shopping experiences for consumers. It includes a diverse range of shelf images that reflect real-world grocery market environments, making it an invaluable resource for researchers and developers focused on image classification and computer vision tasks.

    🌐 UniData provides high-quality datasets, content moderation, data collection and annotation for your AI/ML projects

  4. R

    Detect Mix Dataset

    • universe.roboflow.com
    zip
    Updated Oct 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thesis (2023). Detect Mix Dataset [Dataset]. https://universe.roboflow.com/thesis-vnyio/detect-mix/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 3, 2023
    Dataset authored and provided by
    Thesis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Crack Bounding Boxes
    Description

    Detect Mix

    ## Overview
    
    Detect Mix is a dataset for object detection tasks - it contains Crack annotations for 1,448 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  5. h

    bitcoin-historical-dataset

    • huggingface.co
    Updated Apr 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Naufal Faza (2024). bitcoin-historical-dataset [Dataset]. https://huggingface.co/datasets/Gopalatius/bitcoin-historical-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 9, 2024
    Authors
    Muhammad Naufal Faza
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Gopalatius/bitcoin-historical-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. N

    Merced, CA Population Breakdown by Gender Dataset: Male and Female...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Merced, CA Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b243e3c4-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    California, Merced
    Variables measured
    Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Merced by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Merced across both sexes and to determine which sex constitutes the majority.

    Key observations

    There is a slight majority of female population, with 50.64% of total population being female. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

    Variables / Data Columns

    • Gender: This column displays the Gender (Male / Female)
    • Population: The population of the gender in the Merced is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each gender as a proportion of Merced total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Merced Population by Race & Ethnicity. You can refer the same here

  7. h

    selfrag_train_data

    • huggingface.co
    Updated Oct 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    selfrag (2023). selfrag_train_data [Dataset]. https://huggingface.co/datasets/selfrag/selfrag_train_data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 17, 2023
    Dataset authored and provided by
    selfrag
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This is a training data file for Self-RAG that generates outputs to diverse user queries as well as reflection tokens to call the retrieval system adaptively and criticize its own output and retrieved passages. Self-RAG is trained on our 150k diverse instruction-output pairs with interleaving passages and reflection tokens using the standard next-token prediction objective, enabling efficient and stable learning with fine-grained feedback. At inference, we leverage reflection tokens covering… See the full description on the dataset page: https://huggingface.co/datasets/selfrag/selfrag_train_data.

  8. N

    Rochester, IL Population Breakdown by Gender Dataset: Male and Female...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Rochester, IL Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b24feec9-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Rochester, Illinois
    Variables measured
    Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Rochester by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Rochester across both sexes and to determine which sex constitutes the majority.

    Key observations

    There is a slight majority of female population, with 51.82% of total population being female. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

    Variables / Data Columns

    • Gender: This column displays the Gender (Male / Female)
    • Population: The population of the gender in the Rochester is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each gender as a proportion of Rochester total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Rochester Population by Race & Ethnicity. You can refer the same here

  9. r

    Data from: SMARTBUY dataset

    • researchdata.se
    • gimi9.com
    zip
    Updated Jan 29, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karl Andersson; Damianos Gavalas (2021). SMARTBUY dataset [Dataset]. http://doi.org/10.5878/cg82-h783
    Explore at:
    zip(181405)Available download formats
    Dataset updated
    Jan 29, 2021
    Dataset provided by
    Luleå University of Technology
    Authors
    Karl Andersson; Damianos Gavalas
    Time period covered
    Sep 1, 2018 - Dec 31, 2018
    Area covered
    Greece
    Description

    The dataset represents a compilation of user interaction data generated by users who participated in the project's pilot activities in Patras, Greece. Data was generated by users in the SMARTBUY app and includes information about users, stores, product categories, professions, and events.

    The dataset comprises the following data: - users: user account data for the Patras pilot users - occupation: all possible occupations that the pilot users could choose from - stores: stores which participated in the Patras pilot - sel_products_cat: products uploaded to the SMARTBUY platform by retailers - events: geo-stamped and time-stamped descriptions of a user interaction event (for instance, "user_id 67 rated product_id 722 with rating 4 at location x1 at datetime y1", or "user_id 91 denoted product_id 78 as favorite at location x2 at datetime y2") - event_types: all possible event types captured by the SMARTBUY platform ('Product searches', 'Product views', 'Featured product', 'Products near you views', 'Product photos browsed', 'Product ratings', 'Clicks on Read More button to read product reviews', 'Clicks on Open map button', 'Clicks on Send this info by email button', 'Products denoted as Favorite')

    Privacy-sensitive information such as user names, retailer owner names and store names and keywords searched are anonymized.

  10. Employee Attrition Classification Dataset

    • kaggle.com
    zip
    Updated Jun 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Umair Zia (2024). Employee Attrition Classification Dataset [Dataset]. https://www.kaggle.com/datasets/stealthtechnologies/employee-attrition-dataset
    Explore at:
    zip(1802815 bytes)Available download formats
    Dataset updated
    Jun 11, 2024
    Authors
    Umair Zia
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Synthetic Employee Attrition Dataset is a simulated dataset designed for the analysis and prediction of employee attrition. It contains detailed information about various aspects of an employee's profile, including demographics, job-related features, and personal circumstances.

    The dataset comprises 74,498 samples, split into training and testing sets to facilitate model development and evaluation. Each record includes a unique Employee ID and features that influence employee attrition. The goal is to understand the factors contributing to attrition and develop predictive models to identify at-risk employees.

    This dataset is ideal for HR analytics, machine learning model development, and demonstrating advanced data analysis techniques. It provides a comprehensive and realistic view of the factors affecting employee retention, making it a valuable resource for researchers and practitioners in the field of human resources and organizational development.

    FEATURES:

    Employee ID: A unique identifier assigned to each employee. Age: The age of the employee, ranging from 18 to 60 years. Gender: The gender of the employee Years at Company: The number of years the employee has been working at the company. Monthly Income: The monthly salary of the employee, in dollars. Job Role: The department or role the employee works in, encoded into categories such as Finance, Healthcare, Technology, Education, and Media. Work-Life Balance: The employee's perceived balance between work and personal life, (Poor, Below Average, Good, Excellent) Job Satisfaction: The employee's satisfaction with their job: (Very Low, Low, Medium, High) Performance Rating: The employee's performance rating: (Low, Below Average, Average, High) Number of Promotions: The total number of promotions the employee has received. Distance from Home: The distance between the employee's home and workplace, in miles. Education Level: The highest education level attained by the employee: (High School, Associate Degree, Bachelor’s Degree, Master’s Degree, PhD) Marital Status: The marital status of the employee: (Divorced, Married, Single) Job Level: The job level of the employee: (Entry, Mid, Senior) Company Size: The size of the company the employee works for: (Small,Medium,Large) Company Tenure: The total number of years the employee has been working in the industry. Remote Work: Whether the employee works remotely: (Yes or No) Leadership Opportunities: Whether the employee has leadership opportunities: (Yes or No) Innovation Opportunities: Whether the employee has opportunities for innovation: (Yes or No) Company Reputation: The employee's perception of the company's reputation: (Very Poor, Poor,Good, Excellent) Employee Recognition: The level of recognition the employee receives:(Very Low, Low, Medium, High)

    Attrition: Whether the employee has left the company, encoded as 0 (stayed) and 1 (Left).

  11. R

    Sketch Detection Dataset

    • universe.roboflow.com
    zip
    Updated May 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sketch (2022). Sketch Detection Dataset [Dataset]. https://universe.roboflow.com/sketch/sketch-detection/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 25, 2022
    Dataset authored and provided by
    sketch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Sketch Bounding Boxes
    Description

    Sketch Detection

    ## Overview
    
    Sketch Detection is a dataset for object detection tasks - it contains Sketch annotations for 1,499 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  12. W

    Dataset Group: Climate stations in Mecklenburg Vorpommern

    • wdc-climate.de
    Updated Nov 9, 2010
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kreienkamp, Frank; Enke, Wolfgang; Spekat, Arne (2010). Dataset Group: Climate stations in Mecklenburg Vorpommern [Dataset]. https://www.wdc-climate.de/ui/entry?acronym=WR2010_EH5_1_A1B_MV_KL
    Explore at:
    Dataset updated
    Nov 9, 2010
    Dataset provided by
    World Data Center for Climate (WDCC) at DKRZ
    Authors
    Kreienkamp, Frank; Enke, Wolfgang; Spekat, Arne
    Time period covered
    Jan 1, 1961 - Dec 31, 2100
    Area covered
    Description

    An english description is given below.

    In diesem Datensatz sind alle ( 15) Klimastationen des gewählten Bundeslandes abgelegt. Je Station 10 Realisierungen. 150 Dateien mit je 4'241'439 Byte. Datensatz ist zip-gepackt.

    Daten: (ASCII) Datasatz Kürzel : WR2010_EH5_1_A1B_MV_KL Datasatz Name : UBA-WETTREG ECHAM5/OM 20C + A1B Lauf 1 1961-2100 für das gewählte Bundesland, Klimastationen

    Dateistruktur Klimastation: (Kopfzeilen) Stationsname Breite Länge Höhe Typ ta.mo.jahr TX TM TN RR RF PP DD SD NN FF

    Stationslist: Stationsliste_MV_KL.txt Stationsnummer, Stationsname, Bundeslandkürzel, Breite, Länge, Stationshöhe,Typ

    Es gibt keine Jahre mit Schalttag. Die Ausfallkennung ist -999.0

    This data set is a pool of all ( 15) climate stations of the selected Federal State, specified in the entry_name. 10 realizations per station . 150 files with 4'292'439 Byte. Dataset is zip-compressed.

    Data: (ASCII) Dataset acronym: WR2010_EH5_1_A1B_MV_KL Dataset name: UBA-WETTREG ECHAM5/OM 20C + A1B Run 1 realization 1961-2100 for the selected Federal State - climate stations

    File structure climate stations: (header) station name Latitude Longitude height type ta.mo.jahr TX TM TN RR RF PP DD SD NN FF

    Station list: Stationsliste_MV_KL.txt station number, name of station, Abbreviation of federal state, latitude, longitude, height over sea level,type

    There are no leap years. Missing values are indicated with -999.0

  13. h

    Darija-Stories-Dataset

    • huggingface.co
    Updated Jul 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ali El Filali (2023). Darija-Stories-Dataset [Dataset]. https://huggingface.co/datasets/alielfilali01/Darija-Stories-Dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 2, 2023
    Authors
    Ali El Filali
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Dataset Card for "Darija-Stories-Dataset"

    Darija (Moroccan Arabic) Stories Dataset is a large-scale collection of stories written in Moroccan Arabic dialect (Darija).

      Dataset Description
    

    Darija (Moroccan Arabic) Stories Dataset contains a diverse range of stories that provide insights into Moroccan culture, traditions, and everyday life. The dataset consists of textual content from various chapters, including narratives, dialogues, and descriptions. Each story chapter is… See the full description on the dataset page: https://huggingface.co/datasets/alielfilali01/Darija-Stories-Dataset.

  14. Data from: CABra: a novel large-sample dataset for Brazilian catchments

    • zenodo.org
    • data.niaid.nih.gov
    pdf, txt, zip
    Updated Jul 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andre Almagro; Andre Almagro; Paulo Tarso Sanches Oliveira; Paulo Tarso Sanches Oliveira; Antonio Alves Meira Neto; Antonio Alves Meira Neto; Tirthankar Roy; Tirthankar Roy; Peter Troch; Peter Troch (2024). CABra: a novel large-sample dataset for Brazilian catchments [Dataset]. http://doi.org/10.5281/zenodo.7612350
    Explore at:
    txt, zip, pdfAvailable download formats
    Dataset updated
    Jul 12, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Andre Almagro; Andre Almagro; Paulo Tarso Sanches Oliveira; Paulo Tarso Sanches Oliveira; Antonio Alves Meira Neto; Antonio Alves Meira Neto; Tirthankar Roy; Tirthankar Roy; Peter Troch; Peter Troch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Hydrometeorological time series and catchment attributes from the CABra dataset. The manuscript of "CABra: a novel large-sample dataset for Brazilian catchments" is under review in Hydrology and Earth System Sciences (HESS) journal.

    Here we present the Catchments Attributes for Brazil (CABra), which is a large-sample dataset for Brazilian catchments that includes long-term data (30 years) for 735 catchments in eight main catchment attribute classes (climate, streamflow, groundwater, geology, soil, topography, land-use and land-cover, and hydrologic disturbance). We have collected and synthesized data from multiple sources (ground stations, remote sensing, and gridded datasets). To prepare the dataset, we delineated all the catchments using the Multi-Error-Removed Improved-Terrain Digital Elevation Model and the coordinates of the streamflow stations provided by the Brazilian Water Agency (ANA), where only the stations with 30 years (1980-2010) of data and less than 10% of missing records were included. Catchment areas range from 9 to 4,800,000 km² and the mean daily streamflow varies from 0.02 to 9 mm day-1. Several signatures and indices were calculated based on the climate and streamflow data. Additionally, our dataset includes boundary shapefiles, geographic coordinates, and drainage areas for each catchment, aside from more than 100 attributes within the attribute classes.

    Data can also be accessed at: thecabradataset.shinyapps.io/CABra

    * This version includes water demand in CABra catchments for 2020 and 2040 (projection).

  15. Statewide Death Profiles

    • data.chhs.ca.gov
    • data.ca.gov
    • +3more
    csv, zip
    Updated Apr 24, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2026). Statewide Death Profiles [Dataset]. https://data.chhs.ca.gov/dataset/statewide-death-profiles
    Explore at:
    csv(164006), csv(200270), csv(2026589), csv(337925), csv(5401561), csv(463460), csv(5099), csv(16301), csv(465029), csv(5181371), zipAvailable download formats
    Dataset updated
    Apr 24, 2026
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    Description

    This dataset contains counts of deaths for California as a whole based on information entered on death certificates. Final counts are derived from static data and include out-of-state deaths to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all deaths that occurred during the time period. Deaths involving injuries from external or environmental forces, such as accidents, homicide and suicide, often require additional investigation that tends to delay certification of the cause and manner of death. This can result in significant under-reporting of these deaths in provisional data.

    The final data tables include both deaths that occurred in California regardless of the place of residence (by occurrence) and deaths to California residents (by residence), whereas the provisional data table only includes deaths that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by age, gender, race-ethnicity, and death place type. Deaths due to all causes (ALL) and selected underlying cause of death categories are provided. See temporal coverage for more information on which combinations are available for which years.

    The cause of death categories are based solely on the underlying cause of death as coded by the International Classification of Diseases. The underlying cause of death is defined by the World Health Organization (WHO) as "the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury." It is a single value assigned to each death based on the details as entered on the death certificate. When more than one cause is listed, the order in which they are listed can affect which cause is coded as the underlying cause. This means that similar events could be coded with different underlying causes of death depending on variations in how they were entered. Consequently, while underlying cause of death provides a convenient comparison between cause of death categories, it may not capture the full impact of each cause of death as it does not always take into account all conditions contributing to the death.

  16. a

    Dataset Log

    • data-uvalibrary.opendata.arcgis.com
    • opendata.charlottesville.org
    • +2more
    Updated Oct 26, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Charlottesville (2017). Dataset Log [Dataset]. https://data-uvalibrary.opendata.arcgis.com/datasets/charlottesville::dataset-log
    Explore at:
    Dataset updated
    Oct 26, 2017
    Dataset authored and provided by
    City of Charlottesville
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Description

    Added new dataset OpenDataLog. The dataset stores detailed information regarding issues with the open data portal, new or changes to datasets on the portal as well as other information related to the City's Open Data Portal

  17. o

    US_Stocks_Financial_Indicators

    • openml.org
    Updated Dec 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicolas Carbone (2024). US_Stocks_Financial_Indicators [Dataset]. https://www.openml.org/d/46527
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 12, 2024
    Authors
    Nicolas Carbone
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    200+ Financial Indicators of US Stocks (2018)

    Context

    Algorithmic trading space is buzzing with new strategies. Companies have spent billions in infrastructure and R&D to be able to jump ahead of the competition and beat the market. Finding value in stocks is an art that very few mastered. Can a computer do that?

    Content

    This dataset contains 200+ financial indicators that are commonly found in the 10-K filings each publicly traded company releases yearly, for a period of US stocks for 2018.

    ## Target Variables The dataset includes two class labels: 1. PRICE VAR [%]: Lists the percent price variation for 2018 2. class: Binary classification for each stock where: - 1: Identifies stocks that an hypothetical trader should BUY - 0: Identifies stocks that an hypothetical trader should NOT BUY

    Important Notes

    • Some financial indicator values might be missing
    • Contains outliers with extreme values (possibly due to mistyping)
    • Price variations are calculated from the first trading day of 2018 to the last trading day of 2018
  18. Allen Brain Observatory - Visual Coding AWS Public Data Set

    • registry.opendata.aws
    Updated Jun 20, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allen Institute (2018). Allen Brain Observatory - Visual Coding AWS Public Data Set [Dataset]. https://registry.opendata.aws/allen-brain-observatory/
    Explore at:
    Dataset updated
    Jun 20, 2018
    Dataset provided by
    Allen Institute
    Description

    The Allen Brain Observatory – Visual Coding is a large-scale, standardized survey of physiological activity across the mouse visual cortex, hippocampus, and thalamus. It includes datasets collected with both two-photon imaging and Neuropixels probes, two complementary techniques for measuring the activity of neurons in vivo. The two-photon imaging dataset features visually evoked calcium responses from GCaMP6-expressing neurons in a range of cortical layers, visual areas, and Cre lines. The Neuropixels dataset features spiking activity from distributed cortical and subcortical brain regions, collected under analogous conditions to the two-photon imaging experiments. We hope that experimentalists and modelers will use these comprehensive, open datasets as a testbed for theories of visual information processing.

  19. d

    ANother test dataset

    • search.test.dataone.org
    Updated Jan 13, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rushiraj Nenuji (2026). ANother test dataset [Dataset]. https://search.test.dataone.org/view/urn%3Auuid%3Ae6f97c09-945e-40b7-8a0f-8551618ea78c
    Explore at:
    Dataset updated
    Jan 13, 2026
    Dataset provided by
    urn:node:mnTestKNB
    Authors
    Rushiraj Nenuji
    Time period covered
    Jan 1, 2025
    Area covered
    Description

    some abs. Visit https://dataone.org/datasets/urn%3Auuid%3Ae6f97c09-945e-40b7-8a0f-8551618ea78c for complete metadata about this dataset.

  20. Dataset S1

    • figshare.com
    xlsx
    Updated Mar 20, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weerawat Phuklia; Kaisone Padith; Christina M. Farris; Allen L. Richards; Koukeo Phommasone; Mayfong Mayxay; Mavuto Mukaka; Matthew T. Robinson; Paul N. Newton; Nicholas P.J. Day; Elizabeth A Ashley (2026). Dataset S1 [Dataset]. http://doi.org/10.6084/m9.figshare.31274773.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 20, 2026
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Weerawat Phuklia; Kaisone Padith; Christina M. Farris; Allen L. Richards; Koukeo Phommasone; Mayfong Mayxay; Mavuto Mukaka; Matthew T. Robinson; Paul N. Newton; Nicholas P.J. Day; Elizabeth A Ashley
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    For each isolate, MIC testing was performed in three independent experiments. The mean MIC value obtained from these independent biological replicates was used as the representative MIC for analysis.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Aryan Singhal (2024). Alzheimer's Disease Multiclass Images Dataset [Dataset]. https://www.kaggle.com/datasets/aryansinghal10/alzheimers-multiclass-dataset-equal-and-augmented
Organization logo

Alzheimer's Disease Multiclass Images Dataset

Alzheimer's Disease dataset split into 4 classes

Explore at:
10 scholarly articles cite this dataset (View in Google Scholar)
zip(417170579 bytes)Available download formats
Dataset updated
Jun 26, 2024
Authors
Aryan Singhal
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

The Alzheimer's Disease Multiclass Dataset contains approximately 44,000 MRI images categorized into four distinct classes based on the severity of Alzheimer's disease. This dataset is intended for use in machine learning model training and testing. All images are skull-stripped and clean of non-brain tissue.

Dataset Structure The dataset is organized into the following four directories, each representing a different class of disease severity: NonDemented: Contains 12,800 MRI images of subjects with no signs of dementia. VeryMildDemented: Contains 11,200 MRI images of subjects with very mild symptoms of dementia. MildDemented: Contains 10,000 MRI images of subjects with mild dementia. ModerateDemented: Contains 10,000 MRI images of subjects with moderate dementia.

Image Details Total Number of Images: 44,000 Image Format: MRI scans as .JPG files Image Usage: Suitable for training and testing machine learning models focused on classifying Alzheimer's disease stages.

Disease Severity Classification The dataset follows a severity ranking system for Alzheimer's disease: NonDemented: No dementia. Very Mild Demented: Early signs of dementia, very mild symptoms. Mild Demented: Clear signs of dementia, but still mild. Moderate Demented: More pronounced symptoms of dementia, moderate severity.

This dataset is an augmented and upsampled version of the dataset below: https://www.kaggle.com/datasets/uraninjo/augmented-alzheimer-mri-dataset-v2

This dataset was upsampled as the original dataset had a large class imbalance.

Search
Clear search
Close search
Google apps
Main menu