100+ datasets found
  1. Nemotron-Content-Safety-Audio-Dataset

    • huggingface.co
    Updated Jan 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NVIDIA (2025). Nemotron-Content-Safety-Audio-Dataset [Dataset]. https://huggingface.co/datasets/nvidia/Nemotron-Content-Safety-Audio-Dataset
    Explore at:
    Dataset updated
    Jan 15, 2025
    Dataset provided by
    Nvidiahttp://nvidia.com/
    Authors
    NVIDIA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Nemotron Content Safety Audio Dataset

      Dataset Description
    

    The Nemotron Content Safety Audio Dataset is a multimodal extension of the Nemotron Content Safety Dataset V2 (Aegis 2.0), comprising 1,928 audio files generated from the test set prompts. This dataset enables multimodal AI safety research by providing spoken versions of adversarial and safety-critical prompts across 23 violation categories. LANGUAGE: All prompts are in English. However, the audio files were… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-Content-Safety-Audio-Dataset.

  2. h

    WorldSense

    • huggingface.co
    Updated Feb 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jack Hong (2025). WorldSense [Dataset]. https://huggingface.co/datasets/honglyhly/WorldSense
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 6, 2025
    Authors
    Jack Hong
    Description

    WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs

    Jack Hong1, Shilin Yan1†, Jiayin Cai1, Xiaolong Jiang1, Yao Hu1, Weidi Xie2‡

    †Project Leader
    ‡Corresponding Author
    

    1Xiaohongshu Inc. 2Shanghai Jiao Tong University [🏠 Project Page] [📖 arXiv Paper] [🤗 Dataset] [🏆 Leaderboard]

      🔥 News
    

    2025.02.07 🌟 We release WorldSense, the first benchmark for real-world omnimodal understanding of MLLMs.

      👀 WorldSense Overview
    

    we… See the full description on the dataset page: https://huggingface.co/datasets/honglyhly/WorldSense.

  3. Eye Detection Dataset

    • kaggle.com
    zip
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Officier_Raccoon (2025). Eye Detection Dataset [Dataset]. https://www.kaggle.com/datasets/icebearogo/eye-detection-dataset
    Explore at:
    zip(18943517 bytes)Available download formats
    Dataset updated
    May 23, 2025
    Authors
    Officier_Raccoon
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    This dataset Contains images of human faces particularly focusing on the eye region. The dataset contains nearly 2000 well annotated images for training object detection models like RCNN, YOLO, etc for tracking and detecting the region of interest withing the eyeball. This dataset can be used for Building cataract detection models, eye movement tracking models and some more. Have fun using and building projects with this dataset!!

    --** NOTE **--- The annotation format contained in the dataset is : class_id, x_center, y_center, width, height. 'Standard for YOLOv8'

  4. Lending Club Loan - Pre-Processed Dataset

    • kaggle.com
    zip
    Updated Jul 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Santello (2022). Lending Club Loan - Pre-Processed Dataset [Dataset]. https://www.kaggle.com/datasets/gabrielsantello/lending-club-loan-preprocessed-dataset
    Explore at:
    zip(28819588 bytes)Available download formats
    Dataset updated
    Jul 6, 2022
    Authors
    Gabriel Santello
    Description

    A subset of the LendingClub DataSet obtained from Kaggle: https://www.kaggle.com/wordsforthewise/lending-club

    LendingClub is a US peer-to-peer lending company, headquartered in San Francisco, California. It was the first peer-to-peer lender to register its offerings as securities with the Securities and Exchange Commission (SEC), and to offer loan trading on a secondary market. LendingClub is the world's largest peer-to-peer lending platform.

  5. h

    openai-moderation-dataset

    • huggingface.co
    Updated Aug 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Benjamin Anderson (2023). openai-moderation-dataset [Dataset]. https://huggingface.co/datasets/andersonbcdefg/openai-moderation-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 29, 2023
    Authors
    Benjamin Anderson
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    andersonbcdefg/openai-moderation-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. Anabolic Steroids Dataset

    • kaggle.com
    zip
    Updated Dec 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kanchana1990 (2024). Anabolic Steroids Dataset [Dataset]. https://www.kaggle.com/datasets/kanchana1990/anabolic-steroids-dataset
    Explore at:
    zip(2487 bytes)Available download formats
    Dataset updated
    Dec 23, 2024
    Authors
    Kanchana1990
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Dataset Overview

    This dataset, titled "Anabolic Steroids", provides a meticulously curated compilation of nearly 50 steroids. It includes detailed information on their original names, common names, medicinal applications, abuse potential, side effects, historical context, and relative molecular mass (RMM). The dataset aims to serve as a resource for exploring the dual nature of anabolic steroids—both their therapeutic benefits and their misuse in sports and bodybuilding.

    Anabolic steroids are synthetic derivatives of testosterone that have been used for decades in medicine to treat conditions like anemia, muscle-wasting diseases, and hormone deficiencies. However, they are also widely abused for performance enhancement and aesthetic purposes. This dataset captures a comprehensive view of these compounds, making it valuable for researchers, educators, and data enthusiasts.

    Data Science Applications

    While this dataset is relatively small (approx 50 entries), it offers rich opportunities for exploratory analysis and domain-specific insights. Potential applications include:

    • Exploratory Data Analysis (EDA):

      • Analyze trends in medicinal vs. non-medicinal use.
      • Study correlations between molecular mass and reported side effects.
      • Visualize the historical development of anabolic steroids over time.
    • Domain-Specific Insights:

      • Examine the evolution of steroid formulations from the 1930s to the present.
      • Investigate patterns in therapeutic uses versus abuse potential.
    • Educational Use:

      • Serve as a teaching tool for understanding data cleaning, visualization, and analysis.
      • Provide insights into the pharmacological and chemical properties of anabolic steroids.

    Column Descriptors

    1. Original Name: The scientific or chemical name of the steroid compound (e.g., Testosterone).
    2. Common Name: The popular or brand name under which the steroid is marketed (e.g., Testoviron).
    3. Medicinal Use: Approved therapeutic applications of the steroid (e.g., treating anemia or hormone replacement therapy).
    4. Abused For: Non-medical uses often associated with performance enhancement or bodybuilding (e.g., bulking cycles, lean muscle retention).
    5. Side Effects: Documented adverse effects resulting from steroid use or abuse (e.g., liver toxicity, gynecomastia).
    6. History: A brief historical context about the steroid's development or usage (e.g., year introduced, medical approval status).
    7. Relative Molecular Mass (g/mol): The molar mass of the steroid compound, useful for chemical analysis.

    Ethically Mined Data

    This dataset has been ethically compiled from publicly available sources such as scientific journals, chemical databases, and educational websites. No proprietary or confidential information has been included. The data was aggregated to ensure accuracy and relevance while respecting intellectual property rights.

    Acknowledgements

    The following sources were instrumental in compiling this dataset: 1. PubChem Database – For verifying chemical properties and molecular mass values. 2. Wikipedia – For historical context and general information on anabolic steroids. 3. NIST Chemistry WebBook – For accurate molecular mass values and chemical details. 4. Scientific Journals – Referenced for medicinal uses, side effects documentation, and abuse patterns. 5. DALL·E 3 by OpenAI – Used to generate illustrative images related to anabolic steroids to complement dataset visualizations.

    Discouraging Steroid Usage and Highlighting Harms

    The misuse of anabolic steroids poses significant health risks and ethical concerns. While anabolic steroids have legitimate medical applications, their abuse for performance enhancement or aesthetic purposes can lead to severe physical and psychological side effects. Common adverse effects include liver damage, cardiovascular strain, hormonal imbalances, infertility, aggression, and mental health issues such as depression. Prolonged misuse can also result in irreversible damage to vital organs and an increased risk of life-threatening conditions like heart attacks or strokes. Beyond individual health risks, steroid abuse undermines the integrity of sports and creates unfair advantages in competitive environments. It is crucial to prioritize natural methods of achieving fitness goals and seek professional guidance for any medical conditions requiring treatment.

    Notes for Kaggle Users

    This dataset is not intended for machine learning due to its small size but serves as an excellent resource for exploratory data analysis (EDA), visualization projects, and domain-specific research into anabolic steroids' pharmacology and societal impact.

  7. Employee Attrition Classification Dataset

    • kaggle.com
    zip
    Updated Jun 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Umair Zia (2024). Employee Attrition Classification Dataset [Dataset]. https://www.kaggle.com/datasets/stealthtechnologies/employee-attrition-dataset
    Explore at:
    zip(1802815 bytes)Available download formats
    Dataset updated
    Jun 11, 2024
    Authors
    Umair Zia
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Synthetic Employee Attrition Dataset is a simulated dataset designed for the analysis and prediction of employee attrition. It contains detailed information about various aspects of an employee's profile, including demographics, job-related features, and personal circumstances.

    The dataset comprises 74,498 samples, split into training and testing sets to facilitate model development and evaluation. Each record includes a unique Employee ID and features that influence employee attrition. The goal is to understand the factors contributing to attrition and develop predictive models to identify at-risk employees.

    This dataset is ideal for HR analytics, machine learning model development, and demonstrating advanced data analysis techniques. It provides a comprehensive and realistic view of the factors affecting employee retention, making it a valuable resource for researchers and practitioners in the field of human resources and organizational development.

    FEATURES:

    Employee ID: A unique identifier assigned to each employee. Age: The age of the employee, ranging from 18 to 60 years. Gender: The gender of the employee Years at Company: The number of years the employee has been working at the company. Monthly Income: The monthly salary of the employee, in dollars. Job Role: The department or role the employee works in, encoded into categories such as Finance, Healthcare, Technology, Education, and Media. Work-Life Balance: The employee's perceived balance between work and personal life, (Poor, Below Average, Good, Excellent) Job Satisfaction: The employee's satisfaction with their job: (Very Low, Low, Medium, High) Performance Rating: The employee's performance rating: (Low, Below Average, Average, High) Number of Promotions: The total number of promotions the employee has received. Distance from Home: The distance between the employee's home and workplace, in miles. Education Level: The highest education level attained by the employee: (High School, Associate Degree, Bachelor’s Degree, Master’s Degree, PhD) Marital Status: The marital status of the employee: (Divorced, Married, Single) Job Level: The job level of the employee: (Entry, Mid, Senior) Company Size: The size of the company the employee works for: (Small,Medium,Large) Company Tenure: The total number of years the employee has been working in the industry. Remote Work: Whether the employee works remotely: (Yes or No) Leadership Opportunities: Whether the employee has leadership opportunities: (Yes or No) Innovation Opportunities: Whether the employee has opportunities for innovation: (Yes or No) Company Reputation: The employee's perception of the company's reputation: (Very Poor, Poor,Good, Excellent) Employee Recognition: The level of recognition the employee receives:(Very Low, Low, Medium, High)

    Attrition: Whether the employee has left the company, encoded as 0 (stayed) and 1 (Left).

  8. h

    geneva-generated-dataset

    • huggingface.co
    Updated Feb 10, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dat (2026). geneva-generated-dataset [Dataset]. https://huggingface.co/datasets/datht/geneva-generated-dataset
    Explore at:
    Dataset updated
    Feb 10, 2026
    Authors
    Dat
    Description

    datht/geneva-generated-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. o

    QSAR-DATASET-FOR-DRUG-TARGET-CHEMBL6184

    • openml.org
    Updated Jul 16, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr Jeremy Besnard; Dr Ivan Olier; Dr Noureddin Sadawi; Dr Larisa Soldatova; Dr Crina Grosan; Prof Ross King; Dr Richard Bickerton; Prof Andrew Hopkins and Dr Willem van Hoorn (2016). QSAR-DATASET-FOR-DRUG-TARGET-CHEMBL6184 [Dataset]. https://www.openml.org/d/37680
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 16, 2016
    Authors
    Dr Jeremy Besnard; Dr Ivan Olier; Dr Noureddin Sadawi; Dr Larisa Soldatova; Dr Crina Grosan; Prof Ross King; Dr Richard Bickerton; Prof Andrew Hopkins and Dr Willem van Hoorn
    Description

    This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target ChEMBL_ID: CHEMBL6184 (TID: 102461), and it has 197 rows and 1248 features (not including molecule IDs and class feature: molecule_id and pXC50). The features represent Molecular Descriptors which were generated from SMILES strings. Missing value imputation was applied to this dataset (By choosing the Median).

  10. F

    LUMID: Large-scale Unlabled Medical Imaging Dataset for Unsupervised and...

    • frdr-dfdr.ca
    Updated Dec 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Osman, Islam; Gupta, Anubhav; Shehata, Mohamed S.; Braun, John W. (2024). LUMID: Large-scale Unlabled Medical Imaging Dataset for Unsupervised and Self-supervised Learning [Dataset]. http://doi.org/10.20383/103.01017
    Explore at:
    Dataset updated
    Dec 4, 2024
    Dataset provided by
    Federated Research Data Repository / dépôt fédéré de données de recherche
    Authors
    Osman, Islam; Gupta, Anubhav; Shehata, Mohamed S.; Braun, John W.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    LUMID is a large-scale, unlabeled collection of over 2 million medical images spanning multiple imaging modalities, including CT scans, X-rays, MRIs, and more. This dataset has been meticulously curated from publicly available medical imaging repositories, addressing the critical challenge of limited scale in existing public datasets and the inaccessibility of high-quality private datasets. The primary motivation behind creating this dataset is to empower the medical imaging community with a resource suited for developing and training advanced deep learning models. By enabling the use of unsupervised and self-supervised learning approaches, this dataset facilitates the learning of rich, transferable representations that can significantly enhance performance across various medical imaging tasks, including classification, segmentation, and anomaly detection.

    Key Features: 1) Diversity: Comprising images from multiple modalities and a wide range of medical imaging scenarios. 2) Scalability: A dataset of unprecedented size, providing a robust foundation for training deep neural networks. 3) Versatility: Specifically designed for unsupervised and self-supervised learning methods, fostering innovation in representation learning for medical imaging. 4) Open Access: Built entirely from public datasets, ensuring transparency and reproducibility.

    This dataset is intended to serve as a cornerstone for advancing research in medical AI, fostering the development of models capable of generalizing across diverse imaging types and clinical conditions.

  11. h

    arithmetic-priming-dataset

    • huggingface.co
    Updated Jul 4, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bocchese Giacomo (2023). arithmetic-priming-dataset [Dataset]. https://huggingface.co/datasets/BoccheseGiacomo/arithmetic-priming-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 4, 2023
    Authors
    Bocchese Giacomo
    Description

    BoccheseGiacomo/arithmetic-priming-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. N

    Income Distribution by Quintile: Mean Household Income in Benton township,...

    • neilsberg.com
    csv, json
    Updated Jan 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2024). Income Distribution by Quintile: Mean Household Income in Benton township, Columbia County, Pennsylvania [Dataset]. https://www.neilsberg.com/research/datasets/cd8aef22-b041-11ee-aaca-3860777c1fe6/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Jan 11, 2024
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Columbia County, Benton Township, Pennsylvania
    Variables measured
    Income Level, Mean Household Income
    Measurement technique
    The data presented in this dataset is derived from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. It delineates income distributions across income quintiles (mentioned above) following an initial analysis and categorization. Subsequently, we adjusted these figures for inflation using the Consumer Price Index retroactive series via current methods (R-CPI-U-RS). For additional information about these estimations, please contact us via email at research@neilsberg.com
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset presents the mean household income for each of the five quintiles in Benton township, Columbia County, Pennsylvania, as reported by the U.S. Census Bureau. The dataset highlights the variation in mean household income across quintiles, offering valuable insights into income distribution and inequality.

    Key observations

    • Income disparities: The mean income of the lowest quintile (20% of households with the lowest income) is 19,940, while the mean income for the highest quintile (20% of households with the highest income) is 192,218. This indicates that the top earners earn 10 times compared to the lowest earners.
    • *Top 5%: * The mean household income for the wealthiest population (top 5%) is 257,928, which is 134.19% higher compared to the highest quintile, and 1293.52% higher compared to the lowest quintile.

    https://i.neilsberg.com/ch/benton-township-columbia-county-pa-mean-household-income-by-quintiles.jpeg" alt="Mean household income by quintiles in Benton township, Columbia County, Pennsylvania (in 2022 inflation-adjusted dollars))">

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

    Income Levels:

    • Lowest Quintile
    • Second Quintile
    • Third Quintile
    • Fourth Quintile
    • Highest Quintile
    • Top 5 Percent

    Variables / Data Columns

    • Income Level: This column showcases the income levels (As mentioned above).
    • Mean Household Income: Mean household income, in 2022 inflation-adjusted dollars for the specific income level.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Benton township median household income. You can refer the same here

  13. R

    Dataset First Dataset

    • universe.roboflow.com
    zip
    Updated May 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    1st (2025). Dataset First Dataset [Dataset]. https://universe.roboflow.com/1st-spusr/dataset-first
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    1st
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Dataset First Bounding Boxes
    Description

    Dataset First

    ## Overview
    
    Dataset First is a dataset for object detection tasks - it contains Dataset First annotations for 280 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  14. N

    Rochester, IL Population Breakdown by Gender Dataset: Male and Female...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Rochester, IL Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b24feec9-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Rochester, Illinois
    Variables measured
    Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Rochester by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Rochester across both sexes and to determine which sex constitutes the majority.

    Key observations

    There is a slight majority of female population, with 51.82% of total population being female. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

    Variables / Data Columns

    • Gender: This column displays the Gender (Male / Female)
    • Population: The population of the gender in the Rochester is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each gender as a proportion of Rochester total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Rochester Population by Race & Ethnicity. You can refer the same here

  15. N

    Merced, CA Population Breakdown by Gender Dataset: Male and Female...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Merced, CA Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b243e3c4-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Merced, California
    Variables measured
    Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Merced by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Merced across both sexes and to determine which sex constitutes the majority.

    Key observations

    There is a slight majority of female population, with 50.64% of total population being female. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

    Variables / Data Columns

    • Gender: This column displays the Gender (Male / Female)
    • Population: The population of the gender in the Merced is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each gender as a proportion of Merced total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Merced Population by Race & Ethnicity. You can refer the same here

  16. r

    Data from: SMARTBUY dataset

    • researchdata.se
    • resodate.org
    • +2more
    Updated Jan 29, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karl Andersson; Damianos Gavalas (2021). SMARTBUY dataset [Dataset]. http://doi.org/10.5878/cg82-h783
    Explore at:
    (181405)Available download formats
    Dataset updated
    Jan 29, 2021
    Dataset provided by
    Luleå University of Technology
    Authors
    Karl Andersson; Damianos Gavalas
    Time period covered
    Sep 1, 2018 - Dec 31, 2018
    Area covered
    Greece
    Description

    The dataset represents a compilation of user interaction data generated by users who participated in the project's pilot activities in Patras, Greece. Data was generated by users in the SMARTBUY app and includes information about users, stores, product categories, professions, and events.

    The dataset comprises the following data: - users: user account data for the Patras pilot users - occupation: all possible occupations that the pilot users could choose from - stores: stores which participated in the Patras pilot - sel_products_cat: products uploaded to the SMARTBUY platform by retailers - events: geo-stamped and time-stamped descriptions of a user interaction event (for instance, "user_id 67 rated product_id 722 with rating 4 at location x1 at datetime y1", or "user_id 91 denoted product_id 78 as favorite at location x2 at datetime y2") - event_types: all possible event types captured by the SMARTBUY platform ('Product searches', 'Product views', 'Featured product', 'Products near you views', 'Product photos browsed', 'Product ratings', 'Clicks on Read More button to read product reviews', 'Clicks on Open map button', 'Clicks on Send this info by email button', 'Products denoted as Favorite')

    Privacy-sensitive information such as user names, retailer owner names and store names and keywords searched are anonymized.

  17. R

    11 Original Dataset

    • universe.roboflow.com
    zip
    Updated May 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    caps (2025). 11 Original Dataset [Dataset]. https://universe.roboflow.com/caps-vmqdh/4-11-original/dataset/4
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 9, 2025
    Dataset authored and provided by
    caps
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Person Bounding Boxes
    Description

    11 Original

    ## Overview
    
    11 Original is a dataset for object detection tasks - it contains Person annotations for 225 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  18. N

    Noble, OK Population Breakdown by Gender Dataset: Male and Female Population...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Noble, OK Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b248175d-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Noble, Oklahoma
    Variables measured
    Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Noble by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Noble across both sexes and to determine which sex constitutes the majority.

    Key observations

    There is a slight majority of male population, with 51.18% of total population being male. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

    Variables / Data Columns

    • Gender: This column displays the Gender (Male / Female)
    • Population: The population of the gender in the Noble is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each gender as a proportion of Noble total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Noble Population by Race & Ethnicity. You can refer the same here

  19. N

    Newville, PA Population Breakdown by Gender Dataset: Male and Female...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Newville, PA Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b247f4a0-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Newville, Pennsylvania
    Variables measured
    Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Newville by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Newville across both sexes and to determine which sex constitutes the majority.

    Key observations

    There is a majority of male population, with 53.66% of total population being male. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

    Variables / Data Columns

    • Gender: This column displays the Gender (Male / Female)
    • Population: The population of the gender in the Newville is shown in this column.
    • % of Total Population: This column displays the percentage distribution of each gender as a proportion of Newville total population. Please note that the sum of all percentages may not equal one due to rounding of values.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Newville Population by Race & Ethnicity. You can refer the same here

  20. R

    Windsor V1 Dataset

    • universe.roboflow.com
    zip
    Updated May 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    cocotoyolov1 (2023). Windsor V1 Dataset [Dataset]. https://universe.roboflow.com/cocotoyolov1/windsor-v1/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 8, 2023
    Dataset authored and provided by
    cocotoyolov1
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Road Defects Bounding Boxes
    Description

    Windsor V1

    ## Overview
    
    Windsor V1 is a dataset for object detection tasks - it contains Road Defects annotations for 211 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
NVIDIA (2025). Nemotron-Content-Safety-Audio-Dataset [Dataset]. https://huggingface.co/datasets/nvidia/Nemotron-Content-Safety-Audio-Dataset
Organization logo

Nemotron-Content-Safety-Audio-Dataset

Nemotron Content Safety Audio Dataset

nvidia/Nemotron-Content-Safety-Audio-Dataset

Explore at:
Dataset updated
Jan 15, 2025
Dataset provided by
Nvidiahttp://nvidia.com/
Authors
NVIDIA
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Nemotron Content Safety Audio Dataset

  Dataset Description

The Nemotron Content Safety Audio Dataset is a multimodal extension of the Nemotron Content Safety Dataset V2 (Aegis 2.0), comprising 1,928 audio files generated from the test set prompts. This dataset enables multimodal AI safety research by providing spoken versions of adversarial and safety-critical prompts across 23 violation categories. LANGUAGE: All prompts are in English. However, the audio files were… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-Content-Safety-Audio-Dataset.

Search
Clear search
Close search
Google apps
Main menu