Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Small Size is a dataset for object detection tasks - it contains Small Object annotations for 450 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
A fictional dataset for exploratory data analysis (EDA) and to test simple prediction models.
This toy dataset features 150000 rows and 6 columns.
Note: All data is fictional. The data has been generated so that their distributions are convenient for statistical analysis.
Number: A simple index number for each row
City: The location of a person (Dallas, New York City, Los Angeles, Mountain View, Boston, Washington D.C., San Diego and Austin)
Gender: Gender of a person (Male or Female)
Age: The age of a person (Ranging from 25 to 65 years)
Income: Annual income of a person (Ranging from -674 to 177175)
Illness: Is the person Ill? (Yes or No)
Stock photo by Mika Baumeister on Unsplash.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
## Overview
Drone Data Small is a dataset for object detection tasks - it contains Drone Data Small annotations for 20,822 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [MIT license](https://creativecommons.org/licenses/MIT).
Facebook
TwitterNeighborhood Opportunity Fund, Small Project data includes all projects that have received a final grant payment. Small Projects may have grants up to $250,000 and must be located in an Eligible Commercial Corridor. The Neighborhood Opportunity Fund (“NOF”) receives funds from downtown development in order to support the growth and creation of inclusively vibrant commercial corridors in Chicago’s underserved neighborhoods. Business and property owners may apply for grant funding that will pay for the development or rehabilitation of real estate and projects that support new or expanding businesses or cultural assets.
Facebook
TwitterLikes and image data from the community art website Behance. This is a small, anonymized, version of a larger proprietary dataset.
Metadata includes
appreciates (likes)
timestamps
extracted image features
Basic Statistics:
Users: 63,497
Items: 178,788
Appreciates (likes): 1,000,000
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Small Data Subset is a dataset for object detection tasks - it contains Faces annotations for 215 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterAutoTrain Dataset for project: aymara-t5-small
Dataset Description
This dataset has been automatically processed by AutoTrain for project aymara-t5-small.
Languages
The BCP-47 code for the dataset's language is unk.
Dataset Structure
Data Instances
A sample from this dataset looks as follows: [ { "feat_Lang": "Spanish", "feat_langcode": "es", "feat_Source": "Janiw sartasipk\u00e4ti aka mayiw phuqasi\u00f1apkama, presidentex… See the full description on the dataset page: https://huggingface.co/datasets/alvations/autotrain-data-aymara-t5-small.
Facebook
Twitterhttps://www.licenses.ai/ai-licenseshttps://www.licenses.ai/ai-licenses
Tabular dataset for data analysis and machine learning practice. The dataset is about the market and is usable for Power BI practice and data science.
Facebook
TwitterThese datasets contain reviews from the Steam video game platform, and information about which games were bundled together.
Metadata includes
reviews
purchases, plays, recommends (likes)
product bundles
pricing information
Basic Statistics:
Reviews: 7,793,069
Users: 2,567,538
Items: 15,474
Bundles: 615
Facebook
TwitterDataset includes general information about every grants project funded by the NYDDPC from 2004 to present
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Small Data Set All 640 X 640 is a dataset for object detection tasks - it contains Objects DYn7 annotations for 848 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterThe Poverty Mapping Project: Poverty and Food Security Case Studies data set consists of small area estimates of poverty, inequality, food security and related measures for subnational administrative Units in Mexico, Ecuador, Kenya, Malawi, Bangladesh, Sri Lanka, Nigeria and Vietnam. These data come from country level cases studies that examine poverty and food security from a spatial analysis perspective. The data products include shapefiles (vector data) and tabular data sets (csv format). Additionally, a data catalog (xls format) containing detailed information and documentation is provided. This data set is produced by the Columbia University Center for International Earth Science Information Network (CIESIN) and Centro Internacional de Agricultura Tropical (CIAT). The data set was originally produced by CIAT, International Maize and Wheat Improvement Center (CIMMYT), International Livestock Research Institute (ILRI), International Food Policy Research Institute (IFPRI), International Rice Research Institute (IRRI), International Water Management Institute (IWMI), and International Institute for Tropical Agriculture (IITA).
Facebook
TwitterThe Poverty Mapping Project: Small Area Estimates of Poverty and Inequality data set consists of consumption-based poverty, inequality and related measures for subnational administrative Units in approximately twenty countries throughout Africa, Asia, Europe, North America, and South America. These measures are derived on a country-level basis from a combination of census and survey data using small area estimates techniques. The collection of data have been compiled, integrated and standardized from the original data providers into a unified spatially referenced and globally consistent data set. The data products include shapefiles (vector data), tabular data sets (csv format), and centroids (csv file with latitude and longitude of a geographic Unit and associated poverty estimates). Additionally, a data catalog (xls format) containing detailed information and documentation is provided. This data set is produced by the Columbia University Center for International Earth Science Information Network (CIESIN) in collaboration with a number of external data providers.
Facebook
TwitterThis metadata record describes model outputs and supporting model code for the Data-Driven Drought Prediction project of the Water Resources Mission Area Drought Program. The data listed here include outputs of multiple machine learning model types for predicting hydrological drought at select locations within the conterminous United States. The child items referenced below correspond to different models and spatial extents (Colorado River Basin region or conterminous United States). See the list below or metadata files in each sub-folder for more details. 1. Daily streamflow percentile predictions for the Colorado River Basin region — Outputs from long short-term memory (LSTM) deep learning models corresponding to selected stream gage locations.
Facebook
TwitterThis webinar, sponsored by the Children's Bureau's Division of State Systems, addresses the management of action plans as small projects. Though operating on a small scale, each action plan project requires management and attention. The seven templates and worksheets described in this webinar provide core principles and techniques for addressing essential elements of project management, while offering the time-savings of a customizable structure.
The speaker, Dr. William Brantley, is currently involved in building the data science and project management capabilities for the Office of Policy Management's Strategic Workforce Planning. He shares an easy-to-use, public domain tool he developed for use in managing small projects.
Metadata-only record linking to the original dataset. Open original dataset below.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Small Small is a dataset for object detection tasks - it contains Fashaion annotations for 700 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterSmall Business Improvement Fund data includes all projects that have received a final grant payment since January, 2001. SBIF uses TIF revenues to provide reimbursement grants to small commercial and industrial businesses and property owners to help fund permanent improvements to their buildings and improve the appearance of their neighborhoods.
Facebook
Twitterhttps://heidata.uni-heidelberg.de/api/datasets/:persistentId/versions/1.1/customlicense?persistentId=doi:10.11588/DATA/GV8NBLhttps://heidata.uni-heidelberg.de/api/datasets/:persistentId/versions/1.1/customlicense?persistentId=doi:10.11588/DATA/GV8NBL
The NATCOOP project set out to study how nature shapes the preferences and incentives of economic agents and how this in turn affects common-pool resource management. Imagine a group of fishermen targeting a species that requires a lot of teamwork to harvest. Do these fishers become more social over time compared to fishers that work in a more solitary manner? If so, does this have implications for how the fishery should be managed? To study this, the NATCOOP team travelled to Chile and Tanzania and collected data using surveys and economic experiments. These two very different countries have a large population of small-scale fishermen, and both host several distinct types of fisheries. Over the course of five field trips, the project team surveyed more than 2500 fishermen with each field trip contributing to the main research question by measuring fishermen’s preferences for cooperation and risk. Additionally, each fieldtrip aimed to answer another smaller research question that was either focused on risk taking or cooperation behavior in the fisheries. The data from both surveys and experiments are now publicly available and can be freely studied by other researchers, resource managers, or interested citizens. Overall, the NATCOOP dataset contains participants’ responses to a plethora of survey questions and their actions during incentivized economic experiments. It is available in both the .dta and .csv format, and its use is recommended with statistical software such as R or Stata. For those unaccustomed with statistical analysis, we included a video tutorial on how to use the data set in the open-source program R.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset simulates the financial records of a small-town coffee shop over a two-year period (Jan 2022 – Dec 2023).
It was designed for data science, bookkeeping, and analytics projects — including financial dashboards, revenue forecasting, and expense tracking.
The dataset contains 5 CSV files representing different business accounts:
1. checking_account_main.csv - Daily sales deposits (hot drinks, cold drinks, pastries, sandwiches) + operating expenses
2. checking_account_secondary.csv - Monthly transfers between accounts + payroll funding
3. credit_card_account.csv - Weekly credit card expenses (supplies, utilities, vendor charges) and payments
4. gusto_payroll.csv - Payroll data for 3 employees + 1 contractor
5. gusto_payroll_bc.csv - Payroll data for 3 full-time employees + 1 contractor + 1 seasonal employee, with actual tax breakdown for the province of British Columbia, Canada
checking_account_main.csvchecking_account_secondary.csvcredit_card_account.csvgusto_payroll.csvgusto_payroll_bc.csvThis file simulates bi-weekly payroll data for a small coffee shop in British Columbia, Canada, covering January 2022 – December 2023.
It reflects realistic Canadian payroll structure with federal and provincial tax breakdowns, CPP, EI, and additional factors.
Columns:
- date → Pay date (bi-weekly schedule)
- employee_id → Unique identifier for each employee
- employee_name → Owner, Barista 1, Barista 2, Manager, Contractor, plus a seasonal Barista (June–Aug 2022)
- role → Role within the coffee shop (Owner, Barista, Manager, Contractor)
- gross_pay → Total earnings before deductions (wages + tips + reimbursements)
- federal_tax → Federal income tax withheld
- provincial_tax → British Columbia income tax withheld
- cpp_employee → Employee CPP contribution
- ei_employee → Employee EI contribution
- other_deductions → Placeholder for possible deductions (e.g., garnishments, union dues)
- net_pay → Take-home pay after deductions
- tips → Declared tips (taxable, included in gross pay)
- travel_reimbursement → Non-taxable reimbursement for travel expenses (if applicable)
- cpp_employer → Employer portion of CPP contributions
- ei_employer → Employer portion of EI contributions
Notes:
- Payroll data is synthetic but modeled on Canadian payroll rules (2022–2023 rates).
- A seasonal barista employee is included (employed June 1 – Aug 31, 2022).
- Travel reimbursements are non-taxable and recorded separately.
- This file allows users to practice payroll accounting, deductions analysis, and tax reconciliation.
This dataset is released under the MIT License, free to use for research, learning, or commercial purposes.
⭐ If you use this dataset in your project or notebook, please credit and share your work, it helps the community!
📷 Photo Credits: freepik
Facebook
TwitterThis is the complete dataset for the 500 Cities project 2018 release. This dataset includes 2016, 2015 model-based small area estimates for 27 measures of chronic disease related to unhealthy behaviors (5), health outcomes (13), and use of preventive services (9). Data were provided by the Centers for Disease Control and Prevention (CDC), Division of Population Health, Epidemiology and Surveillance Branch. The project was funded by the Robert Wood Johnson Foundation (RWJF) in conjunction with the CDC Foundation. It represents a first-of-its kind effort to release information on a large scale for cities and for small areas within those cities. It includes estimates for the 500 largest US cities and approximately 28,000 census tracts within these cities. These estimates can be used to identify emerging health problems and to inform development and implementation of effective, targeted public health prevention activities. Because the small area model cannot detect effects due to local interventions, users are cautioned against using these estimates for program or policy evaluations. Data sources used to generate these measures include Behavioral Risk Factor Surveillance System (BRFSS) data (2016, 2015), Census Bureau 2010 census population data, and American Community Survey (ACS) 2012-2016, 2011-2015 estimates. Because some questions are only asked every other year in the BRFSS, there are 4 measures (high blood pressure, taking high blood pressure medication, high cholesterol, cholesterol screening) from the 2015 BRFSS that are the same in the 2018 release as the previous 2017 release. More information about the methodology can be found at www.cdc.gov/500cities.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Small Size is a dataset for object detection tasks - it contains Small Object annotations for 450 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).