100+ datasets found

A Dataset of Water Quality and Related Variables in U.S. Reservoirs
catalog.data.gov
s.cnmilf.com
+1more
Updated Jun 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2025). A Dataset of Water Quality and Related Variables in U.S. Reservoirs [Dataset]. https://catalog.data.gov/dataset/a-dataset-of-water-quality-and-related-variables-in-u-s-reservoirs
Explore at:
Dataset updated
Jun 13, 2025
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Area covered
United States
Description
This dataset presents a rich collection of physicochemical parameters from 147 reservoirs distributed across the conterminous U.S. One hundred and eight of the reservoirs were selected using a statistical survey design and can provide unbiased inferences to the condition of all U.S. reservoirs. These data could be of interest to local water management specialists or those assessing the ecological condition of reservoirs at the national scale. These data have been reviewed in accordance with U.S. Environmental Protection Agency policy and approved for publication. This dataset is not publicly accessible because: It is too large. It can be accessed through the following means: https://portal-s.edirepository.org/nis/mapbrowse?scope=edi&identifier=2033&revision=1. Format: This dataset presents water quality and related variables for 147 reservoirs distributed across the U.S. Water quality parameters were measured during the summers of 2016, 2018, and 2020 – 2023. Measurements include nutrient concentration, algae abundance, dissolved oxygen concentration, and water temperature, among many others. Dataset includes links to other national and global scale data sets that provide additional variables.
Global Retail Sales Data: Orders, Reviews & Trends
kaggle.com
zip
Updated Dec 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Adarsh Anil Kumar (2024). Global Retail Sales Data: Orders, Reviews & Trends [Dataset]. https://www.kaggle.com/datasets/adarsh0806/influencer-merchandise-sales
Explore at:
zip(125403 bytes)Available download formats
Dataset updated
Dec 10, 2024
Authors
Adarsh Anil Kumar
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The Global Retail Sales Data provided here is a self-generated synthetic dataset created using Random Sampling techniques provided by the Numpy Package. The dataset emulates information regarding merchandise sales through a retail website set up by a popular fictional influencer based in the US between the '23-'24 period. The influencer would sell clothing, ornaments and other products at variable rates through the retail website to all of their followers across the world. Imagine that the influencer executes high levels of promotions for the materials they sell, prompting more ratings and reviews from their followers, pushing more user engagement.

This dataset is placed to help with practicing Sentiment Analysis or/and Time Series Analysis of sales, etc. as they are very important topics for Data Analyst prospects. The column description is given as follows:

Order ID: Serves as an identifier for each order made.

Order Date: The date when the order was made.

Product ID: Serves as an identifier for the product that was ordered.

Product Category: Category of Product sold(Clothing, Ornaments, Other).

Buyer Gender: Genders of people that have ordered from the website (Male, Female).

Buyer Age: Ages of the buyers.

Order Location: The city where the order was made from.

International Shipping: Whether the product was shipped internationally or not. (Yes/No)

Sales Price: Price tag for the product.

Shipping Charges: Extra charges for international shipments.

Sales per Unit: Sales cost while including international shipping charges.

Quantity: Quantity of the product bought.

Total Sales: Total sales made through the purchase.

Rating: User rating given for the order.

Review: User review given for the order.
N
Gratis, OH Population Breakdown by Gender Dataset: Male and Female...
neilsberg.com
csv, json
Updated Feb 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Gratis, OH Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b235d8fd-f25d-11ef-8c1b-3860777c1fe6/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 24, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Gratis
Variables measured
Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the population of Gratis by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Gratis across both sexes and to determine which sex constitutes the majority.

Key observations

There is a slight majority of female population, with 50.0% of total population being female. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Scope of gender :

Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

Variables / Data Columns

Gender: This column displays the Gender (Male / Female)

Population: The population of the gender in the Gratis is shown in this column.

% of Total Population: This column displays the percentage distribution of each gender as a proportion of Gratis total population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Gratis Population by Race & Ethnicity. You can refer the same here
Hydrographic and Impairment Statistics Database: THRB
catalog.data.gov
datasets.ai
Updated Nov 25, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Park Service (2025). Hydrographic and Impairment Statistics Database: THRB [Dataset]. https://catalog.data.gov/dataset/hydrographic-and-impairment-statistics-database-thrb
Explore at:
Dataset updated
Nov 25, 2025
Dataset provided by
National Park Servicehttp://www.nps.gov/
Description
Hydrographic and Impairment Statistics (HIS) is a National Park Service (NPS) Water Resources Division (WRD) project established to track certain goals created in response to the Government Performance and Results Act of 1993 (GPRA). One water resources management goal established by the Department of the Interior under GRPA requires NPS to track the percent of its managed surface waters that are meeting Clean Water Act (CWA) water quality standards. This goal requires an accurate inventory that spatially quantifies the surface water hydrography that each bureau manages and a procedure to determine and track which waterbodies are or are not meeting water quality standards as outlined by Section 303(d) of the CWA. This project helps meet this DOI GRPA goal by inventorying and monitoring in a geographic information system for the NPS: (1) CWA 303(d) quality impaired waters and causes; and (2) hydrographic statistics based on the United States Geological Survey (USGS) National Hydrography Dataset (NHD). Hydrographic and 303(d) impairment statistics were evaluated based on a combination of 1:24,000 (NHD) and finer scale data (frequently provided by state GIS layers).
North American Dataset
ncei.noaa.gov
data.cnra.ca.gov
+1more
Updated Oct 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Menne, Matthew J.; Williams, Claude N. Jr.; Korzeniewski, Bryant (2017). North American Dataset [Dataset]. http://doi.org/10.7289/v5348hn5
Explore at:
Unique identifier
https://doi.org/10.7289/v5348hn5
Dataset updated
Oct 2017
Dataset provided by
National Centers for Environmental Informationhttps://www.ncei.noaa.gov/
National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
Authors
Menne, Matthew J.; Williams, Claude N. Jr.; Korzeniewski, Bryant
Time period covered
Jan 1, 1850 - Present
Area covered

Description
The North American Dataset contains sets of Maximum, Minimum and Average Temperature data and Precipitation data that are either (1) raw (non-adjusted though flagged for possible quality issues), (2) adjusted due to time of observation bias (TOB) or (3) put through the Pairwise Homogenization Algorithm (PHA). These files contain North American stations and its data are measured in hundredths of degrees Celsius (without decimal place) for temperature and tenths of millimeters (without decimal place) for Precipitation. Each file includes the entire available Period of Record.
MHS Dashboard Children and Youth Demographic Datasets
data.ca.gov
data.chhs.ca.gov
+1more
csv, zip
Updated Nov 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California Department of Health Care Services (2025). MHS Dashboard Children and Youth Demographic Datasets [Dataset]. https://data.ca.gov/dataset/mhs-dashboard-children-and-youth-demographic-datasets
Explore at:
csv, zipAvailable download formats
Dataset updated
Nov 7, 2025
Dataset authored and provided by
California Department of Health Care Serviceshttp://www.dhcs.ca.gov/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The following datasets are based on the children and youth (under age 21) beneficiary population and consist of aggregate Mental Health Service data derived from Medi-Cal claims, encounter, and eligibility systems. These datasets were developed in accordance with California Welfare and Institutions Code (WIC) § 14707.5 (added as part of Assembly Bill 470 on 10/7/17). Please contact BHData@dhcs.ca.gov for any questions or to request previous years’ versions of these datasets. Note: The Performance Dashboard AB 470 Report Application Excel tool development has been discontinued. Please see the Behavioral Health reporting data hub at https://behavioralhealth-data.dhcs.ca.gov/ for access to dashboards utilizing these datasets and other behavioral health data.
House Price Regression Dataset
kaggle.com
zip
Updated Sep 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prokshitha Polemoni (2024). House Price Regression Dataset [Dataset]. https://www.kaggle.com/datasets/prokshitha/home-value-insights
Explore at:
zip(27045 bytes)Available download formats
Dataset updated
Sep 6, 2024
Authors
Prokshitha Polemoni
Description
Home Value Insights: A Beginner's Regression Dataset

This dataset is designed for beginners to practice regression problems, particularly in the context of predicting house prices. It contains 1000 rows, with each row representing a house and various attributes that influence its price. The dataset is well-suited for learning basic to intermediate-level regression modeling techniques.

Features:

Square_Footage: The size of the house in square feet. Larger homes typically have higher prices.

Num_Bedrooms: The number of bedrooms in the house. More bedrooms generally increase the value of a home.

Num_Bathrooms: The number of bathrooms in the house. Houses with more bathrooms are typically priced higher.

Year_Built: The year the house was built. Older houses may be priced lower due to wear and tear.

Lot_Size: The size of the lot the house is built on, measured in acres. Larger lots tend to add value to a property.

Garage_Size: The number of cars that can fit in the garage. Houses with larger garages are usually more expensive.

Neighborhood_Quality: A rating of the neighborhood’s quality on a scale of 1-10, where 10 indicates a high-quality neighborhood. Better neighborhoods usually command higher prices.

House_Price (Target Variable): The price of the house, which is the dependent variable you aim to predict.

Potential Uses:

Beginner Regression Projects: This dataset can be used to practice building regression models such as Linear Regression, Decision Trees, or Random Forests. The target variable (house price) is continuous, making this an ideal problem for supervised learning techniques.

Feature Engineering Practice: Learners can create new features by combining existing ones, such as the price per square foot or age of the house, providing an opportunity to experiment with feature transformations.

Exploratory Data Analysis (EDA): You can explore how different features (e.g., square footage, number of bedrooms) correlate with the target variable, making it a great dataset for learning about data visualization and summary statistics.

Model Evaluation: The dataset allows for various model evaluation techniques such as cross-validation, R-squared, and Mean Absolute Error (MAE). These metrics can be used to compare the effectiveness of different models.

Versatility:

The dataset is highly versatile for a range of machine learning tasks. You can apply simple linear models to predict house prices based on one or two features, or use more complex models like Random Forest or Gradient Boosting Machines to understand interactions between variables.

It can also be used for dimensionality reduction techniques like PCA or to practice handling categorical variables (e.g., neighborhood quality) through encoding techniques like one-hot encoding.

This dataset is ideal for anyone wanting to gain practical experience in building regression models while working with real-world features.
N
Dataset for Kiawah Island, SC Census Bureau Demographics and Population...
neilsberg.com
Updated Jul 24, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Dataset for Kiawah Island, SC Census Bureau Demographics and Population Distribution Across Age // 2024 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b79be6a5-5460-11ee-804b-3860777c1fe6/
Explore at:
Dataset updated
Jul 24, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Kiawah Island, South Carolina
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Kiawah Island population by age. The dataset can be utilized to understand the age distribution and demographics of Kiawah Island.

Content

The dataset constitues the following three datasets

Kiawah Island, SC Age Group Population Dataset: A complete breakdown of Kiawah Island age demographics from 0 to 85 years, distributed across 18 age groups

Kiawah Island, SC Age Cohorts Dataset: Children, Working Adults, and Seniors in Kiawah Island - Population and Percentage Analysis

Kiawah Island, SC Population Pyramid Dataset: Age Groups, Male and Female Population, and Total Population for Demographics Analysis

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
Covid-19 variants survival data
kaggle.com
zip
Updated Jan 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massock Batalong Maurice Blaise (2025). Covid-19 variants survival data [Dataset]. https://www.kaggle.com/datasets/lumierebatalong/covid-19-variants-survival-data
Explore at:
zip(216589 bytes)Available download formats
Dataset updated
Jan 2, 2025
Authors
Massock Batalong Maurice Blaise
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Overview:

This dataset provides a unique resource for researchers and data scientists interested in the global dynamics of the COVID-19 pandemic. It focuses on the impact of different SARS-CoV-2 variants and mutations on the duration of local epidemics. By combining variant information with epidemiological data, this dataset allows for a comprehensive analysis of factors influencing the trajectory of the pandemic.

Key Features:

Global Coverage: Includes data from multiple countries.

Variant-Specific Information: Detailed records for various SARS-CoV-2 variants.

Epidemic Duration: Data on the duration of local epidemics, accounting for right-censoring.

Epidemiological Variables: Includes mortality rates, a proxy for R0, transmission proxies, and other pertinent variables.

Geographical characteristics: Include a continent variable for exploring geographical patterns

Time varying variables: Include the number of waves and the number of variants in the different countries for more in-depth exploration.

Data Source: The data combines information from the Johns Hopkins University COVID-19 dataset (confirmed_cases.csv and deaths_cases.csv) and the covariants.org dataset (variants.csv). The dataset you see here is the combination of two datasets from Johns Hopkins University and covariants.org.

Questions to Inspire Users:

This dataset is designed for a diverse set of analytical questions. Here are some ideas to inspire the Kaggle community:

Survival Analysis:

How do different SARS-CoV-2 variants influence the duration of local epidemics?

Which factors (mortality, R0, etc.) are most strongly associated with shorter or longer epidemic durations?

Does the type of variant/mutation (mutation,S, Omicron, Delta, Other) have a significant impact on epidemic duration?

Is there a geographical pattern to the duration of epidemics?

Epidemiological Analysis:

How do local transmission rates (represented by our proxy of R0) affect the duration of an epidemic?

Do countries with higher mortality rates have different patterns of epidemic progression?

How can we predict the duration of an epidemic based on its initial characteristics?

How does the number of epidemic waves impact the duration of an epidemic?

Does the number of variants in a country affect the duration of an épidémie?

Data Science/Machine Learning:

Can we develop a machine learning model to predict the duration of an epidemic?

What features have the best predictive power ?

Can we identify clusters of variants/regions with similar epidemic patterns?

Are there interactions between variables that can explain the non-linearities that we have identified ?
I
Cline Center Coup d’État Project Dataset
databank.illinois.edu
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Buddy Peyton; Joseph Bajjalieh; Dan Shalmon; Michael Martin; Emilio Soto (2025). Cline Center Coup d’État Project Dataset [Dataset]. http://doi.org/10.13012/B2IDB-9651987_V7
Explore at:
Unique identifier
https://doi.org/10.13012/B2IDB-9651987_V7
Dataset updated
May 11, 2025
Authors
Buddy Peyton; Joseph Bajjalieh; Dan Shalmon; Michael Martin; Emilio Soto
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Coups d'Ètat are important events in the life of a country. They constitute an important subset of irregular transfers of political power that can have significant and enduring consequences for national well-being. There are only a limited number of datasets available to study these events (Powell and Thyne 2011, Marshall and Marshall 2019). Seeking to facilitate research on post-WWII coups by compiling a more comprehensive list and categorization of these events, the Cline Center for Advanced Social Research (previously the Cline Center for Democracy) initiated the Coup d’État Project as part of its Societal Infrastructures and Development (SID) project. More specifically, this dataset identifies the outcomes of coup events (i.e., realized, unrealized, or conspiracy) the type of actor(s) who initiated the coup (i.e., military, rebels, etc.), as well as the fate of the deposed leader. Version 2.1.3 adds 19 additional coup events to the data set, corrects the date of a coup in Tunisia, and reclassifies an attempted coup in Brazil in December 2022 to a conspiracy. Version 2.1.2 added 6 additional coup events that occurred in 2022 and updated the coding of an attempted coup event in Kazakhstan in January 2022. Version 2.1.1 corrected a mistake in version 2.1.0, where the designation of “dissident coup” had been dropped in error for coup_id: 00201062021. Version 2.1.1 fixed this omission by marking the case as both a dissident coup and an auto-coup. Version 2.1.0 added 36 cases to the data set and removed two cases from the v2.0.0 data. This update also added actor coding for 46 coup events and added executive outcomes to 18 events from version 2.0.0. A few other changes were made to correct inconsistencies in the coup ID variable and the date of the event. Version 2.0.0 improved several aspects of the previous version (v1.0.0) and incorporated additional source material to include: • Reconciling missing event data • Removing events with irreconcilable event dates • Removing events with insufficient sourcing (each event needs at least two sources) • Removing events that were inaccurately coded as coup events • Removing variables that fell below the threshold of inter-coder reliability required by the project • Removing the spreadsheet ‘CoupInventory.xls’ because of inadequate attribution and citations in the event summaries • Extending the period covered from 1945-2005 to 1945-2019 • Adding events from Powell and Thyne’s Coup Data (Powell and Thyne, 2011)
Items in this Dataset 1. Cline Center Coup d'État Codebook v.2.1.3 Codebook.pdf - This 15-page document describes the Cline Center Coup d’État Project dataset. The first section of this codebook provides a summary of the different versions of the data. The second section provides a succinct definition of a coup d’état used by the Coup d'État Project and an overview of the categories used to differentiate the wide array of events that meet the project's definition. It also defines coup outcomes. The third section describes the methodology used to produce the data. Revised February 2024 2. Coup Data v2.1.3.csv - This CSV (Comma Separated Values) file contains all of the coup event data from the Cline Center Coup d’État Project. It contains 29 variables and 1000 observations. Revised February 2024 3. Source Document v2.1.3.pdf - This 325-page document provides the sources used for each of the coup events identified in this dataset. Please use the value in the coup_id variable to identify the sources used to identify that particular event. Revised February 2024 4. README.md - This file contains useful information for the user about the dataset. It is a text file written in markdown language. Revised February 2024
Citation Guidelines 1. To cite the codebook (or any other documentation associated with the Cline Center Coup d’État Project Dataset) please use the following citation: Peyton, Buddy, Joseph Bajjalieh, Dan Shalmon, Michael Martin, Jonathan Bonaguro, and Scott Althaus. 2024. “Cline Center Coup d’État Project Dataset Codebook”. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.1.3. February 27. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V7 2. To cite data from the Cline Center Coup d’État Project Dataset please use the following citation (filling in the correct date of access): Peyton, Buddy, Joseph Bajjalieh, Dan Shalmon, Michael Martin, Jonathan Bonaguro, and Emilio Soto. 2024. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.1.3. February 27. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V7
Microsoft Stock Data and Key Affiliated Companies
kaggle.com
zip
Updated Nov 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zongao Bian (2024). Microsoft Stock Data and Key Affiliated Companies [Dataset]. https://www.kaggle.com/datasets/zongaobian/microsoft-stock-data-and-key-affiliated-companies
Explore at:
zip(1453413 bytes)Available download formats
Dataset updated
Nov 3, 2024
Authors
Zongao Bian
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This dataset contains daily stock price data for Microsoft and several key companies that have significantly contributed to its growth and success. The dataset includes historical data from 1980 to 2024 for the following companies:

Microsoft (MSFT): The core company behind the dataset.

Intel (INTC): A vital partner in the PC revolution, providing processors for many Microsoft-powered devices.

IBM (IBM): Microsoft's early partnership with IBM, starting with MS-DOS, laid the foundation for Microsoft's dominance in operating systems.

Dell Technologies (DELL): Dell’s PCs pre-installed with Windows helped accelerate Microsoft’s growth in the consumer and enterprise markets.

Sony (SONY): A competitor in the gaming industry, Sony played a significant role in shaping Microsoft's strategy for its Xbox division.

Dataset Details:

Date Range: 1980-12-11 to 2024-10-31

Interval: Daily stock prices

Columns: Date, Open, High, Low, Close, Adjusted Close, Volume

This dataset is ideal for: - Financial analysis: Study stock price trends over time and compare performance across companies. - Time series forecasting: Predict future stock prices using historical data. - Market correlation analysis: Analyze the relationships between Microsoft and its key affiliated companies in different market conditions.

Feel free to use this dataset for your financial and stock market projects, analysis, or machine learning models!
m
Graphite//LFP synthetic V vs. Q dataset (>700,000 unique curves)
data.mendeley.com
narcis.nl
Updated Mar 12, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthieu Dubarry (2021). Graphite//LFP synthetic V vs. Q dataset (>700,000 unique curves) [Dataset]. http://doi.org/10.17632/bs2j56pn7y.2
Explore at:
Unique identifier
https://doi.org/10.17632/bs2j56pn7y.2
Dataset updated
Mar 12, 2021
Authors
Matthieu Dubarry
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This training dataset was calculated using the mechanistic modeling approach. See “Big data training data for artificial intelligence-based Li-ion diagnosis and prognosis“ (Journal of Power Sources, Volume 479, 15 December 2020, 228806) and "Analysis of Synthetic Voltage vs. Capacity Datasets for Big Data Diagnosis and Prognosis" (Energies, under review) for more details

The V vs. Q dataset was compiled with a resolution of 0.01 for the triplets and C/25 charges. This accounts for more than 5,000 different paths. Each path was simulated with at most 0.85% increases for each The training dataset, therefore, contains more than 700,000 unique voltage vs. capacity curves.

4 Variables are included, see read me file for details and example how to use. Cell info: Contains information on the setup of the mechanistic model Qnorm: normalize capacity scale for all voltage curves pathinfo: index for simulated conditions for all voltage curves volt: voltage data. Each column corresponds to the voltage simulated under the conditions of the corresponding line in pathinfo.
a
Police Transparency - Calls for Service - All Data (Dataset)
safe-and-secure-communities-tempegov.hub.arcgis.com
data-academy.tempe.gov
+6more
Updated Mar 25, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tempe (2025). Police Transparency - Calls for Service - All Data (Dataset) [Dataset]. https://safe-and-secure-communities-tempegov.hub.arcgis.com/items/d2937ee4e83140559d94080237a6e84c
Explore at:
Dataset updated
Mar 25, 2025
Dataset authored and provided by
City of Tempe
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered

Description
The Calls for Service dataset includes police service requests for which patrol officers, traffic officers, bike officers and, on occasion, detectives will be dispatched to public safety response. It also includes self-initiated calls for service where an officer witnesses a violation or suspicious activity for which they would respond. This item represents a consolidated item of all records.Why the Datasets are Organized into Separate Layers In January of 2022, the Tempe Police Department completed a major transition in how crimes data is reported, moving from the FBI Uniform Crime Report program to the enhanced National-Incident Based Reporting System, or NIBRS. NIBRS is now the required reporting method for the FBI. The Uniform Crime Report (UCR) Program's traditional Summary Reporting System (SRS) was limited in comparison to NIBRS, which offers more detailed data collection that provides a deeper understanding of crime and its circumstances. NIBRS captures a wider range of details on crime incidents and can reflect separate offenses occurring during the same event, including information on victims, known offenders, relationships between victims and offenders, arrestees, and property involved in the crimes. With greater specificity in reporting offenses, NIBRS provides for more accurate and detailed crime-related information, and helps give context to specific crime issues while affording greater analytic capability of crime. Below is the link to Tempe-specific NIBRS reports. Use the drop-down filters to select Tempe PD, the year, and the type of report. Because of these differences, trends and numbers between the two systems should not be directly compared. That’s why we treat 2022 and later (NIBRS) separately from 2021 and earlier (UCR). To make the older data easier to browse, we grouped the data from 2021 and earlier into year ranges instead of showing it all at once. This helps with performance and loading speed due to the large count of records. For detailed guidance on interpreting calls for service data, as well as data scope and limitations, please refer to the User Guide.Data DictionaryAdditional InformationContact Email: PD_DataRequest@tempe.govContact Phone: N/ALink: N/AData Source: Versaterm Informix RMSData Source Type: Informix and/or SQL ServerPreparation Method: Automated processPublish Frequency: DailyPublish Method: Automatic
u
Amazon review data 2018
cseweb.ucsd.edu
nijianmo.github.io
+1more
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UCSD CSE Research Project, Amazon review data 2018 [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets/amazon_v2/
Explore at:
Dataset authored and provided by
UCSD CSE Research Project
Description
Context

This Dataset is an updated version of the Amazon review dataset released in 2014. As in the previous version, this dataset includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs). In addition, this version provides the following features:

More reviews:

The total number of reviews is 233.1 million (142.8 million in 2014).

New reviews:

Current data includes reviews in the range May 1996 - Oct 2018.

Metadata: - We have added transaction metadata for each review shown on the review page.

Added more detailed metadata of the product landing page.

Acknowledgements

If you publish articles based on this dataset, please cite the following paper:

Jianmo Ni, Jiacheng Li, Julian McAuley. Justifying recommendations using distantly-labeled reviews and fined-grained aspects. EMNLP, 2019.

Booking.com Datasets

brightdata.com

.json, .csv, .xlsx

Updated Nov 23, 2023

Facebook

Twitter

Click to copy link

Link copied

Cite

Bright Data (2023). Booking.com Datasets [Dataset]. https://brightdata.com/products/datasets/booking

Explore at:

.json, .csv, .xlsxAvailable download formats

Dataset updated

Nov 23, 2023

Dataset authored and provided by

Bright Data

License

https://brightdata.com/licensehttps://brightdata.com/license

Area covered

Worldwide

Description

The Booking Hotel Listings Dataset provides a structured and in-depth view of accommodations worldwide, offering essential data for travel industry professionals, market analysts, and businesses. This dataset includes key details such as hotel names, locations, star ratings, pricing, availability, room configurations, amenities, guest reviews, sustainability features, and cancellation policies.

With this dataset, users can:

Analyze market trends to understand booking behaviors, pricing dynamics, and seasonal demand.
Enhance travel recommendations by identifying top-rated hotels based on reviews, location, and amenities.
Optimize pricing and revenue strategies by benchmarking property performance and availability patterns.
Assess guest satisfaction through sentiment analysis of ratings and reviews.
Evaluate sustainability efforts by examining eco-friendly features and certifications.

Designed for hospitality businesses, travel platforms, AI-powered recommendation engines, and pricing strategists, this dataset enables data-driven decision-making to improve customer experience and business performance.

Use Cases

Booking Hotel Listings in Greece
Gain insights into Greece’s diverse hospitality landscape, from luxury resorts in Santorini to boutique hotels in Athens. Analyze review scores, availability trends, and traveler preferences to refine booking strategies.

Booking Hotel Listings in Croatia
Explore hotel data across Croatia’s coastal and inland destinations, ideal for travel planners targeting visitors to Dubrovnik, Split, and Plitvice Lakes. This dataset includes review scores, pricing, and sustainability features.

Booking Hotel Listings with Review Scores Greater Than 9
A curated selection of high-rated hotels worldwide, ideal for luxury travel planners and market researchers focused on premium accommodations that consistently exceed guest expectations.

Booking Hotel Listings in France with More Than 1000 Reviews
Analyze well-established and highly reviewed hotels across France, ensuring reliable guest feedback for market insights and customer satisfaction benchmarking.

This dataset serves as an indispensable resource for travel analysts, hospitality businesses, and data-driven decision-makers, providing the intelligence needed to stay competitive in the ever-evolving travel industry.

Flickr Dataset
brightdata.com
.json, .csv, .xlsx
Updated May 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bright Data (2024). Flickr Dataset [Dataset]. https://brightdata.com/products/datasets/image/flickr
Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
May 9, 2024
Dataset authored and provided by
Bright Datahttps://brightdata.com/
License
https://brightdata.com/licensehttps://brightdata.com/license
Area covered
Worldwide
Description
We'll customize a Flickr dataset to align with your unique requirements, incorporating data on image categories, user comments, photographic trends, popular photographers, demographic insights, usage statistics, and other relevant metrics.

Leverage our Flickr datasets for various applications to strengthen strategic planning and market analysis. Examining these datasets enables organizations to understand visual content preferences and photography trends, facilitating refined image curation and marketing campaigns. Tailor your access to the complete dataset or specific subsets according to your business needs.

Popular use cases include optimizing image libraries based on user engagement, refining marketing strategies through targeted audience segmentation, and identifying and predicting trends to maintain a competitive edge in the visual content and digital media market.
d
Biodiversity by County - Distribution of Animals, Plants and Natural...
catalog.data.gov
datasets.ai
+2more
Updated Jul 12, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
State of New York (2025). Biodiversity by County - Distribution of Animals, Plants and Natural Communities [Dataset]. https://catalog.data.gov/dataset/biodiversity-by-county-distribution-of-animals-plants-and-natural-communities
Explore at:
Dataset updated
Jul 12, 2025
Dataset provided by
State of New York
Description
The NYS Department of Environmental Conservation (DEC) collects and maintains several datasets on the locations, distribution and status of species of plants and animals. Information on distribution by county from the following three databases was extracted and compiled into this dataset. First, the New York Natural Heritage Program biodiversity database: Rare animals, rare plants, and significant natural communities. Significant natural communities are rare or high-quality wetlands, forests, grasslands, ponds, streams, and other types of habitats. Next, the 2nd NYS Breeding Bird Atlas Project database: Birds documented as breeding during the atlas project from 2000-2005. And last, DEC’s NYS Reptile and Amphibian Database: Reptiles and amphibians; most records are from the NYS Amphibian & Reptile Atlas Project (Herp Atlas) from 1990-1999.
d
OpenFEMA Data Set Fields
catalog.data.gov
datasets.ai
+1more
Updated Jun 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FEMA/Mission Support/Off of Chf Information Officer (2025). OpenFEMA Data Set Fields [Dataset]. https://catalog.data.gov/dataset/openfema-data-set-fields
Explore at:
Dataset updated
Jun 7, 2025
Dataset provided by
FEMA/Mission Support/Off of Chf Information Officer
Description
Metadata for the OpenFEMA API data set fields. It contains descriptions, data types, and other attributes for each field.rnrnIf you have media inquiries about this dataset please email the FEMA News Desk FEMA-News-Desk@dhs.gov or call (202) 646-3272. For inquiries about FEMA's data and Open government program please contact the OpenFEMA team via email OpenFEMA@fema.dhs.gov.
C
Healthcare Payments Data Snapshot
data.chhs.ca.gov
data.ca.gov
+3more
csv, pdf, zip
Updated Nov 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health Care Access and Information (2025). Healthcare Payments Data Snapshot [Dataset]. https://data.chhs.ca.gov/dataset/healthcare-payments-data-snapshot
Explore at:
zip, pdf(458278), csv(907195), csv(107962), csv(1023), pdf(218738), csv(769), pdf(245152), csv(4432152), csv(1003)Available download formats
Dataset updated
Nov 7, 2025
Dataset authored and provided by
Department of Health Care Access and Information
Description
This dataset contains data for the Healthcare Payments Data (HPD) Snapshot visualization. The Enrollment data file contains counts of claims and encounter data collected for California's statewide HPD Program. It includes counts of enrollment records, service records from medical and pharmacy claims, and the number of individuals represented across these records. Aggregate counts are grouped by payer type (Commercial, Medi-Cal, or Medicare), product type, and year. The Medical data file contains counts of medical procedures from medical claims and encounter data in HPD. Procedures are categorized using claim line procedure codes and grouped by year, type of setting (e.g., outpatient, laboratory, ambulance), and payer type. The Pharmacy data file contains counts of drug prescriptions from pharmacy claims and encounter data in HPD. Prescriptions are categorized by name and drug class using the reported National Drug Code (NDC) and grouped by year, payer type, and whether the drug dispensed is branded or a generic.
s
Pay and Display Meter Locations FCC - Dataset - data.smartdublin.ie
data.smartdublin.ie
Updated Mar 5, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Pay and Display Meter Locations FCC - Dataset - data.smartdublin.ie [Dataset]. https://data.smartdublin.ie/dataset/pay-and-display-meter-locations-fcc2
Explore at:
Dataset updated
Mar 5, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This data set contains mapping and table of the location of all Pay and Display Meters in Fingal County Councils pay and display system around the county of Fingal. We Currently have 146 Meters in use .e have Pay and Display in the following areas: Malahide, Skerries, Balbriggan, Swords, Rush and Clonsilla.Pay & Display operates from 8.00 am to 6.00 pm Monday to Saturday inclusive. However, this can vary for particular areas, check the signage at your location.Charges can vary for different areas for both on-street and car parks. However, the charge is usually €1.20 per hour or €3 per day in the long-term parking areas.Traffic Wardens ticket illegally parked vehicles in a Pay and Display area. It is your responsibility to ensure that a valid parking ticket/permit/disc is displayed and clearly visible on your vehicle.

Facebook

Twitter

Click to copy link

Link copied

Cite

U.S. EPA Office of Research and Development (ORD) (2025). A Dataset of Water Quality and Related Variables in U.S. Reservoirs [Dataset]. https://catalog.data.gov/dataset/a-dataset-of-water-quality-and-related-variables-in-u-s-reservoirs

A Dataset of Water Quality and Related Variables in U.S. Reservoirs

Explore at:

Dataset updated

Jun 13, 2025

Dataset provided by

United States Environmental Protection Agencyhttp://www.epa.gov/

Area covered

United States

Description

This dataset presents a rich collection of physicochemical parameters from 147 reservoirs distributed across the conterminous U.S. One hundred and eight of the reservoirs were selected using a statistical survey design and can provide unbiased inferences to the condition of all U.S. reservoirs. These data could be of interest to local water management specialists or those assessing the ecological condition of reservoirs at the national scale. These data have been reviewed in accordance with U.S. Environmental Protection Agency policy and approved for publication. This dataset is not publicly accessible because: It is too large. It can be accessed through the following means: https://portal-s.edirepository.org/nis/mapbrowse?scope=edi&identifier=2033&revision=1. Format: This dataset presents water quality and related variables for 147 reservoirs distributed across the U.S. Water quality parameters were measured during the summers of 2016, 2018, and 2020 – 2023. Measurements include nutrient concentration, algae abundance, dissolved oxygen concentration, and water temperature, among many others. Dataset includes links to other national and global scale data sets that provide additional variables.

Clear search

Close search

Google apps

Main menu

A Dataset of Water Quality and Related Variables in U.S. Reservoirs

Global Retail Sales Data: Orders, Reviews & Trends

Gratis, OH Population Breakdown by Gender Dataset: Male and Female...

About this dataset

Content

Inspiration

Recommended for further research

Hydrographic and Impairment Statistics Database: THRB

North American Dataset

MHS Dashboard Children and Youth Demographic Datasets

House Price Regression Dataset

Home Value Insights: A Beginner's Regression Dataset

Features:

Potential Uses:

Versatility:

Dataset for Kiawah Island, SC Census Bureau Demographics and Population...

About this dataset

Content

Inspiration

Covid-19 variants survival data

Overview:

Key Features:

Questions to Inspire Users:

Survival Analysis:

Epidemiological Analysis:

Data Science/Machine Learning:

Cline Center Coup d’État Project Dataset

Microsoft Stock Data and Key Affiliated Companies

Dataset Details:

Graphite//LFP synthetic V vs. Q dataset (>700,000 unique curves)

Police Transparency - Calls for Service - All Data (Dataset)

Amazon review data 2018

Context

Acknowledgements

Booking.com Datasets

Flickr Dataset

Biodiversity by County - Distribution of Animals, Plants and Natural...

OpenFEMA Data Set Fields

Healthcare Payments Data Snapshot

Pay and Display Meter Locations FCC - Dataset - data.smartdublin.ie

A Dataset of Water Quality and Related Variables in U.S. Reservoirs