https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Exploring Online Sales Data with Power BI !!
Another productive day diving into online sales dataset! Here’s a roundup of the insights I uncovered today:
Revenue by Category: Analyzed revenue distribution across different product categories to identify high-performing sectors.
Revenue by Sub-Category: Drilled down into sub-categories for a more granular view of revenue streams.
Revenue by Payment Mode: Examined revenue patterns based on payment methods to understand customer preferences.
Revenue by State: Mapped out revenue by state to pinpoint geographical strengths and opportunities.
Profit by Category: Evaluated profitability across product categories to assess which categories yield the highest profit margins.
Profit by Sub-Category: Explored profit levels at a sub-category level to identify the most profitable segments.
Profit by Payment Mode: Analyzed profit distribution across different payment methods.
Top 5 States by Revenue and Profit: Highlighted the top 5 states driving the most revenue and profit, offering insights into regional performance.
Sales Map by State: Visualized sales data on a map to provide a geographical perspective on sales distribution.
Total Quantity, Revenue, and Profit: Aggregated data to give an overview of total quantities sold, overall revenue, and total profit.
Filter by Category: Added a filter functionality to focus on specific categories and refine data analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
From 2016 to 2018, we surveyed the world’s largest natural history museum collections to begin mapping this globally distributed scientific infrastructure. The resulting dataset includes 73 institutions across the globe. It has:
Basic institution data for the 73 contributing institutions, including estimated total collection sizes, geographic locations (to the city) and latitude/longitude, and Research Organization Registry (ROR) identifiers where available.
Resourcing information, covering the numbers of research, collections and volunteer staff in each institution.
Indicators of the presence and size of collections within each institution broken down into a grid of 19 collection disciplines and 16 geographic regions.
Measures of the depth and breadth of individual researcher experience across the same disciplines and geographic regions.
This dataset contains the data (raw and processed) collected for the survey, and specifications for the schema used to store the data. It includes:
The global collections data may also be accessed at https://rebrand.ly/global-collections. This is a preliminary dashboard, constructed and published using Microsoft Power BI, that enables the exploration of the data through a set of visualisations and filters. The dashboard consists of three pages:
Institutional profile: Enables the selection of a specific institution and provides summary information on the institution and its location, staffing, total collection size, collection breakdown and researcher expertise.
Overall heatmap: Supports an interactive exploration of the global picture, including a heatmap of collection distribution across the discipline and geographic categories, and visualisations that demonstrate the relative breadth of collections across institutions and correlations between collection size and breadth. Various filters allow the focus to be refined to specific regions and collection sizes.
Browse: Provides some alternative methods of filtering and visualising the global dataset to look at patterns in the distribution and size of different types of collections across the global view.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This Kaggle dataset comes from an output dataset that powers my March Madness Data Analysis dashboard in Domo. - Click here to view this dashboard: Dashboard Link - Click here to view this dashboard features in a Domo blog post: Hoops, Data, and Madness: Unveiling the Ultimate NCAA Dashboard
This dataset offers one the most robust resource you will find to discover key insights through data science and data analytics using historical NCAA Division 1 men's basketball data. This data, sourced from KenPom, goes as far back as 2002 and is updated with the latest 2025 data. This dataset is meticulously structured to provide every piece of information that I could pull from this site as an open-source tool for analysis for March Madness.
Key features of the dataset include: - Historical Data: Provides all historical KenPom data from 2002 to 2025 from the Efficiency, Four Factors (Offense & Defense), Point Distribution, Height/Experience, and Misc. Team Stats endpoints from KenPom's website. Please note that the Height/Experience data only goes as far back as 2007, but every other source contains data from 2002 onward. - Data Granularity: This dataset features an individual line item for every NCAA Division 1 men's basketball team in every season that contains every KenPom metric that you can possibly think of. This dataset has the ability to serve as a single source of truth for your March Madness analysis and provide you with the granularity necessary to perform any type of analysis you can think of. - 2025 Tournament Insights: Contains all seed and region information for the 2025 NCAA March Madness tournament. Please note that I will continually update this dataset with the seed and region information for previous tournaments as I continue to work on this dataset.
These datasets were created by downloading the raw CSV files for each season for the various sections on KenPom's website (Efficiency, Offense, Defense, Point Distribution, Summary, Miscellaneous Team Stats, and Height). All of these raw files were uploaded to Domo and imported into a dataflow using Domo's Magic ETL. In these dataflows, all of the column headers for each of the previous seasons are standardized to the current 2025 naming structure so all of the historical data can be viewed under the exact same field names. All of these cleaned datasets are then appended together, and some additional clean up takes place before ultimately creating the intermediate (INT) datasets that are uploaded to this Kaggle dataset. Once all of the INT datasets were created, I joined all of the tables together on the team name and season so all of these different metrics can be viewed under one single view. From there, I joined an NCAAM Conference & ESPN Team Name Mapping table to add a conference field in its full length and respective acronyms they are known by as well as the team name that ESPN currently uses. Please note that this reference table is an aggregated view of all of the different conferences a team has been a part of since 2002 and the different team names that KenPom has used historically, so this mapping table is necessary to map all of the teams properly and differentiate the historical conferences from their current conferences. From there, I join a reference table that includes all of the current NCAAM coaches and their active coaching lengths because the active current coaching length typically correlates to a team's success in the March Madness tournament. I also join another reference table to include the historical post-season tournament teams in the March Madness, NIT, CBI, and CIT tournaments, and I join another reference table to differentiate the teams who were ranked in the top 12 in the AP Top 25 during week 6 of the respective NCAA season. After some additional data clean-up, all of this cleaned data exports into the "DEV _ March Madness" file that contains the consolidated view of all of this data.
This dataset provides users with the flexibility to export data for further analysis in platforms such as Domo, Power BI, Tableau, Excel, and more. This dataset is designed for users who wish to conduct their own analysis, develop predictive models, or simply gain a deeper understanding of the intricacies that result in the excitement that Division 1 men's college basketball provides every year in March. Whether you are using this dataset for academic research, personal interest, or professional interest, I hope this dataset serves as a foundational tool for exploring the vast landscape of college basketball's most riveting and anticipated event of its season.
Company Datasets for valuable business insights!
Discover new business prospects, identify investment opportunities, track competitor performance, and streamline your sales efforts with comprehensive Company Datasets.
These datasets are sourced from top industry providers, ensuring you have access to high-quality information:
We provide fresh and ready-to-use company data, eliminating the need for complex scraping and parsing. Our data includes crucial details such as:
You can choose your preferred data delivery method, including various storage options, delivery frequency, and input/output formats.
Receive datasets in CSV, JSON, and other formats, with storage options like AWS S3 and Google Cloud Storage. Opt for one-time, monthly, quarterly, or bi-annual data delivery.
With Oxylabs Datasets, you can count on:
Pricing Options:
Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.
Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.
Experience a seamless journey with Oxylabs:
Unlock the power of data with Oxylabs' Company Datasets and supercharge your business insights today!
By City of Chicago [source]
This dataset is a compilation of reported crimes that have taken place in the City of Chicago over the past year, and provides an invaluable insight into the criminal activity occurring within our city. Featuring more than 65,000 records of data, it contains information on the date of each incident, its location (down to the block level), type of crime committed (determined by FBI Crime Classification Codes) and whether or not an arrest has been made in connection with each crime. As this dataset reveals detailed information on crime incidents which may lead to personal identification, addresses are masked beyond block level and specific locations are not disclosed.
For additional questions regarding this dataset, please do not hesitate to reach out to The Research & Development Division at 312.745.6071 or RandDchicagopolice.com who will be more than happy to help answer any inquiries you may have about our data findings! All visualized maps should be considered approximate however—it is prohibited for any attempts to derive specific addresses from them as accuracy cannot be guaranteed with regards to mechanical or human error when collecting this data over time. So come join us as we explore a year's worth of criminal activities throughout Chicago!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This guide will provide an overview on how to use this dataset to analyze patterns or draw conclusions about crime incidents in and around Chicago.
Secondly, become familiar with columns names which appear at top most row of your opened file which helps you understand what kind of data is stored at each column such as - CASE# - Unique identifier for the crime incident., DATE OF OCCURRENCE - Date when crime incident occurred , BLOCK - Block where event took place , LOCATION DESCRIPTION- Description of location where incident happened . Through these columns name you can easily recognize what kind of data exists within that record/row. That’s why it’s important to get familiar with them first before diving into raw datasets because they’ll help make exploring and understanding large sets easier later on when we go further into illustrating charts & graphs using programs such as Tableau & Power BI or even spreadsheets (Excel). After understanding column names its time to explore further by digging deeper into each record/row and apply filters if required e.g below $100 value will show only those rows having value less than 100 thus it will filter entire dataset according to your requirement. Lastly analyse collected datasets either Visually through plotting graphs with help tableau software OR By using Mathematical mathematical equations based on research questions such as finding out average values after applying sum/avg functions from respective cells etc
- Creating a visualization mapping tool to help visualize the types of crimes and their locations over time within Chicago.
- An analysis tool for city officials or police departments so they can understand correlations between crime type, geography, and other factors like weather changes or economic downturns in order to develop long-term plans for crime prevention.
- Developing an AI model that would be able to predict what areas may be more vulnerable for certain types of crimes or even predict crimes ahead of time based on the data from this dataset
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: crimes-one-year-prior-to-present-1.csv | Column name | Description | |:-------------------------|:------------------------------------------------------------------------------| | CASE# | Unique identifier for each crime incident (String) | | BLOCK | Block where the crime incident occurred (String) | | LOCATION DESCRIPTION | Description of where an incident took place (String) | | ARREST | Indicates if an arrest was made in connection with a crime incident (Boolean) | | DOMESTIC | Indicates if a reported incident is domestic related (Boolean) | | BEAT ...
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Diabetes Analytics Dashboard – Power BI 🩺📊 This practice dashboard is built for Data Analytics, Data Visualization, and Data Science learning. It provides meaningful insights into diabetes risk factors using interactive visuals and advanced analytics.
🔹 Key Metrics – Total patients, BMI, glucose, blood pressure, and insulin levels. 🔹 Diabetes Risk Segmentation – Categorized into High, Medium, and Low risk groups. 🔹 Trends & Distribution – Glucose vs. Age, BMI categories, and Blood Pressure analysis. 🔹 Correlation Analysis – Exploring the relationships between glucose, BMI, and diabetes risk. 🔹 Gauge & Pie Charts – Visualizing risk percentage, BMI distribution, and glucose levels. 🔹 Interactive Filters & Drilldowns – Allowing deeper exploration of specific patient groups. 🔹 Predictive Insights – Identifying potential risk patterns through visual analytics.
This project helps in understanding data-driven healthcare insights using Power BI. Thanks to Kaggle for the dataset!
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The October 2017 dataset of the IMMAP-UNHCR Protection Movement Initiative for South Syria. Explore the PMI Power BI dashboard: https://app.powerbi.com/view?r=eyJrIjoiMjYyYjE5NWUtZTdmYi00ZDZhLTg2N2UtMDg1MzUxMWIxZDA2IiwidCI6ImY2ZjcwZjFiLTJhMmQtNGYzMC04NTJhLTY0YjhjZTBjMTlkNyIsImMiOjF9
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
A date table for use with LG Inform data. This table converts the LG Inform periodIdentifier or periodLabel into a date (DD/MM/YYYY based on the period end date) for use by date/time intelligence aware applications such as Excel, Power BI or Tableau.
The ckanext-power-bi extension for CKAN integrates Power BI reports into CKAN resources. It introduces a new "Power BI" resource view, allowing users to embed and view Power BI reports directly within CKAN. The extension is designed to generate embed tokens with "View" permissions only, restricting interaction to viewing existing report bookmarks without edit capabilities. Key Features: Power BI Report Embedding: Enables embedding Power BI reports into CKAN resources, providing an interactive data visualization experience for CKAN users. View-Only Permissions: Generates embed tokens with "View" permissions, ensuring users can only view and interact with pre-existing report bookmarks and not modify the reports themselves. This means features such as editing are disabled and the experience is limited to viewing. Workspace ID Configuration: Requires the Power BI Workspace ID (Group ID) to correctly connect and display the desired reports. Optional Organization Name Configuration: Allows specifying the Azure organization (tenant) name, intended for possible future Power BI API enhancements (currently unused). i18n Support: Supports Power BI's Multiple-Language Reports feature, allowing the appropriate language to be displayed based on the user's CKAN locale. Provides configurations to facilitate the use of alternate i18n methods if internal translation is needed. MSI Authentication: Leverages ManagedIdentityCredential (MSI) to authenticate with Azure, simplifying authentication in Azure environments using system-assigned managed identities. Technical Integration: The extension integrates into CKAN by adding a new resource view type. It requires configuration settings in CKAN's config file (.ini) to specify the Power BI Workspace ID and optionally the organization name, as well as enabling the plugin in the ckan.plugins setting. It utilizes the Azure Identity library to handle authentication. Benefits & Impact: By integrating Power BI reports directly into CKAN, this extension enhances data accessibility and usability. Users can view and interact with data visualizations without leaving the CKAN environment, fostering a more seamless data exploration experience.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
A weighted matrix to convert fingertips GP practice code data into Birmingham and Solihull wards.The matrix uses the GP census data to allocate a percentage of their patients to each ward based on the citizens home address.How to useUsing a BI tool such as Power Query merge this with your GP Fingertips data based on the practice code.Expand the merged query to new rowsCalculate the new count value using following formula: Count * Weighted ValueCalculate the new denominator value using the following formula: Denominator * Weighted ValueGroup the Fingertips data by Indicator ID, Indicator name, Local authority. Ward code, Ward, Time period, Sex, Age with the aggregation of SUM on the New count and New denominator columnsCalculate the new value with the following formula: new count / new denominator *100Useful informationThe matrix is based on a recent GP census. It is therefore advisable to only work with the latest Fingertips data as historical information will be inaccurate.
This Power BI dashboard shows the COVID-19 vaccination rate by key demographics including age groups, race and ethnicity, and sex for Tempe zip codes.Data Source: Maricopa County GIS Open Data weekly count of COVID-19 vaccinations. The data were reformatted from the source data to accommodate dashboard configuration. The Maricopa County Department of Public Health (MCDPH) releases the COVID-19 vaccination data for each zip code and city in Maricopa County at ~12:00 PM weekly on Wednesdays via the Maricopa County GIS Open Data website (https://data-maricopa.opendata.arcgis.com/). More information about the data is available on the Maricopa County COVID-19 Vaccine Data page (https://www.maricopa.gov/5671/Public-Vaccine-Data#dashboard). The dashboard’s values are refreshed at 3:00 PM weekly on Wednesdays. The most recent date included on the dashboard is available by hovering over the last point on the right-hand side of each chart. Please note that the times when the Maricopa County Department of Public Health (MCDPH) releases weekly data for COVID-19 vaccines may vary. If data are not released by the time of the scheduled dashboard refresh, the values may appear on the dashboard with the next data release, which may be one or more days after the last scheduled release.Dates: Updated data shows publishing dates which represents values from the previous calendar week (Sunday through Saturday). For more details on data reporting, please see the Maricopa County COVID-19 data reporting notes at https://www.maricopa.gov/5460/Coronavirus-Disease-2019.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
📌**Context**
The Healthcare Workforce Mental Health Dataset is designed to explore workplace mental health challenges in the healthcare industry, an environment known for high stress and burnout rates.
This dataset enables users to analyze key trends related to:
đź’ Workplace Stressors: Examining the impact of heavy workloads, poor work environments, and emotional demands.
đź’ Mental Health Outcomes: Understanding how stress and burnout influence job satisfaction, absenteeism, and turnover intention.
đź’ Educational & Analytical Applications: A valuable resource for data analysts, students, and career changers looking to practice skills in data exploration and data visualization.
To help users gain deeper insights, this dataset is fully compatible with a Power BI Dashboard, available as part of a complete analytics bundle for enhanced visualization and reporting.
📌**Source**
This dataset was synthetically generated using the following methods:
đź’ Python & Data Science Techniques: Probabilistic modeling to simulate realistic data distributions. Industry-informed variable relationships based on healthcare workforce studies.
đź’ Guidance & Validation Using AI (ChatGPT): Assisted in refining dataset realism and logical mappings.
đź’ Industry Research & Reports: Based on insights from WHO, CDC, OSHA, and academic studies on workplace stress and mental health in healthcare settings.
📌**Inspiration**
This dataset was inspired by ongoing discussions in healthcare regarding burnout, mental health, and staff retention. The goal is to bridge the gap between raw data and actionable insights by providing a structured, analyst-friendly dataset.
For those who want a ready-to-use reporting solution, a Power BI Dashboard Template is available, designed for interactive data exploration, workforce insights, and stress factor analysis.
📌**Important Note** This dataset is synthetic and intended for educational purposes only. It is not real-world employee data and should not be used for actual decision-making or policy implementation.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This file contains the digital vector boundaries for Regions in England as at December 2024.
The boundaries available are: Generalised Clipped (BGC) - Generalised to 20m and clipped to the coastline (Mean High Water mark) and more generalised than the BFE boundaries.
Contains both Ordnance Survey and ONS Intellectual Property Rights.TopoJSON Shapefile for Power BIOn the Export tab you will find a file under the Alternative exports. This file is in TopoJSON format and is ready for use in compatible visualisation tools such as Power BI or Mapbox.
Use our https://app.powerbi.com/view?r=eyJrIjoiMDQ1MmRlMjEtMThlMy00MWIxLThmNTEtMzU4M2I5ODNmYTJlIiwidCI6ImJmMzQ2ODEwLTljN2QtNDNkZS1hODcyLTI0YTJlZjM5OTVhOCJ9" class="govuk-link">interactive dashboard to explore the data.
For queries please contact planning.statistics@communities.gov.uk.
<p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute"><abbr title="OpenDocument Spreadsheet" class="gem-c-attachment_abbr">ODS</abbr></span>, <span class="gem-c-attachment_attribute">250 KB</span></p>
<p class="gem-c-attachment_metadata">
This file is in an <a href="https://www.gov.uk/guidance/using-open-document-formats-odf-in-your-organisation" target="_self" class="govuk-link">OpenDocument</a> format
Local authority level statistics from table P124A are available in fully open and linkable data formats at http://opendatacommunities.org/def/concept/folders/themes/planning" class="govuk-link">Open Data Communities.
<p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute"><abbr title="OpenDocument Spreadsheet" class="gem-c-attachment_abbr">ODS</abbr></span>, <span class="gem-c-attachment_attribute">904 KB</span></p>
<p class="gem-c-attachment_metadata">
This file is in an <a href="https://www.gov.uk/guidance/using-open-document-formats-odf-in-your-organisation" target="_self" class="govuk-link">OpenDocument</a> format
This dataset contains the monthly historical data of the S&P 500 index from January 1901 to May 2025, collected from Investing.com. The S&P 500 is a stock market index that tracks the performance of 500 large companies listed on stock exchanges in the United States.
It is widely used as a benchmark for the U.S. equity market, representing over 80% of the total market capitalization. This dataset is suitable for:
Column | Description |
---|---|
Date | Monthly date in MM-DD-YY format (e.g., 01-01-24 = Jan 2024) |
Price | Closing price of the S&P 500 for the month |
Open | Opening price of the index for the month |
High | Highest price during the month |
Low | Lowest price during the month |
Change % | Percentage change from previous month’s close |
Data source: Investing.com
https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
This is a publication on maternity activity in English NHS hospitals. This report examines data relating to delivery and birth episodes in 2022-23, and the booking appointments for these deliveries. This annual publication covers the financial year ending March 2023. Data is included from both the Hospital Episodes Statistics (HES) data warehouse and the Maternity Services Data Set (MSDS). HES contains records of all admissions, appointments and attendances for patients admitted to NHS hospitals in England. The HES data used in this publication are called 'delivery episodes'. The MSDS collects records of each stage of the maternity service care pathway in NHS-funded maternity services, and includes information not recorded in HES. The MSDS is a maturing, national-level dataset. In April 2019 the MSDS transitioned to a new version of the dataset. This version, MSDS v2.0, is an update that introduced a new structure and content - including clinical terminology, in order to meet current clinical practice and incorporate new requirements. It is designed to meet requirements that resulted from the National Maternity Review, which led to the publication of the Better Births report in February 2016. This is the fourth publication of data from MSDS v2.0 and data from 2019-20 onwards is not directly comparable to data from previous years. This publication shows the number of HES delivery episodes during the period, with a number of breakdowns including by method of onset of labour, delivery method and place of delivery. It also shows the number of MSDS deliveries recorded during the period, with breakdowns including the baby's first feed type, birthweight, place of birth, and breastfeeding activity; and the mothers' ethnicity and age at booking. There is also data available in a separate file on breastfeeding at 6 to 8 weeks. The count of Total Babies includes both live and still births, and previous changes to how Total Babies and Total Deliveries were calculated means that comparisons between 2019-20 MSDS data and later years should be made with care. Information on how all measures are constructed can be found in the HES Metadata and MSDS Metadata files provided below. In this publication we have also included an interactive Power BI dashboard to enable users to explore key NHS Maternity Statistics measures. The purpose of this publication is to inform and support strategic and policy-led processes for the benefit of patient care. This report will also be of interest to researchers, journalists and members of the public interested in NHS hospital activity in England. Any feedback on this publication or dashboard can be provided to enquiries@nhsdigital.nhs.uk, under the subject “NHS Maternity Statistics”.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
📝 Dataset Overview: This dataset represents real-world, enhanced transactional data from Timac Global Concept, one of Nigeria’s prominent players in fuel and petroleum distribution. It includes comprehensive sales records across multiple stations and product categories (AGO, PMS, Diesel, Lubricants, LPG), along with revenue and shift-based operational tracking.
The dataset is ideal for analysts, BI professionals, and data science students aiming to explore fuel economy trends, pricing dynamics, and operational analytics.
🔍 Dataset Features: Column Name Description Date Transaction date Station_Name Name of the fuel station AGO_Sales (L) Automotive Gas Oil sold in liters PMS_Sales (L) Premium Motor Spirit sold in liters Lubricant_Sales (L) Lubricant sales in liters Diesel_Sales (L) Diesel sold in liters LPG_Sales (kg) Liquefied Petroleum Gas sold in kilograms Total_Revenue (₦) Total revenue generated in Nigerian Naira AGO_Price Price per liter of AGO PMS_Price Price per liter of PMS Lubricant_Price Unit price of lubricants Diesel_Price Price per liter of diesel LPG_Price Price per kg of LPG Product_Category Fuel product type Shift Work shift (e.g., Morning, Night) Supervisor Supervisor in charge during shift Weekday Day of the week for each transaction
🎯 Use Cases: Build Power BI dashboards to track fuel sales trends and shifts
Perform revenue forecasting using time series models
Analyze price dynamics vs sales volume
Visualize station-wise performance and weekday sales patterns
Conduct operational audits per supervisor or shift
đź§° Best Tools for Analysis: Power BI, Tableau
Python (Pandas, Matplotlib, Plotly)
Excel for pivot tables and summaries
SQL for fuel category insights
👤 Created By: Fatolu Peter (Emperor Analytics) Data analyst focused on real-life data transformation in Nigeria’s petroleum, healthcare, and retail sectors. This is Project 11 in my growing portfolio of end-to-end analytics challenges.
✅ LinkedIn Post: ⛽ New Dataset Alert – Fuel Economy & Sales Data Now on Kaggle! 📊 Timac Fuel Distribution & Revenue Dataset (Nigeria – 500 Records) 🔗 Explore the data here
Looking to practice business analytics, revenue forecasting, or operational dashboards?
This dataset contains:
Daily sales of AGO, PMS, Diesel, LPG & Lubricants
Revenue breakdowns by station
Shift & supervisor tracking
Fuel prices across product categories
You can use this to: âś… Build Power BI sales dashboards âś… Create fuel trend visualizations âś… Analyze shift-level profitability âś… Forecast revenue using Python or Excel
Let’s put real Nigerian data to real analytical work. Tag me when you build with it—I’d love to celebrate your work!
https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
This is a publication on maternity activity in English NHS hospitals. This report examines data relating to delivery and birth episodes in 2023-24, and the booking appointments for these deliveries. This annual publication covers the financial year ending March 2024. Data is included from both the Hospital Episodes Statistics (HES) data warehouse and the Maternity Services Data Set (MSDS). HES contains records of all admissions, appointments and attendances for patients admitted to NHS hospitals in England. The HES data used in this publication are called 'delivery episodes'. The MSDS collects records of each stage of the maternity service care pathway in NHS-funded maternity services, and includes information not recorded in HES. The MSDS is a maturing, national-level dataset. In April 2019, the MSDS transitioned to a new version of the dataset. This version, MSDS v2.0, is an update that introduced a new structure and content - including clinical terminology, in order to meet current clinical practice and incorporate new requirements. It is designed to meet requirements that resulted from the National Maternity Review, which led to the publication of the Better Births report in February 2016. This is the fifth publication of data from MSDS v2.0 and data from 2019-20 onwards is not directly comparable to data from previous years. This publication shows the number of HES delivery episodes during the period, with a number of breakdowns including by method of onset of labour, delivery method and place of delivery. It also shows the number of MSDS deliveries recorded during the period, with a breakdown for the mother's smoking status at the booking appointment by age group. It also provides counts of live born term babies with breakdowns for the general condition of newborns (via Apgar scores), skin-to-skin contact and baby's first feed type - all immediately after birth. There is also data available in a separate file on breastfeeding at 6 to 8 weeks. For the first time information on 'Smoking at Time of Delivery' has been presented using annual data from the MSDS. This includes national data broken down by maternal age, ethnicity and deprivation. From 2025/2026, MSDS will become the official source of 'Smoking at Time of Delivery' information and will replace the historic 'Smoking at Time of Delivery' data which is to become retired. We are currently undergoing dual collection and reporting on a quarterly basis for 2024/25 to help users compare information from the two sources. We are working with data submitters to help reconcile any discrepancies at a local level before any close down activities begin. A link to the dual reporting in the SATOD publication series can be found in the links below. Information on how all measures are constructed can be found in the HES Metadata and MSDS Metadata files provided below. In this publication we have also included an interactive Power BI dashboard to enable users to explore key NHS Maternity Statistics measures. The purpose of this publication is to inform and support strategic and policy-led processes for the benefit of patient care. This report will also be of interest to researchers, journalists and members of the public interested in NHS hospital activity in England. Any feedback on this publication or dashboard can be provided to enquiries@nhsdigital.nhs.uk, under the subject “NHS Maternity Statistics”.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset offers a comprehensive historical record of Netflix’s stock price movements, capturing the company’s financial journey from its early days to its position as a global streaming giant.
From its IPO in May 2002, Netflix (Ticker: NFLX) has transformed from a DVD rental service to a powerhouse in on-demand digital content. With its disruptive innovation, strategic shifts, and global expansion, Netflix has seen dramatic shifts in stock prices, reflecting not just market trends but also cultural impact. This dataset provides a window into that evolution.
Each row in this dataset represents daily trading activity on the stock market and includes the following columns:
The data is structured in CSV format and is clean, easy to use, and ready for immediate analysis.
Whether you're learning data science, building a financial model, or exploring machine learning in the real world, this dataset is a goldmine of insights. Netflix's market history includes:
This makes the dataset ideal for:
This dataset is designed for:
The dataset is derived from publicly available historical stock price data, such as Yahoo Finance, and has been cleaned and organized for educational and research purposes. It is continuously maintained to ensure accuracy.
Netflix’s rise is more than just a business story — it’s a data-driven journey. With this dataset, you can analyze the company’s stock behavior, train models to predict future trends, or simply visualize how tech reshapes the market.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset contains 10,000 synthetic records simulating the migratory behavior of various bird species across global regions. Each entry represents a single bird tagged with a tracking device and includes detailed information such as flight distance, speed, altitude, weather conditions, tagging information, and migration outcomes.
The data was entirely synthetically generated using randomized yet realistic values based on known ranges from ornithological studies. It is ideal for practicing data analysis and visualization techniques without privacy concerns or real-world data access restrictions. Because it’s artificial, the dataset can be freely used in education, portfolio projects, demo dashboards, machine learning pipelines, or business intelligence training.
With over 40 columns, this dataset supports a wide array of analysis types. Analysts can explore questions like “Do certain species migrate in larger flocks?”, “How does weather impact nesting success?”, or “What conditions lead to migration interruptions?”. Users can also perform geospatial mapping of start and end locations, cluster birds by behavior, or build time series models based on migration months and environmental factors.
For data visualization, tools like Power BI, Python (Matplotlib/Seaborn/Plotly), or Excel can be used to create insightful dashboards and interactive charts.
Join the Fabric Community DataViz Contest | May 2025: https://community.fabric.microsoft.com/t5/Power-BI-Community-Blog/%EF%B8%8F-Fabric-Community-DataViz-Contest-May-2025/ba-p/4668560
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Exploring Online Sales Data with Power BI !!
Another productive day diving into online sales dataset! Here’s a roundup of the insights I uncovered today:
Revenue by Category: Analyzed revenue distribution across different product categories to identify high-performing sectors.
Revenue by Sub-Category: Drilled down into sub-categories for a more granular view of revenue streams.
Revenue by Payment Mode: Examined revenue patterns based on payment methods to understand customer preferences.
Revenue by State: Mapped out revenue by state to pinpoint geographical strengths and opportunities.
Profit by Category: Evaluated profitability across product categories to assess which categories yield the highest profit margins.
Profit by Sub-Category: Explored profit levels at a sub-category level to identify the most profitable segments.
Profit by Payment Mode: Analyzed profit distribution across different payment methods.
Top 5 States by Revenue and Profit: Highlighted the top 5 states driving the most revenue and profit, offering insights into regional performance.
Sales Map by State: Visualized sales data on a map to provide a geographical perspective on sales distribution.
Total Quantity, Revenue, and Profit: Aggregated data to give an overview of total quantities sold, overall revenue, and total profit.
Filter by Category: Added a filter functionality to focus on specific categories and refine data analysis.