100+ datasets found
  1. n

    Measuring quality of routine primary care data

    • data.niaid.nih.gov
    • datasetcatalog.nlm.nih.gov
    • +1more
    zip
    Updated Mar 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olga Kostopoulou; Brendan Delaney (2021). Measuring quality of routine primary care data [Dataset]. http://doi.org/10.5061/dryad.dncjsxkzh
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 12, 2021
    Dataset provided by
    Imperial College London
    Authors
    Olga Kostopoulou; Brendan Delaney
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Objective: Routine primary care data may be used for the derivation of clinical prediction rules and risk scores. We sought to measure the impact of a decision support system (DSS) on data completeness and freedom from bias.

    Materials and Methods: We used the clinical documentation of 34 UK General Practitioners who took part in a previous study evaluating the DSS. They consulted with 12 standardized patients. In addition to suggesting diagnoses, the DSS facilitates data coding. We compared the documentation from consultations with the electronic health record (EHR) (baseline consultations) vs. consultations with the EHR-integrated DSS (supported consultations). We measured the proportion of EHR data items related to the physician’s final diagnosis. We expected that in baseline consultations, physicians would document only or predominantly observations related to their diagnosis, while in supported consultations, they would also document other observations as a result of exploring more diagnoses and/or ease of coding.

    Results: Supported documentation contained significantly more codes (IRR=5.76 [4.31, 7.70] P<0.001) and less free text (IRR = 0.32 [0.27, 0.40] P<0.001) than baseline documentation. As expected, the proportion of diagnosis-related data was significantly lower (b=-0.08 [-0.11, -0.05] P<0.001) in the supported consultations, and this was the case for both codes and free text.

    Conclusions: We provide evidence that data entry in the EHR is incomplete and reflects physicians’ cognitive biases. This has serious implications for epidemiological research that uses routine data. A DSS that facilitates and motivates data entry during the consultation can improve routine documentation.

  2. f

    Data quality scale applied to the assessment of each measure in the FGT.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Jul 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Silver, Martha; Trumble, Robert; Recchia, Cheri A.; Stevens, Kara; Swasey, Jill H.; Parkes, Graeme; Iudicello, Suzanne (2021). Data quality scale applied to the assessment of each measure in the FGT. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000755084
    Explore at:
    Dataset updated
    Jul 1, 2021
    Authors
    Silver, Martha; Trumble, Robert; Recchia, Cheri A.; Stevens, Kara; Swasey, Jill H.; Parkes, Graeme; Iudicello, Suzanne
    Description

    Data quality scale applied to the assessment of each measure in the FGT.

  3. The impact of routine data quality assessments on electronic medical record...

    • plos.figshare.com
    pdf
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Veronica Muthee; Aaron F. Bochner; Allison Osterman; Nzisa Liku; Willis Akhwale; James Kwach; Mehta Prachi; Joyce Wamicwe; Jacob Odhiambo; Fredrick Onyango; Nancy Puttkammer (2023). The impact of routine data quality assessments on electronic medical record data quality in Kenya [Dataset]. http://doi.org/10.1371/journal.pone.0195362
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Veronica Muthee; Aaron F. Bochner; Allison Osterman; Nzisa Liku; Willis Akhwale; James Kwach; Mehta Prachi; Joyce Wamicwe; Jacob Odhiambo; Fredrick Onyango; Nancy Puttkammer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Kenya
    Description

    BackgroundRoutine Data Quality Assessments (RDQAs) were developed to measure and improve facility-level electronic medical record (EMR) data quality. We assessed if RDQAs were associated with improvements in data quality in KenyaEMR, an HIV care and treatment EMR used at 341 facilities in Kenya.MethodsRDQAs assess data quality by comparing information recorded in paper records to KenyaEMR. RDQAs are conducted during a one-day site visit, where approximately 100 records are randomly selected and 24 data elements are reviewed to assess data completeness and concordance. Results are immediately provided to facility staff and action plans are developed for data quality improvement. For facilities that had received more than one RDQA (baseline and follow-up), we used generalized estimating equation models to determine if data completeness or concordance improved from the baseline to the follow-up RDQAs.Results27 facilities received two RDQAs and were included in the analysis, with 2369 and 2355 records reviewed from baseline and follow-up RDQAs, respectively. The frequency of missing data in KenyaEMR declined from the baseline (31% missing) to the follow-up (13% missing) RDQAs. After adjusting for facility characteristics, records from follow-up RDQAs had 0.43-times the risk (95% CI: 0.32–0.58) of having at least one missing value among nine required data elements compared to records from baseline RDQAs. Using a scale with one point awarded for each of 20 data elements with concordant values in paper records and KenyaEMR, we found that data concordance improved from baseline (11.9/20) to follow-up (13.6/20) RDQAs, with the mean concordance score increasing by 1.79 (95% CI: 0.25–3.33).ConclusionsThis manuscript demonstrates that RDQAs can be implemented on a large scale and used to identify EMR data quality problems. RDQAs were associated with meaningful improvements in data quality and could be adapted for implementation in other settings.

  4. G

    Data Quality Scorecards Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Data Quality Scorecards Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/data-quality-scorecards-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Oct 4, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Data Quality Scorecards Market Outlook



    According to our latest research, the global Data Quality Scorecards market size in 2024 stands at USD 1.42 billion, reflecting robust demand across diverse sectors. The market is projected to expand at a CAGR of 14.8% from 2025 to 2033, reaching an estimated USD 4.45 billion by the end of the forecast period. Key growth drivers include the escalating need for reliable data-driven decision-making, stringent regulatory compliance requirements, and the proliferation of digital transformation initiatives across enterprises of all sizes. As per our latest research, organizations are increasingly recognizing the significance of maintaining high data quality standards to fuel analytics, artificial intelligence, and business intelligence capabilities.




    One of the primary growth factors for the Data Quality Scorecards market is the exponential rise in data volumes generated by organizations worldwide. The digital economy has led to a surge in data collection from various sources, including customer interactions, IoT devices, and transactional systems. This data explosion has heightened the complexity of managing and ensuring data accuracy, completeness, and consistency. As a result, businesses are investing in comprehensive data quality management solutions, such as scorecards, to monitor, measure, and improve the quality of their data assets. These tools provide actionable insights, enabling organizations to proactively address data quality issues and maintain data integrity across their operations. The growing reliance on advanced analytics and artificial intelligence further amplifies the demand for high-quality data, making data quality scorecards an indispensable component of modern data management strategies.




    Another significant growth driver is the increasing regulatory scrutiny and compliance requirements imposed on organizations, particularly in industries such as BFSI, healthcare, and government. Regulatory frameworks such as GDPR, HIPAA, and CCPA mandate stringent controls over data accuracy, privacy, and security. Non-compliance can result in severe financial penalties and reputational damage, compelling organizations to adopt robust data quality management practices. Data quality scorecards help organizations monitor compliance by providing real-time visibility into data quality metrics and highlighting areas that require remediation. This proactive approach to compliance not only mitigates regulatory risks but also enhances stakeholder trust and confidence in organizational data assets. The integration of data quality scorecards into enterprise data governance frameworks is becoming a best practice for organizations aiming to achieve continuous compliance and data excellence.




    The rapid adoption of cloud computing and digital transformation initiatives across industries is also fueling the growth of the Data Quality Scorecards market. As organizations migrate their data infrastructure to the cloud and embrace hybrid IT environments, the complexity of managing data quality across disparate systems increases. Cloud-based data quality scorecards offer scalability, flexibility, and ease of deployment, making them an attractive option for organizations seeking to modernize their data management practices. Moreover, the proliferation of self-service analytics and business intelligence tools has democratized data access, necessitating robust data quality monitoring to ensure that decision-makers are working with accurate and reliable information. The convergence of cloud, AI, and data quality management is expected to create new opportunities for innovation and value creation in the market.




    From a regional perspective, North America continues to dominate the Data Quality Scorecards market, driven by the presence of leading technology vendors, high adoption rates of advanced analytics, and stringent regulatory frameworks. However, the Asia Pacific region is expected to witness the fastest growth during the forecast period, fueled by rapid digitalization, increasing investments in IT infrastructure, and growing awareness of data quality management among enterprises. Europe also represents a significant market, characterized by strong regulatory compliance requirements and a mature data management ecosystem. Latin America and the Middle East & Africa are emerging markets, with increasing adoption of data quality solutions in sectors such as BFSI, healthcare, and government. The global market landscape is evolving rapidly, with regional

  5. d

    5.01 Quality of Business Services (summary)

    • catalog.data.gov
    • performance.tempe.gov
    • +11more
    Updated Nov 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2025). 5.01 Quality of Business Services (summary) [Dataset]. https://catalog.data.gov/dataset/5-01-quality-of-business-services-summary-71fdc
    Explore at:
    Dataset updated
    Nov 15, 2025
    Dataset provided by
    City of Tempe
    Description

    Biennial Business Survey data summary for Quality of Business Services survey results. The Business Survey question that relates to this dataset is: “Quality of services provided by City of Tempe.” Respondents are asked to rate their satisfaction level using a scale of 1 to 5, where 1 means "Very Dissatisfied" and 5 means "Very Satisfied".This page provides data for the Quality of Business Services performance measure. The performance measure dashboard is available at 5.01 Quality of Business Services.Additional InformationSource: Business Survey (Vendor: ETC Institute) Contact: Wydale HolmesContact E-Mail: wydale_holmes@tempe.govData Source Type: .pdf, ExcelPreparation Method: The City contracts with a vendor to conduct the survey, analyze the data, and prepare for publication.Publish Frequency: Every other yearPublish Method: Manual, .pdfData Dictionary

  6. d

    5.01 Quality of Business Services (dashboard)

    • catalog.data.gov
    • data.tempe.gov
    • +1more
    Updated Mar 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2023). 5.01 Quality of Business Services (dashboard) [Dataset]. https://catalog.data.gov/dataset/5-01-quality-of-business-services-dashboard-a7f09
    Explore at:
    Dataset updated
    Mar 18, 2023
    Dataset provided by
    City of Tempe
    Description

    This operations dashboard shows historic and current data related to this performance measure. The performance measure dashboard is available at 5.01 Quality of Business Services. Data Dictionary

  7. w

    Minimum Data Set Quality Measure/Indicator Report

    • data.wu.ac.at
    Updated Apr 5, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Health & Human Services (2016). Minimum Data Set Quality Measure/Indicator Report [Dataset]. https://data.wu.ac.at/schema/data_gov/MzY1YzIyOTQtZjdhMC00MWNlLTkxNjktOGFhZDRlOGFlNDFh
    Explore at:
    Dataset updated
    Apr 5, 2016
    Dataset provided by
    U.S. Department of Health & Human Services
    Description

    No description provided

  8. Using Descriptive Statistics to Analyse Data in R

    • kaggle.com
    zip
    Updated May 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enrico68 (2024). Using Descriptive Statistics to Analyse Data in R [Dataset]. https://www.kaggle.com/datasets/enrico68/using-descriptive-statistics-to-analyse-data-in-r
    Explore at:
    zip(105561 bytes)Available download formats
    Dataset updated
    May 9, 2024
    Authors
    Enrico68
    Description

    Load and view a real-world dataset in RStudio

    • Calculate “Measure of Frequency” metrics

    • Calculate “Measure of Central Tendency” metrics

    • Calculate “Measure of Dispersion” metrics

    • Use R’s in-built functions for additional data quality metrics

    • Create a custom R function to calculate descriptive statistics on any given dataset

  9. D

    Healthcare Data Quality Tools Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Healthcare Data Quality Tools Market Research Report 2033 [Dataset]. https://dataintelo.com/report/healthcare-data-quality-tools-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Sep 30, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Healthcare Data Quality Tools Market Outlook



    According to our latest research, the global healthcare data quality tools market size reached USD 1.82 billion in 2024. The market is expected to exhibit a strong compound annual growth rate (CAGR) of 16.9% from 2025 to 2033, driven by the increasing digitization of healthcare systems, regulatory mandates, and the rising emphasis on data-driven decision-making in healthcare. By 2033, the market is forecasted to achieve a value of USD 7.13 billion. This robust expansion is primarily fueled by the growing need for accurate, complete, and reliable health data to improve patient outcomes, streamline operations, and ensure compliance with evolving healthcare regulations.




    The healthcare data quality tools market is experiencing significant growth due to the surging adoption of electronic health records (EHRs) and the rapid digital transformation within the healthcare sector. As healthcare organizations increasingly transition from paper-based systems to digital platforms, the volume and complexity of healthcare data have grown exponentially. This shift has amplified the need for data quality tools that can cleanse, standardize, and validate large datasets, ensuring that critical clinical and administrative decisions are based on accurate and consistent information. The integration of advanced analytics and artificial intelligence (AI) in healthcare data management further accelerates the demand for robust data quality solutions, enabling organizations to unlock actionable insights from their data assets.




    Another key growth factor for the healthcare data quality tools market is the stringent regulatory environment governing healthcare data management. Regulatory bodies such as HIPAA in the United States and GDPR in Europe have established strict guidelines for data privacy, security, and accuracy, compelling healthcare organizations to invest in tools that ensure compliance. Non-compliance can result in severe penalties and reputational damage, making data quality management a top priority. Additionally, the increasing adoption of value-based care models and the emphasis on population health management require high-quality data to track patient outcomes, measure performance, and optimize resource allocation. This regulatory and operational landscape is driving sustained investments in healthcare data quality tools globally.




    The proliferation of connected medical devices, telemedicine platforms, and health information exchanges has further contributed to the complexity of healthcare data ecosystems. These advancements generate vast amounts of structured and unstructured data from diverse sources, including patient records, imaging systems, wearable devices, and administrative databases. Ensuring the interoperability and consistency of such heterogeneous data is a significant challenge, necessitating advanced data quality tools that can handle multiple data types and formats. As healthcare organizations strive to harness the full potential of big data and predictive analytics, the importance of data quality tools in enabling reliable and actionable insights cannot be overstated.




    From a regional perspective, North America currently dominates the healthcare data quality tools market, accounting for the largest revenue share in 2024. The region’s leadership is attributed to its advanced healthcare IT infrastructure, high adoption of EHRs, and strong regulatory frameworks. However, Asia Pacific is expected to register the fastest growth during the forecast period, supported by increasing healthcare digitization, government initiatives to modernize healthcare systems, and rising investments in health IT. Europe also remains a significant market, driven by stringent data protection regulations and the widespread implementation of digital health initiatives across the region.



    Component Analysis



    The healthcare data quality tools market by component is broadly segmented into software and services. The software segment comprises standalone and integrated solutions designed to automate data cleansing, profiling, integration, enrichment, and monitoring processes within healthcare organizations. These solutions are increasingly incorporating advanced technologies such as artificial intelligence, machine learning, and natural language processing to enhance data accuracy and streamline workflows. The growing need to manage large volumes of healthcare data efficiently and the rising

  10. DataSheet_1_Quality indicators: completeness, validity and timeliness of...

    • frontiersin.figshare.com
    pdf
    Updated Jul 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francesco Giusti; Carmen Martos; Raquel Negrão Carvalho; Liesbet Van Eycken; Otto Visser; Manola Bettio (2023). DataSheet_1_Quality indicators: completeness, validity and timeliness of cancer registry data contributing to the European Cancer Information System.pdf [Dataset]. http://doi.org/10.3389/fonc.2023.1219128.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 28, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Francesco Giusti; Carmen Martos; Raquel Negrão Carvalho; Liesbet Van Eycken; Otto Visser; Manola Bettio
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Population-based Cancer Registries (PBCRs) are tasked with collecting high-quality data, important for monitoring cancer burden and its trends, planning and evaluating cancer control activities, clinical and epidemiological research and development of health policies. The main indicators to measure data quality are validity, completeness, comparability and timeliness. The aim of this article is to evaluate the quality of PBCRs data collected in the first ENCR-JRC data call, dated 2015.MethodsAll malignant tumours, except skin non-melanoma, and in situ and uncertain behaviour of bladder were obtained from 130 European general PBCRs for patients older than 19 years. Proportion of cases with death certificate only (DCO%), proportion of cases with unknown primary site (PSU%), proportion of microscopically verified cases (MV%), mortality to incidence (M:I) ratio, proportion of cases with unspecified morphology (UM%) and the median of the difference between the registration date and the incidence date were computed by sex, age group, cancer site, period and PBCR.ResultsA total of 28,776,562 cases from 130 PBCRs, operating in 30 European countries were included in the analysis. The quality of incidence data reported by PBCRs has been improving across the study period. Data quality is worse for the oldest age groups and for cancer sites with poor survival. No differences were found between males and females. High variability in data quality was detected across European PBCRs.Conclusionthe results reported in this paper are to be interpreted as the baseline for monitoring PBCRs data quality indicators in Europe along time.

  11. w

    City Website Quality Satisfaction (Performance Measure 2.04)

    • data.wu.ac.at
    csv
    Updated Mar 28, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2018). City Website Quality Satisfaction (Performance Measure 2.04) [Dataset]. https://data.wu.ac.at/schema/data_gov/M2FhMGFkYmMtODBiZS00ZGQ5LTg2NmQtOTIwNjJiYjgxOTky
    Explore at:
    csvAvailable download formats
    Dataset updated
    Mar 28, 2018
    Dataset provided by
    City of Tempe
    Description

    This dataset comes from the Annual Community Survey question related to satisfaction with the quality of the city website. Respondents are asked to provide their level of satisfaction related to the “Usefulness of the City's website” on a scale of 5 to 1, where 5 means "Very Satisfied" and 1 means "Very Dissatisfied" (without "don't know" as an option).

    The survey is mailed to a random sample of households in the City of Tempe and has a 95% confidence level.

    This page provides data for the City Website Quality Satisfaction performance measure. Click on the Showcases tab for any available stories or dashboards related to this data.

    The performance measure dashboard is available at PMD 2.04 City Website Satisfaction (Coming Soon)

    PMID: 2211

  12. India Air Quality Index (AQI) Dataset 2010-2024

    • kaggle.com
    zip
    Updated Sep 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Om Patil (2025). India Air Quality Index (AQI) Dataset 2010-2024 [Dataset]. https://www.kaggle.com/datasets/omsandeeppatil/indian-aqi-stations
    Explore at:
    zip(2059414090 bytes)Available download formats
    Dataset updated
    Sep 29, 2025
    Authors
    Om Patil
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Area covered
    India
    Description

    Ever wondered how bad the air really is? Wonder no more! We've got 14 years of hourly data from 553 stations across India proving that yes, it's probably worse than you thought.

    Perfect for data scientists who want to predict the unpredictable and researchers who enjoy charts that trend upward in all the wrong ways.

    🏆 The Masterminds Behind This Chaos

    Curated by: - @omsandeeppatil - Guy who decided counting particles in air was fun - @durvadongre - Partner in crime

    Brought to you by: Project Parisar - Because someone has to keep track of this mess

    🎯 What Can You Do With This?

    • Predict the future: Will tomorrow's air be soup or just thick fog?
    • Machine Learning: Teach computers to be as pessimistic about air quality as we are
    • Health Research: Correlate coughing patterns with PM2.5 spikes
    • Climate Studies: Document the slow-motion environmental apocalypse
    • Urban Planning: Help cities figure out where NOT to put playgrounds

    📁 How We Organized This Disaster

    ├── stations.csv         # All 553 ways we measure disappointment
    └── data/
      ├── Andhra-Pradesh/
      │  ├── AP01.csv      # Local air quality: "Meh"
      └── [More States of Despair]/
    

    🗂️ What's Inside

    stations.csv - Station Hall of Fame

    Your guide to 553 locations where we scientifically measure "yikes":

    ColumnWhat It MeansExample
    idUnique ID for each monitoring disaster1
    station_nameFancy name for "air sniffer""NSIT Dwarka Delhi CPCB"
    station_codeBureaucratic shorthand"DL01"
    cityWhere dreams of clean air go to die"Dwarka"
    state_codeTwo letters of regional identity"DL"
    pin_codePostal code (for sending sympathy cards)110078
    latitudeGPS coords of suffering28.610947
    longitudeMore GPS coords of suffering77.038456
    elevation_mHeight above sea level (not above smog)342
    topo_complexityHow confusing the terrain is1.5
    coastal_proximityDistance to breathable sea air0.7
    valley_factorHow trapped the bad air is0.8

    Individual Station Files - The Daily Grind

    🌫️ The Main Villains: - pm2.5 - Tiny particles of regret (μg/m³) - pm10 - Bigger particles of regret (μg/m³) - no2 - Nitrogen's angry cousin (μg/m³) - so2 - Sulfur's contribution to chaos (μg/m³) - co - The silent but deadly friend (mg/m³) - ozone - Good upstairs, bad downstairs (μg/m³)

    🧪 The Chemical Ensemble Cast: - benzene, toluene, xylene - The aromatic troublemakers (μg/m³) - nh3 - Ammonia, because why not? (μg/m³)

    🌡️ Weather Accomplices: - rh - Humidity (makes everything stickier) (%) - ws - Wind speed (how fast help is blowing away) (m/s) - wd - Wind direction (where the blame is coming from) (°) - bp - Barometric pressure (atmospheric mood swings) (hPa)

    📅 Time & Place Stamps: - timestamp - When exactly everything went wrong - station_id - Which station witnessed this particular tragedy

    🌍 Geographic Coverage

    From the bustling metros to sleepy hill stations, we've got disappointing air quality data everywhere! Mumbai's industrial charm, Delhi's winter wonderland of smog, and even those "pristine" hill stations that aren't so pristine anymore.

    📈 Data Quality

    • 85% complete (The other 15% probably gave up measuring)
    • CPCB validated (Officially certified disappointment)
    • Missing values clearly marked (Honesty in despair)

    🛠️ Tech Specs

    Format: CSV (Because even environmental disasters need spreadsheets)
    Encoding: UTF-8 (International standard for documenting problems)
    Missing Values: When even the sensors couldn't handle it

    🔍 Perfect For

    • PhD students who hate themselves
    • Data scientists with a dark sense of humor
    • Anyone building the next "AirpocalypseNow" app
    • Researchers documenting the end times

    📚 How to Cite This Masterpiece

    Patil, O.S., Dongre, D. (2024). "India Air Quality Dataset: 
    14 Years of Scientifically Measuring How Screwed We Are." 
    Project Parisar. Available at: [kaggle-url]
    

    🤝 Want to Help?

    Project Parisar welcomes contributions! Because misery loves company, and data cleaning is a team sport.

    📞 Questions?

    Hit up @omsandeeppatil or @durvadongre - they're the brave souls who actually organized this chaos.

    🏷️ Tags

    environmental-disaster data-science time-series india air-pollution machine-learning public-health why-we-cant-have-nice-things

    Disclaimer: No air particles were harmed in the making of this dataset. They're doing just fine, unfortunately.

  13. R

    Data Quality Scorecards Market Research Report 2033

    • researchintelo.com
    csv, pdf, pptx
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Research Intelo (2025). Data Quality Scorecards Market Research Report 2033 [Dataset]. https://researchintelo.com/report/data-quality-scorecards-market
    Explore at:
    pdf, csv, pptxAvailable download formats
    Dataset updated
    Oct 1, 2025
    Dataset authored and provided by
    Research Intelo
    License

    https://researchintelo.com/privacy-and-policyhttps://researchintelo.com/privacy-and-policy

    Time period covered
    2024 - 2033
    Area covered
    Global
    Description

    Data Quality Scorecards Market Outlook



    According to our latest research, the Global Data Quality Scorecards Market size was valued at $1.4 billion in 2024 and is projected to reach $4.2 billion by 2033, expanding at a robust CAGR of 13.2% during the forecast period of 2025–2033. The primary growth driver for this market is the increasing reliance on data-driven decision-making across enterprises, which necessitates stringent data quality management to ensure accuracy, compliance, and business agility. As organizations globally accelerate digital transformation initiatives, the demand for comprehensive data quality scorecard solutions is surging, enabling businesses to monitor, measure, and improve data integrity and reliability across diverse operational environments.



    Regional Outlook



    North America currently dominates the Data Quality Scorecards Market, accounting for the largest market share in 2024. The region’s leadership stems from the early adoption of advanced data management technologies, a mature IT infrastructure, and stringent regulatory requirements, particularly in sectors such as BFSI, healthcare, and government. Organizations in the United States and Canada are investing heavily in robust data governance frameworks, which in turn drives the adoption of data quality scorecards. Major technology players headquartered in this region also contribute to rapid product innovation and ecosystem development. As a result, North America is expected to maintain its market leadership, with a projected market value exceeding $1.5 billion by 2033.



    The Asia Pacific region is anticipated to register the fastest growth in the Data Quality Scorecards Market, with a projected CAGR surpassing 15% during the forecast period. This growth is primarily fueled by rapid digitalization, expanding IT and telecommunications sectors, and increasing regulatory focus on data privacy and quality in countries such as China, India, Japan, and South Korea. Enterprises in this region are increasingly adopting cloud-based data quality solutions to support large-scale data integration and analytics projects. Furthermore, government-led digital transformation initiatives and significant investments in smart city projects are propelling the demand for efficient data quality management tools. The region’s burgeoning e-commerce and financial services industries are also key contributors to this robust growth trajectory.



    Emerging economies in Latin America, the Middle East, and Africa are gradually embracing data quality scorecards, although adoption remains at a nascent stage compared to developed markets. Challenges such as limited IT infrastructure, budget constraints, and a shortage of skilled data professionals hinder market penetration. However, the growing awareness of the importance of data quality for regulatory compliance and operational efficiency is driving gradual uptake. Localized demand is further influenced by sector-specific needs in banking, government, and retail, where accurate data is crucial for risk management and customer engagement. Policy reforms aimed at enhancing data security and digital transformation are expected to create new opportunities for market players in these regions over the coming years.



    Report Scope





    Attributes Details
    Report Title Data Quality Scorecards Market Research Report 2033
    By Component Software, Services
    By Deployment Mode On-Premises, Cloud
    By Organization Size Small and Medium Enterprises, Large Enterprises
    By Application Data Governance, Risk and Compliance Management, Data Integration and Migration, Business Intelligence and Analytics, Others
    By End-User BFSI, Healthcare, Retail and E-commerce, IT and Tel

  14. d

    3.36 Quality of City Services (dashboard)

    • catalog.data.gov
    • s.cnmilf.com
    • +2more
    Updated Nov 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2025). 3.36 Quality of City Services (dashboard) [Dataset]. https://catalog.data.gov/dataset/3-36-quality-of-city-services-dashboard-5d8b1
    Explore at:
    Dataset updated
    Nov 15, 2025
    Dataset provided by
    City of Tempe
    Description

    This operations dashboard shows historic and current data related to this performance measure.The performance measure dashboard is available at 3.36 Quality of City Services. Data Dictionary

  15. d

    Data from USGS National Water Quality Laboratory methods used to calculate...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Data from USGS National Water Quality Laboratory methods used to calculate and compare detection limits estimated using single- and multi-concentration spike-based and blank-based procedures [Dataset]. https://catalog.data.gov/dataset/data-from-usgs-national-water-quality-laboratory-methods-used-to-calculate-and-compare-det
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    This dataset provides the expected and determined concentrations of selected inorganic and organic analytes for spiked reagent-water samples (calibration standards and limit of quantitation standards) that were used to calculate detection limits by using the United States Environmental Protection Agency’s (USEPA) Method Detection Limit (MDL) version 1.11 or 2.0 procedures, ASTM International’s Within-Laboratory Critical Level standard procedure D7783-13, and, for five pharmaceutical compounds, by USEPA’s Lowest Concentration Minimum Reporting Level procedure. Also provided are determined concentration data for reagent-water laboratory blank samples, classified as either instrument blank or set blank samples, and reagent-water blind-blank samples submitted by the USGS Quality System Branch, that were used to calculate blank-based detection limits by using the USEPA MDL version 2.0 procedure or procedures described in National Water Quality Laboratory Technical Memorandum 2016.02, http://wwwnwql.cr.usgs.gov/tech_memos/nwql.2016-02.pdf. The determined detection limits are provided and compared in the related external publication at https://doi.org/10.1016/j.talanta.2021.122139.

  16. Data from: Antecedents to website satisfaction, loyalty, and word-of-mouth

    • scielo.figshare.com
    jpeg
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brent Coker (2023). Antecedents to website satisfaction, loyalty, and word-of-mouth [Dataset]. http://doi.org/10.6084/m9.figshare.20011635.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    SciELOhttp://www.scielo.org/
    Authors
    Brent Coker
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Satisfaction, loyalty, and likelihood of referral are regarded by marketers and the Big Three diagnostics leading to retail profitability. However, as yet no-one has developed a model to capture all three of these constructs in the context of the internet. Moreover, although several attempts have been made to develop models to measure quality of website experience, no-one has sought to develop an instrument short enough to be of practical use as a quick customer satisfaction feedback form. In this research we sought to fill this void by developing and psychometrically testing a parsimonious model to capture the Big Three diagnostics, brief enough to be used in a commercial environment as a modal popup feedback form.

  17. d

    Air quality data from the measure stations of the city of Barcelona

    • datos.gob.es
    • opendata-ajuntament.barcelona.cat
    Updated Jun 15, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ayuntamiento de Barcelona (2018). Air quality data from the measure stations of the city of Barcelona [Dataset]. https://datos.gob.es/en/catalogo/l01080193-datos-de-las-estaciones-de-medida-de-la-calidad-del-aire-de-la-ciudad-de-barcelona
    Explore at:
    Dataset updated
    Jun 15, 2018
    Dataset authored and provided by
    Ayuntamiento de Barcelona
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Barcelona
    Description

    This dataset contains data of the contaminants measured in the stations of the city of Barcelona. The update is carried out in intervals of one hour indicating whether the value is validated or not. The data of three days prior to the current one is also displayed.

  18. w

    Inpatient Psychiatric Facility Quality Measure Data – by Facility

    • data.wu.ac.at
    csv, json, xls
    Updated Dec 21, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Medicare (2017). Inpatient Psychiatric Facility Quality Measure Data – by Facility [Dataset]. https://data.wu.ac.at/schema/public_opendatasoft_com/aW5wYXRpZW50LXBzeWNoaWF0cmljLWZhY2lsaXR5LXF1YWxpdHktbWVhc3VyZS1kYXRhLWJ5LWZhY2lsaXR5
    Explore at:
    csv, xls, jsonAvailable download formats
    Dataset updated
    Dec 21, 2017
    Dataset provided by
    Medicare
    Description

    Psychiatric facilities that are eligible for the Inpatient Psychiatric Facility Quality Reporting (IPFQR) program are required to meet all program requirements, otherwise their Medicare payments may be reduced. Follow-Up After Hospitalization for Mental Illness (FUH) measure data on this table are marked as not available. Results for this measure are provided on a separate table.

  19. V

    Quality of life measure - by state

    • data.virginia.gov
    csv
    Updated Oct 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datathon 2024 (2025). Quality of life measure - by state [Dataset]. https://data.virginia.gov/dataset/quality-of-life-by-state
    Explore at:
    csv(1738)Available download formats
    Dataset updated
    Oct 23, 2025
    Dataset authored and provided by
    Datathon 2024
    Description

    Quality of life is a measure of comfort, health, and happiness by a person or a group of people. Quality of life is determined by both material factors, such as income and housing, and broader considerations like health, education, and freedom. Each year, US & World News releases its “Best States to Live in” report, which ranks states on the quality of life each state provides its residents. In order to determine rankings, U.S. News & World Report considers a wide range of factors, including healthcare, education, economy, infrastructure, opportunity, fiscal stability, crime and corrections, and the natural environment. More information on these categories and what is measured in each can be found below:

    Healthcare includes access, quality, and affordability of healthcare, as well as health measurements, such as obesity rates and rates of smoking. Education measures how well public schools perform in terms of testing and graduation rates, as well as tuition costs associated with higher education and college debt load. Economy looks at GDP growth, migration to the state, and new business. Infrastructure includes transportation availability, road quality, communications, and internet access. Opportunity includes poverty rates, cost of living, housing costs and gender and racial equality. Fiscal Stability considers the health of the government's finances, including how well the state balances its budget. Crime and Corrections ranks a state’s public safety and measures prison systems and their populations. Natural Environment looks at the quality of air and water and exposure to pollution.

  20. Data associated with: Measuring Quality and Characterizing Cuna Mas Home...

    • data.iadb.org
    csv
    Updated Apr 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IDB Datasets (2025). Data associated with: Measuring Quality and Characterizing Cuna Mas Home Visits Validation of the HOVRS-A+2 in Peru [Dataset]. http://doi.org/10.60966/cyov6t8i
    Explore at:
    csv(609668)Available download formats
    Dataset updated
    Apr 10, 2025
    Dataset provided by
    Inter-American Development Bankhttp://www.iadb.org/
    License

    Attribution-NonCommercial-NoDerivs 3.0 (CC BY-NC-ND 3.0)https://creativecommons.org/licenses/by-nc-nd/3.0/
    License information was derived automatically

    Time period covered
    Jan 1, 2015
    Area covered
    Peru
    Description

    This dataset contains information on Programa Nacional Cuna Más (Cuna Mas, hereinafter), Peru’s largest early childhood development program established in 2012. It focuses on one of the two services provided by Cuna Mas known as Servicio de Acompanamiento a Familias (SAF), a home visiting program that operates in rural areas and provides one-hour weekly home visits to children aged 0-36 months and their caregiver. The objective of the study was to compare different instruments to measure the quality of home visiting programs. Between August and October 2015, three instruments were administered to a sample of 554 children enrolled in Cuna Mas and receiving home visits at the time of data collection, and on their 176 home visitors who regularly work with 80 supervisors.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Olga Kostopoulou; Brendan Delaney (2021). Measuring quality of routine primary care data [Dataset]. http://doi.org/10.5061/dryad.dncjsxkzh

Measuring quality of routine primary care data

Explore at:
zipAvailable download formats
Dataset updated
Mar 12, 2021
Dataset provided by
Imperial College London
Authors
Olga Kostopoulou; Brendan Delaney
License

https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

Description

Objective: Routine primary care data may be used for the derivation of clinical prediction rules and risk scores. We sought to measure the impact of a decision support system (DSS) on data completeness and freedom from bias.

Materials and Methods: We used the clinical documentation of 34 UK General Practitioners who took part in a previous study evaluating the DSS. They consulted with 12 standardized patients. In addition to suggesting diagnoses, the DSS facilitates data coding. We compared the documentation from consultations with the electronic health record (EHR) (baseline consultations) vs. consultations with the EHR-integrated DSS (supported consultations). We measured the proportion of EHR data items related to the physician’s final diagnosis. We expected that in baseline consultations, physicians would document only or predominantly observations related to their diagnosis, while in supported consultations, they would also document other observations as a result of exploring more diagnoses and/or ease of coding.

Results: Supported documentation contained significantly more codes (IRR=5.76 [4.31, 7.70] P<0.001) and less free text (IRR = 0.32 [0.27, 0.40] P<0.001) than baseline documentation. As expected, the proportion of diagnosis-related data was significantly lower (b=-0.08 [-0.11, -0.05] P<0.001) in the supported consultations, and this was the case for both codes and free text.

Conclusions: We provide evidence that data entry in the EHR is incomplete and reflects physicians’ cognitive biases. This has serious implications for epidemiological research that uses routine data. A DSS that facilitates and motivates data entry during the consultation can improve routine documentation.

Search
Clear search
Close search
Google apps
Main menu