100+ datasets found

d
Customer Data Quality Check - Perfect data quality
datarade.ai
Updated Dec 14, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matrixian (2019). Customer Data Quality Check - Perfect data quality [Dataset]. https://datarade.ai/data-products/personal-data-quality-check
Explore at:
Dataset updated
Dec 14, 2019
Dataset authored and provided by
Matrixian
Area covered
Netherlands
Description
The Customer Data Quality Check consists of the Person Checker, Address Checker, Phone Checker and Email Checker as standard. All personal data, addresses, telephone numbers and email addresses within your file are validated, cleaned, corrected and supplemented. Optionally, we can also provide other data, such as company data or, for example, indicate whether your customer database contains deceased persons, whether relocations have taken place and whether it contains organizations that are bankrupt.

Benefits: - An accurate customer base - Always reach the right (potential) customers - Reconnect with dormant accounts - Increase your reach and thus the conversion - Prevents costs for returns - Prevents image damage
Data quality indicators
cy.ons.gov.uk
ons.gov.uk
xlsx
Updated Feb 13, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2020). Data quality indicators [Dataset]. https://cy.ons.gov.uk/peoplepopulationandcommunity/personalandhouseholdfinances/incomeandwealth/datasets/dataqualityindicators
Explore at:
xlsxAvailable download formats
Dataset updated
Feb 13, 2020
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Metrics used to give an indication of data quality between our test’s groups. This includes whether documentation was used and what proportion of respondents rounded their answers. Unit and item non-response are also reported.
m
Data Quality Management Service Market Size, Share & Future Trends Analysis...
marketresearchintellect.com
Updated Jul 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Intellect (2025). Data Quality Management Service Market Size, Share & Future Trends Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/data-quality-management-service-market/
Explore at:
Dataset updated
Jul 24, 2025
Dataset authored and provided by
Market Research Intellect
License
https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy
Area covered
Global
Description
Check out Market Research Intellect's Data Quality Management Service Market Report, valued at USD 4.5 billion in 2024, with a projected growth to USD 10.2 billion by 2033 at a CAGR of 12.3% (2026-2033).
6
North America Data Quality Tools Market (2025 - 2031) | Trends, Outlook &...
test.6wresearch.com
excel, pdf,ppt,csv
Updated Apr 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
6Wresearch (2025). North America Data Quality Tools Market (2025 - 2031) | Trends, Outlook & Forecast [Dataset]. https://www.test.6wresearch.com/industry-report/north-america-data-quality-tools-market
Explore at:
excel, pdf,ppt,csvAvailable download formats
Dataset updated
Apr 20, 2025
Dataset authored and provided by
6Wresearch
License
https://www.6wresearch.com/privacy-policyhttps://www.6wresearch.com/privacy-policy
Area covered
United States
Variables measured
By Component (Software, Services),, By Deployment Model (On-premises, On-demand),, By Organization Size (SMEs, Large enterprises),, By Countries (United States (US), Canada, Rest of North America),, By Business Function (Marketing, Sales, Finance, Legal, Human resources),, By Data Type (Customer data, Product data, Financial data, Compliance data, Supplier data),, By Vertical (Banking, Financial Services, and Insurance (BFSI), Telecommunications and IT, Retail and eCommerce, Healthcare and Life sciences, Manufacturing, Government, Energy and utilities, Media and entertainment) And Competitive Landscape
Description
North America Data Quality Tools Market is expected to grow during 2025-2031
l
CalOES NG9-1-1 GIS Data Quality Control Plan April 18, 2022
geohub.lacity.org
data.lacounty.gov
+2more
Updated Jul 19, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
County of Los Angeles (2022). CalOES NG9-1-1 GIS Data Quality Control Plan April 18, 2022 [Dataset]. https://geohub.lacity.org/documents/lacounty::caloes-ng9-1-1-gis-data-quality-control-plan-april-18-2022/about
Explore at:
Dataset updated
Jul 19, 2022
Dataset authored and provided by
County of Los Angeles
Description
GIS quality control checks are intended to identify issues in the source data that may impact a variety of9-1-1 end use systems.The primary goal of the initial CalOES NG9-1-1 implementation is to facilitate 9-1-1 call routing. Thesecondary goal is to use the data for telephone record validation through the LVF and the GIS-derivedMSAG.With these goals in mind, the GIS QC checks, and the impact of errors found by them are categorized asfollows in this document:Provisioning Failure Errors: GIS data issues resulting in ingest failures (results in no provisioning of one or more layers)Tier 1 Critical errors: Impact on initial 9-1-1 call routing and discrepancy reportingTier 2 Critical errors: Transition to GIS derived MSAGTier 3 Warning-level errors: Impact on routing of call transfersTier 4 Other errors: Impact on PSAP mapping and CAD systemsGeoComm's GIS Data Hub is configurable to stop GIS data that exceeds certain quality control check error thresholdsfrom provisioning to the SI (Spatial Interface) and ultimately to the ECRFs, LVFs and the GIS derivedMSAG.
d
Research Ship Roger Revelle Underway Meteorological Data, Quality Controlled...
catalog.data.gov
Updated Jun 10, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact) (2023). Research Ship Roger Revelle Underway Meteorological Data, Quality Controlled [Dataset]. https://catalog.data.gov/dataset/research-ship-roger-revelle-underway-meteorological-data-quality-controlled
Explore at:
Dataset updated
Jun 10, 2023
Dataset provided by
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact)
Description
Research Ship Roger Revelle Underway Meteorological Data (delayed ~10 days for quality control) are from the Shipboard Automated Meteorological and Oceanographic System (SAMOS) program. IMPORTANT: ALWAYS USE THE QUALITY FLAG DATA! Each data variable's metadata includes a qcindex attribute which indicates a character number in the flag data. ALWAYS check the flag data for each row of data to see which data is good (flag='Z') and which data isn't. For example, to extract just data where time (qcindex=1), latitude (qcindex=2), longitude (qcindex=3), and airTemperature (qcindex=12) are 'good' data, include this constraint in your ERDDAP query: flag=~"ZZZ........Z." in your query. '=~' indicates this is a regular expression constraint. The 'Z's are literal characters. In this dataset, 'Z' indicates 'good' data. The '.'s say to match any character. The '' says to match the previous character 0 or more times. (Don't include backslashes in your query.) See the tutorial for regular expressions at https://www.vogella.com/tutorials/JavaRegularExpressions/article.html
Test Data Management Market Analysis, Size, and Forecast 2025-2029: North...
technavio.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Technavio, Test Data Management Market Analysis, Size, and Forecast 2025-2029: North America (US and Canada), Europe (France, Germany, Italy, and UK), APAC (Australia, China, India, and Japan), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/test-data-management-market-industry-analysis
Explore at:
Dataset provided by
TechNavio
Authors
Technavio
Time period covered
2021 - 2025
Area covered
Global, United States
Description
Snapshot img

Test Data Management Market Size 2025-2029

The test data management market size is forecast to increase by USD 727.3 million, at a CAGR of 10.5% between 2024 and 2029.

The market is experiencing significant growth, driven by the increasing adoption of automation by enterprises to streamline their testing processes. The automation trend is fueled by the growing consumer spending on technological solutions, as businesses seek to improve efficiency and reduce costs. However, the market faces challenges, including the lack of awareness and standardization in test data management practices. This obstacle hinders the effective implementation of test data management solutions, requiring companies to invest in education and training to ensure successful integration. To capitalize on market opportunities and navigate challenges effectively, businesses must stay informed about emerging trends and best practices in test data management. By doing so, they can optimize their testing processes, reduce risks, and enhance overall quality.

What will be the Size of the Test Data Management Market during the forecast period?

Explore in-depth regional segment analysis with market size data - historical 2019-2023 and forecasts 2025-2029 - in the full report.
Request Free SampleThe market continues to evolve, driven by the ever-increasing volume and complexity of data. Data exploration and analysis are at the forefront of this dynamic landscape, with data ethics and governance frameworks ensuring data transparency and integrity. Data masking, cleansing, and validation are crucial components of data management, enabling data warehousing, orchestration, and pipeline development. Data security and privacy remain paramount, with encryption, access control, and anonymization key strategies. Data governance, lineage, and cataloging facilitate data management software automation and reporting. Hybrid data management solutions, including artificial intelligence and machine learning, are transforming data insights and analytics. Data regulations and compliance are shaping the market, driving the need for data accountability and stewardship. Data visualization, mining, and reporting provide valuable insights, while data quality management, archiving, and backup ensure data availability and recovery. Data modeling, data integrity, and data transformation are essential for data warehousing and data lake implementations. Data management platforms are seamlessly integrated into these evolving patterns, enabling organizations to effectively manage their data assets and gain valuable insights. Data management services, cloud and on-premise, are essential for organizations to adapt to the continuous changes in the market and effectively leverage their data resources.

How is this Test Data Management Industry segmented?

The test data management industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments. ApplicationOn-premisesCloud-basedComponentSolutionsServicesEnd-userInformation technologyTelecomBFSIHealthcare and life sciencesOthersSectorLarge enterpriseSMEsGeographyNorth AmericaUSCanadaEuropeFranceGermanyItalyUKAPACAustraliaChinaIndiaJapanRest of World (ROW).

By Application Insights

The on-premises segment is estimated to witness significant growth during the forecast period.In the realm of data management, on-premises testing represents a popular approach for businesses seeking control over their infrastructure and testing process. This approach involves establishing testing facilities within an office or data center, necessitating a dedicated team with the necessary skills. The benefits of on-premises testing extend beyond control, as it enables organizations to upgrade and configure hardware and software at their discretion, providing opportunities for exploration testing. Furthermore, data security is a significant concern for many businesses, and on-premises testing alleviates the risk of compromising sensitive information to third-party companies. Data exploration, a crucial aspect of data analysis, can be carried out more effectively with on-premises testing, ensuring data integrity and security. Data masking, cleansing, and validation are essential data preparation techniques that can be executed efficiently in an on-premises environment. Data warehousing, data pipelines, and data orchestration are integral components of data management, and on-premises testing allows for seamless integration and management of these elements. Data governance frameworks, lineage, catalogs, and metadata are essential for maintaining data transparency and compliance. Data security, encryption, and access control are paramount, and on-premises testing offers greater control over these aspects. Data reporting
E
Data from: WMT17 Quality Estimation Shared Test Data
live.european-language-grid.eu
binary format
Updated Apr 12, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2017). WMT17 Quality Estimation Shared Test Data [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/1176
Explore at:
binary formatAvailable download formats
Dataset updated
Apr 12, 2017
License
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21
Description
Test data for the WMT17 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-1974

This shared task will build on its previous five editions to further examine automatic methods for estimating the quality of machine translation output at run-time, without relying on reference translations. We include word-level, phrase-level and sentence-level estimation. All tasks will make use of a large dataset produced from post-editions by professional translators. The data will be domain-specific (IT and Pharmaceutical domains) and substantially larger than in previous years. In addition to advancing the state of the art at all prediction levels, our goals include:

- To test the effectiveness of larger (domain-specific and professionally annotated) datasets. We will do so by increasing the size of one of last year's training sets.
- To study the effect of language direction and domain. We will do so by providing two datasets created in similar ways, but for different domains and language directions.
- To investigate the utility of detailed information logged during post-editing. We will do so by providing post-editing time, keystrokes, and actual edits.

This year's shared task provides new training and test datasets for all tasks, and allows participants to explore any additional data and resources deemed relevant. A in-house MT system was used to produce translations for all tasks. MT system-dependent information can be made available under request. The data is publicly available but since it has been provided by our industry partners it is subject to specific terms and conditions. However, these have no practical implications on the use of this data for research purposes.
d
Research Ship Tangaroa Underway Meteorological Data, Quality Controlled
catalog.data.gov
gimi9.com
+1more
Updated Jun 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact) (2023). Research Ship Tangaroa Underway Meteorological Data, Quality Controlled [Dataset]. https://catalog.data.gov/dataset/research-ship-tangaroa-underway-meteorological-data-quality-controlled
Explore at:
Dataset updated
Jun 10, 2023
Dataset provided by
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact)
Description
Research Ship Tangaroa Underway Meteorological Data (delayed ~10 days for quality control) are from the Shipboard Automated Meteorological and Oceanographic System (SAMOS) program. IMPORTANT: ALWAYS USE THE QUALITY FLAG DATA! Each data variable's metadata includes a qcindex attribute which indicates a character number in the flag data. ALWAYS check the flag data for each row of data to see which data is good (flag='Z') and which data isn't. For example, to extract just data where time (qcindex=1), latitude (qcindex=2), longitude (qcindex=3), and airTemperature (qcindex=12) are 'good' data, include this constraint in your ERDDAP query: flag=~"ZZZ........Z." in your query. '=~' indicates this is a regular expression constraint. The 'Z's are literal characters. In this dataset, 'Z' indicates 'good' data. The '.'s say to match any character. The '' says to match the previous character 0 or more times. (Don't include backslashes in your query.) See the tutorial for regular expressions at https://www.vogella.com/tutorials/JavaRegularExpressions/article.html
f
Agreement between observed and reported DQQ responses. Pre-data quality...
plos.figshare.com
xls
Updated Jun 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rhys Manners; Anna W. Herforth; Maria Delfine; Rosil Hesen; Didier Nkubito; Karin Borgonjen-van den Berg; Eric Matsiko; Marguerite Niyibituronsa; Betül T. M. Uyar; Elise F. Talsma (2025). Agreement between observed and reported DQQ responses. Pre-data quality check presents the agreement of DQQ responses for enumerator (n = 154) and mobile-phone (n = 134) respondents compared to observed responses. Post-data quality check presents the agreement of DQQ responses for enumerator (n = 150) and mobile phone (n = 127) respondents following removal of respondents who exceeded the data quality threshold. Agreement rates (reported versus observed) are average rates for all respondents, across the 29 DQQ questions. [Dataset]. http://doi.org/10.1371/journal.pone.0317611.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0317611.t001
Dataset updated
Jun 25, 2025
Dataset provided by
PLOS ONE
Authors
Rhys Manners; Anna W. Herforth; Maria Delfine; Rosil Hesen; Didier Nkubito; Karin Borgonjen-van den Berg; Eric Matsiko; Marguerite Niyibituronsa; Betül T. M. Uyar; Elise F. Talsma
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Agreement between observed and reported DQQ responses. Pre-data quality check presents the agreement of DQQ responses for enumerator (n = 154) and mobile-phone (n = 134) respondents compared to observed responses. Post-data quality check presents the agreement of DQQ responses for enumerator (n = 150) and mobile phone (n = 127) respondents following removal of respondents who exceeded the data quality threshold. Agreement rates (reported versus observed) are average rates for all respondents, across the 29 DQQ questions.
E
WMT18 Quality Estimation Shared Task Test Data
live.european-language-grid.eu
binary format
Updated May 20, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). WMT18 Quality Estimation Shared Task Test Data [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/1243
Explore at:
binary formatAvailable download formats
Dataset updated
May 20, 2018
License
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21
Description
Test data for the WMT18 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-2619.

This shared task will build on its previous six editions to further examine automatic methods for estimating the quality of machine translation output at run-time, without relying on reference translations. We include word-level, phrase-level and sentence-level estimation. All tasks make use of datasets produced from post-editions by professional translators. The datasets are domain-specific (IT and life sciences/pharma domains) and extend from those used previous years with more instances and more languages. One important addition is that this year we also include datasets with neural MT outputs. In addition to advancing the state of the art at all prediction levels, our specific goals are:

To study the performance of quality estimation approaches on the output of neural MT systems. We will do so by providing datasets for two language language pairs where the same source segments are translated by both a statistical phrase-based and a neural MT system.

To study the predictability of deleted words, i.e. words that are missing in the MT output. TO do so, for the first time we provide data annotated for such errors at training time.

To study the effectiveness of explicitly assigned labels for phrases. We will do so by providing a dataset where each phrase in the output of a phrase-based statistical MT system was annotated by human translators.

To study the effect of different language pairs. We will do so by providing datasets created in similar ways for four language language pairs.

To investigate the utility of detailed information logged during post-editing. We will do so by providing post-editing time, keystrokes, and actual edits.

Measure progress over years at all prediction levels. We will do so by using last year's test set for comparative experiments.

In-house statistical and neural MT systems were built to produce translations for all tasks. MT system-dependent information can be made available under request. The data is publicly available but since it has been provided by our industry partners it is subject to specific terms and conditions. However, these have no practical implications on the use of this data for research purposes. Participants are allowed to explore any additional data and resources deemed relevant.
Wine Quality Test
figshare.com
txt
Updated Jul 4, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deepchecks Data (2022). Wine Quality Test [Dataset]. http://doi.org/10.6084/m9.figshare.20223318.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.20223318.v1
Dataset updated
Jul 4, 2022
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Deepchecks Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
part of the dataset supplied in https://www.kaggle.com/datasets/uciml/red-wine-quality-cortez-et-al-2009 https://archive.ics.uci.edu/ml/datasets/wine+quality
B
Data from: Assessing construct reliability through open-ended survey...
borealisdata.ca
Updated Feb 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Katherine E. Koralesky; Marina A.G. von Keyserlingk; Daniel M. Weary (2025). Assessing construct reliability through open-ended survey response analysis [Dataset]. http://doi.org/10.5683/SP3/PEPATK
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP3/PEPATK
Dataset updated
Feb 26, 2025
Dataset provided by
Borealis
Authors
Katherine E. Koralesky; Marina A.G. von Keyserlingk; Daniel M. Weary
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Dataset funded by
Genome Canada
Ontario Genomics Institute: OGI-191
Description
Online surveys often include quantitative attention checks, but inattentive participants might also be identified using their qualitative responses. We used the software Turnitin™ to assess the originality of open-ended responses in four mixed-method online surveys that included validated multi-item rating scales. Across surveys, 18-35% of participants were identified as having copied responses from online sources. We assessed indicator reliability and internal consistency reliability and found that both were lower for participants identified as using copied text versus those who wrote more original responses. Those who provided more original responses also provided more consistent responses to the validated scales, suggesting that these participants were more attentive. We conclude that this process can be used to screen qualitative responses from online surveys. We encourage future research to replicate this screening process using similar tools, investigate strategies to reduce copying behaviour, and explore the motivation of participants to search for information online.
Drinking Water Quality and Enforcement
data.ontario.ca
open.canada.ca
pdf, zip
Updated Jan 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Environment, Conservation and Parks (2025). Drinking Water Quality and Enforcement [Dataset]. https://data.ontario.ca/dataset/drinking-water-quality-and-enforcement
Explore at:
zip(None), pdf(None)Available download formats
Dataset updated
Jan 23, 2025
Dataset provided by
Ministry of the Environment, Conservation and Parkshttp://www.ontario.ca/ministry-environment-and-climate-change
Authors
Environment, Conservation and Parks
License
https://www.ontario.ca/page/open-government-licence-ontariohttps://www.ontario.ca/page/open-government-licence-ontario
Time period covered
Dec 16, 2024
Area covered
Ontario
Description
Ontario has a comprehensive set of measures and regulations to help ensure the safety of drinking water.

The following dataset contains information about the drinking water systems, laboratories and facilities the Ministry of the Environment, Conservation and Parks is responsible for monitoring to ensure compliance with Ontario's drinking water laws.

The dataset includes information about:

the number and type of registered systems and laboratories

drinking water quality test results

adverse water quality incidents

activities to support reduced lead in drinking water

enforcement activities related to inspections

orders and convictions

system operator certification
Ontology Quality Check -- Harmonized Data Quality Framework Alignment
zenodo.org
bin, txt
Updated Jul 29, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TJ Callahan; TJ Callahan; MG Kahn; MG Kahn (2022). Ontology Quality Check -- Harmonized Data Quality Framework Alignment [Dataset]. http://doi.org/10.5281/zenodo.6468948
Explore at:
txt, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.6468948
Dataset updated
Jul 29, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
TJ Callahan; TJ Callahan; MG Kahn; MG Kahn
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Ontologies play an important role in the representation, standardization, and integration of biomedical data, but are known to have data quality (DQ) issues. We aimed to understand if the Harmonized Data Quality Framework (HDQF), developed to standardize electronic health record DQ assessment strategies, could be used to improve ontology quality assessment. A novel set of 14 ontology checks was developed. These DQ checks were aligned to the HDQF and examined by HDQF developers. The ontology checks were evaluated using 11 Open Biomedical Ontology Foundry ontologies. 85.7% of the ontology checks were successfully aligned to at least 1 HDQF category. Accommodating the unmapped DQ checks (n=2), required modifying an original HDQF category and adding a new Data Dependency category. The HQDF is a valuable resource within the clinical domain and this work demonstrates its ability to categorize ontology quality assessment strategies.

This repository contains the following:

Results of mapping the ontology quality checks to the HDQF (Ontology_DQA_v1.5.1.xlsx).

The Jupyter Notebook that contains the code that is used to perform the ontology quality checks (Ontology_Cleaning.ipynb).

An example of the Ontology Quality Report, taken from the v2.1.0 01 MAY2021 PheKnowLator Build (ontology_quality_report_v2.1.0_01MAY2021.txt)
Quality check of river flow data worldwide
zenodo.org
data.niaid.nih.gov
zip
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
louise.crochemore@smhi.se; louise.crochemore@smhi.se (2020). Quality check of river flow data worldwide [Dataset]. http://doi.org/10.5281/zenodo.2611858
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.2611858
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
louise.crochemore@smhi.se; louise.crochemore@smhi.se
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Quality characteristics for 21586 river flow time series from 13 datasets worldwide. The 13 datasets are: the Global Runoff Database from the Global Runoff Data Center (GRDC), the Global River Discharge Data (RIVDIS; Vörösmarty et al., 1998), Surface-Water Data from the United States Geological Survey (USGS), HYDAT from the Water Survey of Canada (WSC), WISKI from the Swedish Meteorological and Hydrological Institute (SMHI), Hidroweb from the Brazilian National Water Agency (ANA), National data from the Australian Bureau of Meteorology (BOM), Spanish river flow data from the Ecological Transition Ministry (Spain), R-ArcticNet v. 4.0 from the Pan-Arctic Project Consortium (R-ArcticNet), Russian River data (NCAR-UCAR; Bodo, 2000), Chinese river flow data from the China Hydrology Data Project (CHDP; Henck et al., 2010, 2011), the European Water Archive from GRDC - EURO-FRIEND-Water (EWA), and the GEWEX Asian Monsoon Experiment (GAME) – Tropics dataset provided by the Royal Irrigation Department of Thailand. Quality characteristics are based on availability, outliers, homogeneity and trends: overall availability (%), longest availability (%), continuity (%), monthly availability (%), outliers ratio (%), homogeneity of annual flows (number of statistical tests agreeing), trend in annual flows, trend in one month of the year.

Bodo, B. (2000) Russian River Flow Data by Bodo. Boulder CO: Research Data Archive at the National Center for Atmospheric Research, Computational and Information Systems Laboratory. Retrieved from http://rda.ucar.edu/datasets/ds553.1/

Henck, A. C., Huntington, K. W., Stone, J. O., Montgomery, D. R. & Hallet, B. (2011) Spatial controls on erosion in the Three Rivers Region, southeastern Tibet and southwestern China. Earth and Planetary Science Letters 303(1–2), 71–83. doi:10.1016/j.epsl.2010.12.038

Henck, A. C., Montgomery, David R., Huntington, K. W. & Liang, C. (2010) Monsoon control of effective discharge, Yunnan and Tibet. Geology 38(11), 975–978. doi:10.1130/G31444.1

Vörösmarty, C. J., Fekete, B. M. & Tucker, B. A. (1998) Global River Discharge, 1807-1991, V[ersion]. 1.1 (RivDIS). doi:10.3334/ornldaac/199
S
SAP Selective Test Data Management Tools Report
marketresearchforecast.com
doc, pdf, ppt
Updated Mar 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Forecast (2025). SAP Selective Test Data Management Tools Report [Dataset]. https://www.marketresearchforecast.com/reports/sap-selective-test-data-management-tools-38799
Explore at:
pdf, ppt, docAvailable download formats
Dataset updated
Mar 17, 2025
Dataset authored and provided by
Market Research Forecast
License
https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The market for SAP Selective Test Data Management Tools is experiencing robust growth, driven by increasing regulatory compliance needs, the expanding adoption of agile and DevOps methodologies, and the rising demand for faster and more efficient software testing processes. The market size in 2025 is estimated at $1.5 billion, projecting a Compound Annual Growth Rate (CAGR) of 12% from 2025 to 2033. This growth is fueled by the increasing complexity of SAP systems and the associated challenges in managing test data effectively. Large enterprises are the primary adopters of these tools, representing a significant portion of the market share, followed by medium-sized and small enterprises. The cloud-based deployment model is gaining traction due to its scalability, cost-effectiveness, and ease of access, surpassing on-premises solutions in growth rate. Key players like SAP, Informatica, and Qlik are actively shaping the market through continuous product innovation and strategic partnerships. However, challenges remain, including the high initial investment costs associated with implementing these tools, the need for specialized expertise, and data security concerns. The geographic distribution reveals North America as a dominant region, followed by Europe and Asia Pacific. Growth in the Asia Pacific region is anticipated to be particularly strong, driven by increasing digitalization and the expanding adoption of SAP solutions across various industries. The competitive landscape is marked by both established vendors and emerging players, leading to increased innovation and a wider array of solutions to meet diverse customer needs. The market is expected to continue its trajectory of growth, driven by factors such as the increasing adoption of cloud-based solutions, the growing demand for data masking and anonymization techniques, and the rising emphasis on test data quality and compliance. Companies are actively seeking solutions that streamline their testing processes, reduce costs, and minimize risks associated with inadequate test data management.
d
Research Ship Oceanus Underway Meteorological Data, Quality Controlled
catalog.data.gov
gimi9.com
+2more
Updated Jun 10, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact) (2023). Research Ship Oceanus Underway Meteorological Data, Quality Controlled [Dataset]. https://catalog.data.gov/dataset/research-ship-oceanus-underway-meteorological-data-quality-controlled
Explore at:
Dataset updated
Jun 10, 2023
Dataset provided by
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact)
Description
Research Ship Oceanus Underway Meteorological Data (delayed ~10 days for quality control) are from the Shipboard Automated Meteorological and Oceanographic System (SAMOS) program. IMPORTANT: ALWAYS USE THE QUALITY FLAG DATA! Each data variable's metadata includes a qcindex attribute which indicates a character number in the flag data. ALWAYS check the flag data for each row of data to see which data is good (flag='Z') and which data isn't. For example, to extract just data where time (qcindex=1), latitude (qcindex=2), longitude (qcindex=3), and airTemperature (qcindex=12) are 'good' data, include this constraint in your ERDDAP query: flag=~"ZZZ........Z." in your query. '=~' indicates this is a regular expression constraint. The 'Z's are literal characters. In this dataset, 'Z' indicates 'good' data. The '.'s say to match any character. The '' says to match the previous character 0 or more times. (Don't include backslashes in your query.) See the tutorial for regular expressions at https://www.vogella.com/tutorials/JavaRegularExpressions/article.html
d
NOAA Ship Fairweather Underway Meteorological Data, Quality Controlled
catalog.data.gov
datadiscoverystudio.org
+1more
Updated Jun 10, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact) (2023). NOAA Ship Fairweather Underway Meteorological Data, Quality Controlled [Dataset]. https://catalog.data.gov/dataset/noaa-ship-fairweather-underway-meteorological-data-quality-controlled
Explore at:
Dataset updated
Jun 10, 2023
Dataset provided by
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact)
Description
NOAA Ship Fairweather Underway Meteorological Data (delayed ~10 days for quality control) are from the Shipboard Automated Meteorological and Oceanographic System (SAMOS) program. IMPORTANT: ALWAYS USE THE QUALITY FLAG DATA! Each data variable's metadata includes a qcindex attribute which indicates a character number in the flag data. ALWAYS check the flag data for each row of data to see which data is good (flag='Z') and which data isn't. For example, to extract just data where time (qcindex=1), latitude (qcindex=2), longitude (qcindex=3), and airTemperature (qcindex=12) are 'good' data, include this constraint in your ERDDAP query: flag=~"ZZZ........Z." in your query. "=~" indicates this is a regular expression constraint. The 'Z's are literal characters. In this dataset, 'Z' indicates 'good' data. The '.'s say to match any character. The '' says to match the previous character 0 or more times. See the tutorial for regular expressions at https://www.vogella.com/tutorials/JavaRegularExpressions/article.html
d
NOAA Ship Oregon II Underway Meteorological Data, Quality Controlled
catalog.data.gov
datasets.ai
+2more
Updated Jun 10, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact) (2023). NOAA Ship Oregon II Underway Meteorological Data, Quality Controlled [Dataset]. https://catalog.data.gov/dataset/noaa-ship-oregon-ii-underway-meteorological-data-quality-controlled1
Explore at:
Dataset updated
Jun 10, 2023
Dataset provided by
Shipboard Automated Meteorological and Oceanographic System (SAMOS) (Point of Contact)
Description
NOAA Ship Oregon II Underway Meteorological Data (delayed ~10 days for quality control) are from the Shipboard Automated Meteorological and Oceanographic System (SAMOS) program. IMPORTANT: ALWAYS USE THE QUALITY FLAG DATA! Each data variable's metadata includes a qcindex attribute which indicates a character number in the flag data. ALWAYS check the flag data for each row of data to see which data is good (flag='Z') and which data isn't. For example, to extract just data where time (qcindex=1), latitude (qcindex=2), longitude (qcindex=3), and airTemperature (qcindex=12) are 'good' data, include this constraint in your ERDDAP query: flag=~"ZZZ........Z." in your query. "=~" indicates this is a regular expression constraint. The 'Z's are literal characters. In this dataset, 'Z' indicates 'good' data. The '.'s say to match any character. The '' says to match the previous character 0 or more times. See the tutorial for regular expressions at https://www.vogella.com/tutorials/JavaRegularExpressions/article.html

Facebook

Twitter

Click to copy link

Link copied

Cite

Matrixian (2019). Customer Data Quality Check - Perfect data quality [Dataset]. https://datarade.ai/data-products/personal-data-quality-check

Customer Data Quality Check - Perfect data quality

Explore at:

Dataset updated

Dec 14, 2019

Dataset authored and provided by

Matrixian

Area covered

Netherlands

Description

The Customer Data Quality Check consists of the Person Checker, Address Checker, Phone Checker and Email Checker as standard. All personal data, addresses, telephone numbers and email addresses within your file are validated, cleaned, corrected and supplemented. Optionally, we can also provide other data, such as company data or, for example, indicate whether your customer database contains deceased persons, whether relocations have taken place and whether it contains organizations that are bankrupt.

Benefits: - An accurate customer base - Always reach the right (potential) customers - Reconnect with dormant accounts - Increase your reach and thus the conversion - Prevents costs for returns - Prevents image damage

Clear search

Close search

Google apps

Main menu

Customer Data Quality Check - Perfect data quality

Data quality indicators

Data Quality Management Service Market Size, Share & Future Trends Analysis...

North America Data Quality Tools Market (2025 - 2031) | Trends, Outlook &...

CalOES NG9-1-1 GIS Data Quality Control Plan April 18, 2022

Research Ship Roger Revelle Underway Meteorological Data, Quality Controlled...

Test Data Management Market Analysis, Size, and Forecast 2025-2029: North...

Snapshot img

Data from: WMT17 Quality Estimation Shared Test Data

Research Ship Tangaroa Underway Meteorological Data, Quality Controlled

Agreement between observed and reported DQQ responses. Pre-data quality...

WMT18 Quality Estimation Shared Task Test Data

Wine Quality Test

Data from: Assessing construct reliability through open-ended survey...

Drinking Water Quality and Enforcement

Ontology Quality Check -- Harmonized Data Quality Framework Alignment

Quality check of river flow data worldwide

SAP Selective Test Data Management Tools Report

Research Ship Oceanus Underway Meteorological Data, Quality Controlled

NOAA Ship Fairweather Underway Meteorological Data, Quality Controlled

NOAA Ship Oregon II Underway Meteorological Data, Quality Controlled

Customer Data Quality Check - Perfect data quality