98 datasets found
  1. National Energy Efficiency Data-Framework (NEED): anonymised data 2025

    • gov.uk
    Updated Jun 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Energy Security and Net Zero (2025). National Energy Efficiency Data-Framework (NEED): anonymised data 2025 [Dataset]. https://www.gov.uk/government/statistics/national-energy-efficiency-data-framework-need-anonymised-data-2025
    Explore at:
    Dataset updated
    Jun 26, 2025
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Energy Security and Net Zero
    Description

    These documents present updated record level data created from the National Energy Efficiency Data-Framework (NEED):

    • the 50,000 row dataset is small enough to use in spreadsheet software
    • the 4 million row dataset is a larger sample of the same NEED dataset
    • the metadata spreadsheet describes the fields included in both tables

    Please forward any feedback to energyefficiency.stats@energysecurity.gov.uk.

  2. National Energy Efficiency Data-Framework (NEED): anonymised data 2021

    • gov.uk
    Updated Apr 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Energy Security and Net Zero (2022). National Energy Efficiency Data-Framework (NEED): anonymised data 2021 [Dataset]. https://www.gov.uk/government/statistics/national-energy-efficiency-data-framework-need-anonymised-data-2021
    Explore at:
    Dataset updated
    Apr 28, 2022
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Energy Security and Net Zero
    Description

    These documents present updated record level data created from the National Energy Efficiency Data-Framework (NEED):

    • the 50,000 row dataset is small enough to use in spreadsheet software
    • the 4 million row dataset is a larger sample of the same NEED dataset
    • the metadata spreadsheet describes the fields included in both tables

    Please forward any feedback to energyefficiency.stats@beis.gov.uk.

  3. G

    Healthcare Data Anonymization Services Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Healthcare Data Anonymization Services Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/healthcare-data-anonymization-services-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Aug 4, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Healthcare Data Anonymization Services Market Outlook



    According to our latest research, the global healthcare data anonymization services market size reached USD 1.42 billion in 2024, reflecting a robust expansion driven by increasing regulatory demands and heightened focus on patient privacy. The market is projected to grow at a CAGR of 15.8% from 2025 to 2033, with the total market value expected to reach USD 5.44 billion by 2033. This impressive growth trajectory is underpinned by the rising adoption of digital health solutions, stringent data protection laws, and the ongoing digitalization of healthcare records worldwide.




    The primary growth factor fueling the healthcare data anonymization services market is the proliferation of electronic health records (EHRs) and the expanding use of big data analytics in healthcare. As healthcare providers and organizations increasingly leverage advanced analytics for improving patient outcomes, there is a corresponding surge in data generation. However, these vast datasets often contain sensitive patient information, making data anonymization essential to ensure compliance with regulations such as HIPAA, GDPR, and other regional privacy laws. The increasing frequency of data breaches and cyberattacks has further highlighted the importance of robust anonymization services, prompting healthcare organizations to prioritize investments in data privacy and security solutions. As a result, demand for both software and service-based anonymization solutions continues to rise, contributing significantly to market growth.




    Another key driver for the healthcare data anonymization services market is the growing emphasis on research and clinical trials, which require the sharing and analysis of large volumes of patient data. Pharmaceutical and biotechnology companies, as well as research organizations, are increasingly collaborating across borders, necessitating the anonymization of datasets to protect patient identities and comply with international data protection standards. The adoption of cloud-based healthcare solutions has also facilitated the secure and efficient sharing of anonymized data, supporting advancements in personalized medicine and population health management. As organizations seek to balance innovation with compliance, the demand for advanced anonymization technologies that offer high accuracy and scalability is expected to accelerate further.




    Technological advancements in artificial intelligence (AI) and machine learning (ML) are also shaping the future of the healthcare data anonymization services market. These technologies are enabling more sophisticated and automated anonymization processes, reducing the risk of re-identification while maintaining data utility for research and analytics. The integration of AI-driven tools into anonymization workflows is helping organizations streamline operations, minimize human error, and achieve greater compliance with evolving regulatory requirements. Additionally, the increasing availability of customizable and interoperable anonymization solutions is making it easier for healthcare organizations of all sizes to adopt and scale these services, thereby broadening the market’s reach and impact.




    From a regional perspective, North America continues to dominate the healthcare data anonymization services market, accounting for the largest share in 2024. This leadership position is attributed to the presence of advanced healthcare infrastructure, widespread adoption of EHRs, and strict regulatory frameworks governing patient data privacy. Europe follows closely, driven by the enforcement of the General Data Protection Regulation (GDPR) and a strong culture of data protection. The Asia Pacific region is witnessing the fastest growth, propelled by increasing healthcare digitalization, government initiatives to modernize healthcare systems, and rising awareness of data privacy among patients and providers. Latin America and the Middle East & Africa are also experiencing steady growth, albeit from a smaller base, as healthcare organizations in these regions begin to prioritize data security and compliance.



    &

  4. 2011 Census Microdata Sample of Anonymised Records Teaching File - Dataset -...

    • ckan.publishing.service.gov.uk
    Updated Feb 19, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2016). 2011 Census Microdata Sample of Anonymised Records Teaching File - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/2011-census-microdata-sample-of-anonymised-records-teaching-file
    Explore at:
    Dataset updated
    Feb 19, 2016
    Dataset provided by
    CKANhttps://ckan.org/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    The main population base for published statistical tables from the 2011 Census in Northern Ireland is the usual resident population base as at Census day, 27 March 2011. By way of background, for 2011 Census purposes a usual resident of the United Kingdom (UK) is anyone who, on Census day, was in the UK and had stayed or intended to stay in the UK for a period of 12 months or more, or had a permanent UK address and was outside the UK and had intended to be outside the UK for less than 12 months. Against this background, the 2011 Census Microdata Sample of Anonymised Records (SARs) Teaching File comprises a sample of 19,862 records (approximately 1 per cent) relating to people who were usually resident in Northern Ireland at the time of the 2011 Census. For each individual, information is available for seventeen separate characteristics (for example, sex, age, marital status) to varying degrees of detail. Both the size of the sample and the content of the records in the file have been harmonised, wherever possible, with the equivalent SARs teaching file that the Office for National Statistics simultaneously released for England and Wales. Purpose The primary purpose of the teaching file, which comprises unit-record level data as opposed to statistical aggregates, is as an educational tool aimed at: encouraging wider use of Census data by facilitating another way of examining Census data, for example through the building of statistical models, over and above that already available through the raft of standard tabular output released to date; providing a broad insight into the sort of detail that is generally included in a SARs product, along with data formats and any associated metadata. This will enable users (arguably those less experienced at using SARs products) to ‘play’ with the data and increase their knowledge and skills in readiness for accessing the more detailed SARs products that are planned and will be available in, for example, a safe setting; and assisting with the teaching of statistics and geography at GCSE and higher levels.

  5. Fundraising Data

    • kaggle.com
    zip
    Updated Aug 17, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Pawlus (2018). Fundraising Data [Dataset]. https://www.kaggle.com/michaelpawlus/fundraising-data
    Explore at:
    zip(1087024 bytes)Available download formats
    Dataset updated
    Aug 17, 2018
    Authors
    Michael Pawlus
    Description

    Context

    This data set is a collection of anonymized sample fundraising data sets so that practitioners within our field can practice and share examples using a common data source

    Open Call for More Content

    If you have any anonymous data that you would like to include here let me know: Michael Pawlus (pawlus@usc.edu)

    Acknowledgements

    Thanks to everyone who has shared data so far to make this possible.

  6. Coding example for the anonymised response file.pdf

    • figshare.com
    pdf
    Updated Mar 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vashni Sewkarran; Emelda Zandile Gumede (2023). Coding example for the anonymised response file.pdf [Dataset]. http://doi.org/10.6084/m9.figshare.22256992.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Mar 11, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Vashni Sewkarran; Emelda Zandile Gumede
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    File is a coding example of the anonymised data

  7. Z

    CKW Smart Meter Data

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    Updated Sep 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Barahona Garzon, Braulio (2024). CKW Smart Meter Data [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13304498
    Explore at:
    Dataset updated
    Sep 22, 2024
    Dataset provided by
    Lucerne University of Applied Sciences and Arts
    Authors
    Barahona Garzon, Braulio
    Description

    Overview

    The CKW Group is a distribution system operator that supplies more than 200,000 end customers in Central Switzerland. Since October 2022, CKW publishes anonymised and aggregated data from smart meters that measure electricity consumption in canton Lucerne. This unique dataset is accessible in the ckw.ch/opendata platform.

    Data set A - anonimised smart meter data

    Data set B - aggregated smart meter data

    Contents of this data set

    This data set contains a small sample of the CKW data set A sorted per smart meter ID, stored as parquet files named with the id field of the corresponding smart meter anonymised data. Example: 027ceb7b8fd77a4b11b3b497e9f0b174.parquet

    The orginal CKW data is available for download at https://open.data.axpo.com/%24web/index.html#dataset-a as a (gzip-compressed) csv files, which are are split into one file per calendar month. The columns in the files csv are:

    id: the anonymized counter ID (text)

    timestamp: the UTC time at the beginning of a 15-minute time window to which the consumption refers (ISO-8601 timestamp)

    value_kwh: the consumption in kWh in the time window under consideration (float)

    In this archive, data from:

    | Dateigrösse | Export Datum | Zeitraum | Dateiname || ----------- | ------------ | -------- | --------- || 4.2GiB | 2024-04-20 | 202402 | ckw_opendata_smartmeter_dataset_a_202402.csv.gz || 4.5GiB | 2024-03-21 | 202401 | ckw_opendata_smartmeter_dataset_a_202401.csv.gz || 4.5GiB | 2024-02-20 | 202312 | ckw_opendata_smartmeter_dataset_a_202312.csv.gz || 4.4GiB | 2024-01-20 | 202311 | ckw_opendata_smartmeter_dataset_a_202311.csv.gz || 4.5GiB | 2023-12-20 | 202310 | ckw_opendata_smartmeter_dataset_a_202310.csv.gz || 4.4GiB | 2023-11-20 | 202309 | ckw_opendata_smartmeter_dataset_a_202309.csv.gz || 4.5GiB | 2023-10-20 | 202308 | ckw_opendata_smartmeter_dataset_a_202308.csv.gz || 4.6GiB | 2023-09-20 | 202307 | ckw_opendata_smartmeter_dataset_a_202307.csv.gz || 4.4GiB | 2023-08-20 | 202306 | ckw_opendata_smartmeter_dataset_a_202306.csv.gz || 4.6GiB | 2023-07-20 | 202305 | ckw_opendata_smartmeter_dataset_a_202305.csv.gz || 3.3GiB | 2023-06-20 | 202304 | ckw_opendata_smartmeter_dataset_a_202304.csv.gz || 4.6GiB | 2023-05-24 | 202303 | ckw_opendata_smartmeter_dataset_a_202303.csv.gz || 4.2GiB | 2023-04-20 | 202302 | ckw_opendata_smartmeter_dataset_a_202302.csv.gz || 4.7GiB | 2023-03-20 | 202301 | ckw_opendata_smartmeter_dataset_a_202301.csv.gz || 4.6GiB | 2023-03-15 | 202212 | ckw_opendata_smartmeter_dataset_a_202212.csv.gz || 4.3GiB | 2023-03-15 | 202211 | ckw_opendata_smartmeter_dataset_a_202211.csv.gz || 4.4GiB | 2023-03-15 | 202210 | ckw_opendata_smartmeter_dataset_a_202210.csv.gz || 4.3GiB | 2023-03-15 | 202209 | ckw_opendata_smartmeter_dataset_a_202209.csv.gz || 4.4GiB | 2023-03-15 | 202208 | ckw_opendata_smartmeter_dataset_a_202208.csv.gz || 4.4GiB | 2023-03-15 | 202207 | ckw_opendata_smartmeter_dataset_a_202207.csv.gz || 4.2GiB | 2023-03-15 | 202206 | ckw_opendata_smartmeter_dataset_a_202206.csv.gz || 4.3GiB | 2023-03-15 | 202205 | ckw_opendata_smartmeter_dataset_a_202205.csv.gz || 4.2GiB | 2023-03-15 | 202204 | ckw_opendata_smartmeter_dataset_a_202204.csv.gz || 4.1GiB | 2023-03-15 | 202203 | ckw_opendata_smartmeter_dataset_a_202203.csv.gz || 3.5GiB | 2023-03-15 | 202202 | ckw_opendata_smartmeter_dataset_a_202202.csv.gz || 3.7GiB | 2023-03-15 | 202201 | ckw_opendata_smartmeter_dataset_a_202201.csv.gz || 3.5GiB | 2023-03-15 | 202112 | ckw_opendata_smartmeter_dataset_a_202112.csv.gz || 3.1GiB | 2023-03-15 | 202111 | ckw_opendata_smartmeter_dataset_a_202111.csv.gz || 3.0GiB | 2023-03-15 | 202110 | ckw_opendata_smartmeter_dataset_a_202110.csv.gz || 2.7GiB | 2023-03-15 | 202109 | ckw_opendata_smartmeter_dataset_a_202109.csv.gz || 2.6GiB | 2023-03-15 | 202108 | ckw_opendata_smartmeter_dataset_a_202108.csv.gz || 2.4GiB | 2023-03-15 | 202107 | ckw_opendata_smartmeter_dataset_a_202107.csv.gz || 2.1GiB | 2023-03-15 | 202106 | ckw_opendata_smartmeter_dataset_a_202106.csv.gz || 2.0GiB | 2023-03-15 | 202105 | ckw_opendata_smartmeter_dataset_a_202105.csv.gz || 1.7GiB | 2023-03-15 | 202104 | ckw_opendata_smartmeter_dataset_a_202104.csv.gz || 1.6GiB | 2023-03-15 | 202103 | ckw_opendata_smartmeter_dataset_a_202103.csv.gz || 1.3GiB | 2023-03-15 | 202102 | ckw_opendata_smartmeter_dataset_a_202102.csv.gz || 1.3GiB | 2023-03-15 | 202101 | ckw_opendata_smartmeter_dataset_a_202101.csv.gz |

    was processed into partitioned parquet files, and then organised by id into parquet files with data from single smart meters.

    A small sample of all the smart meters data above, are archived in the cloud public cloud space of AISOP project https://os.zhdk.cloud.switch.ch/swift/v1/aisop_public/ckw/ts/batch_0424/batch_0424.zip and also here is this public record. For access to the complete data contact the authors of this archive.

    It consists of the following parquet files:

    | Size | Date | Name |

    |------|------|------|

    | 1.0M | Mar 4 12:18 | 027ceb7b8fd77a4b11b3b497e9f0b174.parquet |

    | 979K | Mar 4 12:18 | 03a4af696ff6a5c049736e9614f18b1b.parquet |

    | 1.0M | Mar 4 12:18 | 03654abddf9a1b26f5fbbeea362a96ed.parquet |

    | 1.0M | Mar 4 12:18 | 03acebcc4e7d39b6df5c72e01a3c35a6.parquet |

    | 1.0M | Mar 4 12:18 | 039e60e1d03c2afd071085bdbd84bb69.parquet |

    | 931K | Mar 4 12:18 | 036877a1563f01e6e830298c193071a6.parquet |

    | 1.0M | Mar 4 12:18 | 02e45872f30f5a6a33972e8c3ba9c2e5.parquet |

    | 662K | Mar 4 12:18 | 03a25f298431549a6bc0b1a58eca1f34.parquet |

    | 635K | Mar 4 12:18 | 029a46275625a3cefc1f56b985067d15.parquet |

    | 1.0M | Mar 4 12:18 | 0301309d6d1e06c60b4899061deb7abd.parquet |

    | 1.0M | Mar 4 12:18 | 0291e323d7b1eb76bf680f6e800c2594.parquet |

    | 1.0M | Mar 4 12:18 | 0298e58930c24010bbe2777c01b7644a.parquet |

    | 1.0M | Mar 4 12:18 | 0362c5f3685febf367ebea62fbc88590.parquet |

    | 1.0M | Mar 4 12:18 | 0390835d05372cb66f6cd4ca662399e8.parquet |

    | 1.0M | Mar 4 12:18 | 02f670f059e1f834dfb8ba809c13a210.parquet |

    | 987K | Mar 4 12:18 | 02af749aaf8feb59df7e78d5e5d550e0.parquet |

    | 996K | Mar 4 12:18 | 0311d3c1d08ee0af3edda4dc260421d1.parquet |

    | 1.0M | Mar 4 12:18 | 030a707019326e90b0ee3f35bde666e0.parquet |

    | 955K | Mar 4 12:18 | 033441231b277b283191e0e1194d81e2.parquet |

    | 995K | Mar 4 12:18 | 0317b0417d1ec91b5c243be854da8a86.parquet |

    | 1.0M | Mar 4 12:18 | 02ef4e49b6fb50f62a043fb79118d980.parquet |

    | 1.0M | Mar 4 12:18 | 0340ad82e9946be45b5401fc6a215bf3.parquet |

    | 974K | Mar 4 12:18 | 03764b3b9a65886c3aacdbc85d952b19.parquet |

    | 1.0M | Mar 4 12:18 | 039723cb9e421c5cbe5cff66d06cb4b6.parquet |

    | 1.0M | Mar 4 12:18 | 0282f16ed6ef0035dc2313b853ff3f68.parquet |

    | 1.0M | Mar 4 12:18 | 032495d70369c6e64ab0c4086583bee2.parquet |

    | 900K | Mar 4 12:18 | 02c56641571fc9bc37448ce707c80d3d.parquet |

    | 1.0M | Mar 4 12:18 | 027b7b950689c337d311094755697a8f.parquet |

    | 1.0M | Mar 4 12:18 | 02af272adccf45b6cdd4a7050c979f9f.parquet |

    | 927K | Mar 4 12:18 | 02fc9a3b2b0871d3b6a1e4f8fe415186.parquet |

    | 1.0M | Mar 4 12:18 | 03872674e2a78371ce4dfa5921561a8c.parquet |

    | 881K | Mar 4 12:18 | 0344a09d90dbfa77481c5140bb376992.parquet |

    | 1.0M | Mar 4 12:18 | 0351503e2b529f53bdae15c7fbd56fc0.parquet |

    | 1.0M | Mar 4 12:18 | 033fe9c3a9ca39001af68366da98257c.parquet |

    | 1.0M | Mar 4 12:18 | 02e70a1c64bd2da7eb0d62be870ae0d6.parquet |

    | 1.0M | Mar 4 12:18 | 0296385692c9de5d2320326eaa000453.parquet |

    | 962K | Mar 4 12:18 | 035254738f1cc8a31075d9fbe3ec2132.parquet |

    | 991K | Mar 4 12:18 | 02e78f0d6a8fb96050053e188bf0f07c.parquet |

    | 1.0M | Mar 4 12:18 | 039e4f37ed301110f506f551482d0337.parquet |

    | 961K | Mar 4 12:18 | 039e2581430703b39c359dc62924a4eb.parquet |

    | 999K | Mar 4 12:18 | 02c6f7e4b559a25d05b595cbb5626270.parquet |

    | 1.0M | Mar 4 12:18 | 02dd91468360700a5b9514b109afb504.parquet |

    | 938K | Mar 4 12:18 | 02e99c6bb9d3ca833adec796a232bac0.parquet |

    | 589K | Mar 4 12:18 | 03aef63e26a0bdbce4a45d7cf6f0c6f8.parquet |

    | 1.0M | Mar 4 12:18 | 02d1ca48a66a57b8625754d6a31f53c7.parquet |

    | 1.0M | Mar 4 12:18 | 03af9ebf0457e1d451b83fa123f20a12.parquet |

    | 1.0M | Mar 4 12:18 | 0289efb0e712486f00f52078d6c64a5b.parquet |

    | 1.0M | Mar 4 12:18 | 03466ed913455c281ffeeaa80abdfff6.parquet |

    | 1.0M | Mar 4 12:18 | 032d6f4b34da58dba02afdf5dab3e016.parquet |

    | 1.0M | Mar 4 12:18 | 03406854f35a4181f4b0778bb5fc010c.parquet |

    | 1.0M | Mar 4 12:18 | 0345fc286238bcea5b2b9849738c53a2.parquet |

    | 1.0M | Mar 4 12:18 | 029ff5169155b57140821a920ad67c7e.parquet |

    | 985K | Mar 4 12:18 | 02e4c9f3518f079ec4e5133acccb2635.parquet |

    | 1.0M | Mar 4 12:18 | 03917c4f2aef487dc20238777ac5fdae.parquet |

    | 969K | Mar 4 12:18 | 03aae0ab38cebcb160e389b2138f50da.parquet |

    | 914K | Mar 4 12:18 | 02bf87b07b64fb5be54f9385880b9dc1.parquet |

    | 1.0M | Mar 4 12:18 | 02776685a085c4b785a3885ef81d427a.parquet |

    | 947K | Mar 4 12:18 | 02f5a82af5a5ffac2fe7551bf4a0a1aa.parquet |

    | 992K | Mar 4 12:18 | 039670174dbc12e1ae217764c96bbeb3.parquet |

    | 1.0M | Mar 4 12:18 | 037700bf3e272245329d9385bb458bac.parquet |

    | 602K | Mar 4 12:18 | 0388916cdb86b12507548b1366554e16.parquet |

    | 939K | Mar 4 12:18 | 02ccbadea8d2d897e0d4af9fb3ed9a8e.parquet |

    | 1.0M | Mar 4 12:18 | 02dc3f4fb7aec02ba689ad437d8bc459.parquet |

    | 1.0M | Mar 4 12:18 | 02cf12e01cd20d38f51b4223e53d3355.parquet |

    | 993K | Mar 4 12:18 | 0371f79d154c00f9e3e39c27bab2b426.parquet |

    where each file contains data from a single smart meter.

    Acknowledgement

    The AISOP project (https://aisopproject.com/) received funding in the framework of the Joint Programming Platform Smart Energy Systems from European Union's Horizon 2020 research and innovation programme under grant agreement No 883973. ERA-Net Smart Energy Systems joint call on digital transformation for green energy transition.

  8. s

    Data from: Fostering cultures of open qualitative research: Dataset 1 –...

    • orda.shef.ac.uk
    docx
    Updated Oct 8, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Hanchard; Itzel San Roman Pineda (2025). Fostering cultures of open qualitative research: Dataset 1 – Survey Responses [Dataset]. http://doi.org/10.15131/shef.data.23567250.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Oct 8, 2025
    Dataset provided by
    The University of Sheffield
    Authors
    Matthew Hanchard; Itzel San Roman Pineda
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    This dataset was created and deposited onto the University of Sheffield Online Research Data repository (ORDA) on 23-Jun-2023 by Dr. Matthew S. Hanchard, Research Associate at the University of Sheffield iHuman Institute.

    The dataset forms part of three outputs from a project titled ‘Fostering cultures of open qualitative research’ which ran from January 2023 to June 2023:

    · Fostering cultures of open qualitative research: Dataset 1 – Survey Responses · Fostering cultures of open qualitative research: Dataset 2 – Interview Transcripts · Fostering cultures of open qualitative research: Dataset 3 – Coding Book

    The project was funded with £13,913.85 Research England monies held internally by the University of Sheffield - as part of their ‘Enhancing Research Cultures’ scheme 2022-2023.

    The dataset aligns with ethical approval granted by the University of Sheffield School of Sociological Studies Research Ethics Committee (ref: 051118) on 23-Jan-2021.This includes due concern for participant anonymity and data management.

    ORDA has full permission to store this dataset and to make it open access for public re-use on the basis that no commercial gain will be made form reuse. It has been deposited under a CC-BY-NC license.

    This dataset comprises one spreadsheet with N=91 anonymised survey responses .xslx format. It includes all responses to the project survey which used Google Forms between 06-Feb-2023 and 30-May-2023. The spreadsheet can be opened with Microsoft Excel, Google Sheet, or open-source equivalents.

    The survey responses include a random sample of researchers worldwide undertaking qualitative, mixed-methods, or multi-modal research.

    The recruitment of respondents was initially purposive, aiming to gather responses from qualitative researchers at research-intensive (targetted Russell Group) Universities. This involved speculative emails and a call for participant on the University of Sheffield ‘Qualitative Open Research Network’ mailing list. As result, the responses include a snowball sample of scholars from elsewhere.

    The spreadsheet has two tabs/sheets: one labelled ‘SurveyResponses’ contains the anonymised and tidied set of survey responses; the other, labelled ‘VariableMapping’, sets out each field/column in the ‘SurveyResponses’ tab/sheet against the original survey questions and responses it relates to.

    The survey responses tab/sheet includes a field/column labelled ‘RespondentID’ (using randomly generated 16-digit alphanumeric keys) which can be used to connect survey responses to interview participants in the accompanying ‘Fostering cultures of open qualitative research: Dataset 2 – Interview transcripts’ files.

    A set of survey questions gathering eligibility criteria detail and consent are not listed with in this dataset, as below. All responses provide in the dataset gained a ‘Yes’ response to all the below questions (with the exception of one question, marked with an asterisk (*) below):

    · I am aged 18 or over · I have read the information and consent statement and above. · I understand how to ask questions and/or raise a query or concern about the survey. · I agree to take part in the research and for my responses to be part of an open access dataset. These will be anonymised unless I specifically ask to be named. · I understand that my participation does not create a legally binding agreement or employment relationship with the University of Sheffield · I understand that I can withdraw from the research at any time. · I assign the copyright I hold in materials generated as part of this project to The University of Sheffield. · * I am happy to be contacted after the survey to take part in an interview.

    The project was undertaken by two staff: Co-investigator: Dr. Itzel San Roman Pineda ORCiD ID: 0000-0002-3785-8057 i.sanromanpineda@sheffield.ac.uk

    Postdoctoral Research Assistant Principal Investigator (corresponding dataset author): Dr. Matthew Hanchard ORCiD ID: 0000-0003-2460-8638 m.s.hanchard@sheffield.ac.uk Research Associate iHuman Institute, Social Research Institutes, Faculty of Social Science

  9. National Energy Efficiency Data-Framework (NEED): anonymised data 2014

    • gov.uk
    Updated Jul 18, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Energy & Climate Change (2014). National Energy Efficiency Data-Framework (NEED): anonymised data 2014 [Dataset]. https://www.gov.uk/government/statistics/national-energy-efficiency-data-framework-need-anonymised-data-2014
    Explore at:
    Dataset updated
    Jul 18, 2014
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department of Energy & Climate Change
    Description

    It covers domestic properties in England and Wales. The dataset draws on data from a range of sources matched together at household level and anonymised to ensure no individual household can be identified from the data.

    Two datasets have been published by DECC:

    • Public Use File (PUF): A sample of 49,815 records selected to be representative of the housing stock (based on Region, property age, property type and floor area band). This dataset is available to download from this page and from http://data.gov.uk/">data.gov.uk.
    • End User Licence File (EUL): A sample of 4 million records (4,086,448). This dataset is available from the http://discover.ukdataservice.ac.uk/catalogue/?sn=7518">UK Data Archive, via an end user licence.

    More information on this project is available from the National Energy Efficiency Data-Framework: making data available consultation page.

    Information on how the datasets are being used is welcomed by DECC. For enquiries and feedback please contact: Energyefficiency.stats@decc.gsi.gov.uk

  10. National Energy Efficiency Data-Framework (NEED): anonymised data 2019

    • gov.uk
    Updated Dec 19, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Business, Energy & Industrial Strategy (2019). National Energy Efficiency Data-Framework (NEED): anonymised data 2019 [Dataset]. https://www.gov.uk/government/statistics/national-energy-efficiency-data-framework-need-anonymised-data-2019
    Explore at:
    Dataset updated
    Dec 19, 2019
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Business, Energy & Industrial Strategy
    Description

    These documents present updated record level data created from the National Energy Efficiency Data-Framework (NEED):

    • the 50,000 row dataset is small enough to use in spreadsheet software
    • the 4 million row dataset is a larger sample of the same NEED dataset
    • the metadata spreadsheet describes the fields included in both tables

    Please forward any feedback to energyefficiency.stats@beis.gov.uk.

  11. 2

    2001 Census: Household Sample of Anonymised Records (HSAR): Secure Access

    • datacatalogue.ukdataservice.ac.uk
    Updated Nov 24, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UK Data Service (2016). 2001 Census: Household Sample of Anonymised Records (HSAR): Secure Access [Dataset]. http://doi.org/10.5255/UKDA-SN-8097-1
    Explore at:
    Dataset updated
    Nov 24, 2016
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    Area covered
    England and Wales
    Description

    The 2001 Census: Special Licence Household Sample of Anonymised Records (SL-HSAR) dataset comprises Sample of Anonymised Records (SARs) data that relate to 29 April 2001. They were created by the Office for National Statistics (ONS) as part of the 2001 Census of Population. All households were asked to complete a form giving information about the household and all individuals living in the household. Completion of the form was compulsory for the entire population. The Census schedule includes questions on housing and tenure, and demographic and socio-economic information for all household members.

    The dataset comprises SARs data for 1% of households in England and Wales, including imputed values for households which were not enumerated during the Census. Individual data for households larger than 11 residents have been suppressed. To protect confidentiality, age data have been grouped into 2-year bands and there is no geographical breakdown available. A small amount of perturbation has been applied to the data to protect confidentiality. As with the Individual Licensed SAR (see under SNs 7210 and 7211), separate variables indicate whether or not imputation or perturbation has been applied to any given variable for each case in the sample. Documentation, training and user support for these data is undertaken by the SARs team at the Cathie Marsh Centre for Census and Survey Research (CCSR). A further release of data, which contains additional derived variables, will be made available at a later date.

    The Secure Access version replaces the previous Special Licence version that was held under SN 5278, which is no longer available. Prospective users of the Secure Access data will need to fulfil additional requirements, including completion of face-to-face training and agreement to Secure Access' User Agreement and Breaches Penalties Policy, in order to obtain permission to use that version (see 'Access' section below).

    Detailed SARs data:
    A more detailed version of these data, containing geographical information at the level of Local Authority, is available as a Controlled Access Microdata Sample (CAMS). These can be accessed at all ONS sites. Applications to use these data should be made to ONS; further details can be found on their CAMS web page. The CAMS file includes data for Scotland and Northern Ireland as well as England and Wales.

  12. APHP anonymous sample data INSIDE

    • zenodo.org
    Updated Mar 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Berend Westerhof; Berend Westerhof (2025). APHP anonymous sample data INSIDE [Dataset]. http://doi.org/10.5281/zenodo.14938636
    Explore at:
    Dataset updated
    Mar 3, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Berend Westerhof; Berend Westerhof
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sample data of a subject collected in the APHP INSIDE study

  13. s

    Orphan Drugs - Dataset 1: Twitter issue-networks as excluded publics

    • orda.shef.ac.uk
    txt
    Updated Oct 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Hanchard (2021). Orphan Drugs - Dataset 1: Twitter issue-networks as excluded publics [Dataset]. http://doi.org/10.15131/shef.data.16447326.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Oct 22, 2021
    Dataset provided by
    The University of Sheffield
    Authors
    Matthew Hanchard
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset comprises of two .csv format files used within workstream 2 of the Wellcome Trust funded ‘Orphan drugs: High prices, access to medicines and the transformation of biopharmaceutical innovation’ project (219875/Z/19/Z). They appear in various outputs, e.g. publications and presentations.

    The deposited data were gathered using the University of Amsterdam Digital Methods Institute’s ‘Twitter Capture and Analysis Toolset’ (DMI-TCAT) before being processed and extracted from Gephi. DMI-TCAT queries Twitter’s STREAM Application Programming Interface (API) using SQL and retrieves data on a pre-set text query. It then sends the returned data for storage on a MySQL database. The tool allows for output of that data in various formats. This process aligns fully with Twitter’s service user terms and conditions. The query for the deposited dataset gathered a 1% random sample of all public tweets posted between 10-Feb-2021 and 10-Mar-2021 containing the text ‘Rare Diseases’ and/or ‘Rare Disease Day’, storing it on a local MySQL database managed by the University of Sheffield School of Sociological Studies (http://dmi-tcat.shef.ac.uk/analysis/index.php), accessible only via a valid VPN such as FortiClient and through a permitted active directory user profile. The dataset was output from the MySQL database raw as a .gexf format file, suitable for social network analysis (SNA). It was then opened using Gephi (0.9.2) data visualisation software and anonymised/pseudonymised in Gephi as per the ethical approval granted by the University of Sheffield School of Sociological Studies Research Ethics Committee on 02-Jun-201 (reference: 039187). The deposited dataset comprises of two anonymised/pseudonymised social network analysis .csv files extracted from Gephi, one containing node data (Issue-networks as excluded publics – Nodes.csv) and another containing edge data (Issue-networks as excluded publics – Edges.csv). Where participants explicitly provided consent, their original username has been provided. Where they have provided consent on the basis that they not be identifiable, their username has been replaced with an appropriate pseudonym. All other usernames have been anonymised with a randomly generated 16-digit key. The level of anonymity for each Twitter user is provided in column C of deposited file ‘Issue-networks as excluded publics – Nodes.csv’.

    This dataset was created and deposited onto the University of Sheffield Online Research Data repository (ORDA) on 26-Aug-2021 by Dr. Matthew S. Hanchard, Research Associate at the University of Sheffield iHuman institute/School of Sociological Studies. ORDA has full permission to store this dataset and to make it open access for public re-use without restriction under a CC BY license, in line with the Wellcome Trust commitment to making all research data Open Access.

    The University of Sheffield are the designated data controller for this dataset.

  14. TweetViz-40: Metadata Sample for Visualization

    • kaggle.com
    zip
    Updated Jul 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    O. Faruk Beygo (2025). TweetViz-40: Metadata Sample for Visualization [Dataset]. https://www.kaggle.com/datasets/farukece/tweetviz-40-metadata-sample-for-visualization/discussion
    Explore at:
    zip(1502 bytes)Available download formats
    Dataset updated
    Jul 26, 2025
    Authors
    O. Faruk Beygo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    🗂️ About this Dataset

    This dataset contains anonymized Twitter metadata collected for the purpose of data visualization and exploratory analysis. It includes a small, curated sample of 40 tweets, each represented by key engagement metrics and a binary label (heads_tag) for simple grouping.

    The dataset was scraped via the Twitter API and adheres to Twitter’s Developer Policy. It does not contain any tweet text or user-identifiable information, only metadata fields that are permitted for redistribution.

    📊 Use Case

    This dataset is designed for:

    • Demonstrating data visualization techniques
    • Practicing with plotting tools (e.g., Matplotlib, Seaborn, Plotly)
    • Creating simple binary classification visualizations
    • Quick exploratory analysis of tweet engagement metrics

    Due to its small size, it’s ideal for:

    • Beginner-friendly projects
    • Visualization demos
    • Educational tutorials or notebooks

    📌 Dataset Columns

    Column NameDescription
    tweet_idUnique identifier of the tweet
    user_idAnonymized user ID
    timestampDate and time the tweet was posted
    like_countNumber of likes the tweet received
    retweet_countNumber of times the tweet was retweeted
    reply_countNumber of replies to the tweet
    view_countNumber of views on the tweet
    word_countNumber of words in the tweet (text not included)
    heads_tagBinary label (0 or 1) for abstract grouping (not associated with real entities)

    📄 License & Compliance

    • Shared under the CC BY 4.0 license.
    • Includes only metadata allowed under Twitter’s data sharing policy.
    • No tweet text or personal information is included.
  15. d

    Anonymous IP Data | 250M+ Global IP Data| Audience Targeting and Attribution...

    • datarade.ai
    Updated Nov 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DRAKO (2024). Anonymous IP Data | 250M+ Global IP Data| Audience Targeting and Attribution | CCPA Compliant [Dataset]. https://datarade.ai/data-products/drako-anonymous-ip-data-usa-canada-secure-insights-fo-drako
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Nov 4, 2024
    Dataset authored and provided by
    DRAKO
    Area covered
    Macao, Guinea-Bissau, Turkey, State of, Panama, French Southern Territories, Hungary, Christmas Island, Dominican Republic, Tuvalu
    Description

    DRAKO specializes in delivering Anonymous IP Data, focusing on privacy-first approaches to consumer identity and behavior analysis. Our data allows businesses to track user interactions without compromising individual privacy, ensuring compliance with data protection regulations.

    Anonymous IP Data is crucial for effective audience targeting, analyzing traffic sources, and measuring campaign performance. By connecting digital audiences through various IPs, we enable a clearer understanding of user journeys across devices and platforms. Beyond IPs, we’re also able to connect these IDs to broader ID types like Mobile Advertising IDs and CTV Ids.

    Key Features: - IPV4 and IPV6 in hashed format - Detailed mapping of Anonymous IPs for secure user behavior analysis - Integration with Mobile IP Data for insights into mobile user interactions - Comprehensive Identity Data for enhanced audience profiling - Digital Audience Data to understand demographics and interests - Identity Linkage Data for connecting user profiles across different channels

    Use Cases: - Audience segmentation and targeting strategies - Traffic source analysis and optimization - Digital campaign performance measurement - User journey mapping across devices - Compliance-focused marketing solutions

    Data Compliance: Our Anonymous IP Data is fully compliant with industry standards for data privacy and security. We prioritize ethical data collection practices, ensuring that user identities remain anonymous while still providing valuable insights.

    Data Quality: DRAKO employs rigorous quality assurance protocols to maintain the accuracy and reliability of our Anonymous IP Data. We continuously update our datasets and utilize advanced validation techniques to ensure data integrity.

  16. a

    Portsmouth Water Drinking Water Quality Data 2022 2023 2024

    • hub.arcgis.com
    • streamwaterdata.co.uk
    • +1more
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AHughes_Portsmouth (2025). Portsmouth Water Drinking Water Quality Data 2022 2023 2024 [Dataset]. https://hub.arcgis.com/datasets/d3165fd17d624b22a9900d47677dfa45
    Explore at:
    Dataset updated
    Oct 1, 2025
    Dataset authored and provided by
    AHughes_Portsmouth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Overview

    Water companies in the UK are responsible for testing the quality of drinking water. This dataset contains the results of samples taken from the taps in domestic households to make sure they meet the standards set out by UK and European legislation. This data shows the location, date, and measured levels of determinands set out by the Drinking Water Inspectorate (DWI).

    Key Definitions

    Aggregation

    Process involving summarizing or grouping data to obtain a single or reduced set of information, often for analysis or reporting purposes

    Anonymisation

    Anonymised data is a type of information sanitization in which data anonymisation tools encrypt or remove personally identifiable information from datasets for the purpose of preserving a data subject's privacy

    Dataset

    Structured and organized collection of related elements, often stored digitally, used for analysis and interpretation in various fields.

    Determinand

    A constituent or property of drinking water which can be determined or estimated.

    DWI

    Drinking Water Inspectorate, an organisation “providing independent reassurance that water supplies in England and Wales are safe and drinking water quality is acceptable to consumers.”

    DWI Determinands

    Constituents or properties that are tested for when evaluating a sample for its quality as per the guidance of the DWI. For this dataset, only determinands with “point of compliance” as “customer taps” are included.

    Granularity

    Data granularity is a measure of the level of detail in a data structure. In time-series data, for example, the granularity of measurement might be based on intervals of years, months, weeks, days, or hours

    ID

    Abbreviation for Identification that refers to any means of verifying the unique identifier assigned to each asset for the purposes of tracking, management, and maintenance.

    LSOA

    Lower-Level Super Output Area is made up of small geographic areas used for statistical and administrative purposes by the Office for National Statistics. It is designed to have homogeneous populations in terms of population size, making them suitable for statistical analysis and reporting. Each LSOA is built from groups of contiguous Output Areas with an average of about 1,500 residents or 650 households allowing for granular data collection useful for analysis, planning and policy- making while ensuring privacy.

    ONS

    Office for National Statistics

    Open Data Triage

    The process carried out by a Data Custodian to determine if there is any evidence of sensitivities associated with Data Assets, their associated Metadata and Software Scripts used to process Data Assets if they are used as Open Data. <

    Sample

    A sample is a representative segment or portion of water taken from a larger whole for the purpose of analysing or testing to ensure compliance with safety and quality standards.

    Schema

    Structure for organizing and handling data within a dataset, defining the attributes, their data types, and the relationships between different entities. It acts as a framework that ensures data integrity and consistency by specifying permissible data types and constraints for each attribute.

    Units

    Standard measurements used to quantify and compare different physical quantities.

    Water Quality

    The chemical, physical, biological, and radiological characteristics of water, typically in relation to its suitability for a specific purpose, such as drinking, swimming, or ecological health. It is determined by assessing a variety of parameters, including but not limited to pH, turbidity, microbial content, dissolved oxygen, presence of substances and temperature.

    Data History

    Data Origin

    These samples were taken from customer taps. They were then analysed for water quality, and the results were uploaded to a database. This dataset is an extract from this database.

    Data Triage Considerations

    Granularity

    Is it useful to share results as averages or individual?

    We decided to share as individual results as the lowest level of granularity

    Anonymisation

    It is a requirement that this data cannot be used to identify a singular person or household. We discussed many options for aggregating the data to a specific geography to ensure this requirement is met. The following geographical aggregations were discussed:

    <!--·
    Water Supply Zone (WSZ) - Limits interoperability with other datasets

    <!--·
    Postcode – Some postcodes contain very few households and may not offer necessary anonymisation

    <!--·
    Postal Sector – Deemed not granular enough in highly populated areas

    <!--·
    Rounded Co-ordinates – Not a recognised standard and may cause overlapping areas

    <!--·
    MSOA – Deemed not granular enough

    <!--·
    LSOA – Agreed as a recognised standard appropriate for England and Wales

    <!--·
    Data Zones – Agreed as a recognised standard appropriate for Scotland

    Data Specifications

    Each dataset will cover a calendar year of samples

    This dataset will be published annually

    Historical datasets will be published as far back as 2016 from the introduction of of The Water Supply (Water Quality) Regulations 2016

    The Determinands included in the dataset are as per the list that is required to be reported to the Drinking Water Inspectorate.

    Context

    Many UK water companies provide a search tool on their websites where you can search for water quality in your area by postcode. The results of the search may identify the water supply zone that supplies the postcode searched. Water supply zones are not linked to LSOAs which means the results may differ to this dataset

    Some sample results are influenced by internal plumbing and may not be representative of drinking water quality in the wider area.

    Some samples are tested on site and others are sent to scientific laboratories.

    Data Publish Frequency

    Annually

    Data Triage Review Frequency

    Annually unless otherwise requested

    Supplementary information

    Below is a curated selection of links for additional reading, which provide a deeper understanding of this dataset.

    <!--1.
    Drinking Water Inspectorate Standards and Regulations:

    <!--2.
    https://www.dwi.gov.uk/drinking-water-standards-and-regulations/

    <!--3.
    LSOA (England and Wales) and Data Zone (Scotland):

    <!--4. https://www.nrscotland.gov.uk/files/geography/2011-census/geography-bckground-info-comparison-of-thresholds.pdf

    <!--5.
    Description for LSOA boundaries by the ONS: Census 2021 geographies - Office for National Statistics (ons.gov.uk)

    <!--[6.
    Postcode to LSOA lookup tables: Postcode to 2021 Census Output Area to Lower Layer Super Output Area to Middle Layer Super Output Area to Local Authority District (August 2023) Lookup in the UK (statistics.gov.uk)

    <!--7.
    Legislation history: Legislation - Drinking Water Inspectorate (dwi.gov.uk)

  17. G

    Anonymized Trip Data Exchange Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Anonymized Trip Data Exchange Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/anonymized-trip-data-exchange-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Oct 6, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Anonymized Trip Data Exchange Market Outlook



    According to our latest research, the global anonymized trip data exchange market size reached USD 1.25 billion in 2024. The sector is experiencing robust expansion, registering a CAGR of 18.5% from 2025 to 2033, and is forecasted to attain a value of USD 6.16 billion by 2033. The primary growth factor driving this market is the increasing demand for data-driven insights to optimize urban mobility, enhance traffic management, and support smart city initiatives worldwide. As organizations and governments prioritize privacy-compliant data sharing, the anonymized trip data exchange market is positioned for significant advancements and adoption across various verticals.




    One of the most prominent growth factors for the anonymized trip data exchange market is the global shift towards smart city development and the digital transformation of urban mobility infrastructure. Governments and municipal authorities are increasingly leveraging anonymized trip data to gain actionable insights into traffic flow, congestion hotspots, and travel behavior. This data-driven approach allows for the optimization of public transit routes, reduction of commute times, and improved allocation of resources. With the proliferation of IoT devices and connected vehicles, the volume and granularity of trip data have surged, further fueling the need for sophisticated platforms that can securely exchange and analyze anonymized information while upholding stringent privacy regulations.




    Another critical driver is the rise of mobility-as-a-service (MaaS) platforms and ride-sharing applications, which rely heavily on accurate and real-time trip data to match riders with drivers, predict demand, and optimize pricing strategies. As these services expand into new geographies and diversify their offerings, the necessity for interoperable and anonymized data exchange frameworks becomes even more pronounced. Companies in the transportation and automotive sectors are forming strategic partnerships with data exchange providers to access comprehensive datasets that inform product development, fleet management, and customer experience enhancements. The integration of artificial intelligence and machine learning further amplifies the value of anonymized trip data by enabling predictive analytics and automation across the mobility ecosystem.




    The evolving regulatory landscape also plays a pivotal role in shaping the growth trajectory of the anonymized trip data exchange market. With increasing concerns over data privacy and the implementation of frameworks such as the General Data Protection Regulation (GDPR) in Europe and the California Consumer Privacy Act (CCPA) in the United States, organizations are compelled to adopt robust anonymization techniques to ensure compliance while facilitating data sharing. This dynamic has accelerated the development of advanced anonymization algorithms and secure data exchange protocols, fostering trust among stakeholders and encouraging broader participation in data-driven initiatives. As privacy-preserving technologies mature, the market is expected to witness greater adoption across both public and private sectors.




    From a regional perspective, North America currently leads the global anonymized trip data exchange market, driven by early adoption of smart mobility solutions, a well-established data infrastructure, and proactive regulatory frameworks. Europe follows closely, with significant investments in sustainable transportation projects and cross-border data collaboration. The Asia Pacific region is emerging as a high-growth market, propelled by rapid urbanization, government-led smart city programs, and the proliferation of ride-hailing platforms. Latin America and the Middle East & Africa are also witnessing increasing interest, albeit at a more gradual pace, as urban mobility challenges and digital transformation initiatives gain momentum. Overall, the regional outlook underscores a widespread and accelerating demand for anonymized trip data exchange solutions across the globe.





    Co

  18. e

    Example Textile Trade And Industry Anonymous Company Export Import Data |...

    • eximpedia.app
    Updated Nov 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Example Textile Trade And Industry Anonymous Company Export Import Data | Eximpedia [Dataset]. https://www.eximpedia.app/companies/example-textile-trade-and-industry-anonymous-company/11675481
    Explore at:
    Dataset updated
    Nov 22, 2025
    Description

    Example Textile Trade And Industry Anonymous Company Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.

  19. Oslo City Bike Open Data

    • kaggle.com
    zip
    Updated Nov 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    stanislav_o27 (2025). Oslo City Bike Open Data [Dataset]. https://www.kaggle.com/datasets/stanislavo27/oslo-city-bike-open-data
    Explore at:
    zip(251012812 bytes)Available download formats
    Dataset updated
    Nov 8, 2025
    Authors
    stanislav_o27
    Area covered
    Oslo
    Description

    Source: https://oslobysykkel.no/en/open-data/historical

    I am not the author of the data, only сompiled and structured from here using python-script

    oslo-city-bike License: Norwegian Licence for Open Government Data (NLOD) 2.0 According to the license, we have full rights to collect, use, modify, and distribute this data, provided you clearly indicate the source (which I do).

    Dataset structure

    Folder oslobysykkel contains all available data from 2019 to 2025. Format: oslobysykkel-YYYY-MM.csv. why is oslo still appearing in the file names? because there is also similar data for Trondheim and Bergen

    Variables

    from oslobysykkel.no Variable Format Description started_at Timestamp Timestamp of when the trip started ended_at Timestamp Timestamp of when the trip ended duration Integer Duration of trip in seconds start_station_id String Unique ID for start station start_station_name String Name of start station start_station_description String Description of where start station is located start_station_latitude Decimal degrees in WGS84 Latitude of start station start_station_longitude Decimal degrees in WGS84 Longitude of start station end_station_id String Unique ID for end station end_station_name String Name of end station end_station_description String Description of where end station is located end_station_latitude Decimal degrees in WGS84 Latitude of end station end_station_longitude Decimal degrees in WGS84 Longitude of end station

    Please note: this data and my analysis focuses on the new data format, but historical data for the period April 2016 - December 2018 (Legacy Trip Data) has a different pattern.

    Motivation

    I myself was extremely fascinated by this open data of Oslo City Bike and in the process of deep analysis saw broad prospects. This interest turned into an idea to create a data-analytical problem book or even platfrom 'exercise bike'. Publishing this dataset to make it convenient for my own further use in the next phases of the project (Clustering, Forecasting), as well as so that anyone can participate in analysis and modeling based on this exciting data.

    **Autumn's remake of Oslo bike sharing data analysis ** https://colab.research.google.com/drive/1tAxrIWVK5V-ptKLJBdODjy10zHlsppFv?usp=sharing

    https://drive.google.com/file/d/17FP9Bd5opoZlw40LRxWtycgJJyXSAdC6/view

    Full notebooks with code, visualizations, and commentary will be published soon! This dataset is the backbone of an ongoing project — stay tuned for see a deeper dives into anomaly detection, station clustering, and interactive learning challenges.

    Index of my notebooks Phase 1: Cleaned Data & Core Insights Time-Space Dynamics Exploratory

    Challenge Ideas

    Clustering and Segmentation Demand Forecasting (Time Series) Geospatial Analysis (Network Analysis)

    Resources & Related Work

    Similar dataset https://www.kaggle.com/code/florestancharlaix/oslo-city-bikes-analysis

    links to works I have found or that have inspired me

    Exploring Open Data from Oslo City Bike Jon Olave — visualization of popular routes and seasonality analysis.

    Oslo City Bike Data Wrangling Karl Tryggvason — predicting bicycle availability at stations, focusing on everyday use (e.g., trips to kindergarten).

    Helsinki City Bikes: Exploratory Data Analysis Analysis of a similar system in Helsinki — useful for comparative studies and methodological ideas.

    External Data Sources

    The idea is to connect with other data. For example I did it for weather data - integrate temperature, precipitation, and wind speed to explain variations in daily demand. https://meteostat.net/en/place/no/oslo

    I also used data from Airbnb (that's where I took division into neighbourhoods) https://data.insideairbnb.com/norway/oslo/oslo/2025-06-27/visualisations/neighbourhoods.csv

    oslo bike-sharing eda feature-engineering geospatial time-series

  20. f

    "The BBC's Great Debate": Anonymised Data from a #BBCDebate Archive

    • city.figshare.com
    bin
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ernesto Priego (2023). "The BBC's Great Debate": Anonymised Data from a #BBCDebate Archive [Dataset]. http://doi.org/10.6084/m9.figshare.3457688.v2
    Explore at:
    binAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    City, University of London
    Authors
    Ernesto Priego
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    "The BBC's Great Debate" was broadcasted live in the UK by the BBC on Tuesday 21 June 2016 between 20:00 and 22:00 BST. It saw activity on Twitter with the #BBCDebate hashtag. I collected some of the Tweets tagged with #BBCDebate using a Google Spreadsheet.The raw data was downloaded as an Excel spreadsheet file containing an archive of 38,166 Tweets (38,066 Unique Tweets) publicly published with the queried hashtag (#BBCDebate) between 14/06/2016 22:03:18 and 22/06/2016 09:12:32 BST. Due to the expected high volume of Tweets only users with at least 10 followers were included in the archive. The Tweets contained in the Archive sheet were collected using Martin Hawksey’s TAGS 6.0. Given the relatively large volume of activity expected around #BBCDebate and the public and political nature of the hashtag, I have only shared indicative data. No full tweets nor any other associated metadata have been shared. The dataset contains a metrics summary as well as a table with column headings labeled created_at, time,
    geo_coordinates (anonymised; if there was data YES has been indicated; if no data was present the corresponding cell has been left blank), user_lang and user_followers_count data corresponding to each Tweet. Timestamps should suffice to prove the existence of the Tweets and could be useful to run analyses of activity on Twitter around a real-time media event.No Personally identifiable information (PII), nor Sensitive Personal Information (SPI) was collected nor was contained in the dataset.Some basic deduplication and refining of the collected data performed.I have shared the anonymised dataset including the extra tables as a sample and as an act of citizen scholarship in order to archive, document and encourage open educational and historical research and analysis. It is hoped that by sharing the data someone else might be able to run different analyses and ideally discover different or more significant insights.For more information including methodological and limitation issues etc. please click on the references listed below.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Department for Energy Security and Net Zero (2025). National Energy Efficiency Data-Framework (NEED): anonymised data 2025 [Dataset]. https://www.gov.uk/government/statistics/national-energy-efficiency-data-framework-need-anonymised-data-2025
Organization logo

National Energy Efficiency Data-Framework (NEED): anonymised data 2025

Explore at:
Dataset updated
Jun 26, 2025
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Energy Security and Net Zero
Description

These documents present updated record level data created from the National Energy Efficiency Data-Framework (NEED):

  • the 50,000 row dataset is small enough to use in spreadsheet software
  • the 4 million row dataset is a larger sample of the same NEED dataset
  • the metadata spreadsheet describes the fields included in both tables

Please forward any feedback to energyefficiency.stats@energysecurity.gov.uk.

Search
Clear search
Close search
Google apps
Main menu