68 datasets found
  1. d

    Data Collaborations Across Boundaries (Slides)

    • data.depositar.io
    pdf
    Updated Jun 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    depositar (2025). Data Collaborations Across Boundaries (Slides) [Dataset]. https://data.depositar.io/dataset/data-collaborations-across-boundaries
    Explore at:
    pdf(3112569), pdf(4440122), pdf(1792282), pdf(1296859), pdf(10713394)Available download formats
    Dataset updated
    Jun 27, 2025
    Dataset provided by
    depositar
    Description

    This dataset collects the slides that were presented at the Data Collaborations Across Boundaries session in SciDataCon 2022, part of the International Data Week.

    The following session proposal was prepared by Tyng-Ruey Chuang and submitted to SciDataCon 2022 organizers for consideration on 2022-02-28. The proposal was accepted on 2022-03-28. Six abstracts were submitted and accepted to this session. Five presentations were delivered online in a virtual session on 2022-06-21.

    Data Collaborations Across Boundaries

    There are many good stories about data collaborations across boundaries. We need more. We also need to share the lessons each of us has learned from collaborating with parties and communities not in our familiar circles.

    By boundaries, we mean not just the regulatory borders in between the nation states about data sharing but the various barriers, readily conceivable or not, that hinder collaboration in aggregating, sharing, and reusing data for social good. These barriers to collaboration exist between the academic disciplines, between the economic players, and between the many user communities, just to name a few. There are also cross-domain barriers, for example those that lay among data practitioners, public administrators, and policy makers when they are articulating the why, what, and how of "open data" and debating its economic significance and fair distribution. This session aims to bring together experiences and thoughts on good data practices in facilitating collaborations across boundaries and domains.

    The success of Wikipedia proves that collaborative content production and service, by ways of copyleft licenses, can be sustainable when coordinated by a non-profit and funded by the general public. Collaborative code repositories like GitHub and GitLab demonstrate the enormous value and mass scale of systems-facilitated integration of user contributions that run across multiple programming languages and developer communities. Research data aggregators and repositories such as GBIF, GISAID, and Zenodo have served numerous researchers across academic disciplines. Citizen science projects and platforms, for instance eBird, Galaxy Zoo, and Taiwan Roadkill Observation Network (TaiRON), not only collect data from diverse communities but also manage and release datasets for research use and public benefit (e.g. TaiRON datasets being used to improve road design and reduce animal mortality). At the same time large scale data collaborations depend on standards, protocols, and tools for building registries (e.g. Archival Resource Key), ontologies (e.g. Wikidata and schema.org), repositories (e.g. CKAN and Omeka), and computing services (e.g. Jupyter Notebook). There are many types of data collaborations. The above lists only a few.

    This session proposal calls for contributions to bring forward lessons learned from collaborative data projects and platforms, especially about those that involve multiple communities and/or across organizational boundaries. Presentations focusing on the following (non-exclusive) topics are sought after:

    1. Support mechanisms and governance structures for data collaborations across organizations/communities.

    2. Data policies --- such as data sharing agreements, memorandum of understanding, terms of use, privacy policies, etc. --- for facilitating collaborations across organizations/communities.

    3. Traditional and non-traditional funding sources for data collaborations across multiple parties; sustainability of data collaboration projects, platforms, and communities.

    4. Data workflows --- collection, processing, aggregation, archiving, and publishing, etc. --- designed with considerations of (external) collaboration.

    5. Collaborative web platforms for data acquisition, curation, analysis, visualization, and education.

    6. Examples and insights from data trusts, data coops, as well as other formal and informal forms of data stewardship.

    7. Debates on the pros and cons of centralized, distributed, and/or federated data services.

    8. Practical lessons learned from data collaboration stories: failure, success, incidence, unexpected turn of event, aftermath, etc. (no story is too small!).

  2. Synthetic Data Contracts

    • kaggle.com
    Updated Aug 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luiz Bueno (2025). Synthetic Data Contracts [Dataset]. https://www.kaggle.com/datasets/juniorbueno/synthetic-data-contracts
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 21, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Luiz Bueno
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This dataset contains 30 independent contracts in English, each with realistic structure, signatures, dates, and values. The contracts were generated synthetically to simulate real-world agreements across different domains, including:

    Real Estate (house and apartment financing contracts)

    Vehicles (car and motorcycle financing contracts)

    Education (student loan agreements)

    Health (medical and insurance-related contracts)

    General Financing (personal and business loan contracts)

    Each contract is between 2 and 3 pages, written in professional legal style, and includes fictitious names, addresses, values, and bank standards. They are useful for projects in:

    Natural Language Processing (NLP)

    Information Extraction (IE)

    LegalTech and FinTech AI applications

    Training/Testing OCR models

    Document classification and summarization research

    No personal or sensitive information is included — all data is fully synthetic.

  3. d

    Data Policy and Governance Guide

    • catalog.data.gov
    • data.tempe.gov
    • +2more
    Updated Jan 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2024). Data Policy and Governance Guide [Dataset]. https://catalog.data.gov/dataset/data-policy-and-governance-guide
    Explore at:
    Dataset updated
    Jan 26, 2024
    Dataset provided by
    City of Tempe
    Description

    Use this guide to find information on Tempe data policy and standards.Open Data PolicyEthical Artificial Intelligence (AI) PolicyEvaluation PolicyExpedited Data Sharing PolicyData Sharing Agreement (General)Data Sharing Agreement (GIS)Data Quality Standard and ChecklistDisaggregated Data StandardsData and Analytics Service Standard

  4. G

    Information Sharing Agreement Template PB

    • open.canada.ca
    • ouvert.canada.ca
    docx, html
    Updated Nov 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Yukon (2025). Information Sharing Agreement Template PB [Dataset]. https://open.canada.ca/data/dataset/d6e552db-2a79-4923-b1c5-beb10be16938
    Explore at:
    html, docxAvailable download formats
    Dataset updated
    Nov 19, 2025
    Dataset provided by
    Government of Yukon
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Description

    Forms and Templates

  5. l

    ISA Sharing Register

    • data.leicester.gov.uk
    csv, excel, json
    Updated Jan 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). ISA Sharing Register [Dataset]. https://data.leicester.gov.uk/explore/dataset/isa-sharing-register/
    Explore at:
    json, excel, csvAvailable download formats
    Dataset updated
    Jan 13, 2023
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This datasets contains a register of the Information Sharing Agreements (ISA) the Leicester City Council holds with other organisations.Leicester City Council only shares information with outside organisations when it is necessary, for example effective service delivery and/or to protect the welfare of individuals and only when legislation states that sharing for a specific purpose is allowed.Copies of the agreements are available upon request from info.requests@leicester.gov.uk (with any necessary redactions e.g. signatures)

  6. D

    NSW Network Opportunities Map

    • data.nsw.gov.au
    • researchdata.edu.au
    Updated Oct 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Spatial Services (DCS) (2025). NSW Network Opportunities Map [Dataset]. https://data.nsw.gov.au/data/dataset/1-e0a6faa040e84ed785c03e3b89a23aad
    Explore at:
    Dataset updated
    Oct 24, 2025
    Dataset authored and provided by
    Spatial Services (DCS)
    Area covered
    New South Wales
    Description

    Metadata Portal Metadata Information

    Content TitleNSW Network Opportunities Map
    Content TypeWeb Application
    DescriptionNSW Network Opportunities Map Application created for Department of Climate Change, Energy, the Environment and Water (DCCEEW) using Ausgrid, Essential Energy and Endeavour Energy Assets.
    The data provided under this agreement is restricted to view-only access within the NSW Network Opportunities Map Application. No external use, extraction, or redistribution of the data is permitted under the current terms and conditions.
    Initial Publication Date28/02/2025
    Data Currency28/02/2025
    Data Update FrequencyOther
    Content SourceData provider files
    File TypeESRI File Geodatabase (*.gdb)
    Attribution
    Data Theme, Classification or Relationship to other Datasets
    Accuracy
    Spatial Reference System (dataset)WGS84
    Spatial Reference System (web service)EPSG:3857
    WGS84 Equivalent ToOther
    Spatial Extent
    Content Lineage
    Data ClassificationConfidential
    Data Access PolicyRestricted
    Data Quality
    Terms and ConditionsData Sharing Agreement
    The data provided under this agreement is restricted to view-only access within the NSW Network Opportunities Map Application. No external use, extraction, or redistribution of the data is permitted under the current terms and conditions.
    Standard and Specification
    Data CustodianSpatial Services
    Point of ContactSpatial Services
    Data Aggregator
    Data Distributor
    Additional Supporting Information
    TRIM Number

  7. d

    Department of Health Contracts Started or Amended During State Fiscal Year...

    • catalog.data.gov
    • data.wa.gov
    • +1more
    Updated Jun 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.wa.gov (2025). Department of Health Contracts Started or Amended During State Fiscal Year 2018 [Dataset]. https://catalog.data.gov/dataset/department-of-health-contracts-started-or-amended-during-state-fiscal-year-2018
    Explore at:
    Dataset updated
    Jun 29, 2025
    Dataset provided by
    data.wa.gov
    Description

    A partial list of contracts the State Department of Health started or amended between July 1, 2017 and June 30, 2018. Includes grants and contracts for goods and professional services tracked in the department's primary contract database, the Enterprise Contract Management System (ECMS). It does not include contracts many of the department's contracts with Washington's local health jurisdictions, or contracts for expert witnesses, purchase orders, data sharing agreements, contracts issued by the department but not tracked in ECMS, or contracts exempt from disclosure under state or federal regulation. In 2017, the department added a data element to categorize entities. This is not found in prior years' data sets. Acronyms commonly found in this data set are: CBO=Community Based Organizations/Non-Profits CLH=Local Health Jurisdictions EMS= EMS/Trauma Centers GVF=Government Federal GVL=Government Local (EXCEPT Con-Con/LHJ) GVS=Government State (EXCEPT Higher Ed) HED=Higher Education HSP=Hospitals POP = Period of Performance PRV=Private/For-Profits SCH=Schools, School Districts & Education Institutions (excluding Higher Ed) SOW = Statement of Work TRB=Tribal Entity

  8. Equitable Sharing Spending Dataset

    • kaggle.com
    zip
    Updated Nov 18, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Washington Post (2019). Equitable Sharing Spending Dataset [Dataset]. https://www.kaggle.com/washingtonpost/equitable-sharing-spending-dataset
    Explore at:
    zip(2461225 bytes)Available download formats
    Dataset updated
    Nov 18, 2019
    Dataset authored and provided by
    The Washington Post
    Description

    Context

    In early 2016, The Washington Post wrote that the Justice Department is "resuming a controversial practice that allows local police departments to funnel a large portion of assets seized from citizens into their own coffers under federal law.

    The "Equitable Sharing Program" gives police the option of prosecuting some asset forfeiture cases under federal instead of state law, particularly in instances where local law enforcement officers have a relationship with federal authorities as part of a joint task force. Federal forfeiture policies are more permissive than many state policies, allowing police to keep up to 80 percent of assets they seize." (link to the full article can be found here).

    This is the raw data from the Department of Justice’s Equitable Sharing Agreement and Certification forms that was released by the U.S. Department of Justice Asset Forfeiture and Money Laundering Section.

    Content

    • spending_master.csv is the main spending dataset that contains 58 variables.

    • notes.csv lists the descriptions for all variables.

    Acknowledgements

    The original dataset can be found here. The data was originally obtained from a Freedom of Information Act request fulfilled in December 2014.

    Inspiration

    • Which agency/sector/item received the most amount of funds from the Justice Department?

    • How many agencies received non-cash assets from the federal government through Equitable Sharing?

    • Are there any trends in the total equitable sharing fund across agencies?

  9. f

    Data_Sheet_1_A Blockchain Platform for User Data Sharing Ensuring User...

    • figshare.com
    • frontiersin.figshare.com
    bin
    Updated Oct 22, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ajay Kumar Shrestha; Julita Vassileva; Ralph Deters (2020). Data_Sheet_1_A Blockchain Platform for User Data Sharing Ensuring User Control and Incentives.docx [Dataset]. http://doi.org/10.3389/fbloc.2020.497985.s001
    Explore at:
    binAvailable download formats
    Dataset updated
    Oct 22, 2020
    Dataset provided by
    Frontiers
    Authors
    Ajay Kumar Shrestha; Julita Vassileva; Ralph Deters
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We propose a new platform for user modeling with blockchains that allows users to share data without losing control and ownership of it and applied it to the domain of travel booking. Our new platform provides solution to three important problems: ensuring privacy and user control, and incentives for sharing. It tracks who shared what, with whom, when, by what means and for what purposes in a verifiable fashion. The paper presents a case study of applying the framework for a hotel reservation system as one of the enterprise nodes of Multichain which collects users' profile data and allows users to receive rewards while sharing their data with other travel service providers according to their privacy preferences expressed in smart contracts. The user data from the repository is converted into an open data format and shared via stream in the blockchain so that other nodes can efficiently process and use the data. The smart contract verifies and executes the agreed terms of use of the data and transfers digital tokens as a reward to the user. The smart contract imposes double deposit collateral to ensure that all participants act honestly. The paper also presents a performance evaluation of the new platform by analyzing latency and memory consumption with selected three test-scenarios and measuring the transaction cost for smart contracts deployment. The results show that the node responded quickly in all our cases with a befitting transaction cost.

  10. g

    Information Sharing Agreement Template PB | gimi9.com

    • gimi9.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Information Sharing Agreement Template PB | gimi9.com [Dataset]. https://gimi9.com/dataset/ca_d6e552db-2a79-4923-b1c5-beb10be16938/
    Explore at:
    Description

    🇨🇦 캐나다

  11. Sample - Superstore

    • kaggle.com
    zip
    Updated Dec 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Panda Xiong (2023). Sample - Superstore [Dataset]. https://www.kaggle.com/datasets/degotluv/sample-superstore
    Explore at:
    zip(507936 bytes)Available download formats
    Dataset updated
    Dec 20, 2023
    Authors
    Panda Xiong
    License

    https://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/

    Description

    Dataset

    This dataset was created by Panda Xiong

    Released under Community Data License Agreement - Sharing - Version 1.0

    Contents

  12. V

    CCWIS Technical Bulletin #8

    • data.virginia.gov
    • catalog.data.gov
    html
    Updated Sep 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Administration for Children and Families (2025). CCWIS Technical Bulletin #8 [Dataset]. https://data.virginia.gov/dataset/ccwis-technical-bulletin-8
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Sep 5, 2025
    Dataset provided by
    Administration for Children and Families
    Description

    This Technical Bulletin provides information about data exchange standards, which can reduce the costs and time required to develop a data exchange. The TB also reviews other factors for agencies to consider when developing data exchanges, such as data quality, data sharing agreements, data governance, and security.

    Metadata-only record linking to the original dataset. Open original dataset below.

  13. e

    Vereinbarungen über die gemeinsame Nutzung von Daten

    • data.europa.eu
    Updated Oct 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    London Borough of Barnet (2023). Vereinbarungen über die gemeinsame Nutzung von Daten [Dataset]. https://data.europa.eu/data/datasets/data-sharing-agreements1?locale=de
    Explore at:
    Dataset updated
    Oct 27, 2023
    Dataset authored and provided by
    London Borough of Barnet
    Description

    Dies sind die Data Sharing Agreements (DSAs), die der London Borough of Barnet mit anderen Organisationen geschlossen hat. Meistens wird ein DSA verwendet, wenn es keinen Vertrag zwischen dem Rat und einer anderen öffentlichen Organisation gibt. Personen, die eine DSA unterzeichnen, erhalten keine gesetzlichen Rechte zum Teilen von Daten; die Weitergabe von Daten muss bereits legal sein. In den DSAs werden die Regeln für die Weitergabe personenbezogener Daten aus den genannten Gründen festgelegt.

    Personenbezogene Daten, wie die Namen und Unterschriften von Beamten, wurden aus Gründen der DSGVO aus DSA entfernt.

  14. Confidentiality Toolkit

    • data.virginia.gov
    • catalog.data.gov
    html
    Updated Sep 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Administration for Children and Families (2025). Confidentiality Toolkit [Dataset]. https://data.virginia.gov/dataset/confidentiality-toolkit
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    Administration for Children and Families
    Description

    This Toolkit is intended for staff at all levels of government who work within offices and agencies that promote the well-being of children and families and would like to know more about:

    This includes information about the:

    This toolkit also includes several samples of sharing agreements that offices and agencies can use in their own agreements.

    We hope it will help offices and agencies that want to develop and expand responsible sharing activities, and dispel common misconceptions about confidentiality requirements that are often raised when trying to responsibly share records.

    We would like to know about the individuals using this toolkit so we can further tailor our future work to you. Please take a moment to send us an email telling us:

    Send Email

    PAPERWORK REDUCTION ACT OF 1995 (Pub. L. 104-13) STATEMENT OF PUBLIC BURDEN: Through this information collection, ACF is gathering information to learn about the individuals using this toolkit so we can tailor our future work to better assist them. Public reporting burden for this collection of information is estimated to average of 10 minutes per respondent, including the time for reviewing instructions, gathering and maintaining the data needed, and reviewing the collection of information. This is a voluntary collection of information. agency may not conduct or sponsor, and a person is not required to respond to, a collection of information subject to the requirements of the Paperwork Reduction Act of 1995, unless it displays a currently valid OMB control number. The OMB # is 0970-0401 and the expiration date is 06/30/2024. If you have any comments on this collection of information, please contact Joshua.Williams@acf.hhs.gov

    Individuals who receive human services and other assistance often receive those services from several independent programs. Providing a case worker records from multiple programs can improve their understanding of a recipient and how to better serve them. Providing a researcher records from multiple programs can improve their understanding of the overall system and how to improve services for all recipients. However, sharing records with other offices and agencies often raises legitimate concerns, such as whether it (a) complies with applicable law, (b) meets individuals’ privacy expectations, and (c) creates a security risk.

    This toolkit discusses how to share records collected by human services and related programs. It summarizes the key federal requirements that determine when different records may be shared. It explains how leaders and workgroups can help resolve challenges that arise when developing sharing plans. It advises how offices and agencies might secure electronic records. It also includes success stories, documents used to facilitate record sharing, and links to helpful online resources.

    This toolkit will not replace the important practice of consulting legal counsel. However, we hope it will give ideas on what is possible, help resolve concerns that arise, and generally aid and inspire responsible record sharing.

    Developing processes to share records with other offices and agencies that balance the many competing priorities is complex but not insurmountable. Human services and related programs are often governed by an overlapping web of requirements designed to protect the confidentiality of the individuals connected to those services. Stakeholders often bring additional concerns and interests to the table.

    There are many ways to help ensure a responsible record sharing initiative is successful. These can include:

    Building Trust: Stakeholders often approach all new record sharing initiatives with a range of legitimate and unfounded concerns; concerns, and especially unfounded concerns, often dissipate once stakeholders begin to trust the other parties and their intents.

    Gabay, Mary et al. (2021). Confidentiality Toolkit, OPRE Report 2021-175, Washington, DC: Office of Planning, Research, and Evaluation, Admini

  15. G

    Information Sharing Agreement Template MPB

    • open.canada.ca
    • ouvert.canada.ca
    docx, html
    Updated Nov 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Yukon (2025). Information Sharing Agreement Template MPB [Dataset]. https://open.canada.ca/data/dataset/1ef62127-a2a1-4a8b-963d-944353622bda
    Explore at:
    html, docxAvailable download formats
    Dataset updated
    Nov 19, 2025
    Dataset provided by
    Government of Yukon
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Description

    Forms and Templates

  16. e

    Data on students' group project preferences

    • datarepository.eur.nl
    • dataverse.nl
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tim M. Benning (2023). Data on students' group project preferences [Dataset]. http://doi.org/10.25397/eur.20342649.v1
    Explore at:
    Dataset updated
    May 30, 2023
    Dataset provided by
    Erasmus University Rotterdam (EUR)
    Authors
    Tim M. Benning
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data files contain information about the preferences of bachelor 1 and 2 students obtained via a discrete choice experiment (12 choice tasks per respondent), demographic characteristics of the sample and population, experiences with free-riding, attitude towards teamwork, and a measure of individualism/collectivism. Students were presented a different grade weight before each choice task (i.e., 10%, 30%, or 100%). The data was collected from mid-June to mid-July 2021.

    Access to the data is subject to the approval of a data sharing agreement due to the personal information contained in the dataset.

    A summary of the publication can be found below: Reducing free-riding is an important challenge for educators who use group projects. In this study, we measure students’ preferences for group project characteristics and investigate if characteristics that better help to reduce free-riding become more important for students when stakes increase. We used a discrete choice experiment based on twelve choice tasks in which students chose between two group projects that differed on five characteristics of which each level had its own effect on free-riding. A different group project grade weight was presented before each choice task to manipulate how much there was at stake for students in the group project. Data of 257 student respondents were used in the analysis. Based on random parameter logit model estimates we find that students prefer (in order of importance) assignment based on schedule availability and motivation or self-selection (instead of random assignment), the use of one or two peer process evaluations (instead of zero), a small team size of three or two students (instead of four), a common grade (instead of a divided grade), and a discussion with the course coordinator without a sanction as a method to handle free-riding (instead of member expulsion). Furthermore, we find that the characteristic team formation approach becomes even more important (especially self-selection) when student stakes increase. Educators can use our findings to design group projects that better help to reduce free-riding by (1) avoiding random assignment as team formation approach, (2) using (one or two) peer process evaluations, and (3) creating small(er) teams.

  17. d

    Contracts from PASS

    • catalog.data.gov
    • opendata.dc.gov
    • +4more
    Updated Sep 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of Contracting and Procurement (2025). Contracts from PASS [Dataset]. https://catalog.data.gov/dataset/contracts-from-pass-8ecbd
    Explore at:
    Dataset updated
    Sep 10, 2025
    Dataset provided by
    Office of Contracting and Procurement
    Description

    The Office of Contracting and Procurement (OCP) is required to post contract awards valued at $100,000 or more for agencies served by OCP. The awarded contracts database includes a caption describing the type of goods or services provided, the contract number, the ordering agency, the contract amount, the period of time covered by the contract award, the contractor receiving the award, and the market type. To obtain a copy of any contract, you may submit a FOIA request online via the DC government Public FOIA Portal. Data has been updated to include agency budget code, name, and acronym attributes. Budget codes were used to assign the agency name and acronym to each record. Agencies that share the same budget code, such as those under the Executive Office of the Mayor, were left blank in PASS records. For questions regarding details within the data, contact the Office of Contracting and Procurement at https://contracts.ocp.dc.gov/contact.

  18. G

    Data Contracts for AI Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Data Contracts for AI Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/data-contracts-for-ai-market
    Explore at:
    pptx, csv, pdfAvailable download formats
    Dataset updated
    Oct 7, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Data Contracts for AI Market Outlook



    According to our latest research, the global Data Contracts for AI market size reached USD 1.65 billion in 2024, reflecting robust expansion driven by increasing adoption of AI-powered data management solutions. The market is expected to grow at a remarkable CAGR of 23.7% from 2025 to 2033, positioning the industry to achieve a value of approximately USD 13.45 billion by 2033. This significant growth is propelled by the rising need for robust data governance, regulatory compliance, and secure data sharing frameworks across industries leveraging artificial intelligence.




    One of the primary growth factors fueling the Data Contracts for AI market is the escalating demand for data governance and compliance management in an era of stringent data privacy regulations. Organizations are increasingly recognizing the importance of establishing clear, enforceable agreements—known as data contracts—to ensure the responsible use, sharing, and processing of data within AI systems. The proliferation of global data protection laws such as the GDPR, CCPA, and emerging regulations in Asia Pacific and Latin America is compelling enterprises to adopt solutions that facilitate traceability, accountability, and transparency in AI-driven data workflows. As a result, the market is witnessing heightened investment from industries such as BFSI, healthcare, and government, where regulatory compliance is paramount.




    Another significant driver of market growth is the rapid advancement and integration of AI technologies across diverse business functions, which in turn amplifies the complexity and volume of data being processed. With AI models relying heavily on high-quality, well-governed datasets, data contracts are becoming essential tools for defining data ownership, access rights, and usage limitations. This is particularly crucial in collaborative environments where data is sourced from multiple internal and external stakeholders. The need to mitigate risks associated with data breaches, unauthorized access, and model bias is prompting organizations to invest in advanced data contract solutions that offer granular control and real-time monitoring capabilities, further accelerating market expansion.




    The increasing adoption of cloud-based AI platforms and the emergence of multi-cloud and hybrid environments are also contributing to the growth trajectory of the Data Contracts for AI market. As enterprises transition their data infrastructure to the cloud to enhance scalability and flexibility, they face new challenges related to data integration, interoperability, and security. Data contracts play a pivotal role in addressing these challenges by standardizing data exchange protocols and ensuring compliance across distributed environments. Moreover, the rise of data marketplaces and data-as-a-service models is creating new opportunities for monetizing data assets, thus underscoring the need for robust contractual frameworks to manage data rights and obligations in AI ecosystems.




    Regionally, North America continues to dominate the Data Contracts for AI market, accounting for the largest share in 2024, followed closely by Europe and Asia Pacific. The presence of leading AI technology providers, early adoption of digital transformation initiatives, and a mature regulatory landscape are key factors driving market leadership in these regions. Meanwhile, Asia Pacific is emerging as the fastest-growing region, fueled by rapid industrialization, increasing investments in AI research, and evolving data privacy regulations. Latin America and the Middle East & Africa are also witnessing steady growth, albeit from a smaller base, as enterprises in these regions begin to recognize the strategic value of data contracts in AI deployments.





    Component Analysis



    The Data Contracts for AI market is segmented by component into Software, Services, and Platforms, each playing a distinct role in the overall ecosystem. The software segment encompasses sol

  19. 4

    Dataset with a classification of papers, supporting the paper: Data: to...

    • data.4tu.nl
    zip
    Updated Jun 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rogier Harmelink (2024). Dataset with a classification of papers, supporting the paper: Data: to share or not to share? A Semi-Systematic Literature Review in (rational) data sharing in inter-organizational systems [Dataset]. http://doi.org/10.4121/45af87ba-7eaa-4bc9-b3e8-8192518ec14e.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 4, 2024
    Dataset provided by
    4TU.ResearchData
    Authors
    Rogier Harmelink
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This repository is used for archiving the classification of papers in the following article: Data: to share or not to share? A Semi-Systematic Literature Review in (rational) data sharing in inter-organizational systems.


    This archive is supporting the paper for the labels that have been given to the papers found in the Semi-Systematic Literature Review.


    The following categories and labels are possible in the classification of the literature:

    Category ["Paper"] - Labels: Summarized name(s) of the author(s).


    Category ["Knowledge dimension"] - Labels: ["Data", "Information", "Knowledge", "Data/information (paper is ambiguous on whether it is data or information that is shared)", "None", "Data/Knowledge (data is learned and transferred in to knowledge, the knowledge is shared)"]


    Category ["Type of industry"] - Labels: ["None", "Supply chain", "Healthcare", "Vehicles", "Agricultural", "Research", "Networks", "Automotive", "Innovation", "Smart Grid", "Social media", "R&D", "e-Governance", "Construction Sector", "Government-enterprise", "Manufacturing", "Power grid", "Smart Cities", "Personal data", "Cyber security", "E-commerce", "Maritime", "Online marketplaces", "Assembly", "Engineering", "Communities of Practice", "Knowledge Management Systems", "B2B commerce", "Fisheries", "Outsourcing", "High-tech firms", "Crisis", "e-Services", "Seaports", "Horticulture", "Data markets", "Media", "Cultural Heritage Institutions", "Fresh Products", "Knowledge market", "Ecological", "Ride Sharing", "Government", "Transit", "Medical", "Virtual Research Organization", "Energy", "Oil and gas", "Education"]


    Category ["Game theory approach"] - Labels: ["None", "Non-cooperative game", "Evolutionary", "Cooperative game", "Stackelberg", "Auction", "Unclear defined", "Diffusion kernels", "Markov game", "Negotiation", "(Non-)cooperative game", "Differential game", "Pricing", "Bayesian", "Contract theory", "Non-collusion", "Stochastic differential game", "Fisher’s market", "Hotelling", "Bargaining", "Access control"]


    Category ["Technologies of interest"] - Labels: ["blockchain", "None", "smart contracts", "federated learning", "internet of things", "machine learning", "cloud computing", "artificial intelligence", "ethereum", "5G", "data trust", "deep neural networks", "digital twins", "internet of (medical) things", "smart grid", "data governance", "artifical intelligence", "collaborative learning", "smart contract", "encryption", "data escrow", "NFT", "data mining", "semantic technologies", "transfer learning"]


    Category ["Level of trust"] - Labels: ["Calculus-Based Trust", "None", "security", "privacy", "integrity", "transparency", "authentication", "confidentiality", "traceability", "verification", "Knowledge-Based Trust", "privacy"]


    Category ["Type of contract"> - Labels: ["None", "Smart contracts", "Contract theory", "Linear wholesale price contract", "GMP and IPD contracts", "General contract", "Incentive contract", "Trust-embedded contract", "Wholesale price contract"]

  20. d

    Tree Contract Work Details

    • catalog.data.gov
    • data.cityofnewyork.us
    Updated Nov 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofnewyork.us (2025). Tree Contract Work Details [Dataset]. https://catalog.data.gov/dataset/tree-contract-work-details
    Explore at:
    Dataset updated
    Nov 22, 2025
    Dataset provided by
    data.cityofnewyork.us
    Description

    Tree Contract Work Details contains information related to work performed on specific tree work contracts. This dataset can be joined to the Forestry Work Orders dataset (https://data.cityofnewyork.us/Environment/Forestry-Work-Orders/bdjm-n7q4) by joining work_order_id from Tree Contract Work Details to OBJECTID from Forestry Work Orders. There are many work_order_ids per single contract_number. Not all work orders in Forestry Work Orders are represented in this dataset because it is only used to track work on specific contracts. Data Dictionary found here: https://docs.google.com/spreadsheets/d/1rAQL_d8yQ5axRIGl9vLxpr73rDmQUZ23VKQC8QQWIYI/edit?usp=sharing

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
depositar (2025). Data Collaborations Across Boundaries (Slides) [Dataset]. https://data.depositar.io/dataset/data-collaborations-across-boundaries

Data Collaborations Across Boundaries (Slides)

Explore at:
pdf(3112569), pdf(4440122), pdf(1792282), pdf(1296859), pdf(10713394)Available download formats
Dataset updated
Jun 27, 2025
Dataset provided by
depositar
Description

This dataset collects the slides that were presented at the Data Collaborations Across Boundaries session in SciDataCon 2022, part of the International Data Week.

The following session proposal was prepared by Tyng-Ruey Chuang and submitted to SciDataCon 2022 organizers for consideration on 2022-02-28. The proposal was accepted on 2022-03-28. Six abstracts were submitted and accepted to this session. Five presentations were delivered online in a virtual session on 2022-06-21.

Data Collaborations Across Boundaries

There are many good stories about data collaborations across boundaries. We need more. We also need to share the lessons each of us has learned from collaborating with parties and communities not in our familiar circles.

By boundaries, we mean not just the regulatory borders in between the nation states about data sharing but the various barriers, readily conceivable or not, that hinder collaboration in aggregating, sharing, and reusing data for social good. These barriers to collaboration exist between the academic disciplines, between the economic players, and between the many user communities, just to name a few. There are also cross-domain barriers, for example those that lay among data practitioners, public administrators, and policy makers when they are articulating the why, what, and how of "open data" and debating its economic significance and fair distribution. This session aims to bring together experiences and thoughts on good data practices in facilitating collaborations across boundaries and domains.

The success of Wikipedia proves that collaborative content production and service, by ways of copyleft licenses, can be sustainable when coordinated by a non-profit and funded by the general public. Collaborative code repositories like GitHub and GitLab demonstrate the enormous value and mass scale of systems-facilitated integration of user contributions that run across multiple programming languages and developer communities. Research data aggregators and repositories such as GBIF, GISAID, and Zenodo have served numerous researchers across academic disciplines. Citizen science projects and platforms, for instance eBird, Galaxy Zoo, and Taiwan Roadkill Observation Network (TaiRON), not only collect data from diverse communities but also manage and release datasets for research use and public benefit (e.g. TaiRON datasets being used to improve road design and reduce animal mortality). At the same time large scale data collaborations depend on standards, protocols, and tools for building registries (e.g. Archival Resource Key), ontologies (e.g. Wikidata and schema.org), repositories (e.g. CKAN and Omeka), and computing services (e.g. Jupyter Notebook). There are many types of data collaborations. The above lists only a few.

This session proposal calls for contributions to bring forward lessons learned from collaborative data projects and platforms, especially about those that involve multiple communities and/or across organizational boundaries. Presentations focusing on the following (non-exclusive) topics are sought after:

  1. Support mechanisms and governance structures for data collaborations across organizations/communities.

  2. Data policies --- such as data sharing agreements, memorandum of understanding, terms of use, privacy policies, etc. --- for facilitating collaborations across organizations/communities.

  3. Traditional and non-traditional funding sources for data collaborations across multiple parties; sustainability of data collaboration projects, platforms, and communities.

  4. Data workflows --- collection, processing, aggregation, archiving, and publishing, etc. --- designed with considerations of (external) collaboration.

  5. Collaborative web platforms for data acquisition, curation, analysis, visualization, and education.

  6. Examples and insights from data trusts, data coops, as well as other formal and informal forms of data stewardship.

  7. Debates on the pros and cons of centralized, distributed, and/or federated data services.

  8. Practical lessons learned from data collaboration stories: failure, success, incidence, unexpected turn of event, aftermath, etc. (no story is too small!).

Search
Clear search
Close search
Google apps
Main menu