100+ datasets found
  1. u

    Thesis Data Repository

    • figshare.unimelb.edu.au
    zip
    Updated Oct 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gregory White (2023). Thesis Data Repository [Dataset]. http://doi.org/10.26188/24295243.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 11, 2023
    Dataset provided by
    The University of Melbourne
    Authors
    Gregory White
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Availability of data, code, and plot creation for various figures throughout my PhD thesis. Rough organisation currently. Pertains to Figures 5.4, 5.8, 6.11, 6.18, 7.3, 7.12, and Table 6.1.

  2. d

    Statistics on the number of scholarships for masters and doctoral...

    • data.gov.tw
    csv
    Updated Jun 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Student Affairs and Special Education (2025). Statistics on the number of scholarships for masters and doctoral dissertations and journal papers in gender equality education [Dataset]. https://data.gov.tw/en/datasets/159100
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jun 1, 2025
    Dataset authored and provided by
    Department of Student Affairs and Special Education
    License

    https://data.gov.tw/licensehttps://data.gov.tw/license

    Description

    In order to encourage academic and related research on gender equality education and improve the academic standards of the above-mentioned topics, the Ministry of Education has formulated the "Key Points for the Ministry of Education to Award Master's and Doctoral Thesis and Journal Papers on Gender Equality Education" for awards.

  3. R

    New Thesis Data Sets Dataset

    • universe.roboflow.com
    zip
    Updated Feb 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Conveyor (2024). New Thesis Data Sets Dataset [Dataset]. https://universe.roboflow.com/conveyor/new-thesis-data-sets
    Explore at:
    zipAvailable download formats
    Dataset updated
    Feb 10, 2024
    Dataset authored and provided by
    Conveyor
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Fruits Pineapple Mango Papaya Bounding Boxes
    Description

    New Thesis Data Sets

    ## Overview
    
    New Thesis Data Sets is a dataset for object detection tasks - it contains Fruits Pineapple Mango Papaya annotations for 4,346 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  4. m

    Master Thesis dataset

    • data.mendeley.com
    Updated Feb 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ekaterina Lazareva (2024). Master Thesis dataset [Dataset]. http://doi.org/10.17632/dr5d8mzzr4.1
    Explore at:
    Dataset updated
    Feb 26, 2024
    Authors
    Ekaterina Lazareva
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is used in Master thesis on topic "The impact of upholding environmental, social and governance principles on the market value of capital-intensive companies". The full dataset consits data on Public american companies included in S&P 500 index, traded on the New York Stock Exchange. There are data on ESG-score and its components (E, S, G), as well as components of Envitonmental pillar score. Additionaly dataset includes financial data, like market capitalization, leverage, ROCE, Capex and etc. The main sources of data are Thomson Reuters Eikon and Bloomberg terminals, along with Form 10-k by SEC. The final sample consists of 52 capital-intensive companies, time horizon: 2012-2021 [520 observations in total].

  5. f

    Data underlying the master thesis: Exploring Copula-Based Models for the...

    • figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dimitris Theodorakopoulos (2023). Data underlying the master thesis: Exploring Copula-Based Models for the Stochastic Simulation of Information Retrieval Evaluation Data [Dataset]. http://doi.org/10.4121/21739355.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    4TU.ResearchData
    Authors
    Dimitris Theodorakopoulos
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset contains the results of the experiments that I ran for my master thesis. The full code (and more) can be found at https://github.com/dimitris93/msc-thesis

  6. Z

    Data from the thesis: Reduced-order models to predict mesoscale mechanical...

    • data.niaid.nih.gov
    Updated Apr 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reddy, Vineet K (2024). Data from the thesis: Reduced-order models to predict mesoscale mechanical behavior of polycrystalline materials [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10981039
    Explore at:
    Dataset updated
    Apr 17, 2024
    Dataset provided by
    Reddy, Vineet K
    Adlakha, Ilaksh
    Gupta, Sayan
    Roychowdhury, Sushovan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This record contains the data and code from the thesis: Reduced-order models to predict mesoscale mechanical behavior of polycrystalline materials. The contents of the chapter-wise zip files are described in the respective markdown files with the suffix _readme.md.

    A record containing only the code from the thesis is availabe at: 10.5281/zenodo.10983507.

  7. PhD thesis

    • figshare.com
    pdf
    Updated Jun 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anders Eklund (2023). PhD thesis [Dataset]. http://doi.org/10.6084/m9.figshare.704865.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 20, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Anders Eklund
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    My PhD thesis

    Computational medical image analysis - With a focus on real-time fMRI and non-parametric statistics

  8. H

    Master's Thesis Research Data: Integrating Explainability into Federated...

    • dataverse.harvard.edu
    Updated May 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicolas Sebastian Schuler (2025). Master's Thesis Research Data: Integrating Explainability into Federated Learning: A Non-functional Requirement Perspective [Dataset]. http://doi.org/10.7910/DVN/PNMARJ
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 5, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Nicolas Sebastian Schuler
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This data set contains the research data for the master's thesis: Integrating Explainability into Federated Learning: A Non-functional Requirement Perspective. The master's thesis was written by Nicolas Sebastian Schuler at the Computer Science Department at Karlsruhe Institute for Technology (KIT) in Germany. The data set contains: - Associate Jupyter notebooks for reproducing the figures in the master's thesis. - Generated experiment data by the federated learning simulations. - Results of the user survey conducted for the master's thesis. - Used Python Libraries. It also includes the submitted final thesis. Notice: The research data is split into multiple chunks and can be combined via the following command after downloading: $ cat thesis-results-part-* > thesis-results.tar.zst and extracted via: $ tar --zstd -xvf thesis-results.tar.zst

  9. h

    Data for the PhD thesis "Modeling Lexical Fields for Translation: a...

    • heidata.uni-heidelberg.de
    zip
    Updated Aug 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meri Dallakyan; Meri Dallakyan (2025). Data for the PhD thesis "Modeling Lexical Fields for Translation: a Corpus-Based Study of Armenian, German, and English Culinary Verbs" [Dataset]. http://doi.org/10.11588/DATA/3MPL7E
    Explore at:
    zip(166634), zip(1130199), zip(617108), zip(167898), zip(4471905), zip(5882160), zip(1203076), zip(334871), zip(3353340), zip(2699455), zip(436611), zip(412972), zip(125927), zip(22647800)Available download formats
    Dataset updated
    Aug 4, 2025
    Dataset provided by
    heiDATA
    Authors
    Meri Dallakyan; Meri Dallakyan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains in high resolution all graphical visualizations of data analysis provided in my doctoral dissertation. The graphs are organized according to chapters and subchapters and titeled respectively. Additionally, this dataset provides all dataframes (German, English, and Armenian) in XLSX format of the manual semantic annotation based on which the graphs are generated. Among presented graphical visualizations are (Multiple) Correspondence Analysis (MCA vs. CA), Mosaic-Plots, Conditional Infererence Trees (CIT), and Context-Conditional Correlations Graphs (CCCG).

  10. PhD Thesis: Development of Equitable Algorithms for Road Funds Allocation...

    • figshare.com
    application/cdfv2
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew Naimanye (2016). PhD Thesis: Development of Equitable Algorithms for Road Funds Allocation and Road Scheme Priritization in Developing Countries: A Case Study of Sub-Saharan Africa [Dataset]. http://doi.org/10.6084/m9.figshare.1396244.v1
    Explore at:
    application/cdfv2Available download formats
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Andrew Naimanye
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Africa, Sub-Saharan Africa
    Description

    Uganda Road Fund Allocation Formula application 2014 and 2015

  11. 4

    Data underlying the thesis: Multiparty Computation: The effect of multiparty...

    • data.4tu.nl
    zip
    Updated Nov 6, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Masud Petronia (2020). Data underlying the thesis: Multiparty Computation: The effect of multiparty computation on firms' willingness to contribute protected data [Dataset]. http://doi.org/10.4121/13102430.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 6, 2020
    Dataset provided by
    4TU.ResearchData
    Authors
    Masud Petronia
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This thesis-mpc-dataset-public-readme.txt file was generated on 2020-10-20 by Masud Petronia

    GENERAL INFORMATION
    1. Title of Dataset: Data underlying the thesis: Multiparty Computation: The effect of multiparty computation on firms' willingness to contribute protected data
    2. Author Information A. Principal Investigator Contact Information Name: Masud Petronia Institution: TU Delft, Faculty of Technology, Policy and Management Address: Mekelweg 5, 2628 CD Delft, Netherlands Email: masud.petronia@gmail.com ORCID: https://orcid.org/0000-0003-2798-046X
    3: Description of dataset: This dataset contains perceptual data of firms' willingness to contribute protected data through multi party computation (MPC). Petronia (2020, ch. 6) draws several conclusions from this dataset and provides recommendations for future research Petronia (2020, ch. 7.4).
    4. Date of data collection: July-August 2020
    5. Geographic location of data collection: Netherlands
    6. Information about funding sources that supported the collection of the data: Horizon 2020 Research and Innovation Programme, Grant Agreement no 825225 – Safe Data Enabled Economic Development (SAFE-DEED), from the H2020-ICT-2018-2

    SHARING/ACCESS INFORMATION
    1. Licenses/restrictions placed on the data: CC 0
    2. Links to publications that cite or use the data: Petronia, M. N. (2020). Multiparty Computation: The effect of multiparty computation on firms' willingness to contribute protected data (Master's thesis). Retrieved from http://resolver.tudelft.nl/uuid:b0de4a4b-f5a3-44b8-baa4-a6416cebe26f
    3. Was data derived from another source? No
    4. Citation for this dataset: Petronia, M. N. (2020). Multiparty Computation: The effect of multiparty computation on firms' willingness to contribute protected data (Master's thesis). Retrieved from https://data.4tu.nl/. doi:10.4121/13102430

    DATA & FILE OVERVIEW
    1. File List: thesis-mpc-dataset-public.xlsxthesis-mpc-dataset-public-readme.txt (this document)
    2. Relationship between files: Dataset metadata and instructions
    3. Additional related data collected that was not included in the current data package: Occupation and role of respondents (traceable to unique reference), removed for privacy reasons.
    4. Are there multiple versions of the dataset? No

    METHODOLOGICAL INFORMATION
    1. Description of methods used for collection/generation of data: A pre- and post test experimental design. For more information; see Petronia (2020, ch. 5)
    2. Methods for processing the data: Full instructions are provided by Petronia (2020, ch. 6)
    3. Instrument- or software-specific information needed to interpret the data: Microsoft Excel can be used to convert the dataset to other formats.
    4. Environmental/experimental conditions: This dataset comprises three datasets collected through three channels. These channels are Prolific (incentive), LinkedIn/Twitter (voluntarily), and respondents in a lab setting (voluntarily). For more information; see Petronia (2020, ch. 6.1)
    5. Describe any quality-assurance procedures performed on the data: A thorough examination of consistency and reliability is performed. For more information; see Petronia (2020, ch. 6).
    6. People involved with sample collection, processing, analysis and/or submission: See Petronia (2020, ch. 6)

    DATA-SPECIFIC INFORMATION
    1. Number of variables: see worksheet experiment_matrix of thesis-mpc-dataset-public.xlsx
    2. Number of cases/rows: see worksheet experiment_matrix of thesis-mpc-dataset-public.xlsx
    3. Variable List: see worksheet labels of thesis-mpc-dataset-public.xlsx
    4. Missing data codes: see worksheet comments of thesis-mpc-dataset-public.xlsx
    5. Specialized formats or other abbreviations used: Multiparty computation (MPC) and Trusted Third Party (TTP).

    INSTRUCTIONS
    1. Petronia (2020, ch. 6) describes associated tests and respective syntax.

  12. s

    Data from: UK Doctoral Thesis Metadata from EThOS

    • marketplace.sshopencloud.eu
    Updated Jan 3, 2015
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2015). UK Doctoral Thesis Metadata from EThOS [Dataset]. http://doi.org/10.23636/1137
    Explore at:
    Dataset updated
    Jan 3, 2015
    Area covered
    United Kingdom
    Description

    The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the UK's national thesis service. We estimate the data covers around 98% of all PhDs ever awarded by UK Higher Education institutions, dating back to 1787. Thesis metadata from every PhD-awarding university in the UK is included. You can investigate and re-use this unique collection of UK universities' PhD thesis data to analyse trends in postgraduate research, make connections between researchers, apply large data analysis, improve citation of theses and many more applications.

  13. l

    Coding Set: Social Network Analysis Data for the PhD Thesis "More than...

    • pubdata.leuphana.de
    xlsx
    Updated 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roman Isaac; Berta Martín-López (2024). Coding Set: Social Network Analysis Data for the PhD Thesis "More than trees" [Dataset]. http://doi.org/10.48548/pubdata-217
    Explore at:
    xlsx(18596)Available download formats
    Dataset updated
    2024
    Authors
    Roman Isaac; Berta Martín-López
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Dataset funded by
    Deutsche Forschungsgemeinschaft (DFG)
    Description

    To identify relevant actors for the governance of co-produced forest nature's contributions to people (NCP) the researchers conducted a social-network analysis based on 39 semi-structured interviews with foresters and conservation managers. These interviews were conducted across three case study sites in Germany: Schorfheide-Chorin in the Northeast, Hainich-Dün in the Centre, and Schwäbische Alb in the Southwest. All three case study sites belong to the large-scale and long-term research platform Biodiversity Exploratories. The researchers employed a predefined coding set to analyse the interviews and grasp the relationships between different actors based on the anthropogenic capitals they used to co-produce forest nature's contributions to people (NCP). To secure the interviewees anonymity this coding cannot be published. Therefore, this data set is limited to this coding set.

  14. Z

    Sample Dataset - HR Subject Areas

    • data.niaid.nih.gov
    Updated Jan 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weber, Marc (2023). Sample Dataset - HR Subject Areas [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7447111
    Explore at:
    Dataset updated
    Jan 18, 2023
    Dataset authored and provided by
    Weber, Marc
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset created as part of the Master Thesis "Business Intelligence – Automation of Data Marts modeling and its data processing".

    Lucerne University of Applied Sciences and Arts

    Master of Science in Applied Information and Data Science (MScIDS)

    Autumn Semester 2022

    Change log Version 1.1:

    The following SQL scripts were added:

        Index
        Type
        Name
    
    
        1
        View
        pg.dictionary_table
    
    
        2
        View
        pg.dictionary_column
    
    
        3
        View
        pg.dictionary_relation
    
    
        4
        View
        pg.accesslayer_table
    
    
        5
        View
        pg.accesslayer_column
    
    
        6
        View
        pg.accesslayer_relation
    
    
        7
        View
        pg.accesslayer_fact_candidate
    
    
        8
        Stored Procedure
        pg.get_fact_candidate
    
    
        9
        Stored Procedure
        pg.get_dimension_candidate
    
    
        10
        Stored Procedure
        pg.get_columns
    

    Scripts are based on Microsoft SQL Server Version 2017 and compatible with a data warehouse built with Datavault Builder. Data warehouse objects scripts of the sample data warehouse are restricted and cannot be shared.

  15. c

    Research data supporting PhD thesis: "Automating Assembly on Construction...

    • repository.cam.ac.uk
    xls
    Updated Feb 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Butterfield, Timothy (2024). Research data supporting PhD thesis: "Automating Assembly on Construction Sites" [Dataset]. http://doi.org/10.17863/CAM.95106
    Explore at:
    xls(41257 bytes)Available download formats
    Dataset updated
    Feb 21, 2024
    Dataset provided by
    University of Cambridge
    Apollo
    Authors
    Butterfield, Timothy
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Spreadsheet containing the following data tables in separate tabs: 1) Table of construction components with assembly-related properties obtained from off-the-shelf product ranges. 2) Data table summarising evidence of information technology use in five construction project case studies. 3) Table of technologies which could potentially be applied to adapt existing construction plant to provide robotic handling capabilities. 4) Table of typical reach and payload capabilities of suitable construction plant.

  16. h

    Stock-Thesis-Data

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dimitris, Stock-Thesis-Data [Dataset]. https://huggingface.co/datasets/J1mb0o/Stock-Thesis-Data
    Explore at:
    Authors
    Dimitris
    Description

    J1mb0o/Stock-Thesis-Data dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. Z

    Reduced Order Models Chapter - N.C. Clementi PhD Thesis (problem data set)

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natalia C. Clementi (2021). Reduced Order Models Chapter - N.C. Clementi PhD Thesis (problem data set) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4558104
    Explore at:
    Dataset updated
    Feb 24, 2021
    Dataset authored and provided by
    Natalia C. Clementi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Problem folders including all the input files necessary to reproduce the computations of the results related to the Reduced Order Models Chapter of N.C. Clementi PhD Thesis.

  18. n

    Data from: Advanced Topics in Differentially Private Statistical Learning

    • curate.nd.edu
    pdf
    Updated Jul 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Spencer Tate Giddens (2025). Advanced Topics in Differentially Private Statistical Learning [Dataset]. http://doi.org/10.7274/29498438.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 14, 2025
    Dataset provided by
    University of Notre Dame
    Authors
    Spencer Tate Giddens
    License

    https://www.law.cornell.edu/uscode/text/17/106https://www.law.cornell.edu/uscode/text/17/106

    Description

    Collecting and utilizing data to understand population trends, make predictions, and guide decisions is becoming increasingly common in today's world. In particular, statistical learning allows users to infer relationships between variables, learn patterns, and predict outcomes for previously unseen data via concepts and techniques from statistics and machine learning. Although many of the results of this practice have been beneficial, the data used often contain sensitive information, such as medical records or financial information, so maintaining privacy is of paramount importance when releasing statistics, parameter estimates, and other results. Differential privacy (DP) is the state-of-the-art framework for guaranteeing privacy when releasing aggregate information and statistics from a dataset. It provides a provable bound on the incurred privacy loss via the injection of random noise, at the cost of a reduction in utility. While many works have been devoted to establishing DP guarantees for various analysis tools in the past two decades since DP's introduction, many popular statistical learning approaches still lack a DP counterpart. This dissertation addresses this issue in three original research topics, as listed below.

    First, the dissertation presents the first differentially private algorithm for general weighted empirical risk minimization (wERM), along with theoretical DP guarantees. It evaluates the performance of the DP-wERM framework applied to outcome weighted learning (OWL), a method for learning individualized treatment rules, in both simulation studies and in a real clinical trial. The results demonstrate the feasibility of training OWL models via wERM with DP guarantees while maintaining sufficiently robust model performance.

    Second, the dissertation presents several original approaches with proven DP guarantees for linear mixed-effects (LME) models. LME models are popular, especially among statisticians, but lack sufficient work on integrating DP. The work leverages some recent advancements in the DP literature, particularly in DP stochastic gradient descent (SGD), to estimate LME model parameters with DP guarantees with better privacy-utility trade-offs. Theoretical results for an upper bound for the mean squared error between private parameter estimates vs the true parameters for DP-SGD-based approaches are provided, and a simulation study and a real-world case study provide further empirical evidence for the feasibility of the approaches at practically reasonable privacy budgets.

    Third, this dissertation introduces SAFES, a Sequential PrivAcy and Fairness Enhancing data Synthesis procedure that sequentially combines DP data synthesis with a fairness-aware data transformation. Alongside privacy, the fairness of decisions made by a statistical learning model is also crucial to address, though the vast majority of existing literature treats the two concerns independently. For methods that do consider privacy and fairness simultaneously, they often only apply to a specific machine learning task, limiting their generalizability. SAFES allows full control over the privacy-fairness-utility trade-off via tunable privacy and fairness parameters. SAFES is illustrated by combining a graphical model-based DP data synthesizer with a popular fairness-aware data pre-processing transformation, and empirical evaluations on two popular benchmark datasets demonstrate that for reasonable privacy loss, SAFES-generated synthetic data achieve significantly improved fairness metrics with relatively low utility loss.

  19. z

    Software Engineering PhD and Licentiate Theses in Sweden: Publication...

    • zenodo.org
    • data.niaid.nih.gov
    bin, csv
    Updated Mar 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Grischa Liebel; Grischa Liebel; Robert Feldt; Robert Feldt (2021). Software Engineering PhD and Licentiate Theses in Sweden: Publication statistics [Dataset]. http://doi.org/10.5281/zenodo.4573263
    Explore at:
    csv, binAvailable download formats
    Dataset updated
    Mar 3, 2021
    Dataset provided by
    Zenodo
    Authors
    Grischa Liebel; Grischa Liebel; Robert Feldt; Robert Feldt
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Sweden
    Description

    This simple dataset contains publication statistics of Swedish PhD and Licentiate thesis in Software Engineering from 1999 to 2018. The contents of this dataset were discussed in a blog post on https://grischaliebel.de.

    The data is offered in two formats, xlsx and csv, but with the same content. Names and affiliation are anonymised in the data set to prevent identification of subjects. In the following, we describe the content of the different columns in the table.

    • Level: 'lic' for Licentiate theses or 'phd' for PhD theses
    • Year: The year of publication of the thesis
    • Included: The total number of papers included in the compilation-style thesis.
    • Listed: Number of papers listed in addition to the included papers (basically "I have also published these, but they are not relevant to the thesis). Note that we cannot distinguish between cases, where no papers are listed because none are published, or because the author decided not to list them.
    • IncludedPublished: The amount of included papers that are published or accepted for publication.
    • IncludedSubmitted: The amount of included papers that in submission/under review.
    • IncludedPublishedISI: The amount of included, published papers that are in ISI-ranked journals.
    • IncludedPublishedNonISIJ: The amount of included, published papers that are in non ISI-ranked journals.
    • IncludedPublishedConf: The amount of included, published papers that are in CORE-ranked conferences (any grade).
    • IncludedPublishedWS: The amount of included, published papers that are in workshops. Non CORE-ranked conferences are counted as workshops as well.
    • IncludedPublishedOther: The amount of included, published papers that do not fit in any other category (e.g., book chapters, technical reports).
    • IncludedSubmitted*: Amount of included, submitted papers broken down by category (Journal, conference, workshop, and other).
    • ListedPublished*: Amount of listed, published papers broken down by category (ISI/Non-ISI Journal, conference, workshop, and other).
    • ListedSubmitted*: Amount of listed, submitted papers broken down by category (Journal, conference, workshop, and other).
  20. 4

    Custom code created for the purposes of the thesis: "Applications of...

    • data.4tu.nl
    zip
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michał Ciszewski, Custom code created for the purposes of the thesis: "Applications of statistical theory to sensor data analysis" [Dataset]. http://doi.org/10.4121/d082e14d-6d92-44c9-9791-64b74dce3470.v1
    Explore at:
    zipAvailable download formats
    Dataset provided by
    4TU.ResearchData
    Authors
    Michał Ciszewski
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This is the custom code repository for replicating the results of the thesis. Three main routines are contained within this repository.

    A new quality measure is proposed in the thesis for the purposes of assessing the quality of predictors in human activity recognition problems. The related code can be found in the file: measures.py

    A postprocessing scheme is proposed in the thesis to remove unrealistically short activities from the classification given by the predictor. The related code can be found in the file: postprocessing.py

    A new formulation of the null hypothesis in a permutation test for no effect is proposed in the thesis. The viability of the test is presented based on the simulation study. This simulation study can be found in the files: sim_study_lin_reg.ipynb and sim_study_nn.ipynb.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Gregory White (2023). Thesis Data Repository [Dataset]. http://doi.org/10.26188/24295243.v1

Thesis Data Repository

Explore at:
15 scholarly articles cite this dataset (View in Google Scholar)
zipAvailable download formats
Dataset updated
Oct 11, 2023
Dataset provided by
The University of Melbourne
Authors
Gregory White
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Availability of data, code, and plot creation for various figures throughout my PhD thesis. Rough organisation currently. Pertains to Figures 5.4, 5.8, 6.11, 6.18, 7.3, 7.12, and Table 6.1.

Search
Clear search
Close search
Google apps
Main menu