9 datasets found
  1. Non-IID scenario: 10-fold cross validation results with varying C.

    • plos.figshare.com
    • figshare.com
    xls
    Updated Jun 4, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li Huang; Yifeng Yin; Zeng Fu; Shifa Zhang; Hao Deng; Dianbo Liu (2023). Non-IID scenario: 10-fold cross validation results with varying C. [Dataset]. http://doi.org/10.1371/journal.pone.0230706.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Li Huang; Yifeng Yin; Zeng Fu; Shifa Zhang; Hao Deng; Dianbo Liu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Non-IID scenario: 10-fold cross validation results with varying C.

  2. f

    Experimental parameter table.

    • plos.figshare.com
    xls
    Updated Apr 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Caiyu Su; Jinri Wei; Yuan Lei; Hongkun Xuan; Jiahui Li (2024). Experimental parameter table. [Dataset]. http://doi.org/10.1371/journal.pone.0298261.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Apr 10, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Caiyu Su; Jinri Wei; Yuan Lei; Hongkun Xuan; Jiahui Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the realm of targeted advertising, the demand for precision is paramount, and the traditional centralized machine learning paradigm fails to address this necessity effectively. Two critical challenges persist in the current advertising ecosystem: the data privacy concerns leading to isolated data islands and the complexity in handling non-Independent and Identically Distributed (non-IID) data and concept drift due to the specificity and diversity in user behavior data. Current federated learning frameworks struggle to overcome these hurdles satisfactorily. This paper introduces Fed-GANCC, an innovative federated learning framework that synergizes Generative Adversarial Networks (GANs) and Group Clustering. The framework incorporates a user data augmentation algorithm predicated on adversarial generative networks to enrich user behavior data, curtail the impact of non-uniform data distribution, and enhance the applicability of the global machine learning model. Unlike traditional approaches, our framework offers user data augmentation algorithms based on adversarial generative networks, which not only enriches user behavior data but also reduces the challenges posed by non-uniform data distribution, thereby enhancing the applicability of the global machine learning (ML) model. The effectiveness of Fed-GANCC is distinctly showcased through experimental results, outperforming contemporary methods like FED-AVG and FED-SGD in terms of accuracy, loss value, and receiver operating characteristic (ROC) indicators within the same computing time. Experimental results vindicate the effectiveness of Fed-GANCC, revealing substantial enhancements in accuracy, loss value, and receiver operating characteristic (ROC) metrics compared to FED-AVG and FED-SGD given the same computational time. These outcomes underline Fed-GANCC’s exceptional prowess in mitigating issues such as isolated data islands, non-IID data, and concept drift. With its novel approach to addressing the prevailing challenges in targeted advertising such as isolated data islands, non-IID data, and concept drift, the Fed-GANCC framework stands as a benchmark, paving the way for future advancements in federated learning solutions tailored for the advertising domain. The Fed-GANCC framework promises to offer pivotal insights for the future development of efficient and advanced federated learning solutions for targeted advertising.

  3. Example rows and columns of DRUGS.

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li Huang; Yifeng Yin; Zeng Fu; Shifa Zhang; Hao Deng; Dianbo Liu (2023). Example rows and columns of DRUGS. [Dataset]. http://doi.org/10.1371/journal.pone.0230706.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Li Huang; Yifeng Yin; Zeng Fu; Shifa Zhang; Hao Deng; Dianbo Liu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Example rows and columns of DRUGS.

  4. u

    Single Window Initiative, Integrated Import Declaration (IID) - Regulated...

    • data.urbandatacentre.ca
    Updated Sep 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Single Window Initiative, Integrated Import Declaration (IID) - Regulated Commodities - Data Element Matching Criteria Table - Catalogue - Canadian Urban Data Catalogue (CUDC) [Dataset]. https://data.urbandatacentre.ca/dataset/gov-canada-791de1ec-7fc8-48f1-a4e4-8bbaebc67080
    Explore at:
    Dataset updated
    Sep 30, 2024
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Area covered
    Canada
    Description

    The Single Window Initiative (SWI) enables Importers and Customs Brokers to use Service Option 911 to provide an electronic Integrated Import Declaration (IID) to the Canada Border Services Agency (CBSA) for the nine Participating Government Departments and Agencies (PGAs). The IID can be for both non-regulated and regulated commodities. Trade Chain Partners (TCPs) can use the Data Element Matching Criteria Tables to identify commodities regulated by a particular PGA program. The Regulated Commodities Data Element Matching Criteria Tables consist of two components: A Boolean logic statement and a set of data elements related to each program administered by the Participating Government Departments and Agencies (PGAs). A match between the two confirms that a particular commodity is regulated by a specific program.

  5. f

    Global accuracy with more communiation rounds.

    • figshare.com
    xls
    Updated May 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Babar; Basit Qureshi; Anis Koubaa (2024). Global accuracy with more communiation rounds. [Dataset]. http://doi.org/10.1371/journal.pone.0302539.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 15, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Babar; Basit Qureshi; Anis Koubaa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In recent years, Federated Learning (FL) has gained traction as a privacy-centric approach in medical imaging. This study explores the challenges posed by data heterogeneity on FL algorithms, using the COVIDx CXR-3 dataset as a case study. We contrast the performance of the Federated Averaging (FedAvg) algorithm on non-identically and independently distributed (non-IID) data against identically and independently distributed (IID) data. Our findings reveal a notable performance decline with increased data heterogeneity, emphasizing the need for innovative strategies to enhance FL in diverse environments. This research contributes to the practical implementation of FL, extending beyond theoretical concepts and addressing the nuances in medical imaging applications. This research uncovers the inherent challenges in FL due to data diversity. It sets the stage for future advancements in FL strategies to effectively manage data heterogeneity, especially in sensitive fields like healthcare.

  6. f

    A detailed predictive analysis of round-wise performance outcomes of 5...

    • plos.figshare.com
    xls
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shagun Sharma; Kalpna Guleria; Ayush Dogra; Deepali Gupta; Sapna Juneja; Swati Kumari; Ali Nauman (2025). A detailed predictive analysis of round-wise performance outcomes of 5 clients federated framework with non-IID data. [Dataset]. http://doi.org/10.1371/journal.pone.0316543.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Shagun Sharma; Kalpna Guleria; Ayush Dogra; Deepali Gupta; Sapna Juneja; Swati Kumari; Ali Nauman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A detailed predictive analysis of round-wise performance outcomes of 5 clients federated framework with non-IID data.

  7. f

    A detailed predictive analysis of round-wise performance outcomes of 10...

    • plos.figshare.com
    xls
    Updated Feb 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shagun Sharma; Kalpna Guleria; Ayush Dogra; Deepali Gupta; Sapna Juneja; Swati Kumari; Ali Nauman (2025). A detailed predictive analysis of round-wise performance outcomes of 10 clients federated framework with non-IID data. [Dataset]. http://doi.org/10.1371/journal.pone.0316543.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 11, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Shagun Sharma; Kalpna Guleria; Ayush Dogra; Deepali Gupta; Sapna Juneja; Swati Kumari; Ali Nauman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A detailed predictive analysis of round-wise performance outcomes of 10 clients federated framework with non-IID data.

  8. f

    Comparison with the existing methods.

    • figshare.com
    xls
    Updated Jul 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shuhui Luo; Peilan Liu; Xulun Ye (2024). Comparison with the existing methods. [Dataset]. http://doi.org/10.1371/journal.pone.0307146.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jul 18, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Shuhui Luo; Peilan Liu; Xulun Ye
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    As a widely studied model in the machine learning and data processing society, graph convolutional network reveals its advantage in non-grid data processing. However, existing graph convolutional networks generally assume that the node features can be fully observed. This may violate the fact that many real applications come with only the pairwise relationships and the corresponding node features are unavailable. In this paper, a novel graph convolutional network model based on Bayesian framework is proposed to handle the graph node classification task without relying on node features. First, we equip the graph node with the pseudo-features generated from the stochastic process. Then, a hidden space structure preservation term is proposed and embedded into the generation process to maintain the independent and identically distributed property between the training and testing dataset. Although the model inference is challenging, we derive an efficient training and predication algorithm using variational inference. Experiments on different datasets demonstrate the proposed graph convolutional networks can significantly outperform traditional methods, achieving an average performance improvement of 9%.

  9. MAIN notations and descriptions.

    • plos.figshare.com
    xls
    Updated Jul 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shuhui Luo; Peilan Liu; Xulun Ye (2024). MAIN notations and descriptions. [Dataset]. http://doi.org/10.1371/journal.pone.0307146.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jul 18, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Shuhui Luo; Peilan Liu; Xulun Ye
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    As a widely studied model in the machine learning and data processing society, graph convolutional network reveals its advantage in non-grid data processing. However, existing graph convolutional networks generally assume that the node features can be fully observed. This may violate the fact that many real applications come with only the pairwise relationships and the corresponding node features are unavailable. In this paper, a novel graph convolutional network model based on Bayesian framework is proposed to handle the graph node classification task without relying on node features. First, we equip the graph node with the pseudo-features generated from the stochastic process. Then, a hidden space structure preservation term is proposed and embedded into the generation process to maintain the independent and identically distributed property between the training and testing dataset. Although the model inference is challenging, we derive an efficient training and predication algorithm using variational inference. Experiments on different datasets demonstrate the proposed graph convolutional networks can significantly outperform traditional methods, achieving an average performance improvement of 9%.

  10. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Li Huang; Yifeng Yin; Zeng Fu; Shifa Zhang; Hao Deng; Dianbo Liu (2023). Non-IID scenario: 10-fold cross validation results with varying C. [Dataset]. http://doi.org/10.1371/journal.pone.0230706.t005
Organization logo

Non-IID scenario: 10-fold cross validation results with varying C.

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
Jun 4, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Li Huang; Yifeng Yin; Zeng Fu; Shifa Zhang; Hao Deng; Dianbo Liu
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Non-IID scenario: 10-fold cross validation results with varying C.

Search
Clear search
Close search
Google apps
Main menu