2 datasets found
  1. f

    L-based spectral clustering scores under diverse settings of affinity...

    • plos.figshare.com
    xls
    Updated Feb 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski (2025). L-based spectral clustering scores under diverse settings of affinity parameter (column names). [Dataset]. http://doi.org/10.1371/journal.pone.0313238.t009
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 4, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All the metrics used are available in the sklearn package, see the documentation at https://scikit-learn.org/stable/api/sklearn.metrics.html.

  2. t

    Rainfall Prediction: Comparison of 7 Popular Models

    • test.researchdata.tuwien.ac.at
    bin, png +1
    Updated Apr 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaya Ali Kus; Kaya Ali Kus (2025). Rainfall Prediction: Comparison of 7 Popular Models [Dataset]. http://doi.org/10.70124/p7rh4-0g783
    Explore at:
    png, text/markdown, binAvailable download formats
    Dataset updated
    Apr 28, 2025
    Dataset provided by
    TU Wien
    Authors
    Kaya Ali Kus; Kaya Ali Kus
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Apr 28, 2025
    Description

    Rainfall Prediction using 7 Popular Models

    Context and Methodology

    Research Domain/Project:

    This dataset is part of a machine learning project focused on predicting rainfall, a critical task for sectors like agriculture, water resource management, and disaster prevention. The project employs machine learning algorithms to forecast rainfall occurrences based on historical weather data, including features like temperature, humidity, and pressure.

    Purpose:

    The primary goal of the dataset is to train multiple machine learning models to predict rainfall and compare their performances. The insights gained will help identify the most accurate models for real-world predictions of rainfall events.

    Creation Process:

    The dataset is derived from various historical weather observations, including temperature, humidity, wind speed, and pressure, collected by weather stations across Australia. These observations are used as inputs for training machine learning models. The dataset is publicly available on platforms like Kaggle and is often used in competitions and research to advance predictive analytics in meteorology.

    Technical Details


    Dataset Structure:

    The dataset consists of weather data from multiple Australian weather stations, spanning various time periods. Key features include:

    Temperature
    Humidity
    Wind Speed
    Pressure
    Rainfall (target variable)
    These features are tracked for each weather station over different times, with the goal of predicting rainfall.

    Software Requirements:

    Python: The primary programming language for data analysis and machine learning.
    scikit-learn: For implementing machine learning models.
    XGBoost, LightGBM, and CatBoost: Popular libraries for building more advanced ensemble models.
    Matplotlib/Seaborn: For data visualization.
    These libraries and tools help in data manipulation, modeling, evaluation, and visualization of results.
    DBRepo Authorization: Required to access datasets via the DBRepo API for dataset retrieval.

    Additional Resources

    Model Comparison Charts: The project includes output charts comparing the performance of seven popular machine learning models.
    Trained Models (.pkl files): Pre-trained models are saved as .pkl files for reuse without retraining.
    Documentation and Code: A Jupyter notebook guides through the process of data analysis, model training, and evaluation.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski (2025). L-based spectral clustering scores under diverse settings of affinity parameter (column names). [Dataset]. http://doi.org/10.1371/journal.pone.0313238.t009

L-based spectral clustering scores under diverse settings of affinity parameter (column names).

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
Feb 4, 2025
Dataset provided by
PLOS ONE
Authors
Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

All the metrics used are available in the sklearn package, see the documentation at https://scikit-learn.org/stable/api/sklearn.metrics.html.

Search
Clear search
Close search
Google apps
Main menu