2 datasets found

f
L-based spectral clustering scores under diverse settings of affinity...
plos.figshare.com
xls
Updated Feb 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski (2025). L-based spectral clustering scores under diverse settings of affinity parameter (column names). [Dataset]. http://doi.org/10.1371/journal.pone.0313238.t009
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0313238.t009
Dataset updated
Feb 4, 2025
Dataset provided by
PLOS ONE
Authors
Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
All the metrics used are available in the sklearn package, see the documentation at https://scikit-learn.org/stable/api/sklearn.metrics.html.
t
Rainfall Prediction: Comparison of 7 Popular Models
test.researchdata.tuwien.ac.at
bin, png +1
Updated Apr 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kaya Ali Kus; Kaya Ali Kus (2025). Rainfall Prediction: Comparison of 7 Popular Models [Dataset]. http://doi.org/10.70124/p7rh4-0g783
Explore at:
png, text/markdown, binAvailable download formats
Unique identifier
https://doi.org/10.70124/p7rh4-0g783
Dataset updated
Apr 28, 2025
Dataset provided by
TU Wien
Authors
Kaya Ali Kus; Kaya Ali Kus
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Apr 28, 2025
Description
Rainfall Prediction using 7 Popular Models

Context and Methodology

Research Domain/Project:

This dataset is part of a machine learning project focused on predicting rainfall, a critical task for sectors like agriculture, water resource management, and disaster prevention. The project employs machine learning algorithms to forecast rainfall occurrences based on historical weather data, including features like temperature, humidity, and pressure.

Purpose:

The primary goal of the dataset is to train multiple machine learning models to predict rainfall and compare their performances. The insights gained will help identify the most accurate models for real-world predictions of rainfall events.

Creation Process:

The dataset is derived from various historical weather observations, including temperature, humidity, wind speed, and pressure, collected by weather stations across Australia. These observations are used as inputs for training machine learning models. The dataset is publicly available on platforms like Kaggle and is often used in competitions and research to advance predictive analytics in meteorology.

Technical Details

Dataset Structure:

The dataset consists of weather data from multiple Australian weather stations, spanning various time periods. Key features include:

Temperature
Humidity
Wind Speed
Pressure
Rainfall (target variable)
These features are tracked for each weather station over different times, with the goal of predicting rainfall.

Software Requirements:

Python: The primary programming language for data analysis and machine learning.
scikit-learn: For implementing machine learning models.
XGBoost, LightGBM, and CatBoost: Popular libraries for building more advanced ensemble models.
Matplotlib/Seaborn: For data visualization.
These libraries and tools help in data manipulation, modeling, evaluation, and visualization of results.
DBRepo Authorization: Required to access datasets via the DBRepo API for dataset retrieval.

Additional Resources

Model Comparison Charts: The project includes output charts comparing the performance of seven popular machine learning models.
Trained Models (.pkl files): Pre-trained models are saved as .pkl files for reuse without retraining.
Documentation and Code: A Jupyter notebook guides through the process of data analysis, model training, and evaluation.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski (2025). L-based spectral clustering scores under diverse settings of affinity parameter (column names). [Dataset]. http://doi.org/10.1371/journal.pone.0313238.t009

L-based spectral clustering scores under diverse settings of affinity parameter (column names).

Explore at:

xlsAvailable download formats

Unique identifier

https://doi.org/10.1371/journal.pone.0313238.t009

Dataset updated

Feb 4, 2025

Dataset provided by

PLOS ONE

Authors

Bartłomiej Starosta; Mieczysław A. Kłopotek; Sławomir T. Wierzchoń; Dariusz Czerski; Marcin Sydow; Piotr Borkowski

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

All the metrics used are available in the sklearn package, see the documentation at https://scikit-learn.org/stable/api/sklearn.metrics.html.

Clear search

Close search

Google apps

Main menu

L-based spectral clustering scores under diverse settings of affinity...

Rainfall Prediction: Comparison of 7 Popular Models

Rainfall Prediction using 7 Popular Models

Context and Methodology

Research Domain/Project:

Purpose:

Creation Process:

Technical Details

Dataset Structure:

Software Requirements:

Additional Resources

L-based spectral clustering scores under diverse settings of affinity parameter (column names).