7 datasets found

plism-dataset
huggingface.co
Updated Jan 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Owkin (2025). plism-dataset [Dataset]. https://huggingface.co/datasets/owkin/plism-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 28, 2025
Dataset authored and provided by
Owkinhttps://owkin.com/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
PLISM dataset

This preprocessed dataset was directly generated from owkin/plism-dataset-tiles. It is meant to perform the features extraction in a more convenient way. As such, this dataset contains 91 .h5 files each containing 16,278 images converted into numpy arrays. This allows for easy resuming but require 225 Go storage.

How to extract features

[!IMPORTANT] 🎉 Check plismbench to perform the feature extraction of PLISM dataset and get run our robustness benchmark… See the full description on the dataset page: https://huggingface.co/datasets/owkin/plism-dataset.
Training images
redivis.com
Updated Aug 17, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Redivis Demo Organization (2022). Training images [Dataset]. https://redivis.com/datasets/yz1s-d09009dbb
Explore at:
Dataset updated
Aug 17, 2022
Dataset provided by
Redivis Inc.
Authors
Redivis Demo Organization
Time period covered
Aug 8, 2022
Description
This is an auto-generated index table corresponding to a folder of files in this dataset with the same name. This table can be used to extract a subset of files based on their metadata, which can then be used for further analysis. You can view the contents of specific files by navigating to the "cells" tab and clicking on an individual file_kd.
Genomics examples
redivis.com
Updated Aug 17, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Redivis Demo Organization (2022). Genomics examples [Dataset]. https://redivis.com/datasets/yz1s-d09009dbb
Explore at:
Dataset updated
Aug 17, 2022
Dataset provided by
Redivis Inc.
Authors
Redivis Demo Organization
Time period covered
Jan 30, 2025
Description
This is an auto-generated index table corresponding to a folder of files in this dataset with the same name. This table can be used to extract a subset of files based on their metadata, which can then be used for further analysis. You can view the contents of specific files by navigating to the "cells" tab and clicking on an individual file_id.
R
Data and codes from: Comparison of Solar Imaging Feature Extraction Methods...
entrepot.recherche.data.gouv.fr
7z, application/x-h5 +2
Updated Jun 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maria Tahtouh; Maria Tahtouh; Guillerme Bernoux; Guillerme Bernoux; Antoine Brunet; Antoine Brunet; Denis Standarovski; Denis Standarovski; Gautier Nguyen; Gautier Nguyen; Angélica Sicard; Angélica Sicard (2025). Data and codes from: Comparison of Solar Imaging Feature Extraction Methods in the Context of Space Weather Prediction with Deep Learning-Based Models [Dataset]. http://doi.org/10.57745/DZT7DS
Explore at:
application/x-h5(2599174), 7z(4407015), bin(40653687), 7z(2335796), text/x-python(29618), text/x-python(2593), text/x-python(4013), text/x-python(11669), bin(42832463), text/x-python(2710), application/x-h5(5006388127), bin(1784082487), text/x-python(18773)Available download formats
Unique identifier
https://doi.org/10.57745/DZT7DS
Dataset updated
Jun 4, 2025
Dataset provided by
Recherche Data Gouv
Authors
Maria Tahtouh; Maria Tahtouh; Guillerme Bernoux; Guillerme Bernoux; Antoine Brunet; Antoine Brunet; Denis Standarovski; Denis Standarovski; Gautier Nguyen; Gautier Nguyen; Angélica Sicard; Angélica Sicard
License
https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html
Time period covered
2010 - 2020
Description
This dataset contains replication data for the paper "Comparison of Solar Imaging Feature Extraction Methods in the Context of Space Weather Prediction with Deep Learning-Based Models". It includes files stored into HDF5 (Hierarchical Data Format) file using HDFStore. One file contains the extracted features using the 6 different techniques for the wavelength 19.3 nm named solar_extracted_features_v01_2010-2020.h5 and the second the SERENADE outputs named serenade_predictions_v01.h5. Both files contain several datasets labeled with ‘keys’. The latter correspond to the extraction method. Here is a list of the key names: gn_1024: corresponding to the GoogLenet extractor with 1024 components. pca_1024: corresponding to the Principle Component Analysis technique leaving 1024 components. ae_1024: corresponding to the AutoEncoder with a latent space of 1024. gn_256 (only in solar_extracted_features_v01_2010-2020.h5): corresponding to the GoogLenet extractor with 256 components. pca_256: corresponding to the Principle Component Analysis technique leaving 256 components. ae_256: corresponding to the AutoEncoder technique with a latent space of 256. vae_256 (only in solar_extracted_features_v01_2010-2020.h5): corresponding to the Variational AutoEncoder technique with a latent space of 256. vae_256_old (only in serenade_predictions_v01.h5): the output predictions of SERENADE using the VAE extracted features using the hyperparameters optimized for GoogLeNet. vae_256_new (only in serenade_predictions_v01.h5): the output predictions of SERENADE using the VAE extracted features with the alternative architecture. All the above-mentioned models are explained and detailed in the paper. In order to read the files, the user can do it with the Pandas package for Python as follows: import pandas as pd df = pd.read_hdf('file_name.h5', key = 'model_name') and replace file_name by either solar_extracted_features_v01_2010-2020.h5 or serenade_predictions_v01.h5 and model_name by one of the models in the list above. The extracted features dataset will output a pandas DataFrame indexed by datetime and either 1024 or 256 columns of features. An additional column indicates to which subset (train, validation and test) the corresponding row belongs. The SERENADE outputs dataset will output a DataFrame indexed by datetime and 4 columns: Observations: the first column contains the true daily maximum of the Kp index. Predictions: the second column contains the predicted mean of the daily maximum of the Kp index. Standard Deviation: the third column contains the standard deviation as the predictions are probabilistic. Model: this column specifies from which feature extractor model the inputs were used to generate the predictions. We add the feature extractors AE and VAE class codes as well as their weights in the AEs_class.py and VAE_class.py codes and best_AE_1024.ckpt, best_AE_256.ckpt and best_VAE.ckpt checkpoints respectively. The figures in the manuscript can be reproduced using the codes named after the corresponding figure. The files 6_mins_predictions and seed_variation contain the SERENADE predictions to reproduce figures 7, 8, 9 and 10.
Dataset for the paper "Observation of Acceleration and Deceleration Periods...
zenodo.org
Updated Mar 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yide Qian; Yide Qian (2025). Dataset for the paper "Observation of Acceleration and Deceleration Periods at Pine Island Ice Shelf from 1997–2023 " [Dataset]. http://doi.org/10.5281/zenodo.15022854
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.15022854
Dataset updated
Mar 26, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Yide Qian; Yide Qian
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Pine Island Glacier
Description
Dataset and codes for "Observation of Acceleration and Deceleration Periods at Pine Island Ice Shelf from 1997–2023 "

Description of the data and file structure

The MATLAB codes and related datasets are used for generating the figures for the paper "Observation of Acceleration and Deceleration Periods at Pine Island Ice Shelf from 1997–2023".

Files and variables

File 1: Data_and_Code.zip

Directory: Main_function

**Description:****Include MATLAB scripts and functions. Each script include discriptions that guide the user how to used it and how to find the dataset that used for processing.

MATLAB Main Scripts: Include the whole steps to process the data, output figures, and output videos.

Script_1_Ice_velocity_process_flow.m

Script_2_strain_rate_process_flow.m

Script_3_DROT_grounding_line_extraction.m

Script_4_Read_ICESat2_h5_files.m

Script_5_Extraction_results.m

MATLAB functions: Five Files that includes MATLAB functions that support the main script:

1_Ice_velocity_code: Include MATLAB functions related to ice velocity post-processing, includes remove outliers, filter, correct for atmospheric and tidal effect, inverse weited averaged, and error estimate.

2_strain_rate: Include MATLAB functions related to strain rate calculation.

3_DROT_extract_grounding_line_code: Include MATLAB functions related to convert range offset results output from GAMMA to differential vertical displacement and used the result extract grounding line.

4_Extract_data_from_2D_result: Include MATLAB functions that used for extract profiles from 2D data.

5_NeRD_Damage_detection: Modified code fom Izeboud et al. 2023. When apply this code please also cite Izeboud et al. 2023 (https://www.sciencedirect.com/science/article/pii/S0034425722004655).

6_Figure_plotting_code:Include MATLAB functions related to Figures in the paper and support information.

Director: data_and_result

Description:**Include directories that store the results output from MATLAB. user only neeed to modify the path in MATLAB script to their own path.

1_origin : Sample data ("PS-20180323-20180329", “PS-20180329-20180404”, “PS-20180404-20180410”) output from GAMMA software in Geotiff format that can be used to calculate DROT and velocity. Includes displacment, theta, phi, and ccp.

2_maskccpN: Remove outliers by ccp < 0.05 and change displacement to velocity (m/day).

3_rockpoint: Extract velocities at non-moving region

4_constant_detrend: removed orbit error

5_Tidal_correction: remove atmospheric and tidal induced error

6_rockpoint: Extract non-aggregated velocities at non-moving region

6_vx_vy_v: trasform velocities from va/vr to vx/vy

7_rockpoint: Extract aggregated velocities at non-moving region

7_vx_vy_v_aggregate_and_error_estimate: inverse weighted average of three ice velocity maps and calculate the error maps

8_strain_rate: calculated strain rate from aggregate ice velocity

9_compare: store the results before and after tidal correction and aggregation.

10_Block_result: times series results that extrac from 2D data.

11_MALAB_output_png_result: Store .png files and time serties result

12_DROT: Differential Range Offset Tracking results

13_ICESat_2: ICESat_2 .h5 files and .mat files can put here (in this file only include the samples from tracks 0965 and 1094)

14_MODIS_images: you can store MODIS images here

shp: grounding line, rock region, ice front, and other shape files.

File 2 : PIG_front_1947_2023.zip

Includes Ice front positions shape files from 1947 to 2023, which used for plotting figure.1 in the paper.

File 3 : PIG_DROT_GL_2016_2021.zip

Includes grounding line positions shape files from 1947 to 2023, which used for plotting figure.1 in the paper.

Data was derived from the following sources:
Those links can be found in MATLAB scripts or in the paper "**Open Research" **section.
Reconnection rates of the paper "Simulation of plasmaspheric plume impact on...
zenodo.org
data.niaid.nih.gov
bin, pdf
Updated Jul 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dargent Jérémy; Dargent Jérémy (2024). Reconnection rates of the paper "Simulation of plasmaspheric plume impact on dayside magnetic reconnection" [Dataset]. http://doi.org/10.5281/zenodo.3647857
Explore at:
pdf, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3647857
Dataset updated
Jul 22, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Dargent Jérémy; Dargent Jérémy
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains the dataset needed for the paper "Simulation of plasmaspheric plume impact on dayside magnetic reconnection", i.e. the magnetic reconnection rates at each time and the quantities needed to normalize them. All the data are stored in the file "rates_norm.h5". The file "si_content.pdf" explain how the data are stored and how you can extract them.
Z
Supporting data "Scaling theory for the statistics of slip at frictional...
data.niaid.nih.gov
Updated Apr 4, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
de Geus, Tom W.J. (2024). Supporting data "Scaling theory for the statistics of slip at frictional interfaces" [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7852001
Explore at:
Dataset updated
Apr 4, 2024
Dataset authored and provided by
de Geus, Tom W.J.
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Principle data supporting "Scaling theory for the statistics of slip at frictional interfaces"

T. W. J. de Geus and M. Wyart (2022), Phys. Rev. E, 106(6):065001.

See code at doi: 10.5281/zenodo.10723197 (and its documentation) for workflow, detailed information of the data, and further dependencies.

The files N=*_Run*.zip contain fully restorable events for event-driven athermal quasi-static shear. Sequentually numbered files contain different parts of a single dataset.

The file summary.zip contains an extract of the key variables of these runs, and of triggers at different stresses. Finally, it contains "flow" data acquired by driving at finite rate.

The files N=3^6x4_Trigger_EnsemblePack.zip contain fully restorable triggers at different stresses in the largest system. The sequentially numbered files correspond to one dataset split in different .h5 files.

Highly specific (and poorly documentated) plotting functions are available upon request.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Owkin (2025). plism-dataset [Dataset]. https://huggingface.co/datasets/owkin/plism-dataset

plism-dataset

owkin/plism-dataset

Explore at:

5 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jan 28, 2025

Dataset authored and provided by

Owkinhttps://owkin.com/

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

PLISM dataset

This preprocessed dataset was directly generated from owkin/plism-dataset-tiles. It is meant to perform the features extraction in a more convenient way. As such, this dataset contains 91 .h5 files each containing 16,278 images converted into numpy arrays. This allows for easy resuming but require 225 Go storage.

  How to extract features

[!IMPORTANT] 🎉 Check plismbench to perform the feature extraction of PLISM dataset and get run our robustness benchmark… See the full description on the dataset page: https://huggingface.co/datasets/owkin/plism-dataset.

Clear search

Close search

Google apps

Main menu

plism-dataset

Training images

Genomics examples

Data and codes from: Comparison of Solar Imaging Feature Extraction Methods...

Dataset for the paper "Observation of Acceleration and Deceleration Periods...

Reconnection rates of the paper "Simulation of plasmaspheric plume impact on...

Supporting data "Scaling theory for the statistics of slip at frictional...

plism-dataset

owkin/plism-dataset