Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the accompanying dataset to the following paper https://www.nature.com/articles/s41597-023-01975-w
Caravan is an open community dataset of meteorological forcing data, catchment attributes, and discharge daat for catchments around the world. Additionally, Caravan provides code to derive meteorological forcing data and catchment attributes from the same data sources in the cloud, making it easy for anyone to extend Caravan to new catchments. The vision of Caravan is to provide the foundation for a truly global open source community resource that will grow over time.
If you use Caravan in your research, it would be appreciated to not only cite Caravan itself, but also the source datasets, to pay respect to the amount of work that was put into the creation of these datasets and that made Caravan possible in the first place.
All current development and additional community extensions can be found at https://github.com/kratzert/Caravan
Channel Log:
23 May 2022: Version 0.2 - Resolved a bug when renaming the LamaH gauge ids from the LamaH ids to the official gauge ids provided as "govnr" in the LamaH dataset attribute files.
24 May 2022: Version 0.3 - Fixed gaps in forcing data in some "camels" (US) basins.
15 June 2022: Version 0.4 - Fixed replacing negative CAMELS US values with NaN (-999 in CAMELS indicates missing observation).
1 December 2022: Version 0.4 - Added 4298 basins in the US, Canada and Mexico (part of HYSETS), now totalling to 6830 basins. Fixed a bug in the computation of catchment attributes that are defined as pour point properties, where sometimes the wrong HydroATLAS polygon was picked. Restructured the attribute files and added some more meta data (station name and country).
16 January 2023: Version 1.0 - Version of the official paper release. No changes in the data but added a static copy of the accompanying code of the paper. For the most up to date version, please check https://github.com/kratzert/Caravan
10 May 2023: Version 1.1 - No data change, just update data description.
17 May 2023: Version 1.2 - Updated a handful of attribute values that were affected by a bug in their derivation. See https://github.com/kratzert/Caravan/issues/22 for details.
16 April 2024: Version 1.4 - Added 9130 gauges from the original source dataset that were initially not included because of the area thresholds (i.e. basins smaller than 100sqkm or larger than 2000sqkm). Also extended the forcing period for all gauges (including the original ones) to 1950-2023. Added two different download options that include timeseries data only as either csv files (Caravan-csv.tar.xz) or netcdf files (Caravan-nc.tar.xz). Including the large basins also required an update in the earth engine code
16 Jan 2025: Version 1.5 - Added FAO Penman-Monteith PET (potential_evaporation_sum_FAO_PENMAN_MONTEITH) and renamed the ERA5-LAND potential_evaporation band to potential_evaporation_sum_ERA5_LAND. Also added all PET-related climated indices derived with the Penman-Monteith PET band (suffix "_FAO_PM") and renamed the old PET-related indices accordingly (suffix "_ERA5_LAND").
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Large-sample datasets are essential in hydrological science to support modelling studies and global assessments. This dataset is an extension to Caravan, a global community dataset of meteorological forcing data, catchment attributes, and discharge data for catchments around the world (Kratzert et al. 2023).
The extension includes a subset of those hydrological discharge data and station-based watersheds from the Global Runoff Data Centre (GRDC), which are covered by an open data policy (Attribution 4.0 International; CC BY 4.0). In total, the dataset covers stations from 5356 catchments and 25 countries worldwide with a time series record from 1950 – 2023.
GRDC is an international data centre operating under the auspices of the World Meteorological Organization (WMO) at the German Federal Institute of Hydrology (BfG). Established in 1988, it holds the most substantive collection of quality assured river discharge data worldwide. Primary providers of river discharge data and associated metadata are the National Hydrological and Hydro-Meteorological Services of WMO Member States.
Reference:
Kratzert, F., Nearing, G., Addor, N. et al. Caravan - A global community dataset for large-sample hydrology. Sci Data 10, 61 (2023). https://doi.org/10.1038/s41597-023-01975-w
Update:
With version 0.2 a bug has been fixed that affected the time series of four bands of all GRDC gauges in the GRDC extension. The affected bands were total_precipitation, surface_net_solar_radiation, surface_net_thermal_radiation and potential_evaporation, i.e. all features that are accumulated over the day, as per definition of ERA5-Land.For details look at https://github.com/kratzert/Caravan/issues/26.
Version 0.3: Data description file added.Version 0.4: Added FAO Penman-Monteith PET (potential_evaporation_sum_FAO_PENMAN_MONTEITH) in the meteorological forcing data and renamed the ERA5-LAND potential_evaporation band to potential_evaporation_sum_ERA5_LAND. Also added all PET-related climated indices derived with the Penman-Monteith PET band (suffix "_FAO_PM") and renamed the old PET-related indices accordingly (suffix "_ERA5_LAND").Version 0.5: License overview of the respective countries has been added.Dataset description has been modified and improved.
Dataset structure:
The dataset is provided in the following two file formats:1. caravan-grdc-extension-csv.zip: provides the time series data as comma-separated text files (CSV) (downloadable as 8.8 GB zip archive)2. caravan-grdc-extension-nc.zip: provides the time series data in the Network Common Data Form (NetCDF) (downloadable as 7.6 GB zip archive)
The data in the versions 0.1-0.3 are identical. Version 0.4 added FAO Penman-Monteith PET (potential_evaporation_sum_FAO_PENMAN_MONTEITH) and renamed the ERA5-LAND potential_evaporation band to potential_evaporation_sum_ERA5_LAND.
Further details of the structure of the dataset are described in the data description file.
LamaH-Ice (LArge-SaMple DAta for Hydrology and Environmental Sciences for Iceland) is a large-sample hydro-meteorological dataset for Iceland. The dataset includes daily and hourly hydro-meteorological timeseries, including observed streamflow, and catchment characteristics for 107 river basins in Iceland. The catchment characteristics describe the topographic, hydroclimatic, land cover, vegetation, soils, geological and glaciological attributes of the river catchments, as well as the human influence on streamflow in the catchments. LamaH-Ice conforms to the structure of existing large-sample hydrology datasets and includes most variables offered in these datasets, as well as additional information relevant to cold-region hydrology, e.g., timeseries of snow cover, glacier mass balance and albedo. A large majority of the watersheds in LamaH-Ice are not subject to human activities, such as diversions and flow regulations. The dataset is described in detail in a paper in the journal "Earth System Science Data" (ESSD - https://doi.org/10.5194/essd-2023-349). The code used to assemble the dataset is available in folder "F_appendix" in the dataset as well as on GitHub (https://github.com/hhelgason/LamaH-Ice).
We offer two downloadable files for the LamaH-Ice dataset: 1) Hydrometeorological time series with both daily and hourly resolutions (30 GB after decompression) and 2) Hydrometeorological time series with daily resolution only (2 GB). Other than the temporal resolution, there are no differences between the two downloadable files. This HydroShare resource also hosts the "LamaH-Ice Caravan extension" (1 GB), which complements the "Caravan - A global community dataset for large-sample hydrology" Caravan dataset (Kratzert et al., 2023). The data is formatted in the same manner as the data currently existing in Caravan. To process the Caravan extension, the following guide was used: https://github.com/kratzert/Caravan/wiki/Extending-Caravan-with-new-basins. Some features, e.g. hourly atmospheric and streamflow series, glacier mass balance and MODIS timeseries data are thus only available in the LamaH-Ice dataset.
Data disclaimer: The Icelandic Meteorological Office (IMO) and the National Power Company of Iceland (NPC) own the data from most streamflow gauges in the dataset. The streamflow data is published on Hydroshare with permission of all data owners. Neither we nor the provider of the streamflow dataset can be liable for the data provided. The IMO and the NPC reserve the rights to retrospectively check and update the streamflow timeseries at any time, and these changes will not be reflected in this published dataset. If up-to-date data is needed, users are encouraged to contact the IMO and the NPC.
License: The streamflow data is subject to the CC BY-NC 4.0 (creativecommons.org/licenses/by-nc/4.0/). The streamflow data cannot be used for commercial purposes. All data except for the streamflow measurements are subject to the CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Users can share and adapt the dataset only if appropriate credit is given (the ESSD data description paper is cited, the version of the dataset and all data sources are listed which are declared in the folder "Info") and any changes are clearly indicated, and a link to the original license is provided.
Updates since the HydroShare repository was first created on August 18, 2023: May 31, 2024: • Streamflow series were corrected (replaced) for gauges with IDs 31, 70 and 72, and hydrological signatures and water balance files recalculated using the corrected streamflow series. In the Caravan extension, gauges 43 and 51 were also corrected. March 12, 2024: Dataset Revision: In line with the ESSD manuscript revision, significant updates have been made. For a detailed list, visit https://doi.org/10.5194/essd-2023-349-AC1. Key changes include: • A timeseries for reference ET has been computed using RAV-II reanalysis meteorological timeseries. • Climate indices recalculated with RAV-II reanalysis; ERA5-Land indices remain under an "_ERA5L" suffix. • Hydrological signatures are now derived from RAV-II reanalysis precipitation. • Standardized .csv column separators to semicolons. • Enhanced metadata for all shapefiles. • Added attributes (g_lon, g_lat, g_frac_dyn, g_area_dyn) to the dataset. • Reordered catchment attributes table columns for consistency with the LamaH-Ice paper. • Corrected ERA5-Land reanalysis errors for shortwave and longwave flux timeseries. • Streamflow series were corrected for gauges with IDs 43 and 51 (in LamaH, not Caravan extension) February 22, 2024 • Caravan Extension Fix: Corrected latitude and longitude mix-up. October 1, 2023 • GeoPackages added as an alternative to shapefiles, readme files added in all subfolders for guidance.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Caravan is an open community dataset of meteorological forcing data, catchment attributes, and discharge data for catchments around the world: https://doi.org/10.1038/s41597-023-01975-w; https://zenodo.org/record/6578598.
We have employed the published code to derive daily meteorological forcing data and catchment attributes from global data sources to extend Caravan with data for 95 catchments from Israel.
For the data processing, the following guide was followed: https://github.com/kratzert/Caravan/wiki/Extending-Caravan-with-new-basins.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the accompanying dataset to the following paper https://www.nature.com/articles/s41597-023-01975-w
Caravan is an open community dataset of meteorological forcing data, catchment attributes, and discharge daat for catchments around the world. Additionally, Caravan provides code to derive meteorological forcing data and catchment attributes from the same data sources in the cloud, making it easy for anyone to extend Caravan to new catchments. The vision of Caravan is to provide the foundation for a truly global open source community resource that will grow over time.
If you use Caravan in your research, it would be appreciated to not only cite Caravan itself, but also the source datasets, to pay respect to the amount of work that was put into the creation of these datasets and that made Caravan possible in the first place.
All current development and additional community extensions can be found at https://github.com/kratzert/Caravan
IMPORTANT: Due to size limitations for individual repositories, the netCDF version and the CSV version of Caravan (since Version 1.6) are split into two different repositories. You can find the netCDF version at https://zenodo.org/records/14673536
Channel Log:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
THIS IS A PRE-RELEASE, WHILE THE CARAVAN IS UNDER REVISION.
Check out the preprint at: https://eartharxiv.org/repository/view/3345/
Caravan is an open community dataset of meteorological forcing data, catchment attributes, and discharge daat for catchments around the world. Additionally, Caravan provides code to derive meteorological forcing data and catchment attributes from the same data sources in the cloud, making it easy for anyone to extend Caravan to new catchments. The vision of Caravan is to provide the foundation for a truly global open source community resource that will grow over time.
Channel Log:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Caravan is an open community dataset of meteorological forcing data, catchment attributes, and discharge data for catchments around the world: https://zenodo.org/record/6578598.
We have employed the published code to derive meteorological forcing data and catchment attributes from global data sources to extend Caravan with data for 1887 catchments in Germany. The time series data are in daily resolution and span up to 70 years, from January 1951 to December 2020.
Most of the catchments in Caravan-DE are also part of the CAMELS-DE dataset (10.5281/zenodo.13837553, 1582 catchments). As CAMELS-DE relies on meteorological forcing data that is only available within the borders of Germany, catchments going beyond the German national borders had to be discarded. Caravan uses global data products for the meteorological forcing data and catchment attributes, which is why Caravan-DE includes these catchments that are partly located outside of Germany. As catchments in Caravan-DE and CAMELS-DE are identified by the same ID, the datasets can be used together.
Please refer to the CAMELS-DE paper (https://doi.org/10.5194/essd-2024-318) for information about discharge data and the catchment geometries used for both Caravan-DE and CAMELS-DE.
For the processing of the data the following guide was followed step by step: https://github.com/kratzert/Caravan/wiki/Extending-Caravan-with-new-basins
english:
The state agencies do not guarantee the accuracy or completeness of the discharge or water level data provided. In addition, all hydrological data may be subject to future revisions, including adjustments to the rating curves or corrections of errors. Therefore, it is necessary to obtain the most recent discharge time series directly from the federal state authorities for projects that require water law permits. Additionally, the regulations of the respective federal state apply and specific enquiries should be made as needed. It is also important to note that the state agencies explicitly disclaim any warranty as to the accuracy or completeness of the data and therefore any liability claims against any of the federal states are also excluded.
german:
Die Ländesämter gewährleisten nicht die Genauigkeit oder Vollständigkeit der bereitgestellten Abfluss oder Wasserstandsdaten. Zudem können alle hydrologischen Daten zukünftigen Überarbeitungen unterliegen, einschließlich Anpassungen der Wasserstands-Abflussbeziehung oder der Korrektur von Fehlern. Daher ist es notwendig, die aktuellsten Abflusszeitreihen direkt bei den Landesbehörden zu beziehen, falls Wasserrechtsgenehmigungen erforderlich sind. Zusätzlich gelten die Vorschriften des jeweiligen Bundeslandes, und spezifische Anfragen sollten bei Bedarf gestellt werden. Es ist ebenfalls wichtig zu beachten, dass die staatlichen Behörden ausdrücklich jegliche Gewährleistung hinsichtlich der Genauigkeit oder Vollständigkeit der Daten ausschließen und somit auch jegliche Haftungsansprüche gegenüber einem der Bundesländer ausgeschlossen sind.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
CAMELS-CH (Catchment Attributes and MEteorology for large-sample Studies - Switzerland) is a large-sample hydro-meteorological data set for hydrological Switzerland in Central Europe that covers 331 basins within Switzerland and neighboring countries (Austria, France, Germany and Italy). CAMELS-CH comprises dynamic hydro-meteorological variables and static catchment attributes.
The data set covers 40 years of data between 1st January 1981 and 31st December 2020 for each catchment: daily time series of stream flow and water levels, of meteorological data such as precipitation and air temperature and of daily snow water equivalent data. Additionally, CAMELS-CH encompasses annual time series of land cover change and glacier evolution per catchment. The static catchment attributes comprise the following categories: location and topography, climate, hydrology, soil, hydrogeology, geology, land use, human impact and glaciers.
The corresponding manuscript is published at the journal "Earth System Science Data" (ESSD) and available here. The code used to generate the dataset is available on Github.
The data description file below contains a comprehensive list of all time series and attribute variables covered by the dataset and references to the original data sources. Further, this repository contains the "Caravan extension CH" for the "Caravan - A global community dataset for large-sample hydrology" Caravan dataset (see the list of extensions). This extension has the same format like other Caravan parts and is based on the same data sources. Note that some features like the annual glacier time series, etc. are therefore only available in the original CAMELS-CH dataset.
Updates:
Update version 0.9: affects "Caravan_extension_CH" - In version 1.5 of the Caravan dataset, Penman-Monteith PET was added as an additional time series feature. Additional to the new time series feature, also all pet-related climate indices were recomputed using the new Penman-Monteith PET. For consistency, the old ERA5-Land potential_evaporation time series and climate indices were kept, but renamed for a better identification of the differences.
Update version 0.8: resolving projection issue for shapefiles in "Caravan_extension_CH" using EPSG:4326 (WGS84); updating readme file of "camels_ch" regarding the Dischma catchment
Update version 0.7: update corresponding to the revision of the manuscript at "Earth System Science Data" (ESSD)
dataset file delimiters have been changed to commas from semicolons
the "time_series" folder was renamed to "timeseries"
in the simulation-based data, there was an error in the previous aggregation of precipitation and evapotranspiration. The corresponding time series, affected hydrologic signatures and climatic indices were corrected
the order of simulation-based variables in the timeseries files was changed to resemble the order shown in the tables of the corresponding publication in ESSD
blank values that were masked by "NA" are now consistently indicated by "NaN"
the readme file has been extended
Update version 0.6: updating links to related material (all links and references are available in the preprint/manuscript) and abstract
Update version 0.5: adding the "camels_ch_data_description.pdf" file
Update version 0.4: update of several static attributes in "Caravan_extension_CH" following a general update in Caravan and all its extensions + adopting the geographic coordinate system to Caravan-standard EPSG:4326
Update version 0.3: renaming single files/entries in "Caravan_extension_CH" to start with "camelsch" as unique Caravan extension identifier
Update version 0.2: CH extension to Caravan added
Version 1.5 – Updated in February 2025 Citation: If you use this dataset, please cite the following data description paper: Helgason, H. B. and Nijssen, B.: LamaH-Ice: LArge-SaMple DAta for Hydrology and Environmental Sciences for Iceland, Earth System Science Data, 16, 2741–2771, https://doi.org/10.5194/ESSD-16-2741-2024, 2024.
Key Updates in Version 1.5. This update extends the streamflow and meteorological time series in LamaH-Ice: - Streamflow series from operational gauges now extend to 2024-09-30 or 2023-09-30. - For select gauges (IDs: 8, 36, 66, 98), streamflow data has also been extended further back in time. - Hydrological signatures have been updated accordingly. - ERA5-Land meteorological series in LamaH-Ice have been extended to 2024-09-30 for basin delineation A. These are now sourced from the LamaH-Ice Caravan extension. - Hourly series in the LamaH-Ice hourly dataset were not updated.
Key Updates in the LamaH-Ice Caravan Extension: - Penman-Monteith PET has been added. - All climate indices influenced by PET have been recomputed using the new Penman-Monteith time series. - The ERA5-Land potential evaporation remains available, along with the old indices, but has been renamed for clarity: - New time series names: "potential_evaporation_sum_ERA5_LAND", "potential_evaporation_sum_FAO_PENMAN_MONTEITH" - Updated climate indices naming convention: Suffix _ERA5_LAND for indices using ERA5-Land PET, Suffix _FAO_PM for indices using FAO Penman-Monteith PET. - Streamflow observations have been added for two additional gauges (IDs: 1010, 9900).
Original dataset description: See https://www.hydroshare.org/resource/86117a5f36cc4b7c90a5d54e18161c91/
We offer two downloadable files for the LamaH-Ice dataset: 1) Hydrometeorological time series with both daily and hourly resolutions (30 GB after decompression) and 2) Hydrometeorological time series with daily resolution only (2 GB). Other than the temporal resolution, there are no differences between the two downloadable files. This HydroShare resource also hosts the "LamaH-Ice Caravan extension" (1 GB), which complements the "Caravan - A global community dataset for large-sample hydrology" Caravan dataset (Kratzert et al., 2023). The data is formatted in the same manner as the data currently existing in Caravan. To process the Caravan extension, the following guide was used: https://github.com/kratzert/Caravan/wiki/Extending-Caravan-with-new-basins. Some features, e.g. hourly atmospheric and streamflow series, glacier mass balance and MODIS timeseries data are thus only available in the LamaH-Ice dataset.
Data disclaimer: The Icelandic Meteorological Office (IMO) and the National Power Company of Iceland (NPC) own the data from most streamflow gauges in the dataset. The streamflow data is published on Hydroshare with permission of all data owners. Neither we nor the provider of the streamflow dataset can be liable for the data provided. The IMO and the NPC reserve the rights to retrospectively check and update the streamflow timeseries at any time, and these changes will not be reflected in this published dataset. If up-to-date data is needed, users are encouraged to contact the IMO and the NPC.
License: The streamflow data is subject to the CC BY-NC 4.0 (creativecommons.org/licenses/by-nc/4.0/). The streamflow data cannot be used for commercial purposes. All data except for the streamflow measurements are subject to the CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Users can share and adapt the dataset only if appropriate credit is given (the ESSD data description paper is cited, the version of the dataset and all data sources are listed which are declared in the folder "Info") and any changes are clearly indicated, and a link to the original license is provided.
Updates since the HydroShare repository was first created in February, 2025: August 11, 2025: - Meteorological time series from the CARRA reanalysis were added to the daily lamah-ice dataset - A .pdf file with metadata for all meteorological variables was added (LamaH-Ice Meteorological Data Documentation Table.pdf) - MODIS timeseries for snow cover and albedo were extended to 2024-09-30
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
CAMELS-DE provides a comprehensive collection of hydro-meteorological timeseries data (e.g. discharge, water level, precipitation, air temperature) and catchment attributes for 1582 streamflow gauges across Germany. The time series data is in daily resolution and spans up to 70 years, from January 1951 to December 2020. The static catchment attributes include information on topography, soils, land cover, hydrogeology and human influences. Additionally, the dataset includes discharge simulations from a regional Long-Short Term Memory (LSTM) network and a conceptual hydrological model (HBV), providing benchmark data for future hydrological modelling studies in Germany.
The accompanying data description gives information on data sources, the structure of the data set and contains extensive information on time series and catchment attribute variables. In addition, up-to-date benchmark results of the LSTM and HBV are provided.
Important: As CAMELS-DE is continuously developed and updated, please ensure that you cite the correct version of the dataset that you are using.
The CAMELS-DE data description paper is available here: https://doi.org/10.5194/essd-16-5625-2024.
Information about the code and methods for generating CAMELS-DE can be found here: https://doi.org/10.5281/zenodo.12760336" target="_blank" rel="noopener">CAMELS-DE Processing Pipeline.
CAMELS-DE is also part of the Caravan project, a global hydrological dataset. Due to the use of data products that are available beyond the Germany national boundaries, Caravan-DE includes 305 additional streamflow gauges, resulting in a total of 1887 streamflow gauges: https://doi.org/10.5281/zenodo.13320514.
english:
The state agencies do not guarantee the accuracy or completeness of the discharge or water level data provided. In addition, all hydrological data may be subject to future revisions, including adjustments to the rating curves or corrections of errors. Therefore, it is necessary to obtain the most recent discharge time series directly from the federal state authorities for projects that require water law permits. Additionally, the regulations of the respective federal state apply and specific enquiries should be made as needed. It is also important to note that the state agencies explicitly disclaim any warranty as to the accuracy or completeness of the data and therefore any liability claims against any of the federal states are also excluded.
german:
Die Ländesämter gewährleisten nicht die Genauigkeit oder Vollständigkeit der bereitgestellten Abfluss oder Wasserstandsdaten. Zudem können alle hydrologischen Daten zukünftigen Überarbeitungen unterliegen, einschließlich Anpassungen der Wasserstands-Abflussbeziehung oder der Korrektur von Fehlern. Daher ist es notwendig, die aktuellsten Abflusszeitreihen direkt bei den Landesbehörden zu beziehen, falls Wasserrechtsgenehmigungen erforderlich sind. Zusätzlich gelten die Vorschriften des jeweiligen Bundeslandes, und spezifische Anfragen sollten bei Bedarf gestellt werden. Es ist ebenfalls wichtig zu beachten, dass die staatlichen Behörden ausdrücklich jegliche Gewährleistung hinsichtlich der Genauigkeit oder Vollständigkeit der Daten ausschließen und somit auch jegliche Haftungsansprüche gegenüber einem der Bundesländer ausgeschlossen sind.
CAMELS_DE_topographic_attributes.csv Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the accompanying dataset to the following paper https://www.nature.com/articles/s41597-023-01975-w
Caravan is an open community dataset of meteorological forcing data, catchment attributes, and discharge daat for catchments around the world. Additionally, Caravan provides code to derive meteorological forcing data and catchment attributes from the same data sources in the cloud, making it easy for anyone to extend Caravan to new catchments. The vision of Caravan is to provide the foundation for a truly global open source community resource that will grow over time.
If you use Caravan in your research, it would be appreciated to not only cite Caravan itself, but also the source datasets, to pay respect to the amount of work that was put into the creation of these datasets and that made Caravan possible in the first place.
All current development and additional community extensions can be found at https://github.com/kratzert/Caravan
Channel Log:
23 May 2022: Version 0.2 - Resolved a bug when renaming the LamaH gauge ids from the LamaH ids to the official gauge ids provided as "govnr" in the LamaH dataset attribute files.
24 May 2022: Version 0.3 - Fixed gaps in forcing data in some "camels" (US) basins.
15 June 2022: Version 0.4 - Fixed replacing negative CAMELS US values with NaN (-999 in CAMELS indicates missing observation).
1 December 2022: Version 0.4 - Added 4298 basins in the US, Canada and Mexico (part of HYSETS), now totalling to 6830 basins. Fixed a bug in the computation of catchment attributes that are defined as pour point properties, where sometimes the wrong HydroATLAS polygon was picked. Restructured the attribute files and added some more meta data (station name and country).
16 January 2023: Version 1.0 - Version of the official paper release. No changes in the data but added a static copy of the accompanying code of the paper. For the most up to date version, please check https://github.com/kratzert/Caravan
10 May 2023: Version 1.1 - No data change, just update data description.
17 May 2023: Version 1.2 - Updated a handful of attribute values that were affected by a bug in their derivation. See https://github.com/kratzert/Caravan/issues/22 for details.
16 April 2024: Version 1.4 - Added 9130 gauges from the original source dataset that were initially not included because of the area thresholds (i.e. basins smaller than 100sqkm or larger than 2000sqkm). Also extended the forcing period for all gauges (including the original ones) to 1950-2023. Added two different download options that include timeseries data only as either csv files (Caravan-csv.tar.xz) or netcdf files (Caravan-nc.tar.xz). Including the large basins also required an update in the earth engine code
16 Jan 2025: Version 1.5 - Added FAO Penman-Monteith PET (potential_evaporation_sum_FAO_PENMAN_MONTEITH) and renamed the ERA5-LAND potential_evaporation band to potential_evaporation_sum_ERA5_LAND. Also added all PET-related climated indices derived with the Penman-Monteith PET band (suffix "_FAO_PM") and renamed the old PET-related indices accordingly (suffix "_ERA5_LAND").