Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Contact: Md Safat Sikder (msikder@ksu.edu), Jida Wang (jidawang@ksu.edu; gdbruins@ucla.edu)
Citation
If you use Lake-TopoCat, please cite the following ESSD preprint for now:
Sikder, M. S., Wang, J., Allen, G. H., Sheng, Y., Yamazaki, D., Song, C., Ding, M., Crétaux, J.-F., and Pavelsky, T. M., 2023. Lake-TopoCat: A global lake drainage topology and catchment dataset. Earth System Science Data Discussion, in review, https://doi.org/10.5194/essd-2022-433.
Data description and components
This version of Lake-TopoCat was constructed using the HydroLAKES v1.0 (Messager et al., 2016) lake mask and the 3-arc-second-resolution hydrography dataset MERIT Hydro v1.0.1 (Yamazaki et al., 2019). The drainage type of each HydroLAKES lake, such as isolated, inflow-headwater, headwater, flow-through, terminal, and coastal, was determined with assistance of MERIT Hydro-Vector (Lin et al., 2021), a high-resolution river network dataset with spatially-variable drainage densities.
For convenience, the global landmass (excluding Antarctica) was partitioned to 68 Pfafstetter Level-2 basins or regions, and the Lake-TopoCat data products were also organized based on these 68 regions, with their region or basin IDs shown in the Fig. 'Pfaf2_basins.jpg', attached to this database.
Lake-TopoCat consists of five feature components, each with multiple attributes depicting lake drainage relationships. The five features are:
1. Lake boundaries: polygons of 1,426,967 HydroLAKES lakes, larger than 10 ha.
File name: Lakes_pfaf_xx where, 'pfaf_xx' indicates the Pfafstetter Level-2 basin ID (shown in Fig. 'Pfaf2_basins.jpg')
2. Lake outlets: points representing outlet or pour points of each individual lake. There are multiple outlets from a multifurcation lake. We identified 1,459,201 outlets for 1,426,967 lakes, where 29,190 lakes (~2% of the global lakes) show bi/multifurcation.
File name: Outlets_pfaf_xx
3. Unit catchment: boundary polygons of catchment defining the drainage areas between cascading (i.e., immediately upstream and downstream) lake outlets. The count of unit catchments equal to the count of lake outlets, and bifurcation or multifurcation lakes have multiple local catchments. In total, the delineated catchments in Lake-TopoCat cover about 77.5 million km2, which is about 57% of the Earth’s land mass excluding the Antarctic.
File name: Catchments_pfaf_xx
4. Inter-lake reaches: line features defining the drainage networks that connect the lake outlets to the inland sinks or the ocean. About 3 million connecting reaches were generated among ~1.4 million outlets. The total length of these inter-lake connecting reaches is ~10 million km.
File name: Reaches_pfaf_xx
5. Lake-network basins: boundary polygons of the entire drainage area containing each inter-lake network (i.e., a complete basin from the headwater to an inland sink or the ocean for all basins containing lakes). A total of 47,340 lake-network basins were identified. Among them, endorheic basins account for 5.1% by count and 18% by area of all lake-network basins. These endorheic basins cover ~15.4% of global surface excluding Antarctica.
File name: Basins_pfaf_xx
The attribute tables for each of the feature components are explained in Section 4 of the product description document. For user convenience, we release the preliminary Lake-TopoCat lake outlets, unit catchments, and inter-lake reaches, with the affix '_prelim' in the file names (explained in the attached product description document). We also provide the polygon boundaries of the 68 Pfafstetter basins or regions in the file named 'Pfaf2_regions'. All files of Lake-TopoCat are available in both shapefile and geodatabase formats.
Disclaimer
Authors of this dataset claim no responsibility or liability for any consequences related to the use, citation, or dissemination of Lake-TopoCat.
http://www.opendefinition.org/licenses/cc-by-sahttp://www.opendefinition.org/licenses/cc-by-sa
Lake-TopoCat provides a global-scale dataset that links lakes with their drainage topology, delineates catchments, defines inter-lake pathways, and quantifies topological attributes (upstream/downstream relationships, drainage distances, outlet multiplicity, etc.). The input is based on HydroLAKES v1.0 and MERIT Hydro (3 arcsec), with algorithms to detect possible lake bifurcation, unit catchment delineation, reach segmentation, and network attribution. The database includes:
It covers ~1.43 million lakes (≥10 ha) and ~1.46 million outlets globally, and ~3 million inter-lake reaches, spanning ~77.5 × 10⁶ km² of catchment coverage (≈ 57 % of global landmass, excluding Antarctica). The dataset is freely available via Zenodo (DOI: 10.5281/zenodo.7916729) and is licensed under CC BY 4.0.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Contact: Md Safat Sikder (mssikder@illinois.edu), Jida Wang (jidaw@illinois.edu)
Citation
Sikder, M. S., Wang, J., Allen, G. H., Sheng, Y., Yamazaki, D., Crétaux, J.-F., and Pavelsky, T. M., 2024. HarP: Harmonized Prior river-lake database. Zenodo, https://doi.org/10.5281/zenodo.14205131.
If you only use the PLD-TopoCat dataset, please cite the following paper:
Sikder, M. S., Wang, J., Allen, G. H., Sheng, Y., Yamazaki, D., Song, C., Ding, M., Crétaux, J.-F., and Pavelsky, T. M., 2023. Lake-TopoCat: A global lake drainage topology and catchment dataset. Earth System Science Data, 15, 3483-3511, https://doi.org/10.5194/essd-15-3483-2023.
Data description and components
The Harmonized Prior river-lake database (HarP) for SWOT integrated the SWOT River Database (SWORD) (Altenau et al., 2021) and the SWOT Prior Lake Database (PLD) (Wang et al., 2023) into a geometrically (lake/river) explicit but topologically harmonized vector database to allow for coupled fluvial-lacustrine applications, including a synergistic use of both river and lake products from SWOT.
In addition to the input river network (SWORD v16) and lake database (PLD v106), we used the MERIT Hydro v1.0.1 (Yamazaki et al., 2019), a high-resolution (~90 m) global hydrography dataset, to develop this database.
The SWORD-PLD harmonization process involves three major steps, with Step 3 being divided into three sub-steps. The processing chain is illustrated in the attached Figure "SWORD-PLD_harmonization_steps.jpg", as well as in Section 2 of the product description document. The HarP database consists of the outputs from each of the steps. For convenience, the global landmass (excluding Antarctica) was partitioned to 68 Pfafstetter Level-2 basins/regions, with their IDs shown in Figure "Pfaf2_basins.jpg" attached.
The HarP database consists of five datasets or components (outputs from each step), each with multiple features. The five datasets are described below, and more details are elaborated in the product description document.
1. Harmonized SWORD-PLD (file name "Harmonized_SWORD_PLD"): This is the fully harmonized SWORD-PLD dataset, the primary product of HarP (i.e., output of Step 3.3 in Figure "SWORD-PLD_harmonization_steps.jpg"). This dataset couples SWORD and PLD into a geometrically segmented but topologically integrated dataset at the node, reach, and catchment scales (stored by three feature layers, respectively):
(a) Harmonized feature nodes: Harmonized_feature_nodes_pfaf_xx
(b) Harmonized river network: Harmonized_river_network_pfaf_xx
(c) Harmonized feature catchments: Harmonized_feature_catchments_pfaf_xx
Note: ''pfaf_xx'' indicates the Pfafstetter Level-2 basin ID (shown in Fig. 'Pfaf2_basins.jpg').
Figure "HarP_example.jpg", attached to this database, is an example of the fully harmonized SWORD-PLD dataset for the Ohio River Basin. The example shows three main features of the dataset: feature nodes (i.e., reach downstream ends, lake inlets, and lake outlets; see Fig. 3 in the product description document for definitions), river reaches (i.e., reaches characterized by SWORD alone, characterized by TopoCat alone, and shared by both SWORD and TopoCat), and catchments segmented by each of the feature nodes.
2. Intersected SWORD-PLD drainage configuration (file name "Intersected_SWORD_PLD"): This dataset is the intersected SWORD-PLD (prior river-lake) features (i.e., output of Step 2 in Figure "SWORD-PLD_harmonization_steps.jpg"). This dataset was constructed independently from Step 1 and Step 3. In this dataset, the original geometries of SWORD and PLD are not altered, but instead, their geometric and drainage topological relationships are configured in the attribute tables. This dataset consists of three features:
(a) Intersected reaches: Intersected_SWORD_reaches_pfaf_xx
(b) Intersected nodes: Intersected_SWORD_nodes_pfaf_xx
(c) Intersected lakes: Intersected_PLD_lakes_pfaf_xx
3. PLD-TopoCat (file name "PLD_TopoCat"): This dataset is the lake drainage topology and catchments (TopoCat) for PLD lakes (i.e., output of Step 1 in Figure "SWORD-PLD_harmonization_steps.jpg"). PLD-TopoCat was developed to generate detailed lake drainage topology and connecting paths, which were later used to configure the off-SWORD-network PLD lakes into the tributaries that drain to SWORD. PLD-TopoCat was generated from PLD v106 and MERIT Hydro. Details of the developiong process and algorithm for TopoCat can be found at Sikder at al., (2023). PLD-TopoCat dataset contains six features:
(a) Lake original polygon: PLD_lakes_pfaf_xx
(b) Lake raster polygon: Lake_raster_polygons_pfaf_xx
(c) Lake outlets: Lake_outlets_pfaf_xx
(d) Lake catchments: Lake_catchments_pfaf_xx
(e) Inter-lake reaches: Inter_lake_reaches_pfaf_xx
(f) Lake-network basins: Lake_network_basins_pfaf_xx
Note: full version of the PLD-TopoCat is available here.
4. SWORD-mirror network (file name "SWORD_mirror"): The SWORD-mirror network was constructed to facilitate the SWORD-TopoCat network merging process (i.e., output of Step 3.1 in Figure "SWORD-PLD_harmonization_steps.jpg"). It is essentially a replica of SWORD except that the original SWORD reaches are geometrically modified to be aligned with the topological/hydrographic information depicted in MERIT Hydro. The SWORD-mirror network consists of four features:
(a) SWORD-original reaches: SWORD_original_reaches_pfaf_xx
(b) SWORD-mirror prelim. reaches: SWORD_mirror_prelim_reaches_pfaf_xx
(c) SWORD-mirror reaches: SWORD_mirror_reaches_pfaf_xx
(d) SWORD-mirror reach catchments: SWORD_mirror_reach_catchments_pfaf_xx
5. Merged SWORD-mirror – TopoCat network (file name "SWORD_TopoCat_merged"): This dataset is the output of Step 3.2 in Figure "SWORD-PLD_harmonization_steps.jpg". It is essentially the merged product of the inter-lake reaches (from Step 2) and SWORD-mirror reaches (from Step 3.1). The merged SWORD-mirror – TopoCat network consists of three features:
(a) Merged SWORD-TopoCat reaches: SWORD_TopoCat_merged_reaches_pfaf_xx
(b) SWORD nodes at SWORD-TopoCat confluence: SWORD_TopoCat_confluence_nodes_pfaf_xx
(c) Reach catchments for merged network: SWORD_TopoCat_reach_catchments_pfaf_xx
The attribute tables for each of the feature components are explained in Section 4 of the product description document. All files of HarP are available in both shapefile and geodatabase formats.
Disclaimer
Authors of this dataset claim no responsibility or liability for any consequences related to the use, citation, or dissemination of HarP. For any quesitons, please contact Safat Sikder and Jida Wang.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Contact: Md Safat Sikder (msikder@ksu.edu), Jida Wang (jidawang@ksu.edu; gdbruins@ucla.edu)
Data description
This data can be considered a supplement to the Georeferenced global Dams And Reservoirs (GeoDAR) dataset (doi:10.5281/zenodo.6163413).
Here in GeoDAR-TopoCat, the method of TopoCat (doi:10.5281/zenodo.7420810) has been applied on GeoDAR reservoirs in order to construct the drainage topology and catchments for global reservoirs.
To avoid ambiguity, please refer to this version of GeoDAR-TopoCat as “GeoDAR-TopoCat v1.1-1.0”, where “1.1” specifies the version of GeoDAR reservoirs, whose drainage topology and catchments are constructed using the method in version “1.0” of TopoCat.
Relevant datasets
Attribute description
Description of the attributes of GeoDAR-TopoCat is the same as those of TopoCat v1.0. The unique ID of each GeoDAR reservoir is specified in “id_v11” (consistent with the GeoDAR dataset). Please refer to the attributes of TopoCat and GeoDAR for more details.
Data and code availability
All datasets are available under the Creative Commons Attribution 4.0 International (CC-BY 4.0) license (https://creativecommons.org/licenses/by/4.0).
Please refer to GeoDAR and TopoCat datasets for other details and disclaimers.
Citation
We request anyone who uses GeoDAR-TopoCat to cite both GeoDAR and TopoCat papers:
Wang, J., Walter, B. A., Yao, F., Song, C., Ding, M., Maroof, A. S., Zhu, J., Fan, C., McAlister, J. M., Sikder, M. S., Sheng, Y., Allen, G. H., Crétaux, J.-F., and Wada, Y.: GeoDAR: georeferenced global dams and reservoirs database for bridging attributes and geolocations. Earth System Science Data, 14, 1869-1899, 2022, https://doi.org/10.5194/essd-14-1869-2022.
Sikder, M. S., Wang, J., Allen, G. H., Sheng, Y., Yamazaki, D., Song, C., Ding, M., Crétaux, J.-F., and Pavelsky, T. M., 2023. Lake-TopoCat: A global lake drainage topology and catchment dataset. Earth System Science Data Discussion, in review, https://doi.org/10.5194/essd-2022-433.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Documented March 19, 2023
!!NEW!!!
GeoDAR reservoirs were registered to the drainage network! Please see the auxiliary data "GeoDAR-TopoCat" at https://zenodo.org/records/7750736. "GeoDAR-TopoCat" contains the drainage topology (reaches and upstream/downstream relationships) and catchment boundary for each reservoir in GeoDAR, based on the algorithm used for Lake-TopoCat (doi:10.5194/essd-15-3483-2023).
Documented April 1, 2022
Citation
Wang, J., Walter, B. A., Yao, F., Song, C., Ding, M., Maroof, A. S., Zhu, J., Fan, C., McAlister, J. M., Sikder, M. S., Sheng, Y., Allen, G. H., Crétaux, J.-F., and Wada, Y.: GeoDAR: georeferenced global dams and reservoirs database for bridging attributes and geolocations. Earth System Science Data, 14, 1869–1899, 2022, https://doi.org/10.5194/essd-14-1869-2022.
Please cite the reference above (which was fully peer-reviewed), NOT the preprint version. Thank you.
Contact
Dr. Jida Wang, jidawang@ksu.edu, gdbruins@ucla.edu
Data description and components
Data folder “GeoDAR_v10_v11” (.zip) contains two consecutive, peer-reviewed versions (v1.0 and v1.1) of the Georeferenced global Dams And Reservoirs (GeoDAR) dataset:
As by-products of GeoDAR harmonization, folder “GeoDAR_v10_v11” also contains:
Attribute description
Attribute |
Description and values |
v1.0 dams (file name: GeoDAR_v10_dams; format: comma-separated values (csv) and point shapefile) | |
id_v10 |
Dam ID for GeoDAR version 1.0 (type: integer). Note this is not the same as the International Code in ICOLD WRD but is linked to the International Code via encryption. |
lat |
Latitude of the dam point in decimal degree (type: float) based on datum World Geodetic System (WGS) 1984. |
lon |
Longitude of the dam point in decimal degree (type: float) on WGS 1984. |
geo_mtd |
Georeferencing method (type: text). Unique values include “geo-matching CanVec”, “geo-matching LRD”, “geo-matching MARS”, “geo-matching NID”, “geo-matching ODC”, “geo-matching ODM”, “geo-matching RSB”, “geocoding (Google Maps)”, and “Wada et al. (2017)”. Refer to Table 2 in Wang et al. (2022) for abbreviations. |
qa_rank |
Quality assurance (QA) ranking (type: text). Unique values include “M1”, “M2”, “M3”, “C1”, “C2”, “C3”, “C4”, and “C5”. The QA ranking provides a general measure for our georeferencing quality. Refer to Supplementary Tables S1 and S3 in Wang et al. (2022) for more explanation. |
rv_mcm |
Reservoir storage capacity in million cubic meters (type: float). Values are only available for large dams in Wada et al. (2017). Capacity values of other WRD records are not released due to ICOLD’s proprietary restriction. Also see Table S4 in Wang et al. (2022). |
val_scn |
Validation result (type: text). Unique values include “correct”, “register”, “mismatch”, “misplacement”, and “Google Maps”. Refer to Table 4 in Wang et al. (2022) for explanation. |
val_src |
Primary validation source (type: text). Values include “CanVec”, “Google Maps”, “JDF”, “LRD”, “MARS”, “NID”, “NPCGIS”, “NRLD”, “ODC”, “ODM”, “RSB”, and “Wada et al. (2017)”. Refer to Table 2 in Wang et al. (2022) for abbreviations. |
qc |
Roles and name initials of co-authors/participants during data quality control (QC) and validation. Name initials are given to each assigned dam or region and are listed generally in chronological order for each role. Collation and harmonization of large dams in Wada et al. (2017) (see Table S4 in Wang et al. (2022)) were performed by JW, and this information is not repeated in the qc attribute for a reduced file size. Although we tried to track the name initials thoroughly, the lists may not be always exhaustive, and other undocumented adjustments and corrections were most likely performed by JW. |
v1.1 dams (file name: GeoDAR_v11_dams; format: comma-separated values (csv) and point shapefile) | |
id_v11 |
Dam ID for GeoDAR version 1.1 (type: integer). Note this is not the same as the International Code in ICOLD WRD but is linked to the International Code via encryption. |
id_v10 |
v1.0 ID of this dam/reservoir (as in id_v10) if it is also included in v1.0 (type: integer). |
id_grd_v13 |
GRanD ID of this dam if also included in GRanD v1.3 (type: integer). |
lat |
Latitude of the dam point in decimal degree (type: float) on WGS 1984. Value may be different from that in v1.0. |
lon |
Longitude of the dam point in decimal degree (type: float) on WGS 1984. Value may be different from that in v1.0. |
geo_mtd |
Same as the value of geo_mtd in v1.0 if this dam is included in v1.0. |
qa_rank |
Same as the value of qa_rank in v1.0 if this dam is included in v1.0. |
val_scn |
Same as the value of val_scn in v1.0 if this dam is included in v1.0. |
val_src |
Same as the value of val_src in v1.0 if this dam is included in v1.0. |
rv_mcm_v10 |
Same as the value of rv_mcm in v1.0 if this dam is included in v1.0. |
rv_mcm_v11 |
Reservoir storage capacity in million cubic meters (type: float). Due to ICOLD’s proprietary restriction, provided values are limited to dams in Wada et al. (2017) and GRanD v1.3. If a dam is in both Wada et al. (2017) and GRanD v1.3, the value from the latter (if valid) takes precedence. |
har_src |
Source(s) to harmonize the dam points. Unique values include “GeoDAR v1.0 alone”, “GRanD v1.3 and GeoDAR 1.0”, “GRanD v1.3 and other ICOLD”, and “GRanD v1.3 alone”. Refer to Table 1 in Wang et al. (2022) for more details. |
pnt_src |
Source(s) of the dam point spatial coordinates. Unique values include “GeoDAR v1.0”, “original |
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Contact: Md Safat Sikder (msikder@ksu.edu), Jida Wang (jidawang@ksu.edu; gdbruins@ucla.edu)
Citation
If you use Lake-TopoCat, please cite the following ESSD preprint for now:
Sikder, M. S., Wang, J., Allen, G. H., Sheng, Y., Yamazaki, D., Song, C., Ding, M., Crétaux, J.-F., and Pavelsky, T. M., 2023. Lake-TopoCat: A global lake drainage topology and catchment dataset. Earth System Science Data Discussion, in review, https://doi.org/10.5194/essd-2022-433.
Data description and components
This version of Lake-TopoCat was constructed using the HydroLAKES v1.0 (Messager et al., 2016) lake mask and the 3-arc-second-resolution hydrography dataset MERIT Hydro v1.0.1 (Yamazaki et al., 2019). The drainage type of each HydroLAKES lake, such as isolated, inflow-headwater, headwater, flow-through, terminal, and coastal, was determined with assistance of MERIT Hydro-Vector (Lin et al., 2021), a high-resolution river network dataset with spatially-variable drainage densities.
For convenience, the global landmass (excluding Antarctica) was partitioned to 68 Pfafstetter Level-2 basins or regions, and the Lake-TopoCat data products were also organized based on these 68 regions, with their region or basin IDs shown in the Fig. 'Pfaf2_basins.jpg', attached to this database.
Lake-TopoCat consists of five feature components, each with multiple attributes depicting lake drainage relationships. The five features are:
1. Lake boundaries: polygons of 1,426,967 HydroLAKES lakes, larger than 10 ha.
File name: Lakes_pfaf_xx where, 'pfaf_xx' indicates the Pfafstetter Level-2 basin ID (shown in Fig. 'Pfaf2_basins.jpg')
2. Lake outlets: points representing outlet or pour points of each individual lake. There are multiple outlets from a multifurcation lake. We identified 1,459,201 outlets for 1,426,967 lakes, where 29,190 lakes (~2% of the global lakes) show bi/multifurcation.
File name: Outlets_pfaf_xx
3. Unit catchment: boundary polygons of catchment defining the drainage areas between cascading (i.e., immediately upstream and downstream) lake outlets. The count of unit catchments equal to the count of lake outlets, and bifurcation or multifurcation lakes have multiple local catchments. In total, the delineated catchments in Lake-TopoCat cover about 77.5 million km2, which is about 57% of the Earth’s land mass excluding the Antarctic.
File name: Catchments_pfaf_xx
4. Inter-lake reaches: line features defining the drainage networks that connect the lake outlets to the inland sinks or the ocean. About 3 million connecting reaches were generated among ~1.4 million outlets. The total length of these inter-lake connecting reaches is ~10 million km.
File name: Reaches_pfaf_xx
5. Lake-network basins: boundary polygons of the entire drainage area containing each inter-lake network (i.e., a complete basin from the headwater to an inland sink or the ocean for all basins containing lakes). A total of 47,340 lake-network basins were identified. Among them, endorheic basins account for 5.1% by count and 18% by area of all lake-network basins. These endorheic basins cover ~15.4% of global surface excluding Antarctica.
File name: Basins_pfaf_xx
The attribute tables for each of the feature components are explained in Section 4 of the product description document. For user convenience, we release the preliminary Lake-TopoCat lake outlets, unit catchments, and inter-lake reaches, with the affix '_prelim' in the file names (explained in the attached product description document). We also provide the polygon boundaries of the 68 Pfafstetter basins or regions in the file named 'Pfaf2_regions'. All files of Lake-TopoCat are available in both shapefile and geodatabase formats.
Disclaimer
Authors of this dataset claim no responsibility or liability for any consequences related to the use, citation, or dissemination of Lake-TopoCat.