Last updated: 2021-06-14

Checks: 2 0

Knit directory: globalIRmap/

This reproducible R Markdown analysis was created with workflowr (version 1.6.2). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.


Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.

Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.

The results in this page were generated with repository version 5e433c3. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.

Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish or wflow_git_commit). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:


Ignored files:
    Ignored:    .Rhistory
    Ignored:    .Rproj.user/
    Ignored:    R/.Rhistory
    Ignored:    analysis/.Rhistory
    Ignored:    renv/library/
    Ignored:    renv/staging/

Untracked files:
    Untracked:  .Rbuildignore
    Untracked:  .drake/
    Untracked:  .gitignore
    Untracked:  Compare_models_20201026.Rmd
    Untracked:  Rplots.pdf
    Untracked:  assets/
    Untracked:  figtabres.docx
    Untracked:  figtabres_20201220_1.docx
    Untracked:  figtabres_20210216.docx
    Untracked:  log/
    Untracked:  schema.ini
    Untracked:  tabs_quick.Rmd
    Untracked:  tabs_quick.docx
    Untracked:  tabs_quick.html
    Untracked:  test.html

Unstaged changes:
    Modified:   .Rprofile
    Modified:   R/IRmapping_plan.R
    Modified:   analysis/_site.yml
    Deleted:    analysis/results_diagnostics.Rmd
    Modified:   figtabres.Rmd
    Modified:   interactive.R
    Modified:   workflowr_commands.R

Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.


These are the previous versions of the repository in which changes were made to the R Markdown (analysis/methods_riveratlas.Rmd) and HTML (docs/methods_riveratlas.html) files. If you’ve configured a remote Git repository (see ?wflow_git_remote), click on the hyperlinks in the table below to view the files as they were in that past version.

File Version Author Date Message
Rmd 5e433c3 messamat 2021-06-14 Publish new pages

Global underpinning hydrography

We predicted the distribution of IRES for river reaches in the global RiverATLAS database. RiverATLAS is a widely used representation of the global river network built on the hydrographic database HydroSHEDS. Rivers are delineated on the basis of drainage direction and flow accumulation maps derived from elevation data at a pixel resolution of 3 arcseconds (~90 m at the equator) and subsequently upscaled to 15 arcseconds (~500 m at the equator). In this study, we only included river reaches with a modelled MAF ≥ 0.1 m3 s−1 and excluded: i) smaller streams (owing to increasing uncertainties in their geospatial location and flow estimates derived from global datasets and models); and ii) sections of river reaches within lakes (identified based on HydroLAKES polygons).

Hydro-environmental predictor variables

The primary source of predictor variables was the global RiverATLAS database, version 1.0, which is a subset of the broader HydroATLAS product. RiverATLAS provides hydro-environmental information for all rivers of the world, both within their contributing local reach catchment and across the entire upstream drainage area of every reach. This information was derived by aggregating and reformatting original data from well established global digital maps, and by accumulating them along the drainage network from headwaters to ocean outlets.

We complemented the RiverATLAS v1.0 data with three additional sets of variables. The first set of variables describes the inter-annual open surface water dynamics as determined by remote sensing imagery from 1999 to 2019 (Pickens et al. 2019). In the original dataset, each 30-m-resolution pixel has been covered by water sometime during this time period was assigned one of seven ‘interannual dynamic classes’ (for example, permanent water, stable seasonal, high-frequency changes) on the basis of a time series analysis of the annual percentage of open water in the pixel. We computed the percent coverage of each of these interannual dynamic classes relative to the total area of surface water within the contributing local catchment and across the entire upstream drainage area of every river reach.

Second, we replaced the soil and climate characteristics in RiverATLAS v1.0 with updated datasets. Specifically, we computed the average texture of the top 100 cm of soil based on SoilGrids250m v2. We also updated the climate variables with WorldClim v2 (adding all bioclimatic variables to the existing set of variables) as well as the second version of the Global Aridity Index and Global Reference Evapotranspiration (Global-PET) datasets. Finally, we updated the Climate Moisture Index (CMI), computed from the annual precipitation and potential evapotranspiration datasets provided by the WorldClim v2 and Global-PET v2 databases, respectively.

We derived a third set of variables by combining multiple variables already included in the model through algebraic operations. These metrics included the runoff coefficient (that is, the ratio of MAF and mean annual precipitation), specific discharge (that is, MAF per unit drainage area), and various temporal (for example, minimum annual/ maximum annual discharge) and spatial (for example, mean elevation in local reach catchment/mean elevation in upstream drainage area) ratios.

Pre-formatting of spatial data

This GitHub repository contains Python code used for pre-processing of data. The output of this pre-processing is available and compiled as the attributes in the global river network and streamflow gauging stations datasets (available in the figshare data repository).

Here, we briefly explain how this code was used, but additional data (~100GB) not currently available publicly are needed to fully reproduce the analysis. Please contact and/or for additional information should you want to re-produce the results from this study. In addition please note that processing these data takes weeks of continuous computing on a normal workstation.