Soil moisture is a state variable of the land that crosses the interfaces of several disciplines, including meteorology, hydrology, climatology and ecology. Soil moisture content controls the partitioning of net radiation at the land surface between sensible and latent heat fluxes. This occurs primarily in the subtopics and tropics where rainfall is not abundant, and in the mid-latitudes during the warm season. Thus, accurate representation of soil moisture and its controls on surface fluxes are essential for water and energy cycle understanding and simulation, weather and climate forecasting (particularly at subseasonal to seasonal time scales) and the representation of changing feedbacks in a changing climate (Seneviratne et al. 2010; Dirmeyer et al. 2012, 2015).

Each type of soil moisture data set has its strengths and weaknesses, which are summarized in the table below. The best choice of product depends on the application. For instance, soil moisture for initializing a forecast model or Earth system model should be consistent with the statistics of the LSM in that model – it is never advisable to use a gridded soil moisture analysis as the soil moisture state in an LSM without first renormalizing the data in terms of standard deviations (anomalies) from the climatological annual cycle (Koster et al. 2009). Ideally, one should use a soil moisture analysis generated by the same LSM as used in the forecast model. Similarly, observational data should not be used to validate the absolute value of soil moisture in a model, as they do not necessarily correspond – model soil moisture is the result of soil moisture change calculated as the residual of the water balance (cf. Koster and Milly 1997). It is prudent to compare anomalies (in time) or spatial structures, and not RMSE or other 1st moment error statistics.

[The above text and table below were contributed by Dr. Paul Dirmeyer. Please see the Expert Guidance tab for the complete review and links to specific datasets.]

  Strengths and Limitations of different types of soil moisture data sets  
Category Strengths Limitations
 
Ground observations
  • In situ (in the ground) measurement (most)
  • Provides subsurface data (most)
  • Represents "ground truth"
  • Point measurements; limited spatial and temporal coverage
  • Prone to random and systematic errors
  • May not match model soil moisture in terms of scale, characteristics
Satellite observations
  • Global coverage
  • Level 3+ products are on a regular grid
  • Generally not obscured by clouds untless they are precipitating

• Gaps in coverage due to satellite orbits; complete coverage typically every ~8 days, but locally can be more frequent.

• Prone to random and systematic errors

• Microwave sensors only see surface (~3cm), do not see soil where vegetation is dense

• Microwave sensors prone to radio interference, signals especially corrupted over urban areas

• Gravity sensors have low spatial resolution, sense all water including water table fluctuations.

Blended observations
 

• Global coverage

• Data are gridded

• Fewer coverage gaps and can achieve higher temporal resolution than individual satellites

• Time series inconsistencies can arise when different sensors come in/out of operation/inclusion.

• Prone to random and systematic errors

LSM-based analyses

• Global coverage, no gaps in space or time

• Data are gridded, include subsurface data

• Somewhat constrained by observations (especially precipitation)

• Not prone to random errors

• Prone to systematic errors due to errors in model physics, soils or vegetation data sets used by models

• Not direct or indirect observations – model characteristics will strongly flavor the soil moisture analyses

• Most do not assimilate soil moisture observations

• Likely not representative of variations at spatial scales smaller than model grid cell

Reanalyses

• Global coverage, no gaps in space or time

• Data are gridded, include subsurface data

• Prone to systematic errors due to errors in atmospheric and land model physics, soils and vegetation data sets used by models