THE HOMOGENIZATION OF GPS INTEGRATED WATER VAPOUR TIME SERIES: METHODOLOGY AND BENCHMARKING THE ALGORITHMS ON SYNTHETIC DATASETS


Creative Commons License

Van Malderen R., Eric P., Klos A., Bock O., Bogusz J., Chimani B., ...More

9th Seminar for Homogenization and Quality Control in Climatological Databases and 4th Conference on Spatial Interpolation Techniques in Climatology and Meteorology, Budapest, Hungary, 3 - 07 April 2017

  • Publication Type: Conference Paper / Summary Text
  • City: Budapest
  • Country: Hungary
  • Karadeniz Technical University Affiliated: Yes

Abstract

Within the COST Action ES1206 “Advanced Global Navigation Satellite Systems tropospheric products for monitoring severe weather events and climate” (GNSS4SWEC), there was a clear interest and need to homogenize a worldwide Integrated Water Vapour (IWV) dataset retrieved with Global Positioning System (GPS), by correcting (artificial) break points due to e.g. instrumental changes. The first activity concentrated on a worldwide GPS dataset covering 100+ stations, with a homogeneous data processing from 1995 to March 2011. As at most of these stations, the ERA-interim reanalysis IWV field output correlates well with the IWV retrieved by GPS, the ERA-interim IWV time series are used as reference and the IWV differences between both sets are considered. The characterization of these IWV differences provided us with typical trend values, seasonal oscillations and noise models, to build a synthetic benchmark IWV dataset of differences. As a matter of fact, three different synthetic datasets have been generated, with different levels of complexity (w/o autoregressive noise, w/o gaps and trends), to assess the impact of these different factors on the performance of homogenization algorithms. In this presentation, we will show the results of different homogenization algorithms on those 3 different sets of synthetic time series. We present their detection scores and compare the estimated trends (and uncertainties) from their homogenized time series with the true input trends. We analyze the sensitivity of each detection method w.r.t. the complexity of the synthetic datasets. Finally, an outlook of the future activities and a call for participating in our activity will be given.