Reduction of Ground-based Sensor Sites for Spatio-temporal Analysis of Aerosols

Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data, Paris, France, 2009

Vladan Radosavljevic, Slobodan Vucetic, and Zoran Obradovic

“In many remote sensing applications it is important to use multiple sensors to be able to understand the major spatio-temporal distribution patterns of an observed phenomenon. A particular remote sensing application addressed in this study is estimation of an important property of atmosphere, called Aerosol Optical Depth (AOD). Remote sensing data for AOD estimation are collected from ground and satellite-based sensors. Satellite-based measurements can be used as attributes for estimation of AOD and in this way could lead to better understanding of spatio-temporal aerosol patterns on a global scale. Ground-based AOD estimation is more accurate and is traditionally used as ground-truth information in validation of satellite-based AOD estimations. In contrast to this traditional role of ground-based sensors, a data mining approach allows more active use of ground-based measurements as labels in supervised learning of a regression model for AOD estimation from satellite measurements. Considering the high operational costs of ground-based sensors, we are studying a budget-cut scenario that requires a reduction in a number of ground-based sensors. To minimize loss of information, the objective is to retain sensors that are the most useful as a source of labeled data. The proposed goodness criterion for the selection is how close the accuracy of a regression model built on data from a reduced sensor set is to the accuracy of a model built of the entire set of sensors. We developed an iterative method that removes sensors one by one from locations where AOD can be predicted most accurately using training data from the remaining sites. Extensive experiments on two years of globally distributed AERONET ground-based sensor data provide strong evidence that sensors selected using the proposed algorithm are more informative than the competing approaches that select sensors at random or that select sensors based on spatial diversity.”