2D mapping by Kohonen networks of the air quality data from a large city

J Chem Inf Comput Sci. 2004 Mar-Apr;44(2):339-46. doi: 10.1021/ci030418r.

Abstract

The 15-variable environmental data (7 concentrations: CO, SO2, O3, NOx, NO, NO2, particulate matter smaller than 10 micron (PM10), and 8 weather data: cloudiness, rainfall, insolation factor (Isfi), temperature, pressure at two locations, and wind intensity with direction) in a period of 45 days with 1-h intervals were extracted from a larger database of concentrations recorded in minute intervals for the same time period. The monitoring site was located in the City of Buenos Aires in a relatively heavy traffic crossroad of two avenues. The data required special pretreatment where the hourly content of rain, wind intensity, wind velocity, and cloudiness were concerned. The new variable named insolation factor (relative UV radiation) calculated on the basis of the general meteorological data, the geographic position of the monitoring site, cloudiness, date, and the time of the recording was composed. The relative intensity of UV radiation was modeled by a Gaussian function, multiplied by a cloudiness factor. Based on the 14-variable input and the 1-variable output (ozone) data, first, the clustering of all 980 data records was made. The top map clustering showing the ozone concentration was related to the maps of all 14 variables. The link between O3 clusters, NO2, and Isfi weight levels is shown and discussed. As a preliminary result of this study some of the most interesting correlations between the maps and remaining variables are given.