Review and evaluation of Monte Carlo algorithms and neural networks to predict air pollution in the environment of a spatial information system

Number of pages: 139 File Format: word File Code: 31426
Year: 2011 University Degree: Master's degree Category: Biology - Environment
  • Part of the Content
  • Contents & Resources
  • Summary of Review and evaluation of Monte Carlo algorithms and neural networks to predict air pollution in the environment of a spatial information system

    Master's Thesis

    Abstract

    The necessity of having a healthy environment and raising the health level of the society makes the need to have proper planning to reduce the sources of air pollutants production and predict these pollutants to prevent its harmful effects inevitable. Prediction of pollutants can be useful in air pollution management and control. In this research, O3 pollutant due to its harmful effects on human health, as well as CO pollutant due to the use of non-standard vehicles and the problem of incomplete fuel combustion in cars in the city of Tehran have been taken into consideration. Meanwhile, the use of neural networks due to their appropriate ability to model systems with non-linear behavior can be useful for predicting changes in air pollutants. With such an approach, in this thesis, prediction and modeling of hourly changes in the concentration of CO and O3 pollutants using MLP and Elman neural networks and Bayesian regression have been investigated. In order to pre-process the data before entering the neural network, principal component analysis has been used. In this case, the use of the components obtained from the principal component analysis method has led to a reduction in the number of features, an increase in the degree of freedom, and a reduction in the training time of the network.

    Prediction of the two pollutants CO and O3 using Bayesian regression and estimation of its parameters by the Monte Carlo Markov chain method have also been considered.

    The results of the implementation of two types of neural networks and Bayesian regression show that MLP network with coefficient of determination (R2) equal to 0.6307 for CO prediction and Elman network with coefficient of determination equal to 0.6186 for O3 prediction have the best accuracy. Therefore, the results of the research confirm the superiority of the proposed neural networks over Bayesian regression.

    Key words: neural network, principal component analysis, Bayesian regression, Markov Monte Carlo chains. It is worrying to predict. Unfortunately, the ever-increasing activities of humans, especially after the industrial revolution, have caused air pollution on a large scale.

    It is clear that knowing biological behaviors in the production of air pollutants can help in managing and controlling air quality, and as a result, raising the level of social health and reducing the effects of air pollution; Because with this knowledge, one can think about the necessary planning to reduce the sources of air pollution production and to have a healthy environment. Primary pollutants are substances that enter the ambient air directly from sources and include five pollutants: carbon monoxide CO, nitrogen dioxide NO2, sulfur dioxide SO2, suspended particles with a diameter of less than 10 microns PM10 and volatile hydrocarbons VOCs. Secondary pollutants refer to substances that arise as a result of interactions in the air around the earth, and ozone O3 can be mentioned in this group. In this research, among the mentioned pollutants, the prediction of two pollutants, CO and O3, is the basis of the work. We know the necessity of ozone prediction due to its negative effects on human, animal and plant health and that with ozone modeling, it is possible to issue a quick warning in places where its concentration increases. Also, since cars are the main source of carbon monoxide production, due to the heavy traffic volume caused by transportation in the city of Tehran, the use of non-standard cars and the problem of incomplete combustion of the fuels used in cars, we have paid attention to the prediction of CO. Considering the deadly effects that carbon monoxide can have on human health, it seems necessary to make the necessary decisions for proper planning in dealing with this problem.

    As it is necessary to have a suitable decision in the future, we must obtain appropriate information about the behavior of our system so that we can check how it functions in other times by modeling the behavior of the system. In such a way, after proper modeling of the system, we can make a proper prediction of its behavior in the future and, as a result, make more optimal decisions to prevent unwanted incidents.In the way of systems modeling, knowing the influencing parameters in the system, the relationship of these parameters and the type of influence of each one in the system is one of the main discussions in the analysis and identification of the system.

    With this approach, in this research, we are looking for a suitable analysis of the environment so that we can predict its behavior and have a more accurate drawing of the future for ourselves. Neural networks, fuzzy logic, regressions and statistical methods are often used to model the behavior of air pollutants. In this research, we are looking to model and predict two pollutants, CO and O3, using the method of neural networks and linear regressions.

    Neural networks have a high ability to extract patterns from data and also solve complex problems of a natural nature. The accuracy of the implementation of these networks is appropriate in the case of dependence of the input parameters and even the presence of noise in the data. In this research, among the different architectures of neural networks, two multilayer perceptron neural networks [1] and Elman neural network [2] have been used to predict CO and O3 pollutants. The multi-layer perceptron neural network, despite its classicity in modeling intelligent systems, has been chosen due to its high flexibility in process modeling and its wide application in the discussion of air pollutant forecasting. Also, in this research, we have used Elman's neural network due to its structural and functional nature in modeling time series. 

    In regression methods, the need to accurately estimate the results and obtain relationships between parameters and variables affecting the results has created a wide range of these methods, among these regression methods, Bayes regression method [3] can be mentioned. In this research, we use Markov Monte Carlo chain method to determine the posterior distribution of Bayes estimation. 

    In this research, we aim to model the process of the two pollutants CO and O3 using neural network and Bayesian regression methods, investigate the accuracy of the neural network method as a smart nonlinear method and Bayesian regression as a classical linear method and examine the efficiency and flexibility of the methods used in modeling the two pollutants CO and O3. Also, by implementing MLP and Elman neural networks in order to predict the two pollutants CO and O3, we compare the accuracy of these two networks for modeling. 1-2- Background and objectives of the thesis Using neural networks and Bayesian regression to predict the two pollutants CO and O3 is the main goal of this thesis. Considering the multiplicity of neural network structures, comparing the effectiveness of two types of neural networks, MLP and Elman, in predicting air pollution, as well as comparing their accuracy with the Bayesian regression method, constitute the other goals of this thesis. According to the above, the sub-goals of this thesis are as follows: Introducing neural networks as an efficient method in predicting air pollutants.

    Investigation of the relationship between the amounts of pollutants and the error resulting from their prediction by neural networks.

    Investigation and analysis of the spatial and temporal variability of the two pollutants CO and O3.

    1-3-      Overview of the researches

    Researches carried out in the field of prediction of air pollutants have been expanded in various aspects. In this research, the prediction of meteorological pollutants has been challenged mainly from the aspect of the data used to predict each pollutant, the pre-processing performed on the input data to the forecasting system, the structure of the model used for prediction and the accuracy of these models.

    In the path of predicting air pollutants, the use of different data to enter the neural network has been proposed. In some of the conducted researches, according to the relationship between the desired pollutants and meteorological parameters, the use of meteorological parameters to predict the desired pollutants has been suggested [1], [2] and [3]. As an example, the use of meteorological data including: wind speed, relative humidity, wind direction, temperature, rainfall, air pressure, radiation amount and evaporation amount has been suggested in order to predict the O3 concentration hourly [1]. The prediction of 3 pollutants PM10, SO2 and CO in the next 24, 48 and 72 hours, using meteorological parameters including: wind direction, air pressure, day temperature, night temperature, relative humidity and wind speed is also suggested [2].

  • Contents & References of Review and evaluation of Monte Carlo algorithms and neural networks to predict air pollution in the environment of a spatial information system

    List:

    Chapter 1: Introduction. 1

    1-1- Introduction. 2

    1-2- Thesis fields and objectives. 5

    1-3- An overview of the conducted research 5

    1-4- Research method. 10

    1-5- Thesis structure. 12

    Chapter 2: Theoretical foundations. 13

    2-1- What is air pollution? 14

    2-1-1- Types of pollutants 14

               Polluted particles or airborne substances (PM10) 15

               Carbon monoxide. 15

    Sulfur oxides 16 Nitrogen oxides. 17. Ozone..18. Volatile hydrocarbons (VOCs) 19. 2-1-2- Air pollution standard index 19. Definition of ppm and ppb. 21

    2-2- Meteorological parameters and their effects on air polluting factors 21

    2-3- Time spatial information system. 26

    2-4- Time series. 28

    Chapter 3: Materials and methods used in research. 30

    3-1- Introduction of stations and data 31

    3-2- Examining the predictability of data 33

    3-2-1- The change analysis test of the basis of the change area (R/S analysis) 34

    3-3- The use of principal component analysis and principal factor analysis in order to examine the data affecting CO and O3 for Entering a CO and O3 prediction system 36 3-3-1- Main component analysis and main factor analysis. 36 3-4- Data time series analysis in order to extract effective time delays of each data series in forecasting O3 and CO 39 3-4-1 Using autocorrelation functions (ACF) and partial autocorrelation (PACF) in order to find the appropriate model for the time series 40 Autocorrelation function. 40

                Partial autocorrelation function. 41

               Autoregressive processes. 41

               Moving average processes. 42

               Moving average autoregressive processes. 42

               Stationary processes 43

               Converting non-stationary processes to stationary processes 44

    3-5- Neural network architectures. 45

    3-5-1- An artificial neuron model. 47

    3-6- Multilayer perceptron neural network. 48

    3-6-1- The structure of multilayer perceptron neural networks. 49

    3-6-2- Error backpropagation algorithm in multilayer perceptron networks. 51

    3-7- Elman network. 52

    3-7-1- Elaman network training. 54

    3-8- Linear regression. 55

    3-9- Generalized linear models. 55

    3-10- Bayes generalized linear models. 59

    3-11- Monte Carlo Markov chains. 60

    Chapter 4: Evaluation of neural networks and Bayesian regression with Monte Carlo approach in predicting CO and O3 pollutants.    ..62

    4-1- Introduction. 63

    4-2- Examining the spatial changes of the two pollutants CO and O3 65

    4-3- Examining the predictability of the data 67

    4-4- Examining the results of principal component analysis and principal factor analysis in order to examine the data affecting the two pollutants CO and O3 69

    4-5- Examining each of the time series to The purpose of determining the effective time delays for predicting a time step ahead 73

               Investigating the humidity time series. 74

    4-6- Data preprocessing to enter the neural network. 77

    4-7- Prediction of CO and O3 pollutants using neural networks. 79

    4-7-1- Prediction using MLP networks. 80

               O3 prediction 81

                CO prediction. 84

    4-7-2- Prediction using the Elman network. 88

               O3 prediction 88

                CO prediction. 91

    4-8- Prediction of CO and O3 pollutants using Bayes regression with Monte Carlo approach. 94

    Chapter 5: Conclusion and suggestions. 101

    5-1- Conclusion. 102

    5-2- Proposals. 107

    Appendix    108

    Appendix 1- Hirst view charts for meteorological parameters and air pollutants. 109

    Appendix 2- Graphs of autocorrelation and partial autocorrelation functions of meteorological parameters and air pollutants, as well as graphs of these functions for the rest of the fitted AR model. 112

    Appendix 3- An example of the components obtained from the principal component analysis method for the effective time delays of each parameter. . 119

    Appendix119

    Appendix 4- Formula for calculating RMSE and R2 122

    Source:

    [1] Sadr Mousavi, M., Rahimi, A., (Comparison of the results of multilayer perceptron neural networks with multiple regression in predicting ozone concentration in Tabriz city), Natural Geography Research, No. 71, Spring 1388.

    [2] A. Kurt, B. Gulbagci, F. Karaca, O. Alagha, "An online air pollution forecasting system using neural networks", Environment International, vol.34, pp.592-598, 2008.

    [3] Y. Feng, W. Zhang, D. Sun, L. Zhang, "Ozone concentration forecast method based on genetic algorithm optimized back propagation neural networks and support vector machine data classification", Environment International, vol.45, pp.1979-1985, 2011.

     

    [4] P. Perez, J. reyes, "An integrated neural network model for PM10 forecasting", Atmospheric Environment, vol.40, pp.2845-2851, 2006.

    [5] S. Wei Lin, C. Hong Sun, C. Han Chen, "Temporal data mining using genetic algorithm and neural network – A case study of air pollutant forecasts", IEEE transaction, vol.2, no.6, pp.568-576, 1991.

    [6] L. Osrodka, M. Wojtylac, E. Krajny, "Forecasting of high-level air pollution in urban-industrial agglomeration by means of numerical weather forecasting", journal of Natural Resource Research, vol.8, pp.93-109, 2003.

     

    [7] H. Jef, M. Clemens, D. Gerwin, F. Frans, B. Olivier "A neural network forecast for daily average PM10 concentrations in Belgium", Atmospheric Environment , vol.39, no.18, pp.3279-3289, 2005.

     

    [8] C. Carnevale, G. Finzi, E. Pisoni, V. Singh, M. Volta, "Neural networks and co-kriging techniques to forecast ozone concentrations in urban areas", International Environmental Modeling and Software Society (iEMSs), 2008.

     

    [9] E. Pasero, L. Mesin, "Artificial Neural Network to forecast air pollution", journal of Remote Sensing, vol.24, no.5, pp.1151-1156, 2003.

     

    [10] Aliari, M., (Prediction of the behavior of air pollution in Arak using neural networks), Master's thesis, University of Electrical Engineering Khwaja Nasiruddin Toosi Industry. 1382

    [11] E. Kalapanidas, N. Avouris, "Feature selection using a genetic algorithm applied on an air quality forecasting problem", Australian journal of earth science, vol.47, pp.757-770, 2000.

    [12] M. H. Gholizadeh, M. Darand, "Forecasting the air pollution with using artificial neural networks: The case study; Tehran city", Journal of Applied Sciences, vol.9, No.21, pp.3882-3887, 2009.

    [13] H. K. Cigizoglu, K. Alp, M. Komurcu, "Two neural network methods in estimation of air pollution time", Geoscience Canada, vol.17 (2), pp.79-89 2006.

    [14] W. Zhen Lu, W. Jian Wang, X. Kang Wang, S. Hang Yan, J. C. Lam, "Potential assessment of a neural network model with PCA/RBF approach for forecasting pollutant trends in Mong Kok urban air, Hong Kong", Environmental Research, vol.96, pp. 79–87, 2004.

     

    [15] S. I. V. Sousa, F. G. Martins, M. C. M. Alvim-Ferraz, M. C. Pereira, "Multiple linear regression and artificial neural networks based on principal components to predict ozone concentrations",

    Environmental Modeling & Software, vol. 22, pp. 97-103, 2007. [16] G. Soja, A.M. Soja, "Ozone indices based on simple meteorological parameters: potentials and limitations of regression and neural network models", Atmospheric Environment, vol. 33, pp. 4299-4307, 1999.

    [17] W.Z Lu, D. Wang, "Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme" Science of the Total Environment, vol. 395, pp. 109-116, 2008. [18] E.G. Ortiz-Garc?a, S. Salcedo-Sanz, ?.M. Pérez-Bellido, J.A. Portilla-Figueras, L.

Review and evaluation of Monte Carlo algorithms and neural networks to predict air pollution in the environment of a spatial information system