1,900
Views
30
CrossRef citations to date
0
Altmetric
Articles

Air pollution hazard assessment using decision tree algorithms and bivariate probability cluster polar function: evaluating inter-correlation clusters of PM10 and other air pollutants

ORCID Icon, ORCID Icon &
Pages 207-226 | Received 29 Jun 2019, Accepted 18 Dec 2019, Published online: 10 Jan 2020

Figures & data

Table 1. Minimum and maximum concentrations of pollutants around air monitoring stations.

Figure 1. Air pollution sources and industry density map in study area.

Figure 1. Air pollution sources and industry density map in study area.

Figure 2. Missing data summary (a) CA0054 and (b) CA0016 and the pattern of the missing data (c) CA0054 and (d) CA0016 along the data record period of the mean values of the pollutants for the period 2007 to 2016. (Yellow color indicates missing data while blue color indicates existing data).

Figure 2. Missing data summary (a) CA0054 and (b) CA0016 and the pattern of the missing data (c) CA0054 and (d) CA0016 along the data record period of the mean values of the pollutants for the period 2007 to 2016. (Yellow color indicates missing data while blue color indicates existing data).

Figure 3. (a) PM10 percentile distribution at CA0016 and (b) PM10 percentile distribution at CA0054. X-axis represents percentile unit while y-axis represents the frequency of the units.

Figure 3. (a) PM10 percentile distribution at CA0016 and (b) PM10 percentile distribution at CA0054. X-axis represents percentile unit while y-axis represents the frequency of the units.

Figure 4. Elbow algorithm to show the best number of clusters in CA0016 station using normalize percentiles quantiles of (a) SO2; (b) PM10; (c) CO.

Figure 4. Elbow algorithm to show the best number of clusters in CA0016 station using normalize percentiles quantiles of (a) SO2; (b) PM10; (c) CO.

Figure 5. Summary plot shows the mean values of the pollutants for the period 2007 to 2016 (a) CA0016, (b) CA0054.

Where “0%” in the bracket behind the missing value refers to ratio of missing data ([number of missing days’ record]/[total record period]) using real number precision.

Figure 5. Summary plot shows the mean values of the pollutants for the period 2007 to 2016 (a) CA0016, (b) CA0054.Where “0%” in the bracket behind the missing value refers to ratio of missing data ([number of missing days’ record]/[total record period]) using real number precision.

Figure 6. Pairplot relationship between CO, PM10, humidity, and SO2 (a) CA0016 (b) CA0054.

Figure 6. Pairplot relationship between CO, PM10, humidity, and SO2 (a) CA0016 (b) CA0054.

Figure 7. Decision tree using Rpart algorithms using 1000 randomly selected hours (a) CA0016 (b) CA0054.

Figure 7. Decision tree using Rpart algorithms using 1000 randomly selected hours (a) CA0016 (b) CA0054.

Table 2. Pollutants’ relative importance according to PM10 using random forest results.

Figure 8. Polar cluster plot for CA0016 and CA0054 stations.

Figure 8. Polar cluster plot for CA0016 and CA0054 stations.

Figure 9. Polar plot with cluster instead of wind speed.

Figure 9. Polar plot with cluster instead of wind speed.

Figure 10. The Theil-Sen function plot for CA0016 and CA0054 stations. Y-axis represents the values of concentrations and X-axis represents the time in years.

Note also that the symbols shown next to each trend estimate relate to how statistically significant the trend estimate is: p < 0.001 = ∗ ∗ ∗, p < 0.01 = ∗∗, p < 0.05 = ∗, and p < 0.1 = +.

Figure 10. The Theil-Sen function plot for CA0016 and CA0054 stations. Y-axis represents the values of concentrations and X-axis represents the time in years.Note also that the symbols shown next to each trend estimate relate to how statistically significant the trend estimate is: p < 0.001 = ∗ ∗ ∗, p < 0.01 = ∗∗, p < 0.05 = ∗, and p < 0.1 = +.

Figure 10. (Continued).

Figure 10. (Continued).

Figure 11. TimeVariation function for normalized concentrations of NOx, CO, SO2, and PM10 for (a) CA0016, (b) CA0054. Y-axis represents the normalized values of concentrations and X-axis represents the time in hour, months, and days.

Figure 11. TimeVariation function for normalized concentrations of NOx, CO, SO2, and PM10 for (a) CA0016, (b) CA0054. Y-axis represents the normalized values of concentrations and X-axis represents the time in hour, months, and days.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.