TY - JOUR
T1 - Identification of odor emission sources in urban areas using machine learning-based classification models
AU - Choi, Yelim
AU - Kim, Kyunghoon
AU - Kim, Seonghwan
AU - Kim, Daekeun
N1 - Publisher Copyright:
© 2022
PY - 2022/1
Y1 - 2022/1
N2 - Odor-causing substances are generated by various emission sources in urban areas. Recently, urbanization has greatly increased the density of odor emission facilities, implying the identification of odorants emission source is challenging. Identifying emission source is multifactorial, and a machine learning approach is considered useful for these complicated matters. The objectives of this study were to propose a method using machine learning-based classification models to identify odor sources in urban areas. We collected 34,539 data points regarding quantitative data of 22 compounds emitting from 11 types of facilities in urban areas (i.e., automobile industry, bio factory, wastewater treatment plant, landfill, construction site, farm industrial complex area, restaurant, gas station, roadside, park) and odor intensity of these 11 facilities. Decision tree (DT) and random forest (RF) algorithms were used as classification models for identifying odor sources with 23 variables (22 compounds + odor intensity). The DT model identified 7 out of 11 emission sources with 87.15% accuracy. The RF model identified all 11 emission sources with 99.23% accuracy. When including 6 important variables only (i.e., hydrogen sulfide, ammonia, trimethylamine, methyl mercaptan, acetaldehyde, odor intensity) in the RF model, accuracy (99.15%) was almost same with that (99.23%) obtained from all 23 variables included as variables in the model. Our findings imply that a machine learning approach can help to identify odor emission sources with high accuracy and we can save time and cost in the identification of odor emission sources by including the 6 important variables only.
AB - Odor-causing substances are generated by various emission sources in urban areas. Recently, urbanization has greatly increased the density of odor emission facilities, implying the identification of odorants emission source is challenging. Identifying emission source is multifactorial, and a machine learning approach is considered useful for these complicated matters. The objectives of this study were to propose a method using machine learning-based classification models to identify odor sources in urban areas. We collected 34,539 data points regarding quantitative data of 22 compounds emitting from 11 types of facilities in urban areas (i.e., automobile industry, bio factory, wastewater treatment plant, landfill, construction site, farm industrial complex area, restaurant, gas station, roadside, park) and odor intensity of these 11 facilities. Decision tree (DT) and random forest (RF) algorithms were used as classification models for identifying odor sources with 23 variables (22 compounds + odor intensity). The DT model identified 7 out of 11 emission sources with 87.15% accuracy. The RF model identified all 11 emission sources with 99.23% accuracy. When including 6 important variables only (i.e., hydrogen sulfide, ammonia, trimethylamine, methyl mercaptan, acetaldehyde, odor intensity) in the RF model, accuracy (99.15%) was almost same with that (99.23%) obtained from all 23 variables included as variables in the model. Our findings imply that a machine learning approach can help to identify odor emission sources with high accuracy and we can save time and cost in the identification of odor emission sources by including the 6 important variables only.
KW - Classification
KW - Emission source
KW - Machine learning
KW - Odor
UR - https://www.scopus.com/pages/publications/85124048986
U2 - 10.1016/j.aeaoa.2022.100156
DO - 10.1016/j.aeaoa.2022.100156
M3 - Article
AN - SCOPUS:85124048986
SN - 2590-1621
VL - 13
JO - Atmospheric Environment: X
JF - Atmospheric Environment: X
M1 - 100156
ER -