TY - JOUR
T1 - A Systematic Study of Hyperparameter Tuning for Environmental Text Classification
T2 - Implications for Environmental Management
AU - Kim, J. J.
AU - Adamowski, J.
AU - Park, S.
AU - Lim, K.
AU - Jeong, H.
N1 - Publisher Copyright:
© 2025 ISEIS All rights reserved.
PY - 2025
Y1 - 2025
N2 - Environmental management increasingly relies on rapid and precise information analysis to resolve critical environmental problems. This study evaluated the effectiveness of hyperparameter tuning and its impact on automatic environmental text classification performance using different Machine Learning (ML) classifiers and term-weighting schemes. Our results indicated that hyperparameter tuning generally enhanced classification performance, with the eXtreme Gradient Boosting (XGBoost) classifier showing the highest performance. The study also highlighted the trade-off between performance improvement and computational cost, i.e., enhanced classification accuracy at the expense of increased execution time. Notably, hyperparameter sensitivity varied among ML classifiers. For example, the Multinomial Naive Bayes classifier was less sensitive to hyperparameter tuning under certain term-weighting schemes. These findings provide new insights into the relationships between hyperparameter optimization, classification performance, and computational efficiency in environmental text classification. They offer valuable guidance for selecting and tuning classifiers to support better-informed decisions in environmental management.
AB - Environmental management increasingly relies on rapid and precise information analysis to resolve critical environmental problems. This study evaluated the effectiveness of hyperparameter tuning and its impact on automatic environmental text classification performance using different Machine Learning (ML) classifiers and term-weighting schemes. Our results indicated that hyperparameter tuning generally enhanced classification performance, with the eXtreme Gradient Boosting (XGBoost) classifier showing the highest performance. The study also highlighted the trade-off between performance improvement and computational cost, i.e., enhanced classification accuracy at the expense of increased execution time. Notably, hyperparameter sensitivity varied among ML classifiers. For example, the Multinomial Naive Bayes classifier was less sensitive to hyperparameter tuning under certain term-weighting schemes. These findings provide new insights into the relationships between hyperparameter optimization, classification performance, and computational efficiency in environmental text classification. They offer valuable guidance for selecting and tuning classifiers to support better-informed decisions in environmental management.
KW - environmental big data
KW - Korean news articles
KW - machine learning classifier
KW - term-weighting schemes
KW - text mining in environmental informatics
UR - https://www.scopus.com/pages/publications/105012962910
U2 - 10.3808/jei.202500545
DO - 10.3808/jei.202500545
M3 - Article
AN - SCOPUS:105012962910
SN - 1726-2135
VL - 46
JO - Journal of Environmental Informatics
JF - Journal of Environmental Informatics
IS - 1
ER -