TY - JOUR
T1 - Development of empirical formula for imbalanced transverse dispersion coefficient data set using SMOTE
AU - Lee, Sunmi
AU - Yoon, Taewon
AU - Park, Inhwan
N1 - Publisher Copyright:
© 2021 Korea Water Resources Association.
PY - 2021/12
Y1 - 2021/12
N2 - In this study, a new empirical formula for 2D transverse dispersion coefficient was developed using the results of previous tracer test studies, and the performance of the formula was evaluated. Since many tracer test studies have been conducted under the conditions where the width-to-depth ratio is less than 50, the existing empirical formulas developed using these imbalanced tracer test results have limitations in applying to rivers with a width-to-depth ratio greater than 50. Therefore, in order to develop an empirical formula for transverse dispersion coefficient using the imbalanced tracer test data, the Synthetic Minority Oversampling TEchnique (SMOTE) was used to oversample new data representing the properties of the existing tracer test data. The hydraulic data and the transverse dispersion coefficients in conditions of width-to-depth ratio greater than 50 were oversampled using the SMOTE. The reliability of the oversampled data was evaluated using the ROC (Receiver Operating Characteristic) curve. The empirical formula of transverse dispersion coefficient was developed including the oversampled data, and the performance of the results were compared with the empirical formulas suggested in previous studies using R2. From the comparison results, the value of R2 was 0.81 for the range of W/H < 50 and 0.92 for 50 < W/H, which were improved accuracy compared to the previous studies.
AB - In this study, a new empirical formula for 2D transverse dispersion coefficient was developed using the results of previous tracer test studies, and the performance of the formula was evaluated. Since many tracer test studies have been conducted under the conditions where the width-to-depth ratio is less than 50, the existing empirical formulas developed using these imbalanced tracer test results have limitations in applying to rivers with a width-to-depth ratio greater than 50. Therefore, in order to develop an empirical formula for transverse dispersion coefficient using the imbalanced tracer test data, the Synthetic Minority Oversampling TEchnique (SMOTE) was used to oversample new data representing the properties of the existing tracer test data. The hydraulic data and the transverse dispersion coefficients in conditions of width-to-depth ratio greater than 50 were oversampled using the SMOTE. The reliability of the oversampled data was evaluated using the ROC (Receiver Operating Characteristic) curve. The empirical formula of transverse dispersion coefficient was developed including the oversampled data, and the performance of the results were compared with the empirical formulas suggested in previous studies using R2. From the comparison results, the value of R2 was 0.81 for the range of W/H < 50 and 0.92 for 50 < W/H, which were improved accuracy compared to the previous studies.
KW - Empirical formula
KW - Imbalanced data
KW - SMOTE
KW - Transverse dispersion coefficient
UR - https://www.scopus.com/pages/publications/85159379301
U2 - 10.3741/JKWRA.2021.54.12.1305
DO - 10.3741/JKWRA.2021.54.12.1305
M3 - Article
AN - SCOPUS:85159379301
SN - 2799-8746
VL - 54
SP - 1305
EP - 1316
JO - Journal of Korea Water Resources Association
JF - Journal of Korea Water Resources Association
IS - 12
ER -