TY - JOUR
T1 - Evaluation of beef quality using machine learning based on the CIELAB color space
AU - Kim, Somin
AU - Kim, Woo Ju
AU - Doh, Hansol
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2026/2
Y1 - 2026/2
N2 - This study presents a Python-based, machine learning approach to predict spoilage indicators of beef using non-destructive color measurements from the CIELAB color space. A two-stage modeling strategy was employed, where spoilage indicators predicted from a∗ values in the first-stage were used as input features in a second-stage model to estimate spoilage rate. First, a strong correlation between the a∗ value and spoilage indicators, validated through Pearson correlation analysis, highlights the feasibility of color data for quality prediction. Among the regression models tested, the random forest (RF) regression model demonstrated superior performance, achieving R2 values of 0.912 and 0.804 for metmyoglobin (Met.Mb) and peroxide value (PV), respectively. Gradient boosting (GB) regression was most effective for pH prediction, while the k-nearest neighbors (KNN) regression model excelled in thiobarbituric acid reactive substances (TBARS) estimation. Then, the RF regression model was further optimized to predict spoilage rates using Met.Mb, pH, PV, and TBARS as input features, achieving an R2 of 0.988 and RMSE of 2.120. SHapley Additive exPlanations (SHAP) analysis identified TBARS as the most influential variable, followed by Met.Mb and pH. These findings demonstrate the potential of combining machine learning models with non-destructive methods for real-time beef quality assessment, offering a practical and resource-efficient alternative to traditional analytical approaches.
AB - This study presents a Python-based, machine learning approach to predict spoilage indicators of beef using non-destructive color measurements from the CIELAB color space. A two-stage modeling strategy was employed, where spoilage indicators predicted from a∗ values in the first-stage were used as input features in a second-stage model to estimate spoilage rate. First, a strong correlation between the a∗ value and spoilage indicators, validated through Pearson correlation analysis, highlights the feasibility of color data for quality prediction. Among the regression models tested, the random forest (RF) regression model demonstrated superior performance, achieving R2 values of 0.912 and 0.804 for metmyoglobin (Met.Mb) and peroxide value (PV), respectively. Gradient boosting (GB) regression was most effective for pH prediction, while the k-nearest neighbors (KNN) regression model excelled in thiobarbituric acid reactive substances (TBARS) estimation. Then, the RF regression model was further optimized to predict spoilage rates using Met.Mb, pH, PV, and TBARS as input features, achieving an R2 of 0.988 and RMSE of 2.120. SHapley Additive exPlanations (SHAP) analysis identified TBARS as the most influential variable, followed by Met.Mb and pH. These findings demonstrate the potential of combining machine learning models with non-destructive methods for real-time beef quality assessment, offering a practical and resource-efficient alternative to traditional analytical approaches.
KW - Beef
KW - CIELAB color space
KW - Food quality
KW - Machine learning
KW - Prediction
KW - Python
UR - https://www.scopus.com/pages/publications/105013201273
U2 - 10.1016/j.foodcont.2025.111642
DO - 10.1016/j.foodcont.2025.111642
M3 - Article
AN - SCOPUS:105013201273
SN - 0956-7135
VL - 180
JO - Food Control
JF - Food Control
M1 - 111642
ER -