Coffee Quality Prediction: A Comparative Analysis of Machine Learning Techniques Using CQI Data in Sensory Score Estimation

Citation

Bau, Yoon Teck and Sianjaya, Raymond and Lee, Kian Chin (2025) Coffee Quality Prediction: A Comparative Analysis of Machine Learning Techniques Using CQI Data in Sensory Score Estimation. Journal of Logistics, Informatics and Service Science, 12 (4). pp. 91-110. ISSN 2409-2665

[img] Text
5.pdf - Published Version
Restricted to Repository staff only

Download (219kB)

Abstract

Coffee quality assessment plays a critical role in determining market value, directly influencing pricing and income for producers. However, traditional evaluation methods are often subjective and inconsistent. This study investigates the use of machine learning techniques to predict coffee quality scores represented as total cup points based on objective sensory and physical attributes such as flavor, acidity, aroma, aftertaste, balance, body, and overall impression. Using manually collected data from the Coffee Quality Institute (CQI), we conducted comprehensive preprocessing and exploratory data analysis to identify trends and relevant patterns. Feature importance analysis revealed that flavor was the most influential factor in predicting coffee quality, followed by category one defects and overall. Machine learning techniques including random forest, multiple linear regression (MLR), support vector machine (SVM), and decision tree were trained and evaluated using four performance metrics: mean absolute error (MAE), mean squared error (MSE), root mean squared error (RMSE), and R-squared (R²). Random forest achieved the best performance with MAE of 0.1598 ± 0.0712 and R² of 0.8242 ± 0.1907, followed by multiple linear regression (MAE: 0.1712 ± 0.0522, R²: 0.8149 ± 0.1829), support vector machine (MAE: 0.1623 ± 0.0776, R²: 0.7951 ± 0.2334), and decision tree (MAE: 0.2469 ± 0.0745, R²: 0.6944 ± 0.1719). These findings demonstrate the effectiveness of machine learning in producing reliable, data-driven assessments of coffee quality. The implementation of such models can support more consistent grading practices, reduce human bias, and enhance transparency across the coffee supply chain particularly beneficial in markets where specialty coffee commands premium prices.

Item Type: Article
Uncontrolled Keywords: Machine Learning Techniques
Subjects: Q Science > Q Science (General) > Q300-390 Cybernetics
Divisions: Faculty of Computing and Informatics (FCI)
Depositing User: Ms Rosnani Abd Wahab
Date Deposited: 27 Aug 2025 03:19
Last Modified: 03 Sep 2025 07:30
URII: http://shdl.mmu.edu.my/id/eprint/14421

Downloads

Downloads per month over past year

View ItemEdit (login required)