Automatic Food Label Detection in Images Using Convolutional Neural Network with Food-101 Dataset
DOI:
https://doi.org/10.56313/jictas.v4i1.432Keywords:
Convolutional Neural Network, Food Label Detection, Food Recognition, Food-101 Dataset, Image ClassificationAbstract
automatic detection of food labels from digital images has emerged as a crucial application in dietary analysis, nutrition monitoring, and smart culinary systems. This study presents the implementation of a Convolutional Neural Network (CNN) model for food label recognition using the Food-101 dataset, which consists of over 101,000 images from 101 distinct food categories. The proposed system follows a systematic pipeline that includes image resizing, normalization, and data augmentation to enhance model robustness and performance. The CNN architecture is designed with multiple convolutional and pooling layers, followed by dense and softmax output layers for final classification. The training was conducted using the Adam optimizer with a learning rate of 0.0001, batch size of 32, and dropout regularization to prevent overfitting. Experimental results demonstrate a classification accuracy of 24.45% after one training epoch, highlighting both the capability and limitations of the baseline CNN model. Despite moderate accuracy, the model successfully identifies visually distinguishable food items and sets a foundation for future improvements through transfer learning and fine-tuning. This research confirms the potential of CNN-based models for food label detection and provides insights for the development of more accurate food recognition systems in health, dietary, and culinary applications
References
C. Bossard, M. Guillaumin, and L. Van Gool, “Food-101 – Mining Discriminative Components with Random Forests,” in Proc. European Conf. on Computer Vision (ECCV), Zurich, Switzerland, 2014, pp. 446–461.
D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” in Proc. Int. Conf. on Learning Representations (ICLR), San Diego, CA, USA, 2015.
S. Ioffe and C. Szegedy, “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift,” in Proc. ICML, 2015, pp. 448–456.
K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” arXiv preprint arXiv:1409.1556, 2014.
A. Shorten and T. M. Khoshgoftaar, “A Survey on Image Data Augmentation for Deep Learning,” J. Big Data, vol. 6, no. 1, p. 60, 2019.
B. Yanto, E. Rouza, L. Fimawahib, B. H. Hayadi, and R. R. Pratama, “Penerapan Algoritma Deep Learning Convolutional Neural Network dalam Menentukan Kematangan Buah Jeruk Manis Berdasarkan Citra Red Green Blue (RGB),” J. Teknol. Inform. dan Ilmu Komputer, vol. 10, no. 1, pp. 59–66, 2023.
M. Tan and Q. Le, “EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks,” in Proc. ICML, 2019, pp. 6105–6114.
B. Yanto, A. Lubis, B. H. Hayadi, and N. S. T. E. Armita, “Klarifikasi Kematangan Buah Nanas dengan Ruang Warna Hue Saturation Intensity (HSI),” INOVTEK Polbeng Seri Informatika, vol. 6, no. 1, pp. 135–146, 2021.
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection,” in Proc. CVPR, Las Vegas, NV, USA, 2016, pp. 779–788.
Y. Zhang, X. Wang, and H. Ma, “Deep Learning in Food Category Recognition,” Information Fusion, vol. 98, p. 101859, May 2023.
B. Yanto, M. A. Kartawidjaja, R. Sukwadi, and M. Bachtiar, “Implementation of Hue Saturation Intensity (HSI) Color Space Transformation Algorithm with Red, Green, Blue (RGB) Color Brightness in Assessing Tomato Fruit Maturity,” Riau J. Comput. Sci., vol. 9, no. 2, pp. 167–178, 2023.
G. Huang, Z. Liu, L. van der Maaten, and K. Q. Weinberger, “Densely Connected Convolutional Networks,” in Proc. CVPR, 2017, pp. 2261–2269.
H. Meyers et al., “Food Recognition Using Mobile Devices: A Review,” IEEE Access, vol. 9, pp. 21532–21549, 2021.
B. Yanto, B. Basorudin, J. Jufri, B. H. Hayadi, and N. S. T. E. Armita, “Indentifikasi Pola Aksara Arab Melayu Dengan Jaringan Syaraf Tiruan Convolutional Neural Network (CNN),” J. Scientific and Applied Informatics, vol. 3, no. 3, pp. 106–114, 2020.
Y. Liu, L. Zhang, and X. Wang, “Fine-Grained Food Recognition with Multi-Scale Attention,” IEEE Access, vol. 8, pp. 85804–85815, 2020.
W. Liu et al., “SSD: Single Shot Multibox Detector,” in Proc. ECCV, Amsterdam, Netherlands, 2016, pp. 21–37.
B. Yanto, L. Fimawahib, A. Supriyanto, B. H. Hayadi, and R. R. Pratama, “Klasifikasi Tekstur Kematangan Buah Jeruk Manis Berdasarkan Tingkat Kecerahan Warna dengan Metode Deep Learning Convolutional Neural Network,” J. Inovtek Polbeng Seri Informatika, vol. 6, no. 2, pp. 259–268, 2021.
I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning, Cambridge, MA, USA: MIT Press, 2016.
T. Fawcett, “An Introduction to ROC Analysis,” Pattern Recognit. Lett., vol. 27, no. 8, pp. 861–874, 2006.
R. E. Pawening et al., “Food Object Detection Using Mask R-CNN for Diet Recommendation,” J. Ilm. Tek. Inform. dan Komun., vol.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Journal of ICT Aplications and System

This work is licensed under a Creative Commons Attribution 4.0 International License.

