Deep Ensemble Methods for Food Ingredient Entity Recognition in Natural Language Processing
- Alternative Title
- 자연어 처리에서 식품 성분 용어 인식을 위한 심층 앙상블 방법
- Abstract
- In recent years, recipe-sharing websites are becoming popular among those who wish to learn how to cook or plan their menu. Individuals can choose ingredients that suit their lifestyle and health condition using online food recipes. The information from online recipes can be used to build various food, nutrition, and healthcare applications. However, the information collected from online food recipes lacks structured information. To extract such information into well-structured data, we can use a technique in natural language processing called Named Entity Recognition or NER. NER is a technique of recognizing key information or entities in a text and categorizing them into a predetermined category. However, three major issues arise when developing named-entity recognition in the food domain: (1) The availability of datasets for the food domain is still quite limited; (2) How to design a machine learning model that is effective and efficient in recognizing food entities; and (3) Existing NER models relied solely on a single model, and just a few studies employ ensemble learning for NER, particularly none for the food domain.
This study aims to solve these problems via an ensemble learning technique, combining various learning algorithms to obtain a collective performance beyond existing models' performance. Drawing upon the ensemble technique, we propose a solution to the challenges mentioned above in two stages: first, we built an iterative self-training approach called SMPT (Semi-supervised Multi-model Prediction Technique). SMPT is a deep ensemble learning model that employs the concept of self-training and builds on multiple pre-trained language models in the iterative data labeling process, with a voting mechanism used as the final decision to determine the entity's label. Utilizing the SMPT, we have created a new annotated dataset of ingredient entities named the FINER dataset; and second, we proposed a food ingredient NER model called the Recurrent Network-based Ensemble model or RNE. RNE is a novel model for extracting food-related entities by incorporating deep ensemble learning with recurrent network models, including RNN, GRU, and LSTM. The experimental findings demonstrate that the proposed RNE model could extract information from food recipes more effectively than a single model. In future development, such information can support numerous food-related information systems.
- Author(s)
- KOKOYSITIKOMARIAH
- Issued Date
- 2023
- Awarded Date
- 2023-02
- Type
- Dissertation
- Keyword
- natural language processing, information extration, named entity recognition, food entity recognition, deep ensemble method
- Publisher
- 부경대학교
- URI
- https://repository.pknu.ac.kr:8443/handle/2021.oak/32912
http://pknu.dcollection.net/common/orgView/200000663736
- Affiliation
- Pukyong National University, Graduate School
- Department
- 대학원 인공지능융합학과
- Advisor
- SinBongKee
- Table Of Contents
- Chapter I Introduction 1
1.1 Background 1
1.2 Motivations 3
1.3 Thesis Contributions 5
1.4 Outline of the Thesis 6
Chapter II Literature Reviews 8
2.1 Natural Language Processing 8
2.2 Named Entity Recognition 10
2.3 Ensemble Deep Learning 15
2.4 Transfer Learning 17
2.5 Self Training 21
2.6 Deep Learning Approaches for NER 23
2.6.1 Transformer Model 23
2.6.2 Transformer-based NER 27
2.6.2.1 SpaCy NER 27
2.6.2.2 BERT 28
2.6.2.3 DistilBERT 29
2.6.3 Recurrent Neural Networks (RNNs) based NER 31
2.6.3.1 Recurrent Neural Network (RNN) 31
2.6.3.2 Long-Short Term Memory (LSTM) 32
2.6.3.3 Gated Recurrent Unit (GRU) 34
Chapter III Food Ingredient Named-Entity Data Construction using Semi-Supervised Multi-Model Prediction Technique 36
3.1 Background and Related Works 36
3.2 Data Construction Workflow 38
3.3 Data Preparation 40
3.4 Named Entity Labeling 45
3.5 Semi-Supervised Multi-Model Prediction Technique (SMPT) 46
3.5.1 Training 48
3.5.2 Dataset Building Schemes 48
Chapter IV Ensemble-based Recurrent Networks for Food Ingredient Named Entity Recognition 51
4.1 Background and Related Works 51
4.1.1 Food-Related NER 51
4.1.2 Ensemble Method for NER 52
4.1.3 Recurrent Model for NER 53
4.2 Dataset 54
4.3 Hyperparameter Optimization 55
4.4 Recurrent Network-based Ensemble (RNE) 56
Chapter V Results and Analysis 59
5.1 Experimental Setup 59
5.2 Evaluation Metrics 59
5.3 Analysis and Evaluation for SMPT method 60
5.3.1 Test Results with Training Schemes 61
5.3.2 Evaluation on ML Models 66
5.4 Analysis and Evaluation for RNE model 70
Chapter VI Conclusions and Future Work 78
6.1 Conclusions 78
6.2 Future Work 79
References 81
- Degree
- Doctor
-
Appears in Collections:
- 대학원 > 인공지능융합학과
- Authorize & License
-
- Authorize공개
- Embargo2023-02-17
- Files in This Item:
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.