Unsupervised and Semi-supervised Learning Methods based on Deep Clustering and Explainable Active Learning
- Alternative Title
- 딥 클러스터링 및 설명 가능한 능동 학습을 기반으로 한 비지도 및 반지도 학습 방법
- Abstract
- In the recent years, the progress of deep learning in terms of performance in various areas has enabled the extensive use of big data. More and more intelligent systems, either in industrial or medical areas, are based on the information extracted by machine learning algorithms from thousands of data. As deep learning based methods require an excessively large amount of data in order to be efficient, we can expect that the number of data needed in order to build intelligent systems will only continue to increase exponentially. This situation poses a challenge in terms of data labeling. In fact, most of the deep learning applications, especially in the medical area, are based on the supervised learning. Even though the supervised learning paradigm reaches tremendous results in terms of discrimination, the necessity of having thousands of labeled images represents a serious drawback for this kind of learning system.
The aim of the present study is to explore different methods, from deep clustering to semi-supervised learning techniques, that can completely eliminate or substantially diminish the need of data labeling process while maintaining a high performance in terms of discrimination. Our study focuses on the classification of the different types of the Human Epithelial type-2 (HEp-2) cells. We propose two models that are based on different scenarios.
The first model is based on an unsupervised learning paradigm (scheme). We can say that the method is fully unsupervised in the sense that there is no need of labeled data in order to perform an end-to-end training of the model. On that purpose, we use deep clustering, a technique that combines conventional clustering methods with deep learning structures. A clustering layer is incorporated in the middle of a deep convolutional autoencoder (DCAE) in order to perform clustering on the latent space's features at every iteration of the training process. This technique allows the DCAE to produce latent features that are easily discriminable. Furtherly, unlike in the actual state-of-the-art deep clustering methods, where the reconstruction of the DCAE is not maximized, we propose a novel architecture for deep clustering, named attention-based deep clustering, where the DACE's reconstruction accuracy is efficiently maximized. Finally, for this first model, we present a systematic analysis of the results in order to explore the influence of the reconstruction accuracy on the latent features.
The second model is based on a semi-supervised learning scheme. We adopt the techniques of active learning in order to alleviate the data labeling process. Active learning consists of applying some methods in order to select, among the thousands of available data, only those that really need to be labeled. It consists of selecting a limited number of informative data that can help to maximize the overall model's accuracy without the exigency of labeling the totality of the available data. In this work, we redefine active learning by proposing a data selection process based on explainable artificial intelligence (XAI). Relevance maps produced by some pre-selected XAI methods are used in order to find the informative data that need to be labeled. This proposed active learning scheme is performance-agnostic, since the data selection process does not depend on the accuracy of the model like in the conventional active learning methods. Secondly, by using XAI, we ensure the explainability of the selection process, which can contribute to the overall understating of the model. The two proposed models, the deep clustering and the explainable active learning, are extensively tested on the existing HEp-2 cell public datasets.
The key contributions of the present work can be summarized as follows. For the deep clustering model, (1) we propose different techniques (pooling indices storage, copy and concatenation and attention network) in order to boost the reconstruction of the DCAE. Our method outperforms the state-of-the-art deep clustering method in terms of reconstruction accuracy. (2) We investigate if the reconstruction quality can affect the clustering accuracy. (3) We demonstrate that a better reconstruction of the DCAE increases the quality of the learned latent features. Our best model (attention network) outperforms the conventional handcrafted features and reaches similar level (97.56% on one of the datasets) with the supervised learning methods in terms of discrimination of the HEp-2 cells.
For the proposed explainable active learning, (1) we propose a new definition of the selection method. Instead of using the model's output, as normally done in the conventional methods, we propose the use of the relevance maps. (2) Following the obtained results during the experiments, this selection method provides better results and, more importantly, has the advantage of the explainability. This is very important in the context of interpretability issues posed by the deep learning models. (3) We demonstrate that, using our proposed active learning method, an XAI-based selection contributes to boost the performance of the model even with a quite limited number of labeled data. With only 20% of the training data, the discrimination results achieve 92.77% of accuracy in one of the large HEp-2 cell dataset, while random and conventional selections give, respectively, 82% and 90%.
- Author(s)
- VUNUNU CALEB BRUCE NGANDU
- Issued Date
- 2021
- Awarded Date
- 2021. 8
- Type
- Dissertation
- Keyword
- Machine Learning Deep Learning Active Learning Explainable Artificial Intelligence HEp-2 cells
- Publisher
- 부경대학교
- URI
- https://repository.pknu.ac.kr:8443/handle/2021.oak/1149
http://pknu.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=200000508729
- Affiliation
- 부경대학교 대학원
- Department
- 대학원 IT융합응용공학과
- Advisor
- 권기룡
- Table Of Contents
- 1. INTRODUCTION 1
1.1 Research motivation 1
1.2 Thesis plan 4
2. LITERATURE REVIEW 6
2.1 Unsupervised learning: General view 6
2.2 Deep clustering 10
2.3 Explainable artificial intelligence 12
2.3.1 Overview 12
2.3.2 Relevance maps 14
2.3.3 Evaluation of relevance maps 17
2.4 Active learning 19
2.5 Unsupervised learning in medical images analysis 23
2.6 HEp-2 cell classification in literature 24
3. PROPOSED METHODS 28
3.1 Proposed deep clustering method 28
3.1.1 Method overview 28
3.1.2 Deep embedded clustering network 29
3.1.3 Deep embedded clustering with dual autoencoder 38
3.1.4 Attention-based deep clustering 41
3.2 Proposed active learning method 45
3.2.1 Method overview 45
3.2.2 Deep parallel residual networks 45
3.2.3 Active learning using the pre-trained parallel residual networks 54
3.2.4 Explainable active learning 57
4. EXPERIMENTAL RESULTS AND DISCUSSION 64
4.1 Deep clustering results 64
4.1.1 Datasets and experimental setups 64
4.1.2 Results for Case-1 68
4.1.3 Results for Case-2 70
4.1.4 Results for Case-3 72
4.2 Active learning results 82
4.2.1 Datasets and initial setups 82
4.2.2 Results of the "RS" case 88
4.2.3 Results of the "AL" case 91
4.2.4 Results of the "IN-RS" case 93
4.2.5 Results of the "IN-AL" case 95
4.2.6 Results of the explainable active learning 102
5. CONCLUSIONS 119
6. REFERENCES 122
7. ACKNOWLEDGEMENT 137
- Degree
- Doctor
-
Appears in Collections:
- 대학원 > IT융합응용공학과
- Authorize & License
-
- Files in This Item:
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.