PUKYONG

GUIDE OAK 국가리포지터리 부경대학교

검색

PUKYONG Repository 대학원 지구환경시스템과학부-공간정보시스템공학전공

Application of Deep Reinforcement Learning for Measuring the Efficiency of Autonomous Vehicles under a Mixed-Traffic Condition in Non-Signalized Intersections

Metadata Downloads

Abstract: The objective of this dissertation is to develop deep reinforcement learning for multiple autonomous vehicles under mixed traffic conditions in non-signalized junctions. To achieve the objective, firstly, the responsibility-sensitive safety-based partially observable Markov decision process model can be proposed to enhance the efficiency of one autonomous vehicle at a non-signalized intersection according to traffic safety guarantee, delay time, and smooth driving. Secondly, a deep reinforcement learning method evaluated the efficiency of multiple autonomous vehicles at a non-signalized intersection. This proposed method integrated reinforcement learning with multilayer perceptron algorithms. Additionally, a set of proximal policy optimization hyperparameters improved the performance of deep reinforcement learning training. Lastly, an advanced deep reinforcement learning was conducted to analyze the efficiency of multiple autonomous vehicles in the urban network with nine non-signalized intersections and propose a set of proximal policy optimization hyperparameters for enhancing deep reinforcement learning training's performance. The experimental results showed that an advanced deep reinforcement learning was getting a promising approach for optimizing self-driving behaviors under a mixed traffic condition regarding average reward, average speed, delay time, fuel consumptions, and emissions. A significant improvement in traffic perturbations was demonstrated as a higher the market penetration rate of autonomous vehicles.

Author(s): TRAN QUANG DUY

Issued Date: 2021

Awarded Date: 2021. 8

Type: Dissertation

Keyword: Autonomous Driving Self-driving vehicles Deep reinforcement learning partially observable Markov decision process Responsibility sensitive safety

Publisher: 부경대학교

URI: https://repository.pknu.ac.kr:8443/handle/2021.oak/1060
http://pknu.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=200000504253

Affiliation: 부경대학교 대학원

Department: 대학원 지구환경시스템과학부공간정보시스템공학전공

Advisor: SangHoonBae

Table Of Contents: CHAPTER 1 1
INTRODUCTION 1
1.1 Backgrounds 1
1.2 Goal and Objectives 3
1.3 Research Flow 5
CHAPTER 2 9
LITERATURE REVIEW 9
2.1 Safe Driving Issues 9
2.2 Autonomous Vehicles and Controllers 11
2.3 Autonomous Driving Platform 24
2.4 Deep Reinforcement Learning for Autonomous Driving 37
2.5 Research Implication 45
CHAPTER 3 48
RESEARCH METHODOLOGY 48
3.1 Adaptive Model Predictive Control system 48
3.2 Car-Following Models 54
3.3 Responsibility-Sensitive Safety (RSS) Paradigm 56
3.4 Partially Observable Markov Decision Process (POMDP) 59
3.5 Deep Reinforcement Learning (DRL) 60
CHAPTER 4 64
IMPROVEMENT OF THE EFFICIENCY OF ONE AUTONOMOUS VEHICLE AT A NON-SIGNALIZED INTERSECTION 64
4.1 Overview 64
4.2 Proposed RSS-Based POMDP Method 66
4.2.1 Belief state 67
4.2.2 Action 68
4.2.3 Observation 68
4.2.4 Reward function and optimal action 69
4.3 Efficiency evaluation 72
4.3.1 Simulation Scenarios 72
4.3.2 Delay Time Performance 77
4.3.3 Smooth Driving Performance 78
4.3.4 Simulation Results 79
4.4 Summary 94
CHAPTER 5 96
IMPROVEMENT OF THE EFFICIENCY OF MULTIPLE AUTONOMOUS VEHICLES AT A NON-SIGNALIZED INTERSECTION 96
5.1 Overview 96
5.2 Policy optimization 98
5.3 Proposed method's Architecture 102
5.3.1 Initialized Simulation 105
5.3.2 Observation 105
5.3.3 State 106
5.3.4 Action 107
5.3.5 Reward Function 108
5.3.6 Termination 109
5.3.7 Controller 109
5.4 Efficiency evaluation 109
5.4.1 Hyperparameter Setting and Evaluation Metrics 109
5.4.2 Experimental Scenarios 112
5.4.3 Experimental Results and Analysis 115
5.5 Summary 124
CHAPTER 6 126
IMPROVEMENT OF THE EFFICIENCY OF MULTIPLE AUTONOMOUS VEHICLES IN AN URBANNETWORK WITH MULTIPLE NON-SIGNALIZED INTERSECTIONS 126
6.1 Overview 126
6.2 Policy Optimization 127
6.3 Proposed method's Architecture 131
6.4 Efficiency evaluation 139
6.4.1 Hyperparameter Setting and Evaluation Metrics 139
6.4.2 Experimental Scenarios 142
6.4.3 Experimental Evaluation 149
6.5 Summary 160
CHAPTER 7 162
CONCLUSIONS AND FURTHER STUDIES 162
7.1 Conclusions 162
7.2 Discussions 166
7.3 Further Studies 168
REFERENCES 171
ACKNOWLEDGMENTS 193

Degree: Doctor

Appears in Collections:: 대학원 > 지구환경시스템과학부-공간정보시스템공학전공

Show Simple Item RecordShow Full Item Record

Authorize & License

Authorize공개

qrcode

트윗하기