PUKYONG

Application of Deep Reinforcement Learning for Measuring the Efficiency of Autonomous Vehicles under a Mixed-Traffic Condition in Non-Signalized Intersections

Metadata Downloads
Abstract
The objective of this dissertation is to develop deep reinforcement learning for multiple autonomous vehicles under mixed traffic conditions in non-signalized junctions. To achieve the objective, firstly, the responsibility-sensitive safety-based partially observable Markov decision process model can be proposed to enhance the efficiency of one autonomous vehicle at a non-signalized intersection according to traffic safety guarantee, delay time, and smooth driving. Secondly, a deep reinforcement learning method evaluated the efficiency of multiple autonomous vehicles at a non-signalized intersection. This proposed method integrated reinforcement learning with multilayer perceptron algorithms. Additionally, a set of proximal policy optimization hyperparameters improved the performance of deep reinforcement learning training. Lastly, an advanced deep reinforcement learning was conducted to analyze the efficiency of multiple autonomous vehicles in the urban network with nine non-signalized intersections and propose a set of proximal policy optimization hyperparameters for enhancing deep reinforcement learning training's performance. The experimental results showed that an advanced deep reinforcement learning was getting a promising approach for optimizing self-driving behaviors under a mixed traffic condition regarding average reward, average speed, delay time, fuel consumptions, and emissions. A significant improvement in traffic perturbations was demonstrated as a higher the market penetration rate of autonomous vehicles.
Author(s)
TRAN QUANG DUY
Issued Date
2021
Awarded Date
2021. 8
Type
Dissertation
Keyword
Autonomous Driving Self-driving vehicles Deep reinforcement learning partially observable Markov decision process Responsibility sensitive safety
Publisher
부경대학교
URI
https://repository.pknu.ac.kr:8443/handle/2021.oak/1060
http://pknu.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=200000504253
Affiliation
부경대학교 대학원
Department
대학원 지구환경시스템과학부공간정보시스템공학전공
Advisor
SangHoonBae
Table Of Contents
CHAPTER 1 1
INTRODUCTION 1
1.1 Backgrounds 1
1.2 Goal and Objectives 3
1.3 Research Flow 5
CHAPTER 2 9
LITERATURE REVIEW 9
2.1 Safe Driving Issues 9
2.2 Autonomous Vehicles and Controllers 11
2.3 Autonomous Driving Platform 24
2.4 Deep Reinforcement Learning for Autonomous Driving 37
2.5 Research Implication 45
CHAPTER 3 48
RESEARCH METHODOLOGY 48
3.1 Adaptive Model Predictive Control system 48
3.2 Car-Following Models 54
3.3 Responsibility-Sensitive Safety (RSS) Paradigm 56
3.4 Partially Observable Markov Decision Process (POMDP) 59
3.5 Deep Reinforcement Learning (DRL) 60
CHAPTER 4 64
IMPROVEMENT OF THE EFFICIENCY OF ONE AUTONOMOUS VEHICLE AT A NON-SIGNALIZED INTERSECTION 64
4.1 Overview 64
4.2 Proposed RSS-Based POMDP Method 66
4.2.1 Belief state 67
4.2.2 Action 68
4.2.3 Observation 68
4.2.4 Reward function and optimal action 69
4.3 Efficiency evaluation 72
4.3.1 Simulation Scenarios 72
4.3.2 Delay Time Performance 77
4.3.3 Smooth Driving Performance 78
4.3.4 Simulation Results 79
4.4 Summary 94
CHAPTER 5 96
IMPROVEMENT OF THE EFFICIENCY OF MULTIPLE AUTONOMOUS VEHICLES AT A NON-SIGNALIZED INTERSECTION 96
5.1 Overview 96
5.2 Policy optimization 98
5.3 Proposed method's Architecture 102
5.3.1 Initialized Simulation 105
5.3.2 Observation 105
5.3.3 State 106
5.3.4 Action 107
5.3.5 Reward Function 108
5.3.6 Termination 109
5.3.7 Controller 109
5.4 Efficiency evaluation 109
5.4.1 Hyperparameter Setting and Evaluation Metrics 109
5.4.2 Experimental Scenarios 112
5.4.3 Experimental Results and Analysis 115
5.5 Summary 124
CHAPTER 6 126
IMPROVEMENT OF THE EFFICIENCY OF MULTIPLE AUTONOMOUS VEHICLES IN AN URBANNETWORK WITH MULTIPLE NON-SIGNALIZED INTERSECTIONS 126
6.1 Overview 126
6.2 Policy Optimization 127
6.3 Proposed method's Architecture 131
6.4 Efficiency evaluation 139
6.4.1 Hyperparameter Setting and Evaluation Metrics 139
6.4.2 Experimental Scenarios 142
6.4.3 Experimental Evaluation 149
6.5 Summary 160
CHAPTER 7 162
CONCLUSIONS AND FURTHER STUDIES 162
7.1 Conclusions 162
7.2 Discussions 166
7.3 Further Studies 168
REFERENCES 171
ACKNOWLEDGMENTS 193
Degree
Doctor
Appears in Collections:
대학원 > 지구환경시스템과학부-공간정보시스템공학전공
Authorize & License
  • Authorize공개
Files in This Item:

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.