PUKYONG

강화학습을 위한 3차원 물리 기반 시뮬레이터 설계 및 구현

Metadata Downloads
Alternative Title
Design and Implementation of 3D Physics-Based Simulator for Reinforcement Learning
Abstract
In recent years, the development of deep artificial neural network technology has made it possible to deal with high-dimensional information without domain knowledge. As a result, reinforcement learning has been actively researched in the field of solving problems by interacting with three-dimensional environment such as autonomous vehicle driving and drone. Recent reinforcement learning studies have been proving the validity of the results through computer simulators such as Bellemare's ALE and OpenAI's Gym. Learning and experimenting with a robotic agent in the real world is due to financial losses such as damage, accidents, and data collection costs. In the physical simulator, however, floating point or numerical integration errors can cause errors in the physical engine and bugs in various programs. The problem with these simulators is the agent can learn through physically impossible movements in reality, which means that users need to visually check and monitor them because wrong results are learned. It is also necessary to restore an agent if it learns from the wrong information. However, the tools for implementing the existing reinforcement learning environment do not provide functions related to computer graphics and do not support the restoration function.
Therefore, in this paper, we provide an implementation method to visualize the screen with computer graphics, which can be reinforcement learning, and a 3D physics based simulator architecture to restore experience data and policy weights. By providing the above architecture, it is possible to reduce the cost of simulator implementation time and expect stable reinforcement learning through the restoration function.
Author(s)
박건국
Issued Date
2019
Awarded Date
2019. 2
Type
Dissertation
Keyword
인공지능 시뮬레이터 데이터관리
Publisher
부경대학교
URI
https://repository.pknu.ac.kr:8443/handle/2021.oak/23206
http://pknu.dcollection.net/common/orgView/200000185958
Affiliation
부경대학교 대학원
Department
대학원 IT융합응용공학과
Advisor
김영봉
Table Of Contents
1. 서론 1
1.1. 연구 배경 1
1.2. 연구 내용 4
1.3. 논문 구성 5
2. 관련 연구 6
2.1. 강화학습 6
2.1.1. Markov Decision Process(MDP) 7
2.1.2. Q-Learning 8
2.2. 기존 강화 학습 시뮬레이터 11
2.2.1. ALE 11
2.2.2. OpenAI Gym 12
3. 제안한 시뮬레이터 아키텍처 13
3.1. Environment Manager 13
3.1.1. Policy Script와의 통신을 위한 인터페이스 14
3.1.2. Physics Manager와의 통신을 위한 인터페이스 16
3.1.3. 경험 데이터 관리 구조 18
3.1.4. 경험 데이터 저장 19
3.1.5. 경험 데이터 복구 20
3.2. Resource Manager 22
3.3. Physics Manager 23
3.3.1. Object 23
3.3.2. Sensor 24
3.3.3. Physics Manager 처리 단계 25
3.4. Rendering Manager 26
3.4.1. Generate Model Matrix 26
3.4.2. Render Screen 27
4. 시뮬레이터 구현 29
4.1. 구현 대상 29
4.1.1. 시뮬레이터 진행 과정 30
4.1.2. 강화학습 구성별 정의 31
4.2. 구성별 구현과 결과 33
4.3. 구현 결과 37
4.3.1. 결과 이미지 37
4.3.2. 타 플랫폼과의 기능 비교 39
5. 결론 및 향후 연구 40
5.1. 결론 40
5.2. 향후 연구 41
Degree
Master
Appears in Collections:
대학원 > IT융합응용공학과
Authorize & License
  • Authorize공개
Files in This Item:

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.