PUKYONG

GUIDE OAK 국가리포지터리 부경대학교

검색

PUKYONG Repository 대학원 IT융합응용공학과

강화학습을 위한 3차원 물리 기반 시뮬레이터 설계 및 구현

Metadata Downloads

Alternative Title: Design and Implementation of 3D Physics-Based Simulator for Reinforcement Learning

Abstract: In recent years, the development of deep artificial neural network technology has made it possible to deal with high-dimensional information without domain knowledge. As a result, reinforcement learning has been actively researched in the field of solving problems by interacting with three-dimensional environment such as autonomous vehicle driving and drone. Recent reinforcement learning studies have been proving the validity of the results through computer simulators such as Bellemare's ALE and OpenAI's Gym. Learning and experimenting with a robotic agent in the real world is due to financial losses such as damage, accidents, and data collection costs. In the physical simulator, however, floating point or numerical integration errors can cause errors in the physical engine and bugs in various programs. The problem with these simulators is the agent can learn through physically impossible movements in reality, which means that users need to visually check and monitor them because wrong results are learned. It is also necessary to restore an agent if it learns from the wrong information. However, the tools for implementing the existing reinforcement learning environment do not provide functions related to computer graphics and do not support the restoration function.
Therefore, in this paper, we provide an implementation method to visualize the screen with computer graphics, which can be reinforcement learning, and a 3D physics based simulator architecture to restore experience data and policy weights. By providing the above architecture, it is possible to reduce the cost of simulator implementation time and expect stable reinforcement learning through the restoration function.

Author(s): 박건국

Issued Date: 2019

Awarded Date: 2019. 2

Type: Dissertation

Keyword: 인공지능 시뮬레이터 데이터관리

Publisher: 부경대학교

URI: https://repository.pknu.ac.kr:8443/handle/2021.oak/23206
http://pknu.dcollection.net/common/orgView/200000185958

Affiliation: 부경대학교 대학원

Department: 대학원 IT융합응용공학과

Advisor: 김영봉

Table Of Contents: 1. 서론 1
1.1. 연구 배경 1
1.2. 연구 내용 4
1.3. 논문 구성 5
2. 관련 연구 6
2.1. 강화학습 6
2.1.1. Markov Decision Process(MDP) 7
2.1.2. Q-Learning 8
2.2. 기존 강화 학습 시뮬레이터 11
2.2.1. ALE 11
2.2.2. OpenAI Gym 12
3. 제안한 시뮬레이터 아키텍처 13
3.1. Environment Manager 13
3.1.1. Policy Script와의 통신을 위한 인터페이스 14
3.1.2. Physics Manager와의 통신을 위한 인터페이스 16
3.1.3. 경험 데이터 관리 구조 18
3.1.4. 경험 데이터 저장 19
3.1.5. 경험 데이터 복구 20
3.2. Resource Manager 22
3.3. Physics Manager 23
3.3.1. Object 23
3.3.2. Sensor 24
3.3.3. Physics Manager 처리 단계 25
3.4. Rendering Manager 26
3.4.1. Generate Model Matrix 26
3.4.2. Render Screen 27
4. 시뮬레이터 구현 29
4.1. 구현 대상 29
4.1.1. 시뮬레이터 진행 과정 30
4.1.2. 강화학습 구성별 정의 31
4.2. 구성별 구현과 결과 33
4.3. 구현 결과 37
4.3.1. 결과 이미지 37
4.3.2. 타 플랫폼과의 기능 비교 39
5. 결론 및 향후 연구 40
5.1. 결론 40
5.2. 향후 연구 41

Degree: Master

Appears in Collections:: 대학원 > IT융합응용공학과

Show Simple Item RecordShow Full Item Record

Authorize & License

Authorize공개

qrcode

트윗하기