PUKYONG

Computational Studies of Learning, Inference, and Assessment

Metadata Downloads
Alternative Title
학습, 추론, 평가의 계산적 연구: 사회적 상호작용에서의 불확실성 이해
Abstract
This thesis addresses two topics in evolutionary game theory. The first topic discusses how a player can infer other players’ strategies in the iterated prisoner’s dilemma game. We construct two methods for strategy inference. In the first method, we consider players with memory-one stochastic strategies in the iterated prisoner’s dilemma, with an assumption that they cannot directly access each other’s strategy but only observe the actual moves for a certain number of rounds. Based on the observation, the observer has to infer the resident strategy in a Bayesian way and chooses his or her own strategy accordingly. By examining the best-response relations, we argue that players can escape from full defection into a cooperative equilibrium supported by Win-Stay-Lose-Shift in a self-confirming manner, provided that the cost of cooperation is
low and the observational learning supplies sufficiently large uncertainty. In the second method, the focal player infers the co-player’s strategy by applying the maximum likelihood estimation (MLE) to the observed sequence of actions. Our first finding is that the focal player’s inference is accurate when both players take only their last actions into consideration if the observed sequence is sufficiently long. To see the case in which inference must be inaccurate, we also set the focal player’s memory length to be shorter than the co-player’s. In this case, we choose a combination of Tit-for-tat and Anti-tit-for-tat (TA) as the co-player’s long-memory strategy. TA satisfies the following three conditions: 1) mutual cooperation is achieved when all player use the strategy, 2) the strategy exploits unconditional cooperation, and 3) a player using this strategy is not exploited repeatedly by any co-player. The short-memory player inaccurately infers TA as either Tit-for-tat, Win-Stay-Lose-Shift, or Grim trigger, depending on his or her own strategy. This work presents how a long-memory strategy is projected onto a short-memory one by inference with information loss. In addition, we suggest that each of those three well-known strategies could be a facet of a single successful strategy with a higher cognitive capacity. The second topic discusses the characteristics of cooperative stationarily stable strategies (CESS)
with ternary reputations. In this topic, we introduce the third reputation, ‘Neutral’ (N), to the existing ‘Good’ (G) and ‘Bad’ (B) reputations and find CESSs. We categorize the CESSs into three types according to the proportions of the three reputations in equilibrium. We found that the proportion of N is finite in one of the three types. This result suggests that cooperation can be maintained with only ‘G’ but also in the presence of multiple reputations (‘G’ and ‘N’).
Author(s)
김민재
Issued Date
2023
Awarded Date
2023-08
Type
Dissertation
Publisher
부경대학교
URI
https://repository.pknu.ac.kr:8443/handle/2021.oak/33244
http://pknu.dcollection.net/common/orgView/200000697504
Alternative Author(s)
Minjae Kim
Affiliation
부경대학교 대학원
Department
대학원 물리학과
Advisor
백승기
Table Of Contents
I. Introduction 1
II. Evolutionary game theory 5
1. Two-player game and payoff 6
2. Prisoner’s dilemma game 6
3. Iterated prisoner’s dilemma game in a population 7
4. Evolutionary Stable Strategy 7
5. Imitation dynamics 8
6. Indirect reciprocity 9
III. Win-Stay-Lose-Shift as a self-confirming equilibrium in the iterated Prisoner’s Dilemma 11
1. Introduction 11
2. Method and Result 13
2.1. Best-response relations without observational uncertainty 13
2.2. Observational learning 18
3. Summary and Discussion 26
IV. Strategy inference using the maximum likelihood estimation in the iterated prisoner’s dilemma game 30
1. INTRODUCTION 30
2. Methods 31
3. Results and discussion 37
3.1. m1 vs m1 37
3.2. m1 vs TFT-ATFT (m2) 39
4. Summary and discussion 41
V. Social norms in indirect reciprocity with ternary reputations 43
1. INTRODUCTION 43
2. Model 46
2.1. Description 46
2.2. Calculation 48
3. Methods 50
3.1. Calculation of the stationary-state population 50
3.2. Calculation of the cooperation level and the payoffs 52
3.3. Enumeration of norms 54
4. Results 54
4.1. Leading eight in the binary-reputation model 54
4.2. CESS’s in the ternary-reputation model 57
4.3. Details of each type 60
4.4. Dynamics of C3 norms 62
5. Summary and Discussion 64
VI. Conclusions 68
References 74
Degree
Doctor
Appears in Collections:
대학원 > 물리학과
Authorize & License
  • Authorize공개
  • Embargo2023-08-07
Files in This Item:

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.