IEEE Computational Intelligence Magazine - November 2021 - 60
input x .,ci1
The lower the distance, the higher the value of I ,i
and the clearer the memory.
To elaborate the algorithm, first
di 112 f== is
/,
Li ,, ,
initialized (see line 1 in Algorithm 2). For each step in a learning
trial, the most suitable memotype is activated for instructing
agent decision making according to the meme activation process.
Once a learning trial succeeds, all the activated memotypes are
defined as the useful memes and the corresponding memory
intensity I per step is transferred into the memory intensity set
MIS (see line 11 in Algorithm 2). After certain learning trials,
the intensity values
iL,12f=
,
and the median values Ii
fied foriL1 f= ,, .
It is assumed that there is a causal relationship between the
values of memory intensities in all trials and an agent's learning
performance (i.e., probability to succeed). If an agent remembers
certain channels of state information more clearly than the
others, they are expected to have a higher probability to succeed.
For example, II
AB
Median
2
Median means that the agent is
more likely to succeed if they remember the information from
the Ath sensory channel more clearly. Recall that d controls
the intensity of attention for information from the different
sensory channels. If
dd the information from the Ath
AB ,2
sensory channel tends to have a larger impact on calculating
the activation values Tj
of the memotype j, causing more memotypes
to be generated for remembering the information from
the Ath sensory channel more clearly, which in the end
improves the agent's learning performance.
Intuitively, Ii
Median
represents the intensity of the successful
j
II /R 1
Median = Median
=
ii Ij
L
memory on the information from different sensory channels.
Based on the normalization of
Median for
iL the attention intensity id is then updated (see
line 16 in Algorithm 2).
= 12 f ,, , ,
dd )=- d+ () fitness
() Ii
new
i
()
di 12 f=
iL
oldold
Median ii
Note if the fitness (i.e., success rate) of an agent is poor, Ii
can be inaccurate, and ,, ,, is updated slowly.
()
(11)
Median
in MIS {, ,} are sorted for
Median of MISi are identii
III
iii
= 123
f
IV. Bidirectional Imitation in Meme Selection
In the past, many researchers have focused on answering the
question, " (how) can agents benefit from the useful knowledge
of other agents in order to achieve better individual and overall
system performance? " A significant amount of research has
been proposed to exploit domain expertise with varying
amounts of human-provided knowledge. Many knowledge
sharing strategies such as instance transfer [28], model transfer
[29], and advice exchanging [30] have been proposed, which
benefits a wide range of learning and decision-making tasks.
However, most of these strategies assume that a well-trained or
predefined domain expert is provided. Consequently, they are
not feasible in multi-agent learning scenarios where all agents
begin the learning process tabula rasa [31].
In multi-agent learning, each agent can be unique both memetically
and genetically. While any given agent may not be an expert
during learning, they may have useful local knowledge that all others
are not aware of due to their independent learning experiences
(in the form of memotypes). Similar to the human social learning
scheme, these learning agents would likely benefit from knowledge
shared by their partners, thereby improving the effectiveness of system-wide
learning. Current research is beginning to take steps
toward addressing the problem of knowledge sharing. Several studies
have investigated learning from demonstrations [32], action
advising [33], [34], knowledge broadcasting [35], and imitation [36].
Among them, the superiority of the imitation strategy has been
verified in MeMAS and its enhanced versions for increasing the
learning performance of memetic agents by considering the essential
human learning principles, i.e., " imitate-from-elitist " [10] and
" like-attracts-like " [11]. Still, these approaches always assume that an
agent is capable of constantly monitoring the performance of all
others. The imitation process occurs whenever a potential expert is
identified, that is, an individual with better learning fitness or bearing
certain similarities under the given circumstances. This approach
suffers from drawbacks, as constantly paying attention imposes
increasing communication costs and can be simply unrealistic.
In this study, the objective is to develop a more effective
State Judgment
Imitation
The Imitator Policy
Request
Respond
Own Action ai
Imitated Action aj
Action
Memetic Agent i
FIGURE 4 Illustration of the imitation process.
Memetic Agent j
where A is the space of all available actions, and (, )Qs a l is the
normalized Q-value of performing the action a at a state s:
60 IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE | NOVEMBER 2021
State Judgment
Imitation
The Imitated Policy
knowledge sharing scheme in multi-agent learning. Different
from the existing MeMAS research, a bidirectionally interactive
imitation strategy (shown in Figure 4) is proposed based on an
agent's estimation of the importance and/or uncertainty of
decision making at a given environmental state.
Recall that a memotype inside an agent's mind universe essentially
encodes a mapping of the input state s and the action a,
through the reward measurement
R = (( ,),Qs a
1 -Qs a (, ))
which is updated using Q-learning functions. Accordingly, a
method is proposed to measure the state importance, Imp(s), of the
current state s using a Gini coefficient regarding the Q-value distribution
for all possible actions, which is formulated as follows:
A
Imps
() / ll(, )( (, ))
1
=-#
=
1
a
QsaQsa
(12)
IEEE Computational Intelligence Magazine - November 2021
Table of Contents for the Digital Edition of IEEE Computational Intelligence Magazine - November 2021
IEEE Computational Intelligence Magazine - November 2021 - Cover1
IEEE Computational Intelligence Magazine - November 2021 - Cover2
IEEE Computational Intelligence Magazine - November 2021 - 1
IEEE Computational Intelligence Magazine - November 2021 - 2
IEEE Computational Intelligence Magazine - November 2021 - 3
IEEE Computational Intelligence Magazine - November 2021 - 4
IEEE Computational Intelligence Magazine - November 2021 - 5
IEEE Computational Intelligence Magazine - November 2021 - 6
IEEE Computational Intelligence Magazine - November 2021 - 7
IEEE Computational Intelligence Magazine - November 2021 - 8
IEEE Computational Intelligence Magazine - November 2021 - 9
IEEE Computational Intelligence Magazine - November 2021 - 10
IEEE Computational Intelligence Magazine - November 2021 - 11
IEEE Computational Intelligence Magazine - November 2021 - 12
IEEE Computational Intelligence Magazine - November 2021 - 13
IEEE Computational Intelligence Magazine - November 2021 - 14
IEEE Computational Intelligence Magazine - November 2021 - 15
IEEE Computational Intelligence Magazine - November 2021 - 16
IEEE Computational Intelligence Magazine - November 2021 - 17
IEEE Computational Intelligence Magazine - November 2021 - 18
IEEE Computational Intelligence Magazine - November 2021 - 19
IEEE Computational Intelligence Magazine - November 2021 - 20
IEEE Computational Intelligence Magazine - November 2021 - 21
IEEE Computational Intelligence Magazine - November 2021 - 22
IEEE Computational Intelligence Magazine - November 2021 - 23
IEEE Computational Intelligence Magazine - November 2021 - 24
IEEE Computational Intelligence Magazine - November 2021 - 25
IEEE Computational Intelligence Magazine - November 2021 - 26
IEEE Computational Intelligence Magazine - November 2021 - 27
IEEE Computational Intelligence Magazine - November 2021 - 28
IEEE Computational Intelligence Magazine - November 2021 - 29
IEEE Computational Intelligence Magazine - November 2021 - 30
IEEE Computational Intelligence Magazine - November 2021 - 31
IEEE Computational Intelligence Magazine - November 2021 - 32
IEEE Computational Intelligence Magazine - November 2021 - 33
IEEE Computational Intelligence Magazine - November 2021 - 34
IEEE Computational Intelligence Magazine - November 2021 - 35
IEEE Computational Intelligence Magazine - November 2021 - 36
IEEE Computational Intelligence Magazine - November 2021 - 37
IEEE Computational Intelligence Magazine - November 2021 - 38
IEEE Computational Intelligence Magazine - November 2021 - 39
IEEE Computational Intelligence Magazine - November 2021 - 40
IEEE Computational Intelligence Magazine - November 2021 - 41
IEEE Computational Intelligence Magazine - November 2021 - 42
IEEE Computational Intelligence Magazine - November 2021 - 43
IEEE Computational Intelligence Magazine - November 2021 - 44
IEEE Computational Intelligence Magazine - November 2021 - 45
IEEE Computational Intelligence Magazine - November 2021 - 46
IEEE Computational Intelligence Magazine - November 2021 - 47
IEEE Computational Intelligence Magazine - November 2021 - 48
IEEE Computational Intelligence Magazine - November 2021 - 49
IEEE Computational Intelligence Magazine - November 2021 - 50
IEEE Computational Intelligence Magazine - November 2021 - 51
IEEE Computational Intelligence Magazine - November 2021 - 52
IEEE Computational Intelligence Magazine - November 2021 - 53
IEEE Computational Intelligence Magazine - November 2021 - 54
IEEE Computational Intelligence Magazine - November 2021 - 55
IEEE Computational Intelligence Magazine - November 2021 - 56
IEEE Computational Intelligence Magazine - November 2021 - 57
IEEE Computational Intelligence Magazine - November 2021 - 58
IEEE Computational Intelligence Magazine - November 2021 - 59
IEEE Computational Intelligence Magazine - November 2021 - 60
IEEE Computational Intelligence Magazine - November 2021 - 61
IEEE Computational Intelligence Magazine - November 2021 - 62
IEEE Computational Intelligence Magazine - November 2021 - 63
IEEE Computational Intelligence Magazine - November 2021 - 64
IEEE Computational Intelligence Magazine - November 2021 - 65
IEEE Computational Intelligence Magazine - November 2021 - 66
IEEE Computational Intelligence Magazine - November 2021 - 67
IEEE Computational Intelligence Magazine - November 2021 - 68
IEEE Computational Intelligence Magazine - November 2021 - 69
IEEE Computational Intelligence Magazine - November 2021 - 70
IEEE Computational Intelligence Magazine - November 2021 - 71
IEEE Computational Intelligence Magazine - November 2021 - 72
IEEE Computational Intelligence Magazine - November 2021 - 73
IEEE Computational Intelligence Magazine - November 2021 - 74
IEEE Computational Intelligence Magazine - November 2021 - 75
IEEE Computational Intelligence Magazine - November 2021 - 76
IEEE Computational Intelligence Magazine - November 2021 - 77
IEEE Computational Intelligence Magazine - November 2021 - 78
IEEE Computational Intelligence Magazine - November 2021 - 79
IEEE Computational Intelligence Magazine - November 2021 - 80
IEEE Computational Intelligence Magazine - November 2021 - 81
IEEE Computational Intelligence Magazine - November 2021 - 82
IEEE Computational Intelligence Magazine - November 2021 - 83
IEEE Computational Intelligence Magazine - November 2021 - 84
IEEE Computational Intelligence Magazine - November 2021 - 85
IEEE Computational Intelligence Magazine - November 2021 - 86
IEEE Computational Intelligence Magazine - November 2021 - 87
IEEE Computational Intelligence Magazine - November 2021 - 88
IEEE Computational Intelligence Magazine - November 2021 - 89
IEEE Computational Intelligence Magazine - November 2021 - 90
IEEE Computational Intelligence Magazine - November 2021 - 91
IEEE Computational Intelligence Magazine - November 2021 - 92
IEEE Computational Intelligence Magazine - November 2021 - 93
IEEE Computational Intelligence Magazine - November 2021 - 94
IEEE Computational Intelligence Magazine - November 2021 - 95
IEEE Computational Intelligence Magazine - November 2021 - 96
IEEE Computational Intelligence Magazine - November 2021 - 97
IEEE Computational Intelligence Magazine - November 2021 - 98
IEEE Computational Intelligence Magazine - November 2021 - 99
IEEE Computational Intelligence Magazine - November 2021 - 100
IEEE Computational Intelligence Magazine - November 2021 - Cover3
IEEE Computational Intelligence Magazine - November 2021 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202311
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202308
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202305
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202302
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202211
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202208
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202205
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202202
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202111
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202108
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202105
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202102
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202011
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202008
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202005
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_202002
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201911
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201908
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201905
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201902
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201811
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201808
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201805
https://www.nxtbook.com/nxtbooks/ieee/computationalintelligence_201802
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_winter17
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_fall17
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_summer17
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_spring17
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_winter16
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_fall16
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_summer16
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_spring16
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_winter15
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_fall15
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_summer15
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_spring15
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_winter14
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_fall14
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_summer14
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_spring14
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_winter13
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_fall13
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_summer13
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_spring13
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_winter12
https://www.nxtbook.com/nxtbooks/ieee/computational_intelligence_fall12
https://www.nxtbookmedia.com