IEEE Robotics & Automation Magazine - December 2021 - 26
Penalty for Collisions Rc
Collisions are the very last thing expected to happen to the
agent vehicle, so we define the punishment for them as
R = '
c
!
-rc
where, again, r Rl
+
.
where wk
Update Mechanism
The first three reward functions are updated in the branches.
Because collision avoidance should be conducted by all the
policies of lane keeping, lane change, and overtaking, it has a
higher priority, which means Rc
are updated. Finally, the reward in Renv
Rs as (, ,)
env
ks ol,
is updated after the branches
is formulated as
ll
"
=+
!
/ Rs as Rc
,,
k(, ,) .
(12)
Specific information about the reward parameters is presented
in the " Experiment and Results " section.
Detailed Implementation of the MRA DQN Model
For the framework of the MRA DQN, the Q value functions
corresponding to the reward functions
subnetworks are denoted by ,Qs
R ,s R ,o and Rl
in the
Q ,o and Q .l The output
action of the car agent is decided by multiple driving
PER
Qs Rs
Prioritized Encoding With the TD Error
The deviation extent from the estimated Q value to the
targe one indicates the importance of the training sample.
Therefore, we apply TD error PMRA
of samples:
S
Qo
Ro
QMRA
Ql Rl
Psay Qs a(, ;) .
,, ,
MRA ii (, ;)
ikik i
ks ol
=! "
/
,
(16)
To
avoid overfitting caused by overbiased sampling, it is necessary
to build a nonzero probability sampling strategy. What
is more, the priority values should possess significant disparity
so that the important data can be sampled enough times.
The sampling probability Si
of sample i is
Shared
Low-Level
Network
Separated
High-Level Network
Branches for MRA
where Pi
Figure 2. The proposed MRA with the decomposed reward
architecture for three branches of the Q network.
Table 1. The kernel parameter configuration.
Name rv vmax
vmin
rl
S = n
i
/ ()a
i=1
()a
Pi
Pi
,
is the priority of sample i fine-tuned with a coefficient
a ! (, ].01 For uniform sampling, a 0= in (17).
ro rc ii c
Value 1 80 km/h 40 km/h 0.25 0.5 20 1/3 0.99
Prioritized Sampling Based on a Sum Tree
The time complexity of storing linear data is O(),n which is
unaffordable for large-scale training. In this article, the solution
is to apply a sum tree to optimize the storage process, by
which the complexity is reduced to O().
logn In a sum tree,
the leaf nodes record the priority, the branch nodes record the
26 * IEEE ROBOTICS & AUTOMATION MAGAZINE * DECEMBER 2021
(17)
is the weight assigned to indicate the importance of
every driving preference. The loss function to minimize in
training and the target value are formulated as
L ()
ii
ii (14)
"
=!
E
,,,,
,,
ks ol,
yR sasQ sa
(, ,)
x
sars / yQ (, ;) ,
2
l ^h;
sa
ki ki
E
ki, kk i 1-ll l
=+ci$ ma (, ;).
(15)
The proposed MRA method can improve the driving performance
of the DQN because the learning objective
becomes diverse. The single reward function limits the network
to a single objective task. With the MRA approach, the
network can simultaneously learn multiple objectives, which
is similar to multiagent reinforcement learning. Because different
learning tasks emphasize features representing varying
information, the MRA DQN will comprehensively consider
a driving scenario.
the agentcar collides
otherwise,
(11)
preferences in discrepant aspects, while the driving preferences
are optimized by the branches in the network by the
reward functions. Therefore, the final QMRA
DQN is calculated by combining all the Q value functions:
:
QsawQs a
,,
MRA ii=
! "
(, ;)
/
ks ol,
k
k(, ;),
(13)
in the MRA
to encode the priority
IEEE Robotics & Automation Magazine - December 2021
Table of Contents for the Digital Edition of IEEE Robotics & Automation Magazine - December 2021
Contents
IEEE Robotics & Automation Magazine - December 2021 - Cover1
IEEE Robotics & Automation Magazine - December 2021 - Cover2
IEEE Robotics & Automation Magazine - December 2021 - Contents
IEEE Robotics & Automation Magazine - December 2021 - 2
IEEE Robotics & Automation Magazine - December 2021 - 3
IEEE Robotics & Automation Magazine - December 2021 - 4
IEEE Robotics & Automation Magazine - December 2021 - 5
IEEE Robotics & Automation Magazine - December 2021 - 6
IEEE Robotics & Automation Magazine - December 2021 - 7
IEEE Robotics & Automation Magazine - December 2021 - 8
IEEE Robotics & Automation Magazine - December 2021 - 9
IEEE Robotics & Automation Magazine - December 2021 - 10
IEEE Robotics & Automation Magazine - December 2021 - 11
IEEE Robotics & Automation Magazine - December 2021 - 12
IEEE Robotics & Automation Magazine - December 2021 - 13
IEEE Robotics & Automation Magazine - December 2021 - 14
IEEE Robotics & Automation Magazine - December 2021 - 15
IEEE Robotics & Automation Magazine - December 2021 - 16
IEEE Robotics & Automation Magazine - December 2021 - 17
IEEE Robotics & Automation Magazine - December 2021 - 18
IEEE Robotics & Automation Magazine - December 2021 - 19
IEEE Robotics & Automation Magazine - December 2021 - 20
IEEE Robotics & Automation Magazine - December 2021 - 21
IEEE Robotics & Automation Magazine - December 2021 - 22
IEEE Robotics & Automation Magazine - December 2021 - 23
IEEE Robotics & Automation Magazine - December 2021 - 24
IEEE Robotics & Automation Magazine - December 2021 - 25
IEEE Robotics & Automation Magazine - December 2021 - 26
IEEE Robotics & Automation Magazine - December 2021 - 27
IEEE Robotics & Automation Magazine - December 2021 - 28
IEEE Robotics & Automation Magazine - December 2021 - 29
IEEE Robotics & Automation Magazine - December 2021 - 30
IEEE Robotics & Automation Magazine - December 2021 - 31
IEEE Robotics & Automation Magazine - December 2021 - 32
IEEE Robotics & Automation Magazine - December 2021 - 33
IEEE Robotics & Automation Magazine - December 2021 - 34
IEEE Robotics & Automation Magazine - December 2021 - 35
IEEE Robotics & Automation Magazine - December 2021 - 36
IEEE Robotics & Automation Magazine - December 2021 - 37
IEEE Robotics & Automation Magazine - December 2021 - 38
IEEE Robotics & Automation Magazine - December 2021 - 39
IEEE Robotics & Automation Magazine - December 2021 - 40
IEEE Robotics & Automation Magazine - December 2021 - 41
IEEE Robotics & Automation Magazine - December 2021 - 42
IEEE Robotics & Automation Magazine - December 2021 - 43
IEEE Robotics & Automation Magazine - December 2021 - 44
IEEE Robotics & Automation Magazine - December 2021 - 45
IEEE Robotics & Automation Magazine - December 2021 - 46
IEEE Robotics & Automation Magazine - December 2021 - 47
IEEE Robotics & Automation Magazine - December 2021 - 48
IEEE Robotics & Automation Magazine - December 2021 - 49
IEEE Robotics & Automation Magazine - December 2021 - 50
IEEE Robotics & Automation Magazine - December 2021 - 51
IEEE Robotics & Automation Magazine - December 2021 - 52
IEEE Robotics & Automation Magazine - December 2021 - 53
IEEE Robotics & Automation Magazine - December 2021 - 54
IEEE Robotics & Automation Magazine - December 2021 - 55
IEEE Robotics & Automation Magazine - December 2021 - 56
IEEE Robotics & Automation Magazine - December 2021 - 57
IEEE Robotics & Automation Magazine - December 2021 - 58
IEEE Robotics & Automation Magazine - December 2021 - 59
IEEE Robotics & Automation Magazine - December 2021 - 60
IEEE Robotics & Automation Magazine - December 2021 - 61
IEEE Robotics & Automation Magazine - December 2021 - 62
IEEE Robotics & Automation Magazine - December 2021 - 63
IEEE Robotics & Automation Magazine - December 2021 - 64
IEEE Robotics & Automation Magazine - December 2021 - 65
IEEE Robotics & Automation Magazine - December 2021 - 66
IEEE Robotics & Automation Magazine - December 2021 - 67
IEEE Robotics & Automation Magazine - December 2021 - 68
IEEE Robotics & Automation Magazine - December 2021 - 69
IEEE Robotics & Automation Magazine - December 2021 - 70
IEEE Robotics & Automation Magazine - December 2021 - 71
IEEE Robotics & Automation Magazine - December 2021 - 72
IEEE Robotics & Automation Magazine - December 2021 - 73
IEEE Robotics & Automation Magazine - December 2021 - 74
IEEE Robotics & Automation Magazine - December 2021 - 75
IEEE Robotics & Automation Magazine - December 2021 - 76
IEEE Robotics & Automation Magazine - December 2021 - 77
IEEE Robotics & Automation Magazine - December 2021 - 78
IEEE Robotics & Automation Magazine - December 2021 - 79
IEEE Robotics & Automation Magazine - December 2021 - 80
IEEE Robotics & Automation Magazine - December 2021 - 81
IEEE Robotics & Automation Magazine - December 2021 - 82
IEEE Robotics & Automation Magazine - December 2021 - 83
IEEE Robotics & Automation Magazine - December 2021 - 84
IEEE Robotics & Automation Magazine - December 2021 - 85
IEEE Robotics & Automation Magazine - December 2021 - 86
IEEE Robotics & Automation Magazine - December 2021 - 87
IEEE Robotics & Automation Magazine - December 2021 - 88
IEEE Robotics & Automation Magazine - December 2021 - 89
IEEE Robotics & Automation Magazine - December 2021 - 90
IEEE Robotics & Automation Magazine - December 2021 - 91
IEEE Robotics & Automation Magazine - December 2021 - 92
IEEE Robotics & Automation Magazine - December 2021 - 93
IEEE Robotics & Automation Magazine - December 2021 - 94
IEEE Robotics & Automation Magazine - December 2021 - 95
IEEE Robotics & Automation Magazine - December 2021 - 96
IEEE Robotics & Automation Magazine - December 2021 - 97
IEEE Robotics & Automation Magazine - December 2021 - 98
IEEE Robotics & Automation Magazine - December 2021 - 99
IEEE Robotics & Automation Magazine - December 2021 - 100
IEEE Robotics & Automation Magazine - December 2021 - 101
IEEE Robotics & Automation Magazine - December 2021 - 102
IEEE Robotics & Automation Magazine - December 2021 - 103
IEEE Robotics & Automation Magazine - December 2021 - 104
IEEE Robotics & Automation Magazine - December 2021 - 105
IEEE Robotics & Automation Magazine - December 2021 - 106
IEEE Robotics & Automation Magazine - December 2021 - 107
IEEE Robotics & Automation Magazine - December 2021 - 108
IEEE Robotics & Automation Magazine - December 2021 - 109
IEEE Robotics & Automation Magazine - December 2021 - 110
IEEE Robotics & Automation Magazine - December 2021 - 111
IEEE Robotics & Automation Magazine - December 2021 - 112
IEEE Robotics & Automation Magazine - December 2021 - 113
IEEE Robotics & Automation Magazine - December 2021 - 114
IEEE Robotics & Automation Magazine - December 2021 - 115
IEEE Robotics & Automation Magazine - December 2021 - 116
IEEE Robotics & Automation Magazine - December 2021 - 117
IEEE Robotics & Automation Magazine - December 2021 - 118
IEEE Robotics & Automation Magazine - December 2021 - 119
IEEE Robotics & Automation Magazine - December 2021 - 120
IEEE Robotics & Automation Magazine - December 2021 - 121
IEEE Robotics & Automation Magazine - December 2021 - 122
IEEE Robotics & Automation Magazine - December 2021 - 123
IEEE Robotics & Automation Magazine - December 2021 - 124
IEEE Robotics & Automation Magazine - December 2021 - 125
IEEE Robotics & Automation Magazine - December 2021 - 126
IEEE Robotics & Automation Magazine - December 2021 - 127
IEEE Robotics & Automation Magazine - December 2021 - 128
IEEE Robotics & Automation Magazine - December 2021 - 129
IEEE Robotics & Automation Magazine - December 2021 - 130
IEEE Robotics & Automation Magazine - December 2021 - 131
IEEE Robotics & Automation Magazine - December 2021 - 132
IEEE Robotics & Automation Magazine - December 2021 - Cover3
IEEE Robotics & Automation Magazine - December 2021 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2010
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2010
https://www.nxtbookmedia.com