IEEE Robotics & Automation Magazine - December 2021 - 25

where al means the value of variable a is updated, which is the
same as variable s, and r is the measured value of
value as rr=
r .t
define the optimal Qr
t r
argmax Qs al
aAl!
"
If we
t (, :)(,) ,
Qs maxaQ sa ,
the determini s t i c policy can be obtained by a =
(, ).
To im plement the PMRA DQN, the
model should embed a multireward mechanism, and the
PER function is applied to the training process. The following
sections will introduce details about the observation, reward,
PER, action, and Q network.
Observation of the Agent Car
On a simulation platform, we can collect data from the firstperson
perspective (the sensor) and the third-person perspective
(the environment). Considering that image and
point cloud data are the most popular input format in autonomous
driving, the agent car is equipped with a front view
camera and a lidar sensor. The red-green-blue front view
camera provides information about the road and surrounding
vehicles, while the single-line lidar sensor measures 2D
depth information in 360ยบ.
Multireward for Driving
We propose an MRA for highway driving to train a better
function approximator to accomplish complex tasks. In the
DQN framework, when the nonlinear function is used to
approximate the Q value function, the update of the Q value
is prone to oscillation, which leads to unstable learning
behavior. For this reason, a target network is introduced. It
can provide a target value, and the online network can provide
an estimate value; the two values can used to obtain the
learning loss. We name the model the MRA DQN. In our
case, the value to approximate is
Q ,r
t
and the approximator is
implemented in the form of a deep neural network. When
training the MRA DQN, the loss function and target value are
formulated as
L ()=-l^h6 yQ(, ;) ,@ii (5)
DQN
yr xQs a
ii E ,, ,sars
DQN
where yi
2
i
i =+ci$ ma (, ;),
ll
the function approximator. Here, i 1i -
sa i
i-1
(6)
DQN is the target value and i is the weight vector for
is encoded by a separated
target network. The meaning of the other notations
remains as previously explained.
The general design idea is to decompose a single learning
task into several subtasks. Subreward functions are proposed
to train different branches of the deep neural network, which
makes the branches optimize toward independent goals. The
original Q network is divided into branches for high-dimensional
feature representation and shares the same feature
map for the low-dimensional network. Therefore, the total
reward Renv
the benefits obtained by various actions:
(, ,)
env
=
!
Rs asll/ (, ,),
kP
Rs ask
(7)
where P is the set of policies. As shown in (7), the subnetworks
are trained with the reward function emphasizing
different aspects. The advantage of this design is that the optimization
of the whole
network is constrained to
a few possible directions
by observable variables
[10], which makes the
model's decision more
interpretable for humans,
and the training burden
is lightened.
Safety and efficiency
The advantage of this
design is that the
are essential factors that
should be taken into account
for highway driving.
We consider four driving
preferences: higher speed,
more overtaking, less lane changing, and no collisions. In
other words,
optimization of the whole
network is constrained to a
few possible directions by
observable variables.
P := {speed changing, overtaking, lane changing,
collision}. The reasonability and formulation of these actions
are as follows.
Reward for Speed Rs
The higher the speed is, the more time is saved. Encouraging
faster driving brings an improvement of efficiency. The speed
reward is closely related to velocity v as
R =
s
()$
-
-
vv
vv r
maxmin
min
v
.
In (8), the basic reward value for the speed is ;rv
(8)
vmax and
vmin are the maximum and minimum velocity in a range of
time, so vmin may not be zero.
Reward for Overtaking Ro
Overtaking helps to maintain high speeds and brings
about more efficiency, so we define the reward for passing
vehicles as
R = '
o
!
r
where r Ro
o
the agentcar overtakesother cars
otherwise,
+
.
Penalty for Lane Changing Rl
Safety must be guaranteed when pursuing efficiency. In case
the reward for overtaking grows enthusiastically high, which
may instigate unnecessary passing and cause accidents, a limitation
on lane changing frequency is reasonable. Therefore,
we define the discouragement for lane changing as
returned from the environment is the sum of all
R = '
l
where r Rl
!
+
.
DECEMBER 2021 * IEEE ROBOTICS & AUTOMATION MAGAZINE *
25
-rl
the agent car changeslane
otherwise,
(10)
(9)

IEEE Robotics & Automation Magazine - December 2021

Table of Contents for the Digital Edition of IEEE Robotics & Automation Magazine - December 2021

Contents
IEEE Robotics & Automation Magazine - December 2021 - Cover1
IEEE Robotics & Automation Magazine - December 2021 - Cover2
IEEE Robotics & Automation Magazine - December 2021 - Contents
IEEE Robotics & Automation Magazine - December 2021 - 2
IEEE Robotics & Automation Magazine - December 2021 - 3
IEEE Robotics & Automation Magazine - December 2021 - 4
IEEE Robotics & Automation Magazine - December 2021 - 5
IEEE Robotics & Automation Magazine - December 2021 - 6
IEEE Robotics & Automation Magazine - December 2021 - 7
IEEE Robotics & Automation Magazine - December 2021 - 8
IEEE Robotics & Automation Magazine - December 2021 - 9
IEEE Robotics & Automation Magazine - December 2021 - 10
IEEE Robotics & Automation Magazine - December 2021 - 11
IEEE Robotics & Automation Magazine - December 2021 - 12
IEEE Robotics & Automation Magazine - December 2021 - 13
IEEE Robotics & Automation Magazine - December 2021 - 14
IEEE Robotics & Automation Magazine - December 2021 - 15
IEEE Robotics & Automation Magazine - December 2021 - 16
IEEE Robotics & Automation Magazine - December 2021 - 17
IEEE Robotics & Automation Magazine - December 2021 - 18
IEEE Robotics & Automation Magazine - December 2021 - 19
IEEE Robotics & Automation Magazine - December 2021 - 20
IEEE Robotics & Automation Magazine - December 2021 - 21
IEEE Robotics & Automation Magazine - December 2021 - 22
IEEE Robotics & Automation Magazine - December 2021 - 23
IEEE Robotics & Automation Magazine - December 2021 - 24
IEEE Robotics & Automation Magazine - December 2021 - 25
IEEE Robotics & Automation Magazine - December 2021 - 26
IEEE Robotics & Automation Magazine - December 2021 - 27
IEEE Robotics & Automation Magazine - December 2021 - 28
IEEE Robotics & Automation Magazine - December 2021 - 29
IEEE Robotics & Automation Magazine - December 2021 - 30
IEEE Robotics & Automation Magazine - December 2021 - 31
IEEE Robotics & Automation Magazine - December 2021 - 32
IEEE Robotics & Automation Magazine - December 2021 - 33
IEEE Robotics & Automation Magazine - December 2021 - 34
IEEE Robotics & Automation Magazine - December 2021 - 35
IEEE Robotics & Automation Magazine - December 2021 - 36
IEEE Robotics & Automation Magazine - December 2021 - 37
IEEE Robotics & Automation Magazine - December 2021 - 38
IEEE Robotics & Automation Magazine - December 2021 - 39
IEEE Robotics & Automation Magazine - December 2021 - 40
IEEE Robotics & Automation Magazine - December 2021 - 41
IEEE Robotics & Automation Magazine - December 2021 - 42
IEEE Robotics & Automation Magazine - December 2021 - 43
IEEE Robotics & Automation Magazine - December 2021 - 44
IEEE Robotics & Automation Magazine - December 2021 - 45
IEEE Robotics & Automation Magazine - December 2021 - 46
IEEE Robotics & Automation Magazine - December 2021 - 47
IEEE Robotics & Automation Magazine - December 2021 - 48
IEEE Robotics & Automation Magazine - December 2021 - 49
IEEE Robotics & Automation Magazine - December 2021 - 50
IEEE Robotics & Automation Magazine - December 2021 - 51
IEEE Robotics & Automation Magazine - December 2021 - 52
IEEE Robotics & Automation Magazine - December 2021 - 53
IEEE Robotics & Automation Magazine - December 2021 - 54
IEEE Robotics & Automation Magazine - December 2021 - 55
IEEE Robotics & Automation Magazine - December 2021 - 56
IEEE Robotics & Automation Magazine - December 2021 - 57
IEEE Robotics & Automation Magazine - December 2021 - 58
IEEE Robotics & Automation Magazine - December 2021 - 59
IEEE Robotics & Automation Magazine - December 2021 - 60
IEEE Robotics & Automation Magazine - December 2021 - 61
IEEE Robotics & Automation Magazine - December 2021 - 62
IEEE Robotics & Automation Magazine - December 2021 - 63
IEEE Robotics & Automation Magazine - December 2021 - 64
IEEE Robotics & Automation Magazine - December 2021 - 65
IEEE Robotics & Automation Magazine - December 2021 - 66
IEEE Robotics & Automation Magazine - December 2021 - 67
IEEE Robotics & Automation Magazine - December 2021 - 68
IEEE Robotics & Automation Magazine - December 2021 - 69
IEEE Robotics & Automation Magazine - December 2021 - 70
IEEE Robotics & Automation Magazine - December 2021 - 71
IEEE Robotics & Automation Magazine - December 2021 - 72
IEEE Robotics & Automation Magazine - December 2021 - 73
IEEE Robotics & Automation Magazine - December 2021 - 74
IEEE Robotics & Automation Magazine - December 2021 - 75
IEEE Robotics & Automation Magazine - December 2021 - 76
IEEE Robotics & Automation Magazine - December 2021 - 77
IEEE Robotics & Automation Magazine - December 2021 - 78
IEEE Robotics & Automation Magazine - December 2021 - 79
IEEE Robotics & Automation Magazine - December 2021 - 80
IEEE Robotics & Automation Magazine - December 2021 - 81
IEEE Robotics & Automation Magazine - December 2021 - 82
IEEE Robotics & Automation Magazine - December 2021 - 83
IEEE Robotics & Automation Magazine - December 2021 - 84
IEEE Robotics & Automation Magazine - December 2021 - 85
IEEE Robotics & Automation Magazine - December 2021 - 86
IEEE Robotics & Automation Magazine - December 2021 - 87
IEEE Robotics & Automation Magazine - December 2021 - 88
IEEE Robotics & Automation Magazine - December 2021 - 89
IEEE Robotics & Automation Magazine - December 2021 - 90
IEEE Robotics & Automation Magazine - December 2021 - 91
IEEE Robotics & Automation Magazine - December 2021 - 92
IEEE Robotics & Automation Magazine - December 2021 - 93
IEEE Robotics & Automation Magazine - December 2021 - 94
IEEE Robotics & Automation Magazine - December 2021 - 95
IEEE Robotics & Automation Magazine - December 2021 - 96
IEEE Robotics & Automation Magazine - December 2021 - 97
IEEE Robotics & Automation Magazine - December 2021 - 98
IEEE Robotics & Automation Magazine - December 2021 - 99
IEEE Robotics & Automation Magazine - December 2021 - 100
IEEE Robotics & Automation Magazine - December 2021 - 101
IEEE Robotics & Automation Magazine - December 2021 - 102
IEEE Robotics & Automation Magazine - December 2021 - 103
IEEE Robotics & Automation Magazine - December 2021 - 104
IEEE Robotics & Automation Magazine - December 2021 - 105
IEEE Robotics & Automation Magazine - December 2021 - 106
IEEE Robotics & Automation Magazine - December 2021 - 107
IEEE Robotics & Automation Magazine - December 2021 - 108
IEEE Robotics & Automation Magazine - December 2021 - 109
IEEE Robotics & Automation Magazine - December 2021 - 110
IEEE Robotics & Automation Magazine - December 2021 - 111
IEEE Robotics & Automation Magazine - December 2021 - 112
IEEE Robotics & Automation Magazine - December 2021 - 113
IEEE Robotics & Automation Magazine - December 2021 - 114
IEEE Robotics & Automation Magazine - December 2021 - 115
IEEE Robotics & Automation Magazine - December 2021 - 116
IEEE Robotics & Automation Magazine - December 2021 - 117
IEEE Robotics & Automation Magazine - December 2021 - 118
IEEE Robotics & Automation Magazine - December 2021 - 119
IEEE Robotics & Automation Magazine - December 2021 - 120
IEEE Robotics & Automation Magazine - December 2021 - 121
IEEE Robotics & Automation Magazine - December 2021 - 122
IEEE Robotics & Automation Magazine - December 2021 - 123
IEEE Robotics & Automation Magazine - December 2021 - 124
IEEE Robotics & Automation Magazine - December 2021 - 125
IEEE Robotics & Automation Magazine - December 2021 - 126
IEEE Robotics & Automation Magazine - December 2021 - 127
IEEE Robotics & Automation Magazine - December 2021 - 128
IEEE Robotics & Automation Magazine - December 2021 - 129
IEEE Robotics & Automation Magazine - December 2021 - 130
IEEE Robotics & Automation Magazine - December 2021 - 131
IEEE Robotics & Automation Magazine - December 2021 - 132
IEEE Robotics & Automation Magazine - December 2021 - Cover3
IEEE Robotics & Automation Magazine - December 2021 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2010
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2010
https://www.nxtbookmedia.com