IEEE Robotics & Automation Magazine - December 2021 - 14

between O and the projections of the COG on the x- and
y-axis, respectively.
Besides safety, we further observed that the contact of the
robot tracks with the surface cannot alone guarantee proper
traction. In reality, it is essential that sufficient force is exerted
from the robot to the ground, which is possible when the projection
of the COG Cx
is located near the perpendicular projection
of the geometrical center O on the staircase. This leads
to more uniformly distributed traction along the entire platform.
Finally, we further penalize the robot when it loses stability
and tips over, which happens when its pitch reaches /.2r
Platform Shaking Reduction
The effect of gravity on the movement of a robot switches
between ascent and descent by hindering ascent and accelerating
descent, which may endanger the robot and lead to tipovers.
Even if no accident happens, every staircase edge
traversal causes platform shaking and collisions, which influence
the robot and potentially an object that is being transported.
These events are repetitive when the robot traverses
every step, but the greatest impact can be observed at the
beginning of the traversal when the robot COG crosses the
uppermost stair edge. At that moment, the robot velocity can
be instantaneously increased, leading to a shock, slip, and,
worst, tip-over. The end of traversal is also crucial because if
the rear flippers are pushed down, the rear part of the robot
just falls from the step.
Such behaviors lead to distinctive pitch velocity peaks;
therefore, the controller should mitigate such events. Letting
Wt
denote pitch velocity and KW
rt = '
W
-
-KW
Wt
)
cient, the corresponding negative reward is
,
,
1if tip over
otherwise
the normalization coeffi(6)
This
penalty is assigned only in descent tasks and when a
robot traverses a set of stairs, namely, when the COG projection
on the steps is located between the staircase nosing limits.
Controller Learning Details
In designing a controller learned via RL, there are several
alternatives that can be considered [17]. Approximators such
as tile and coarse coding tend to suffer from the curse of
dimensionality and do not generalize well due to grain size
sensitivity. The main advantage of artificial NNs over other
function approximators is their generalization capability and
straightforward increase of depth, which has been well studied.
Another common choice is radial basis function (RBF)
NNs, which were widely applied before the emergence of
deep NNs. RBF NNs converge quickly to global optima with
fewer trials and errors; still, they have proved to be of limited
use due to the absence of tractable and stable integration into
more complex deep NN architectures.
Shallow networks with one hidden layer tend to have more
parameters to approximate the same function, in contrast to
deep networks with fewer parameters and the additional ability
to learn different representations at intermediate levels.
14 * IEEE ROBOTICS & AUTOMATION MAGAZINE * DECEMBER 2021
Thus, we opted for a multilayer perceptron (see Figure 1). The
first layer is composed of nine neurons on which the state
vector s is received every time step. The input is forwarded
through two dense layers to the output layer, which forms the
action vector a. Following up on the discussion in the " Control
Methods in Obstacle Negotiation " section, the SAC and
TD3 are considered more sample efficient than PPO since
they are off policy and converge faster to higher returns. Still,
we use PPO in our work to optimize the policy parameters
because it is more straightforward to implement, less sensitive
to hyperparameter changes, and has the most favorable
results in deep RL.
KL Divergence-Based Policy Comparison
We can quantitatively compare a pair of trained policies via a
commonly used probability distribution divergence measure,
namely, the KL divergence. Often, RL algorithms exploit divergence
as a metric to be optimized or a constraint of policy
updates, as, for example, in PPO, where updates of the policy
parameters are constrained through a penalty on KL divergence.
We denote the reference policy for a given task as
rq ,
and we wish to find how the policy pr diverges from qr
(assuming the same observation and action spaces). The process
of learning a policy amounts to learning the mean and
variance statistics of the underlying probability distribution
rather than the exact probability distribution for each possible
state-action pair. Using the hypothesis of a normally distributed
policy with a mean and variance that depend on the state, a
policy can be randomly sampled across the total observation
space, with actions chosen according to that policy.
Thus, once the total set of observations has been sampled,
the corresponding actions are prescribed by one of the two
policies being compared, namely, the reference policy
rq .
After having determined the state-action pairs across which
the policies will be compared, we calculate the divergence as
D (, )(,)
sa,
KL pq
rr = / pr sa log
r
r
q
p
(, )
(, )
sa
sa
.
(7)
Training a policy is always a stochastic procedure, and thus a
policy is a random variable itself, together with the number of
state-action pairs we choose for calculating divergence. We
empirically determine the number of times needed to train a
policy as well as the number of samples taken in the calculation
of the divergence, though a more consistent estimation is
possible if required.
Controller Deployment on a Real Robot
System Description
We begin by presenting the experiment setup used to obtain a
ground truth robot and environment state. This enables us to
establish an upper bound on the effectiveness of a controller
in reality, with moderate state errors, while facilitating the
development and evaluation of alternative robot localization
approaches later on. On the other hand, we opted for injecting
noise into the state during policy learning in simulation

IEEE Robotics & Automation Magazine - December 2021

Table of Contents for the Digital Edition of IEEE Robotics & Automation Magazine - December 2021

Contents
IEEE Robotics & Automation Magazine - December 2021 - Cover1
IEEE Robotics & Automation Magazine - December 2021 - Cover2
IEEE Robotics & Automation Magazine - December 2021 - Contents
IEEE Robotics & Automation Magazine - December 2021 - 2
IEEE Robotics & Automation Magazine - December 2021 - 3
IEEE Robotics & Automation Magazine - December 2021 - 4
IEEE Robotics & Automation Magazine - December 2021 - 5
IEEE Robotics & Automation Magazine - December 2021 - 6
IEEE Robotics & Automation Magazine - December 2021 - 7
IEEE Robotics & Automation Magazine - December 2021 - 8
IEEE Robotics & Automation Magazine - December 2021 - 9
IEEE Robotics & Automation Magazine - December 2021 - 10
IEEE Robotics & Automation Magazine - December 2021 - 11
IEEE Robotics & Automation Magazine - December 2021 - 12
IEEE Robotics & Automation Magazine - December 2021 - 13
IEEE Robotics & Automation Magazine - December 2021 - 14
IEEE Robotics & Automation Magazine - December 2021 - 15
IEEE Robotics & Automation Magazine - December 2021 - 16
IEEE Robotics & Automation Magazine - December 2021 - 17
IEEE Robotics & Automation Magazine - December 2021 - 18
IEEE Robotics & Automation Magazine - December 2021 - 19
IEEE Robotics & Automation Magazine - December 2021 - 20
IEEE Robotics & Automation Magazine - December 2021 - 21
IEEE Robotics & Automation Magazine - December 2021 - 22
IEEE Robotics & Automation Magazine - December 2021 - 23
IEEE Robotics & Automation Magazine - December 2021 - 24
IEEE Robotics & Automation Magazine - December 2021 - 25
IEEE Robotics & Automation Magazine - December 2021 - 26
IEEE Robotics & Automation Magazine - December 2021 - 27
IEEE Robotics & Automation Magazine - December 2021 - 28
IEEE Robotics & Automation Magazine - December 2021 - 29
IEEE Robotics & Automation Magazine - December 2021 - 30
IEEE Robotics & Automation Magazine - December 2021 - 31
IEEE Robotics & Automation Magazine - December 2021 - 32
IEEE Robotics & Automation Magazine - December 2021 - 33
IEEE Robotics & Automation Magazine - December 2021 - 34
IEEE Robotics & Automation Magazine - December 2021 - 35
IEEE Robotics & Automation Magazine - December 2021 - 36
IEEE Robotics & Automation Magazine - December 2021 - 37
IEEE Robotics & Automation Magazine - December 2021 - 38
IEEE Robotics & Automation Magazine - December 2021 - 39
IEEE Robotics & Automation Magazine - December 2021 - 40
IEEE Robotics & Automation Magazine - December 2021 - 41
IEEE Robotics & Automation Magazine - December 2021 - 42
IEEE Robotics & Automation Magazine - December 2021 - 43
IEEE Robotics & Automation Magazine - December 2021 - 44
IEEE Robotics & Automation Magazine - December 2021 - 45
IEEE Robotics & Automation Magazine - December 2021 - 46
IEEE Robotics & Automation Magazine - December 2021 - 47
IEEE Robotics & Automation Magazine - December 2021 - 48
IEEE Robotics & Automation Magazine - December 2021 - 49
IEEE Robotics & Automation Magazine - December 2021 - 50
IEEE Robotics & Automation Magazine - December 2021 - 51
IEEE Robotics & Automation Magazine - December 2021 - 52
IEEE Robotics & Automation Magazine - December 2021 - 53
IEEE Robotics & Automation Magazine - December 2021 - 54
IEEE Robotics & Automation Magazine - December 2021 - 55
IEEE Robotics & Automation Magazine - December 2021 - 56
IEEE Robotics & Automation Magazine - December 2021 - 57
IEEE Robotics & Automation Magazine - December 2021 - 58
IEEE Robotics & Automation Magazine - December 2021 - 59
IEEE Robotics & Automation Magazine - December 2021 - 60
IEEE Robotics & Automation Magazine - December 2021 - 61
IEEE Robotics & Automation Magazine - December 2021 - 62
IEEE Robotics & Automation Magazine - December 2021 - 63
IEEE Robotics & Automation Magazine - December 2021 - 64
IEEE Robotics & Automation Magazine - December 2021 - 65
IEEE Robotics & Automation Magazine - December 2021 - 66
IEEE Robotics & Automation Magazine - December 2021 - 67
IEEE Robotics & Automation Magazine - December 2021 - 68
IEEE Robotics & Automation Magazine - December 2021 - 69
IEEE Robotics & Automation Magazine - December 2021 - 70
IEEE Robotics & Automation Magazine - December 2021 - 71
IEEE Robotics & Automation Magazine - December 2021 - 72
IEEE Robotics & Automation Magazine - December 2021 - 73
IEEE Robotics & Automation Magazine - December 2021 - 74
IEEE Robotics & Automation Magazine - December 2021 - 75
IEEE Robotics & Automation Magazine - December 2021 - 76
IEEE Robotics & Automation Magazine - December 2021 - 77
IEEE Robotics & Automation Magazine - December 2021 - 78
IEEE Robotics & Automation Magazine - December 2021 - 79
IEEE Robotics & Automation Magazine - December 2021 - 80
IEEE Robotics & Automation Magazine - December 2021 - 81
IEEE Robotics & Automation Magazine - December 2021 - 82
IEEE Robotics & Automation Magazine - December 2021 - 83
IEEE Robotics & Automation Magazine - December 2021 - 84
IEEE Robotics & Automation Magazine - December 2021 - 85
IEEE Robotics & Automation Magazine - December 2021 - 86
IEEE Robotics & Automation Magazine - December 2021 - 87
IEEE Robotics & Automation Magazine - December 2021 - 88
IEEE Robotics & Automation Magazine - December 2021 - 89
IEEE Robotics & Automation Magazine - December 2021 - 90
IEEE Robotics & Automation Magazine - December 2021 - 91
IEEE Robotics & Automation Magazine - December 2021 - 92
IEEE Robotics & Automation Magazine - December 2021 - 93
IEEE Robotics & Automation Magazine - December 2021 - 94
IEEE Robotics & Automation Magazine - December 2021 - 95
IEEE Robotics & Automation Magazine - December 2021 - 96
IEEE Robotics & Automation Magazine - December 2021 - 97
IEEE Robotics & Automation Magazine - December 2021 - 98
IEEE Robotics & Automation Magazine - December 2021 - 99
IEEE Robotics & Automation Magazine - December 2021 - 100
IEEE Robotics & Automation Magazine - December 2021 - 101
IEEE Robotics & Automation Magazine - December 2021 - 102
IEEE Robotics & Automation Magazine - December 2021 - 103
IEEE Robotics & Automation Magazine - December 2021 - 104
IEEE Robotics & Automation Magazine - December 2021 - 105
IEEE Robotics & Automation Magazine - December 2021 - 106
IEEE Robotics & Automation Magazine - December 2021 - 107
IEEE Robotics & Automation Magazine - December 2021 - 108
IEEE Robotics & Automation Magazine - December 2021 - 109
IEEE Robotics & Automation Magazine - December 2021 - 110
IEEE Robotics & Automation Magazine - December 2021 - 111
IEEE Robotics & Automation Magazine - December 2021 - 112
IEEE Robotics & Automation Magazine - December 2021 - 113
IEEE Robotics & Automation Magazine - December 2021 - 114
IEEE Robotics & Automation Magazine - December 2021 - 115
IEEE Robotics & Automation Magazine - December 2021 - 116
IEEE Robotics & Automation Magazine - December 2021 - 117
IEEE Robotics & Automation Magazine - December 2021 - 118
IEEE Robotics & Automation Magazine - December 2021 - 119
IEEE Robotics & Automation Magazine - December 2021 - 120
IEEE Robotics & Automation Magazine - December 2021 - 121
IEEE Robotics & Automation Magazine - December 2021 - 122
IEEE Robotics & Automation Magazine - December 2021 - 123
IEEE Robotics & Automation Magazine - December 2021 - 124
IEEE Robotics & Automation Magazine - December 2021 - 125
IEEE Robotics & Automation Magazine - December 2021 - 126
IEEE Robotics & Automation Magazine - December 2021 - 127
IEEE Robotics & Automation Magazine - December 2021 - 128
IEEE Robotics & Automation Magazine - December 2021 - 129
IEEE Robotics & Automation Magazine - December 2021 - 130
IEEE Robotics & Automation Magazine - December 2021 - 131
IEEE Robotics & Automation Magazine - December 2021 - 132
IEEE Robotics & Automation Magazine - December 2021 - Cover3
IEEE Robotics & Automation Magazine - December 2021 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2010
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2010
https://www.nxtbookmedia.com