IEEE Robotics & Automation Magazine - December 2021 - 17

mean episode return starts at -0.23 and -0.44 for the two tasks
because of the negative penalty and the failure of the robots to
advance to the goal, and it converges to zero by the end of
learning for both policies. The maximum bands can attain values
of 0.26 and 0.28 episode return, respectively. This means
the robots learn to control flipper angles and linear velocity.
The episode return for Abs-Asc-3-COG is lower in the beginning
because the Absolem COG is higher, and the robot can
tip over more easily. Neither of the two learning curves reaches
the maximum return of one, as it is impossible to put the COG
closer than a certain distance to the point O (see Figure 2).
This is normal since the robots cannot be entirely parallel to
the ground while they traverse the staircase.
Figure 5(b) graphs the evolution of the COG during learning.
Both policies reduce the COG deviation at the same pace
and reach minimal values after 8,000 time steps, or 120 episodes.
The Absolem policy exhibits greater COG deviation,
equal to 0.19 m, due to the higher COG placement in the initial
configuration, where flippers are extended; still, it decreases
the initial COG by 17%. The Jaguar policy decreases the
COG deviation by 36%, to 0.075 m, which approximately represents
the COG deviation along the y-axis.
The learning process for the two robots is fairly similar.
Furthermore, the same episode return value is achieved by
the end of training. The COG curves exhibit comparable
behavior and converge at different values due to differences in
mass distributions. Those differences also change robot control
features; yet, both robots achieve the goal by minimizing
the COG deviation. Note, however, that the absolute COG
deviation difference between Absolem and Jaguar at the end
0.4
0.2
-0.2
-0.4
-0.6
1k 2k 3k 4k 5k 6k 7k 8k 9k 10k
Time Step
(a)
0.25
0.2
0.15
0.1
0.05
1k 2k 3k 4k 5k 6k 7k 8k 9k 10k
Time Step
(b)
Abs-Asc-3-COG
Jag-Asc-3-COG
Figure 5. The ascent task learning analysis. (a) Total reward and
(b) Dt safety measure evolution.
of learning in Figure 5(b) is not transposed to an equivalent
absolute difference in the total reward in Figure 5(a), thanks
to the normalization of each negative penalty. These results
strongly indicate that the framework is applicable to different
robot models for the ascent task.
Performance for Descent Tasks
Figure 6(a) presents learning curves for tasks Jag-Des-3-Ang
and Abs-Des-3-Ang. The first starts at a -0.1 episode return
and converges to 0.1. The second policy convergences similarly
but starts at a -0.3 episode return and ends at -0.1. Both platform
policies exhibit similar learning behavior, separated by
around 0.2, that can be explained by robot dynamics because
Absolem is exposed to higher pitch angular velocity. This is
seen in Figure 6(b), where the platform mean angular velocity
during an episode is greater for Absolem, due to a higher centroid
position over the surface in the resting state. The Absolem
policy decreases its angular velocity by the end of learning by
13%, while the Jaguar policy drops it by 10%. Minimal angular
velocity values are attained through 5,000 steps, but from this
moment onward, the learning curves slightly increase. This
may happen due to optimization of the traversal time, as a
robot spends less time for staircase traversal. As before, these
results suggest that the framework is effective for both robots.
KL Divergence Between Policies
Figure 7 gives KL divergences for learned controllers in the
3-DoF tasks. We omitted 5-DoF tasks since the simulated
Absolem model does not include an arm. An (, )ij th heat
map cell represents the divergence between a controller with
0.4
0.2
-0.2
-0.4
-0.6
1k 2k 3k 4k 5k 6k 7k 8k 9k 10k
Time Step
(a)
0.3
0.25
0.2
0.15
1k 2k 3k 4k 5k 6k 7k 8k 9k 10k
Time Step
(b)
Abs-Des-3-Ang
Jag-Des-3-Ang
Figure 6. The descent task learning analysis. (a) Total reward and
(b) Wt safety measure evolution.
DECEMBER 2021 * IEEE ROBOTICS & AUTOMATION MAGAZINE *
17
COG Deviation (m)
Reward
Angular Velocity (rad/s)
Reward

IEEE Robotics & Automation Magazine - December 2021

Table of Contents for the Digital Edition of IEEE Robotics & Automation Magazine - December 2021

Contents
IEEE Robotics & Automation Magazine - December 2021 - Cover1
IEEE Robotics & Automation Magazine - December 2021 - Cover2
IEEE Robotics & Automation Magazine - December 2021 - Contents
IEEE Robotics & Automation Magazine - December 2021 - 2
IEEE Robotics & Automation Magazine - December 2021 - 3
IEEE Robotics & Automation Magazine - December 2021 - 4
IEEE Robotics & Automation Magazine - December 2021 - 5
IEEE Robotics & Automation Magazine - December 2021 - 6
IEEE Robotics & Automation Magazine - December 2021 - 7
IEEE Robotics & Automation Magazine - December 2021 - 8
IEEE Robotics & Automation Magazine - December 2021 - 9
IEEE Robotics & Automation Magazine - December 2021 - 10
IEEE Robotics & Automation Magazine - December 2021 - 11
IEEE Robotics & Automation Magazine - December 2021 - 12
IEEE Robotics & Automation Magazine - December 2021 - 13
IEEE Robotics & Automation Magazine - December 2021 - 14
IEEE Robotics & Automation Magazine - December 2021 - 15
IEEE Robotics & Automation Magazine - December 2021 - 16
IEEE Robotics & Automation Magazine - December 2021 - 17
IEEE Robotics & Automation Magazine - December 2021 - 18
IEEE Robotics & Automation Magazine - December 2021 - 19
IEEE Robotics & Automation Magazine - December 2021 - 20
IEEE Robotics & Automation Magazine - December 2021 - 21
IEEE Robotics & Automation Magazine - December 2021 - 22
IEEE Robotics & Automation Magazine - December 2021 - 23
IEEE Robotics & Automation Magazine - December 2021 - 24
IEEE Robotics & Automation Magazine - December 2021 - 25
IEEE Robotics & Automation Magazine - December 2021 - 26
IEEE Robotics & Automation Magazine - December 2021 - 27
IEEE Robotics & Automation Magazine - December 2021 - 28
IEEE Robotics & Automation Magazine - December 2021 - 29
IEEE Robotics & Automation Magazine - December 2021 - 30
IEEE Robotics & Automation Magazine - December 2021 - 31
IEEE Robotics & Automation Magazine - December 2021 - 32
IEEE Robotics & Automation Magazine - December 2021 - 33
IEEE Robotics & Automation Magazine - December 2021 - 34
IEEE Robotics & Automation Magazine - December 2021 - 35
IEEE Robotics & Automation Magazine - December 2021 - 36
IEEE Robotics & Automation Magazine - December 2021 - 37
IEEE Robotics & Automation Magazine - December 2021 - 38
IEEE Robotics & Automation Magazine - December 2021 - 39
IEEE Robotics & Automation Magazine - December 2021 - 40
IEEE Robotics & Automation Magazine - December 2021 - 41
IEEE Robotics & Automation Magazine - December 2021 - 42
IEEE Robotics & Automation Magazine - December 2021 - 43
IEEE Robotics & Automation Magazine - December 2021 - 44
IEEE Robotics & Automation Magazine - December 2021 - 45
IEEE Robotics & Automation Magazine - December 2021 - 46
IEEE Robotics & Automation Magazine - December 2021 - 47
IEEE Robotics & Automation Magazine - December 2021 - 48
IEEE Robotics & Automation Magazine - December 2021 - 49
IEEE Robotics & Automation Magazine - December 2021 - 50
IEEE Robotics & Automation Magazine - December 2021 - 51
IEEE Robotics & Automation Magazine - December 2021 - 52
IEEE Robotics & Automation Magazine - December 2021 - 53
IEEE Robotics & Automation Magazine - December 2021 - 54
IEEE Robotics & Automation Magazine - December 2021 - 55
IEEE Robotics & Automation Magazine - December 2021 - 56
IEEE Robotics & Automation Magazine - December 2021 - 57
IEEE Robotics & Automation Magazine - December 2021 - 58
IEEE Robotics & Automation Magazine - December 2021 - 59
IEEE Robotics & Automation Magazine - December 2021 - 60
IEEE Robotics & Automation Magazine - December 2021 - 61
IEEE Robotics & Automation Magazine - December 2021 - 62
IEEE Robotics & Automation Magazine - December 2021 - 63
IEEE Robotics & Automation Magazine - December 2021 - 64
IEEE Robotics & Automation Magazine - December 2021 - 65
IEEE Robotics & Automation Magazine - December 2021 - 66
IEEE Robotics & Automation Magazine - December 2021 - 67
IEEE Robotics & Automation Magazine - December 2021 - 68
IEEE Robotics & Automation Magazine - December 2021 - 69
IEEE Robotics & Automation Magazine - December 2021 - 70
IEEE Robotics & Automation Magazine - December 2021 - 71
IEEE Robotics & Automation Magazine - December 2021 - 72
IEEE Robotics & Automation Magazine - December 2021 - 73
IEEE Robotics & Automation Magazine - December 2021 - 74
IEEE Robotics & Automation Magazine - December 2021 - 75
IEEE Robotics & Automation Magazine - December 2021 - 76
IEEE Robotics & Automation Magazine - December 2021 - 77
IEEE Robotics & Automation Magazine - December 2021 - 78
IEEE Robotics & Automation Magazine - December 2021 - 79
IEEE Robotics & Automation Magazine - December 2021 - 80
IEEE Robotics & Automation Magazine - December 2021 - 81
IEEE Robotics & Automation Magazine - December 2021 - 82
IEEE Robotics & Automation Magazine - December 2021 - 83
IEEE Robotics & Automation Magazine - December 2021 - 84
IEEE Robotics & Automation Magazine - December 2021 - 85
IEEE Robotics & Automation Magazine - December 2021 - 86
IEEE Robotics & Automation Magazine - December 2021 - 87
IEEE Robotics & Automation Magazine - December 2021 - 88
IEEE Robotics & Automation Magazine - December 2021 - 89
IEEE Robotics & Automation Magazine - December 2021 - 90
IEEE Robotics & Automation Magazine - December 2021 - 91
IEEE Robotics & Automation Magazine - December 2021 - 92
IEEE Robotics & Automation Magazine - December 2021 - 93
IEEE Robotics & Automation Magazine - December 2021 - 94
IEEE Robotics & Automation Magazine - December 2021 - 95
IEEE Robotics & Automation Magazine - December 2021 - 96
IEEE Robotics & Automation Magazine - December 2021 - 97
IEEE Robotics & Automation Magazine - December 2021 - 98
IEEE Robotics & Automation Magazine - December 2021 - 99
IEEE Robotics & Automation Magazine - December 2021 - 100
IEEE Robotics & Automation Magazine - December 2021 - 101
IEEE Robotics & Automation Magazine - December 2021 - 102
IEEE Robotics & Automation Magazine - December 2021 - 103
IEEE Robotics & Automation Magazine - December 2021 - 104
IEEE Robotics & Automation Magazine - December 2021 - 105
IEEE Robotics & Automation Magazine - December 2021 - 106
IEEE Robotics & Automation Magazine - December 2021 - 107
IEEE Robotics & Automation Magazine - December 2021 - 108
IEEE Robotics & Automation Magazine - December 2021 - 109
IEEE Robotics & Automation Magazine - December 2021 - 110
IEEE Robotics & Automation Magazine - December 2021 - 111
IEEE Robotics & Automation Magazine - December 2021 - 112
IEEE Robotics & Automation Magazine - December 2021 - 113
IEEE Robotics & Automation Magazine - December 2021 - 114
IEEE Robotics & Automation Magazine - December 2021 - 115
IEEE Robotics & Automation Magazine - December 2021 - 116
IEEE Robotics & Automation Magazine - December 2021 - 117
IEEE Robotics & Automation Magazine - December 2021 - 118
IEEE Robotics & Automation Magazine - December 2021 - 119
IEEE Robotics & Automation Magazine - December 2021 - 120
IEEE Robotics & Automation Magazine - December 2021 - 121
IEEE Robotics & Automation Magazine - December 2021 - 122
IEEE Robotics & Automation Magazine - December 2021 - 123
IEEE Robotics & Automation Magazine - December 2021 - 124
IEEE Robotics & Automation Magazine - December 2021 - 125
IEEE Robotics & Automation Magazine - December 2021 - 126
IEEE Robotics & Automation Magazine - December 2021 - 127
IEEE Robotics & Automation Magazine - December 2021 - 128
IEEE Robotics & Automation Magazine - December 2021 - 129
IEEE Robotics & Automation Magazine - December 2021 - 130
IEEE Robotics & Automation Magazine - December 2021 - 131
IEEE Robotics & Automation Magazine - December 2021 - 132
IEEE Robotics & Automation Magazine - December 2021 - Cover3
IEEE Robotics & Automation Magazine - December 2021 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2023
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2022
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2021
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2020
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2019
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2018
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2017
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2016
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2015
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2014
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2013
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2012
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_june2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_march2011
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_december2010
https://www.nxtbook.com/nxtbooks/ieee/roboticsautomation_september2010
https://www.nxtbookmedia.com