IEEE Circuits and Systems Magazine - Q4 2020 - 71
generated at time step t. Then, the derivative of L t with
respect to W can be written as follows:
Output Layer
2L t = 2L t 2y t 2s t .(4)
2W
2y t 2s t 2W
Considering that s t = h ^Ux t - i + Ws t - 1 + b h h and s t - 1
is also influenced by W and s t - 2 . According to the chain
rule, we have
Hidden Layer
2s t = t 2s t 2s i . (5)
/
2W i = 1 2s i 2W
In addition, we also have
Input Layer
2s t =
% 2s k .(6)
2s i k = i + 1 2s k - 1
Therefore, 2L t 2W can be formulated as
t
C. Evolution of RNNs
Despite the effectiveness in sequential data processing,
RNN still suffers from gradient vanishing or explosion.
This is because that there exists a lot of partial derivatives in model training (see Eq. 7), especially when the
number of time steps is very large. If ^2s k 2s k - 1 h2 1, then
the gradient approaches infinity when the length of input time series is too large. In contrast, if ^2s k 2s k - 1 h 1 1.
then the gradient will be close to 0. To solve this problem, adopting ReLu as activation function instead of
tanh can slightly alleviate the vanishing gradient [27]. A
better method is to change the propagation structure of
RNNs to keep the partial derivatives equal to 1. In this
case, the product of partial derivatives would no longer
converge to 0 or diverge gradually. To satisfy this demand, Hochreiter and Schmidhuber [27] present a long
short-term memory (LSTM) model with several " gate "
structures. Inherently, a " gate " is a specific function.
There are generally three kinds of gates: forget gate for,
input gate in and output gate out. Then, the inference
process of LSTM can be formulated as:
for ^ x t , s t - 1 h = v ^W f · conc ^ x t , s t - 1 h + b f h,(9)
in ^ x t , s t - 1 h = v ^W i · conc ^ x t , s t - 1 h + b i h,(10)
clt = h ^W c · conc ^ x t , s t - 1 h + b c h,(11)
c t = for ^ x t , s t - 1 h ) c t - 1 + in ^ x t , s t - 1 h ) clt ,(12)
out ^ x t , s t - 1 h = v ^W o · conc ^ x t , s t - 1 h + b o h,(13)
s t = out ^ x t , s t - 1 h ) h ^c t h,(14)
FOURTH QUARTER 2020
W
V
t
2L t =
/ 2L t 2y t % 2s k 2s i . (8)
2U i = 1 2y t 2s t e k = i + 1 2s k - 1 o 2U
t
yt-1
y
2L t =
/ 2L t 2y t % 2s k 2s i . (7)
2W i = 1 2y t 2s t e k = i + 1 2s k - 1 o 2W
t
Similarly, the derivative of L t with respect to U can be
formulated as follows:
Context Layer
(a)
t
s
U
W
V
st-1
U
x
yt
W
V
st
U
xt-1
yt+1
W
V
st+1
W
U
xt
xt+1
(b)
Figure 4. The illustration of RNN where U, V and W are respectively weights between input layer and hidden layer,
weights between hidden layer and output layer and weights
between hidden layer and context layer. (a) The structure of
RNN [11], (b) The inference process of RNN [43].
where conc(â·â) denotes the concatenation operator and
v denotes sigmoid function. These " gates " can flexibly
control the filtration of the information flow in the network. Thereby, the gradient problems of vanilla RNNs
can be solved via making the product of the partial derivatives always equal to either 0 or 1 when computing
the gradients.
Furthermore, many variants of the " gate " structures
are suggested to improve the performance. Gers and
Schmidhuber [45] enable the gates to observe the cell
state c t - 1 through some " peephole " connections. Cho
et al. [46] simplify the transmission structure of information flow via integrating the forgot gate and input
gate into one update gate. Moreover, hidden state and
cell state are also merged in [46]. Lei et al. [47] propose
a simple recurrent unit that can accelerate network
computing via parallelizing the recurrence. Recently,
another attempt to handle the gradient problem is to
replace the weight matrices of RNN with the unitary
matrix [48]. In addition, the attention mechanism is
also introduced into the structure of RNNs to improve
the flexibility and effectiveness in different application
scenarios, such as natural language processing [49] and
image captioning [50].
IEEE CIRCUITS AND SYSTEMS MAGAZINE
71
IEEE Circuits and Systems Magazine - Q4 2020
Table of Contents for the Digital Edition of IEEE Circuits and Systems Magazine - Q4 2020
Contents
IEEE Circuits and Systems Magazine - Q4 2020 - Cover1
IEEE Circuits and Systems Magazine - Q4 2020 - Cover2
IEEE Circuits and Systems Magazine - Q4 2020 - Contents
IEEE Circuits and Systems Magazine - Q4 2020 - 2
IEEE Circuits and Systems Magazine - Q4 2020 - 3
IEEE Circuits and Systems Magazine - Q4 2020 - 4
IEEE Circuits and Systems Magazine - Q4 2020 - 5
IEEE Circuits and Systems Magazine - Q4 2020 - 6
IEEE Circuits and Systems Magazine - Q4 2020 - 7
IEEE Circuits and Systems Magazine - Q4 2020 - 8
IEEE Circuits and Systems Magazine - Q4 2020 - 9
IEEE Circuits and Systems Magazine - Q4 2020 - 10
IEEE Circuits and Systems Magazine - Q4 2020 - 11
IEEE Circuits and Systems Magazine - Q4 2020 - 12
IEEE Circuits and Systems Magazine - Q4 2020 - 13
IEEE Circuits and Systems Magazine - Q4 2020 - 14
IEEE Circuits and Systems Magazine - Q4 2020 - 15
IEEE Circuits and Systems Magazine - Q4 2020 - 16
IEEE Circuits and Systems Magazine - Q4 2020 - 17
IEEE Circuits and Systems Magazine - Q4 2020 - 18
IEEE Circuits and Systems Magazine - Q4 2020 - 19
IEEE Circuits and Systems Magazine - Q4 2020 - 20
IEEE Circuits and Systems Magazine - Q4 2020 - 21
IEEE Circuits and Systems Magazine - Q4 2020 - 22
IEEE Circuits and Systems Magazine - Q4 2020 - 23
IEEE Circuits and Systems Magazine - Q4 2020 - 24
IEEE Circuits and Systems Magazine - Q4 2020 - 25
IEEE Circuits and Systems Magazine - Q4 2020 - 26
IEEE Circuits and Systems Magazine - Q4 2020 - 27
IEEE Circuits and Systems Magazine - Q4 2020 - 28
IEEE Circuits and Systems Magazine - Q4 2020 - 29
IEEE Circuits and Systems Magazine - Q4 2020 - 30
IEEE Circuits and Systems Magazine - Q4 2020 - 31
IEEE Circuits and Systems Magazine - Q4 2020 - 32
IEEE Circuits and Systems Magazine - Q4 2020 - 33
IEEE Circuits and Systems Magazine - Q4 2020 - 34
IEEE Circuits and Systems Magazine - Q4 2020 - 35
IEEE Circuits and Systems Magazine - Q4 2020 - 36
IEEE Circuits and Systems Magazine - Q4 2020 - 37
IEEE Circuits and Systems Magazine - Q4 2020 - 38
IEEE Circuits and Systems Magazine - Q4 2020 - 39
IEEE Circuits and Systems Magazine - Q4 2020 - 40
IEEE Circuits and Systems Magazine - Q4 2020 - 41
IEEE Circuits and Systems Magazine - Q4 2020 - 42
IEEE Circuits and Systems Magazine - Q4 2020 - 43
IEEE Circuits and Systems Magazine - Q4 2020 - 44
IEEE Circuits and Systems Magazine - Q4 2020 - 45
IEEE Circuits and Systems Magazine - Q4 2020 - 46
IEEE Circuits and Systems Magazine - Q4 2020 - 47
IEEE Circuits and Systems Magazine - Q4 2020 - 48
IEEE Circuits and Systems Magazine - Q4 2020 - 49
IEEE Circuits and Systems Magazine - Q4 2020 - 50
IEEE Circuits and Systems Magazine - Q4 2020 - 51
IEEE Circuits and Systems Magazine - Q4 2020 - 52
IEEE Circuits and Systems Magazine - Q4 2020 - 53
IEEE Circuits and Systems Magazine - Q4 2020 - 54
IEEE Circuits and Systems Magazine - Q4 2020 - 55
IEEE Circuits and Systems Magazine - Q4 2020 - 56
IEEE Circuits and Systems Magazine - Q4 2020 - 57
IEEE Circuits and Systems Magazine - Q4 2020 - 58
IEEE Circuits and Systems Magazine - Q4 2020 - 59
IEEE Circuits and Systems Magazine - Q4 2020 - 60
IEEE Circuits and Systems Magazine - Q4 2020 - 61
IEEE Circuits and Systems Magazine - Q4 2020 - 62
IEEE Circuits and Systems Magazine - Q4 2020 - 63
IEEE Circuits and Systems Magazine - Q4 2020 - 64
IEEE Circuits and Systems Magazine - Q4 2020 - 65
IEEE Circuits and Systems Magazine - Q4 2020 - 66
IEEE Circuits and Systems Magazine - Q4 2020 - 67
IEEE Circuits and Systems Magazine - Q4 2020 - 68
IEEE Circuits and Systems Magazine - Q4 2020 - 69
IEEE Circuits and Systems Magazine - Q4 2020 - 70
IEEE Circuits and Systems Magazine - Q4 2020 - 71
IEEE Circuits and Systems Magazine - Q4 2020 - 72
IEEE Circuits and Systems Magazine - Q4 2020 - 73
IEEE Circuits and Systems Magazine - Q4 2020 - 74
IEEE Circuits and Systems Magazine - Q4 2020 - 75
IEEE Circuits and Systems Magazine - Q4 2020 - 76
IEEE Circuits and Systems Magazine - Q4 2020 - 77
IEEE Circuits and Systems Magazine - Q4 2020 - 78
IEEE Circuits and Systems Magazine - Q4 2020 - 79
IEEE Circuits and Systems Magazine - Q4 2020 - 80
IEEE Circuits and Systems Magazine - Q4 2020 - 81
IEEE Circuits and Systems Magazine - Q4 2020 - 82
IEEE Circuits and Systems Magazine - Q4 2020 - 83
IEEE Circuits and Systems Magazine - Q4 2020 - 84
IEEE Circuits and Systems Magazine - Q4 2020 - 85
IEEE Circuits and Systems Magazine - Q4 2020 - 86
IEEE Circuits and Systems Magazine - Q4 2020 - 87
IEEE Circuits and Systems Magazine - Q4 2020 - 88
IEEE Circuits and Systems Magazine - Q4 2020 - Cover3
IEEE Circuits and Systems Magazine - Q4 2020 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021Q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q1
https://www.nxtbookmedia.com