IEEE Circuits and Systems Magazine - Q2 2023 - 25

Table 6.
Comparison of different model configurations on
one-billion [39]. Tensorized Transformer is the
model with multi-linear attention. Core-1 denotes
a model with a single-block term tensor, whereas
core-2 denotes a model with two block term
tensors.
Model
RNN-1024+9 Gram [72]
LSTM-2018-512 [73]
GCNN-14 bottleneck [74]
LSTM-8192-1024+CNN
Input [73]
High-Budget MoE [75]
LSTM+Mos [76]
Transformer+adaptive
input [77]
Transformer-XL Base [78]
Transformer-XL Large [78]
Tensorized Transformer
core-1 [39]
Tensorized Transformer
core-2 [39]
Params
20B
0.83B
-
1.04B
5B
113M
0.46B
0.46B
0.8B
0.16B
0.16B
Test PPL
51.3
43.7
31.9
30.0
28.0
37.10
23.7
23.5
21.8
20.5
19.5
tasks [36]. Three datasets were chosen: one of small
size (PTB), one of medium size (WikiText-103) and one
of large size (One-billion). PTB contains 929,900 training
tokens, 73,900 validation words, and 82,900 test words
[71]. WikiText-103 has 267,735 distinct tokens. The dataset
is a long-term dependency word-level language modeling
benchmark. It contains 103 million training tokens
from 28 thousand articles, with an average length of
3.6 thousand tokens per article. The One-Billion Word
benchmark is a large dataset with 829 250 940,, tokens
over a vocabulary of 793,471 words. Models were evaluated
based on the average per-word log-probability,
Perplexity (PPL). The lower the PPL, the more accurate
the model. The standard multi-head attention layers in
Transformer were replaced with Multi-linear attention.
A comparison of different model configurations on different
datasets is shown in Tables 6 and 7. Notice that
the tensorized transformer with Multi-linear attention
achieves lower PPL with much fewer parameters than
other models in the three datasets.
Neural Machine Translation involves translating
text or speech from one language to another. The
baseline is a vanilla Transformer trained on WMT
2016 English-German dataset [83]. For comparison,
Table 7.
Comparison of different model configurations on PTB and WiKiTEXT-103 [39]. Tensorized transformer is the model
with multi-linear attention. Core-1 denotes a model with a single-block term tensor, whereas core-2 denotes a
model with two block term tensors.
PTB
Model
Params
LSTM+augmented loss
[79]
Variational RHN [80]
4-layer QRNN [81]
AWD-LSTM-MoS [76]
Transformer+adaptive
input [77]
Transformer-XL-Base
[78]
Transformer-XL-Large
[78]
Transformer-XL+TT [38]
Sparse Transformer [82]
Tensorized Transformer
core-1 [39]
Tensorized Transformer
core-2 [39]
SECOND QUARTER 2023
24M
23M
-
22M
24M
24M
-
18M
14M
12M
12M
Val
PPL
75.7
67.9
-
58.08
59.1
56.72
-
57.9
74.0
60.5
54.25
Test
PPL
48.7
65.4
-
55.97
57
54.52
-
55.4
73.1
57.9
49.8
WikiText-103
Params
-
-
151M
-
247M
151M
257M
130M
174M
85.3M
85.3M
Val
PPL
-
-
-
29.0
19.8
23.1
-
23.61
38.98
22.7
19.7
Test PPL
48.7
45.2
33.0
29.2
20.5
24.0
18.3
25.70
40.23
20.9
18.9
IEEE CIRCUITS AND SYSTEMS MAGAZINE
25

IEEE Circuits and Systems Magazine - Q2 2023

Table of Contents for the Digital Edition of IEEE Circuits and Systems Magazine - Q2 2023

Contents
IEEE Circuits and Systems Magazine - Q2 2023 - Cover1
IEEE Circuits and Systems Magazine - Q2 2023 - Cover2
IEEE Circuits and Systems Magazine - Q2 2023 - Contents
IEEE Circuits and Systems Magazine - Q2 2023 - 2
IEEE Circuits and Systems Magazine - Q2 2023 - 3
IEEE Circuits and Systems Magazine - Q2 2023 - 4
IEEE Circuits and Systems Magazine - Q2 2023 - 5
IEEE Circuits and Systems Magazine - Q2 2023 - 6
IEEE Circuits and Systems Magazine - Q2 2023 - 7
IEEE Circuits and Systems Magazine - Q2 2023 - 8
IEEE Circuits and Systems Magazine - Q2 2023 - 9
IEEE Circuits and Systems Magazine - Q2 2023 - 10
IEEE Circuits and Systems Magazine - Q2 2023 - 11
IEEE Circuits and Systems Magazine - Q2 2023 - 12
IEEE Circuits and Systems Magazine - Q2 2023 - 13
IEEE Circuits and Systems Magazine - Q2 2023 - 14
IEEE Circuits and Systems Magazine - Q2 2023 - 15
IEEE Circuits and Systems Magazine - Q2 2023 - 16
IEEE Circuits and Systems Magazine - Q2 2023 - 17
IEEE Circuits and Systems Magazine - Q2 2023 - 18
IEEE Circuits and Systems Magazine - Q2 2023 - 19
IEEE Circuits and Systems Magazine - Q2 2023 - 20
IEEE Circuits and Systems Magazine - Q2 2023 - 21
IEEE Circuits and Systems Magazine - Q2 2023 - 22
IEEE Circuits and Systems Magazine - Q2 2023 - 23
IEEE Circuits and Systems Magazine - Q2 2023 - 24
IEEE Circuits and Systems Magazine - Q2 2023 - 25
IEEE Circuits and Systems Magazine - Q2 2023 - 26
IEEE Circuits and Systems Magazine - Q2 2023 - 27
IEEE Circuits and Systems Magazine - Q2 2023 - 28
IEEE Circuits and Systems Magazine - Q2 2023 - 29
IEEE Circuits and Systems Magazine - Q2 2023 - 30
IEEE Circuits and Systems Magazine - Q2 2023 - 31
IEEE Circuits and Systems Magazine - Q2 2023 - 32
IEEE Circuits and Systems Magazine - Q2 2023 - 33
IEEE Circuits and Systems Magazine - Q2 2023 - 34
IEEE Circuits and Systems Magazine - Q2 2023 - 35
IEEE Circuits and Systems Magazine - Q2 2023 - 36
IEEE Circuits and Systems Magazine - Q2 2023 - 37
IEEE Circuits and Systems Magazine - Q2 2023 - 38
IEEE Circuits and Systems Magazine - Q2 2023 - 39
IEEE Circuits and Systems Magazine - Q2 2023 - 40
IEEE Circuits and Systems Magazine - Q2 2023 - 41
IEEE Circuits and Systems Magazine - Q2 2023 - 42
IEEE Circuits and Systems Magazine - Q2 2023 - 43
IEEE Circuits and Systems Magazine - Q2 2023 - 44
IEEE Circuits and Systems Magazine - Q2 2023 - 45
IEEE Circuits and Systems Magazine - Q2 2023 - 46
IEEE Circuits and Systems Magazine - Q2 2023 - 47
IEEE Circuits and Systems Magazine - Q2 2023 - 48
IEEE Circuits and Systems Magazine - Q2 2023 - 49
IEEE Circuits and Systems Magazine - Q2 2023 - 50
IEEE Circuits and Systems Magazine - Q2 2023 - 51
IEEE Circuits and Systems Magazine - Q2 2023 - 52
IEEE Circuits and Systems Magazine - Q2 2023 - 53
IEEE Circuits and Systems Magazine - Q2 2023 - 54
IEEE Circuits and Systems Magazine - Q2 2023 - 55
IEEE Circuits and Systems Magazine - Q2 2023 - 56
IEEE Circuits and Systems Magazine - Q2 2023 - 57
IEEE Circuits and Systems Magazine - Q2 2023 - 58
IEEE Circuits and Systems Magazine - Q2 2023 - 59
IEEE Circuits and Systems Magazine - Q2 2023 - 60
IEEE Circuits and Systems Magazine - Q2 2023 - 61
IEEE Circuits and Systems Magazine - Q2 2023 - 62
IEEE Circuits and Systems Magazine - Q2 2023 - 63
IEEE Circuits and Systems Magazine - Q2 2023 - 64
IEEE Circuits and Systems Magazine - Q2 2023 - 65
IEEE Circuits and Systems Magazine - Q2 2023 - 66
IEEE Circuits and Systems Magazine - Q2 2023 - 67
IEEE Circuits and Systems Magazine - Q2 2023 - 68
IEEE Circuits and Systems Magazine - Q2 2023 - 69
IEEE Circuits and Systems Magazine - Q2 2023 - 70
IEEE Circuits and Systems Magazine - Q2 2023 - 71
IEEE Circuits and Systems Magazine - Q2 2023 - 72
IEEE Circuits and Systems Magazine - Q2 2023 - Cover3
IEEE Circuits and Systems Magazine - Q2 2023 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021Q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q1
https://www.nxtbookmedia.com