IEEE Circuits and Systems Magazine - Q3 2023 - 9

M
I. Overview of Tiny Machine Learning
achine learning (ML) has made significant
impacts on various fields, including vision,
language, and audio. However, state-of-theart
models often come at the cost of high computation
and memory, making them expensive to deploy. To address
this, researchers have been working on efficient
algorithms, systems, and hardware to reduce the cost
of machine learning models in various deployment scenarios.
There are two main subdomains of efficient ML:
EdgeML and CloudML (Figure 1). While CloudML focuses
on improving latency and throughput on cloud servers,
EdgeML focuses on improving energy efficiency, latency,
and privacy on edge devices. These two domains
also intersect in areas such as hybrid inference [1],
[2], over-the-air (OTA) updates, and federated learning
between the edge and cloud [3]. In recent years, there
has been significant progress in extending the scope of
EdgeML to ultra-low-power devices such as IoT devices
and microcontrollers, known as TinyML.
TinyML has several key advantages. It enables machine
learning using only a few hundred kilobytes of
memory which greatly reduces the cost. With billions of
IoT devices producing more and more data in our daily
lives, there is a growing need for low-power, always-on,
on-device AI. By performing on-device inference near
the sensor, TinyML enables better responsiveness and
privacy while reducing the energy cost associated with
wireless communication. On-device processing of data
can be beneficial for applications where real-time decision-making
is crucial, such as autonomous vehicles.
In addition to inference, we push the frontier of TinyML
to enable on-device training on IoT devices. Itrevolutionizes
EdgeAI through continuous and lifelong
learning. Edge device can finetune the model on itself
rather than transmitting data to cloud servers, which
protects privacy. On-device learning has numerous benefits
and a variety of applications. For example, home
cameras can continuously recognize new faces, and
email clients can gradually improve their prediction by
updating customized language models. It also enables
IoT applications that do not have a physical connection
to the internet to adapt to the environment, such as precision
agriculture and ocean sensing.
In this review, we will first discuss the definition and
challenges of TinyML, analyzing why we can't directly
scale mobile ML or cloud ML models for tinyML. Then
we delve into the importance of system-algorithm codesign
in TinyML. We will then survey recent literature
and the progress of the field, presenting a holistic survey
and comparison in Tables 2 and 3. Next, we will introduce
our TinyML project, MCUNet, which combines
efficient system and algorithm design to enable TinyML
for both inference to training. Finally, we will discuss
several emerging topics for future research directions
in the field.
A. Challenges of TinyML
The success of deep learning models often comes at the
cost of high computation, which is not feasible for use
in TinyML applications due to the strict resource constraints
of devices such as microcontrollers. Deploying
and training AI models on MCU is extremely hard: No
DRAM, no operating systems (OS), and strict memory
constraints (SRAM is smaller than 256 kB, and FLASH
is read-only). The available resources on these devices
are orders of magnitude smaller than those available on
mobile platforms (see Table 1). Previous work in the field
has either (I) focused on reducing model parameters
without addressing the real bottleneck of activations or
(II) only optimized operator kernels without considering
improving the network architecture design. Neither of
which considers the problem from a co-design perspective,
and this has led to less optimal solutions for TinyML
applications. We observe several unique challenges of
TinyML and postulate how they might be overcome:
Figure 1. Efficiency is critical
CloudML
TinyML.
targets
for CloudML, EdgeML, and
high-throughput
accelerators
like GPUs, while EdgeML focuses on portable devices like
mobile phones. TinyML further pushes the efficiency boundary,
enabling powerful ML models to run on ultra-low-power
devices such as microcontrollers.
1) Models Designed for Mobile Platforms
Does Not Fit TinyML
There has been a lot of effort optimizing deep learning
models for mobile platforms like MobileNets [4], [5]
and ShuffleNet [6]. However, since mobile devices have
sufficient memory resources (Table 2), the model designs
focus on parameters/FLOPs/latency reduction but
not peak memory usage. As shown in Figure 2 left and
middle, comparing two models with the same level of
The authors are with the Massachusetts Institute of Technology, Cambridge, MA 02139 USA (e-mail: songhan@mit.edu).
THIRD QUARTER 2023
IEEE CIRCUITS AND SYSTEMS MAGAZINE
9

IEEE Circuits and Systems Magazine - Q3 2023

Table of Contents for the Digital Edition of IEEE Circuits and Systems Magazine - Q3 2023

Contents
IEEE Circuits and Systems Magazine - Q3 2023 - Cover1
IEEE Circuits and Systems Magazine - Q3 2023 - Cover2
IEEE Circuits and Systems Magazine - Q3 2023 - Contents
IEEE Circuits and Systems Magazine - Q3 2023 - 2
IEEE Circuits and Systems Magazine - Q3 2023 - 3
IEEE Circuits and Systems Magazine - Q3 2023 - 4
IEEE Circuits and Systems Magazine - Q3 2023 - 5
IEEE Circuits and Systems Magazine - Q3 2023 - 6
IEEE Circuits and Systems Magazine - Q3 2023 - 7
IEEE Circuits and Systems Magazine - Q3 2023 - 8
IEEE Circuits and Systems Magazine - Q3 2023 - 9
IEEE Circuits and Systems Magazine - Q3 2023 - 10
IEEE Circuits and Systems Magazine - Q3 2023 - 11
IEEE Circuits and Systems Magazine - Q3 2023 - 12
IEEE Circuits and Systems Magazine - Q3 2023 - 13
IEEE Circuits and Systems Magazine - Q3 2023 - 14
IEEE Circuits and Systems Magazine - Q3 2023 - 15
IEEE Circuits and Systems Magazine - Q3 2023 - 16
IEEE Circuits and Systems Magazine - Q3 2023 - 17
IEEE Circuits and Systems Magazine - Q3 2023 - 18
IEEE Circuits and Systems Magazine - Q3 2023 - 19
IEEE Circuits and Systems Magazine - Q3 2023 - 20
IEEE Circuits and Systems Magazine - Q3 2023 - 21
IEEE Circuits and Systems Magazine - Q3 2023 - 22
IEEE Circuits and Systems Magazine - Q3 2023 - 23
IEEE Circuits and Systems Magazine - Q3 2023 - 24
IEEE Circuits and Systems Magazine - Q3 2023 - 25
IEEE Circuits and Systems Magazine - Q3 2023 - 26
IEEE Circuits and Systems Magazine - Q3 2023 - 27
IEEE Circuits and Systems Magazine - Q3 2023 - 28
IEEE Circuits and Systems Magazine - Q3 2023 - 29
IEEE Circuits and Systems Magazine - Q3 2023 - 30
IEEE Circuits and Systems Magazine - Q3 2023 - 31
IEEE Circuits and Systems Magazine - Q3 2023 - 32
IEEE Circuits and Systems Magazine - Q3 2023 - 33
IEEE Circuits and Systems Magazine - Q3 2023 - 34
IEEE Circuits and Systems Magazine - Q3 2023 - 35
IEEE Circuits and Systems Magazine - Q3 2023 - 36
IEEE Circuits and Systems Magazine - Q3 2023 - 37
IEEE Circuits and Systems Magazine - Q3 2023 - 38
IEEE Circuits and Systems Magazine - Q3 2023 - 39
IEEE Circuits and Systems Magazine - Q3 2023 - 40
IEEE Circuits and Systems Magazine - Q3 2023 - 41
IEEE Circuits and Systems Magazine - Q3 2023 - 42
IEEE Circuits and Systems Magazine - Q3 2023 - 43
IEEE Circuits and Systems Magazine - Q3 2023 - 44
IEEE Circuits and Systems Magazine - Q3 2023 - 45
IEEE Circuits and Systems Magazine - Q3 2023 - 46
IEEE Circuits and Systems Magazine - Q3 2023 - 47
IEEE Circuits and Systems Magazine - Q3 2023 - 48
IEEE Circuits and Systems Magazine - Q3 2023 - 49
IEEE Circuits and Systems Magazine - Q3 2023 - 50
IEEE Circuits and Systems Magazine - Q3 2023 - 51
IEEE Circuits and Systems Magazine - Q3 2023 - 52
IEEE Circuits and Systems Magazine - Q3 2023 - 53
IEEE Circuits and Systems Magazine - Q3 2023 - 54
IEEE Circuits and Systems Magazine - Q3 2023 - 55
IEEE Circuits and Systems Magazine - Q3 2023 - 56
IEEE Circuits and Systems Magazine - Q3 2023 - 57
IEEE Circuits and Systems Magazine - Q3 2023 - 58
IEEE Circuits and Systems Magazine - Q3 2023 - 59
IEEE Circuits and Systems Magazine - Q3 2023 - 60
IEEE Circuits and Systems Magazine - Q3 2023 - 61
IEEE Circuits and Systems Magazine - Q3 2023 - 62
IEEE Circuits and Systems Magazine - Q3 2023 - 63
IEEE Circuits and Systems Magazine - Q3 2023 - 64
IEEE Circuits and Systems Magazine - Q3 2023 - 65
IEEE Circuits and Systems Magazine - Q3 2023 - 66
IEEE Circuits and Systems Magazine - Q3 2023 - 67
IEEE Circuits and Systems Magazine - Q3 2023 - 68
IEEE Circuits and Systems Magazine - Q3 2023 - 69
IEEE Circuits and Systems Magazine - Q3 2023 - 70
IEEE Circuits and Systems Magazine - Q3 2023 - 71
IEEE Circuits and Systems Magazine - Q3 2023 - 72
IEEE Circuits and Systems Magazine - Q3 2023 - 73
IEEE Circuits and Systems Magazine - Q3 2023 - 74
IEEE Circuits and Systems Magazine - Q3 2023 - 75
IEEE Circuits and Systems Magazine - Q3 2023 - 76
IEEE Circuits and Systems Magazine - Q3 2023 - 77
IEEE Circuits and Systems Magazine - Q3 2023 - 78
IEEE Circuits and Systems Magazine - Q3 2023 - 79
IEEE Circuits and Systems Magazine - Q3 2023 - 80
IEEE Circuits and Systems Magazine - Q3 2023 - 81
IEEE Circuits and Systems Magazine - Q3 2023 - 82
IEEE Circuits and Systems Magazine - Q3 2023 - 83
IEEE Circuits and Systems Magazine - Q3 2023 - 84
IEEE Circuits and Systems Magazine - Q3 2023 - 85
IEEE Circuits and Systems Magazine - Q3 2023 - 86
IEEE Circuits and Systems Magazine - Q3 2023 - 87
IEEE Circuits and Systems Magazine - Q3 2023 - 88
IEEE Circuits and Systems Magazine - Q3 2023 - Cover3
IEEE Circuits and Systems Magazine - Q3 2023 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2023Q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2022Q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021Q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2021q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2020q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2019q1
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q4
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q3
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q2
https://www.nxtbook.com/nxtbooks/ieee/circuitsandsystems_2018q1
https://www.nxtbookmedia.com