IEEE Spectrum - North American - March 2017 - 36

Input layer

Hidden layer 2

Hidden layer 1
36

|

MAR 2017

|

NORTh AMERICAN

types of signals it would later have to
classify on the fly.
Then we used samples of noisy speech
and their corresponding results on the
ideal binary mask to complete the second phase of training, which was the
supervised learning. In particular, the
set of 1s and 0s that make up the ideal
binary mask was like an answer sheet
that we used to test and improve our program's ability to separate speech from
noise. For each new sample, the program
would extract a set of attributes from
the noisy speech. Then, after analyzing
these attributes-frequencies, intensities,
and so on-the filter performed a provisional classification-was it speech? was it
noise?-and compared the result to what
the ideal binary mask would determine
in the same situation. If the result was
different from the 1s and 0s within our
perfect binary mask filter, we tweaked
the neural network's parameters accordingly, so that the network would produce
results closer to the 1s and 0s of the ideal
binary mask on its next try.
To make these adjustments, we first calculated the error of the neural network,
measured as the discrepancy between the
ideal binary mask and the result at the neural network's final layer, which is known as
the output layer. Once we computed this
error, we would then use it to change the
weights of the neural network's connections so that if the same classification was
carried out again, the discrepancy would
be reduced. The training of the neural network consisted of performing this procedure thousands of times.
One important refinement along the
way was to build a second deep neural
network that would be fed by the first
one and fine-tune its results. While
Output
layer

Hidden layer 3
|

SPECTRUM.IEEE.ORG

sMart layers: A deep
neural network consists
of two or more processing layers in between
the input layer, through
which information is fed
into the system [left], and
the output layer, which
reveals the results [right].
To improve performance,
researchers can adjust
the system's parameters
and tweak the connections between layers. 

1880 to 1920

1921 to 1953

1984 to present

that first network had focused on labeling attributes within each individual
time-frequency unit, the second network would examine the attributes of
several units near a particular one. In
other words, the second network provided the first network with extra context
about the speech and noise it processed
and further improved its classification
accuracy. For example, a syllable may
span many time-frequency units, but the
background noise could change abruptly
while it was being spoken. In our case,
having contextual clues could help the
program to more accurately separate
speech from noise within the syllable.  
At the end of the supervised training, the deep-neural-network classifier proved to be far superior to earlier
methods at separating speech from
noise. In fact, this algorithm was the
first, of any technique relying on monaural techniques, to achieve major
improvements in hearing-impaired listeners' ability to make sense of spoken
phrases obscured by noise.
To test it with human subjects, we
asked 12 hearing-impaired people and
12 with normal hearing to listen through
illustration by

Erik Vrielink

From Top: hearing aiD museum; Joe haupT/Flickr; sspl/geTTy images

Neural networks come in many shapes
and sizes and with varying degrees of
complexity. Deep neural networks are
defined as having at least two "hidden"
processing layers, which are not directly
connected to a system's input or output.
Each hidden layer refines the results fed
to it by previous layers, adding in new
considerations based on prior knowledge.
For example, a program designed
to verify a customer's signature might
begin by comparing a new signature to a
sample included in a training database.
However, that program also knows from
its training that the new signature does
not need to precisely match the original.
Other layers can determine if the new
signature shares certain qualities that
tend to remain consistent in a person's
signature, such as the angle of slant, or
the failure to dot the letter i. 
To build our own deep neural network, we began by writing algorithms
to extract features that could distinguish
voices from noise based on common
changes in amplitude, frequency, and
the modulations of each. We identified
dozens of attributes that could help our
program discriminate between speech
and noise to some extent, and we used
all 85 of them to make the algorithms as
powerful as possible. Among the most
important attributes we identified were
the frequencies of the sounds and their
intensities (loud or soft).
Next, we trained the deep neural network to use these 85 attributes to distinguish speech from noise. This training
occurred in two phases: First, we set the
program's parameters through unsupervised learning. This means we loaded
many examples of the attributes into
the program in order to prime it for the


http://SPECTRUM.IEEE.ORG

Table of Contents for the Digital Edition of IEEE Spectrum - North American - March 2017

Contents
IEEE Spectrum - North American - March 2017 - Cover1
IEEE Spectrum - North American - March 2017 - Cover2
IEEE Spectrum - North American - March 2017 - 1
IEEE Spectrum - North American - March 2017 - 2
IEEE Spectrum - North American - March 2017 - Contents
IEEE Spectrum - North American - March 2017 - 4
IEEE Spectrum - North American - March 2017 - 5
IEEE Spectrum - North American - March 2017 - 6
IEEE Spectrum - North American - March 2017 - 7
IEEE Spectrum - North American - March 2017 - 8
IEEE Spectrum - North American - March 2017 - 9
IEEE Spectrum - North American - March 2017 - 10
IEEE Spectrum - North American - March 2017 - 11
IEEE Spectrum - North American - March 2017 - 12
IEEE Spectrum - North American - March 2017 - 13
IEEE Spectrum - North American - March 2017 - 14
IEEE Spectrum - North American - March 2017 - 15
IEEE Spectrum - North American - March 2017 - 16
IEEE Spectrum - North American - March 2017 - 17
IEEE Spectrum - North American - March 2017 - 18
IEEE Spectrum - North American - March 2017 - 19
IEEE Spectrum - North American - March 2017 - 20
IEEE Spectrum - North American - March 2017 - 21
IEEE Spectrum - North American - March 2017 - 22
IEEE Spectrum - North American - March 2017 - 23
IEEE Spectrum - North American - March 2017 - 24
IEEE Spectrum - North American - March 2017 - 25
IEEE Spectrum - North American - March 2017 - 26
IEEE Spectrum - North American - March 2017 - 27
IEEE Spectrum - North American - March 2017 - 28
IEEE Spectrum - North American - March 2017 - 29
IEEE Spectrum - North American - March 2017 - 30
IEEE Spectrum - North American - March 2017 - 31
IEEE Spectrum - North American - March 2017 - 32
IEEE Spectrum - North American - March 2017 - 33
IEEE Spectrum - North American - March 2017 - 34
IEEE Spectrum - North American - March 2017 - 35
IEEE Spectrum - North American - March 2017 - 36
IEEE Spectrum - North American - March 2017 - 37
IEEE Spectrum - North American - March 2017 - 38
IEEE Spectrum - North American - March 2017 - 39
IEEE Spectrum - North American - March 2017 - 40
IEEE Spectrum - North American - March 2017 - 41
IEEE Spectrum - North American - March 2017 - 42
IEEE Spectrum - North American - March 2017 - 43
IEEE Spectrum - North American - March 2017 - 44
IEEE Spectrum - North American - March 2017 - 45
IEEE Spectrum - North American - March 2017 - 46
IEEE Spectrum - North American - March 2017 - 47
IEEE Spectrum - North American - March 2017 - 48
IEEE Spectrum - North American - March 2017 - 49
IEEE Spectrum - North American - March 2017 - 50
IEEE Spectrum - North American - March 2017 - 51
IEEE Spectrum - North American - March 2017 - 52
IEEE Spectrum - North American - March 2017 - 53
IEEE Spectrum - North American - March 2017 - 54
IEEE Spectrum - North American - March 2017 - 55
IEEE Spectrum - North American - March 2017 - 56
IEEE Spectrum - North American - March 2017 - Cover3
IEEE Spectrum - North American - March 2017 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1217
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1117
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1017
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0917
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0817
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0717
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0617
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0517
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0417
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0317
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0217
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0117
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1216
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1116
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1016
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0916
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0816
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0716
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0616
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0516
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0416
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0316
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0216
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0116
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1215
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1115
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1015
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0915
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0815
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0715
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0615
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0515
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0415
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0315
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0215
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0115
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1214
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1114
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1014
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0914
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0814
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0714
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0614
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0514
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0414
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0314
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0214
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0114
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1213
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1113
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1013
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0913
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0813
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0713
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0613
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0513
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0413
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0313
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0213
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0113
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1212
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1112
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1012
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0912
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0812
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0712
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0612
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0512
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0412
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0312
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0212
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0112
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1211
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1111
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1011
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0911
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0811
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0711
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0611
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0511
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0411
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0311
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0211
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0111
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1210
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1110
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1010
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0910
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0810
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0710
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0610
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0510
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0410
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0310
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0210
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0110
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1209
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1109
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1009
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0909
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0809
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0709
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0609
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0509
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0409
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0309
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0209
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0109
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1208
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1108
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1008
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0908
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0808
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0708
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0608
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0508
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0408
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0308
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0208
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0108
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1207
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1107
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_1007
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0907
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0807
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0707
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0607
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0507
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0407
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0307
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0207
https://www.nxtbook.com/nxtbooks/ieee/spectrum_na_0107
https://www.nxtbookmedia.com