IEEE Signal Processing - May 2018 - 109

[48] S. V. David, N. Mesgarani, and S. A. Shamma, "Estimating sparse spectrotemporal receptive fields with natural stimuli," Network: Comput. Neural Syst., vol.
18, no. 3, pp. 191-212, 2007.

[74] D. Giacobello, M. G. Christensen, M. N. Murthi, S. H. Jensen, and M.
Moonen, "Retrieving sparse patterns using a compressed sensing framework," IEEE
Signal Process. Lett., vol. 17, no. 1, pp. 103-106, 2010.

[49] M. Kleinschmidt, "Methods for capturing spectro-temporal modulations
in automatic speech recognition," Acta Acust. United Acust., vol. 88, no. 3,
pp. 416-422, 2002.

[75] J.-M. Valin, K. Vos, and T. Terriberry (2012, Sept.) Definition of the Opus
audio codec. Internet Engineering Task Force (IETF). [Online]. Available: http://
tools.ietf.org/html/rfc6716

[50] N. Mesgarani and S. Shamma, "Speech processing with a cortical representation of audio," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
(ICASSP), Prague, Czech Republic, 2011, pp. 5872-5875.

[76] N. Jaitly and G. Hinton, "Learning a better representation of speech soundwaves using restricted boltzmann machines," in Proc. IEEE Int. Conf. Acoustics,
Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011, pp.
5884-5887.

[51] A. Hyafil, L. Fontolan, C. Kabdebon, B. Gutkin, A. L. Giraud, and H.
Brownell, "Speech encoding by coupled cortical theta and gamma oscillations,"
eLife, vol. 4, pp. 1-23, May 2015.
[52] M. Cernak, A. Asaei, and H. Bourlard, "On structured sparsity of phonological
posteriors for linguistic parsing," Speech Commun., vol. 84, pp. 36-45, Nov. 2016.
[53] A. Asaei, M. Cernak, and H. Bourlard, "On compressibility of neural network
phonological features for low bit rate speech coding," in Proc. Interspeech,
Dresden, Germany, Sept. 2015, pp. 418-422.
[54] E. L. Saltzman and K. G. Munhall, "A dynamical approach to gestural patterning in speech production," Ecological Psych., vol. 1, no. 4, pp. 333-382, 1989.
[55] M. Cernak, B. Potard, and P. N. Garner, "Phonological vocoding using artificial neural networks," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal
Processing (ICASSP), Brisbane, Australia, Apr. 2015, pp. 4844-4848.

[77] G. Fant. (2001). On the speech code. KTH Computer Science and
Communication. [Online]. Available: https://pdfs.semanticscholar.org/6881/
d93c8c74e64cb823d9ce1c6587df91ff05c9.pdf
[78] A. M. Liberman, F. S. Cooper, D. P. Shankweiler, and M. Studdert-Kennedy,
"Perception of the speech code," Psychol. Rev., vol. 74, no. 6, pp. 431-461, Nov.
1967.
[79] C. A. Fowler, D. Shankweiler, and M. Studdert-Kennedy, "Perception of the
speech code revisited: Speech ss alphabetic after all." Psych. Rev., vol. 123, no. 2, p.
125, Aug. 2015.
[80] A. M. Liberman and I. G. Mattingly, "The motor theory of speech perception
revised," Cognition, vol. 21, no. 1, pp. 1-36, 1985.

[56] T. Painter and A. Spanias, "Perceptual coding of digital audio," Proc. IEEE,
vol. 88, no. 4, pp. 451-515, 2000.

[81] F. H. Guenther and G. Hickok, "Role of the auditory system in speech production," in Handbook of Clinical Neurology, vol. 129, M. J. Aminoff, F. Boller, and
D. F. Swaab, Eds. Amsterdam, The Netherlands: Elsevier, 2015, pp. 161-175.

[57] M. Hasegawa-Johnson and A. Alwan, Wiley Encyclopedia of Telecommunications. Hoboken, NJ: Wiley, 2003.

[82] C. P. Browman and L. M. Goldstein, "Articulatory gestures as phonological
units," Phonology, vol. 6, no. 2, pp. 201-251, 1989.

[58] J. Herre and M. Lutzky, "Perceptual audio coding of speech signals," in
Springer Handbook of Speech Processing, J. Benesty, Sondhi, and Y. Huang, Eds.
Berlin, Germany: Springer-Verlag, 2008, pp. 393-410.

[83] C. P. Browman and L. M. Goldstein, "Articulatory phonology: An overview,"
Phonetica, vol. 49, no. 3-4, pp. 155-180, 1992.

[59] J. Gibson, "Speech Compression," Inform., vol. 7, no. 2, p. 32, June 2016.

[84] P. K. Kuhl, "Early language acquisition: Cracking the speech code," Nature
Rev. Neurosci., vol. 5, no. 11, pp. 831-843, 2004.

[60] A. Rämö and H. Toukomaa, "Subjective quality evaluation of the 3gpp evs
codec," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
(ICASSP), Brisbane, Australia, pp. 5157-5161.

[85] L. Deng, M. L. Seltzer, D. Yu, A. Acero, A.-r. Mohamed, and G. E. Hinton,
"Binary coding of speech spectrograms using a deep auto-encoder," in Proc.
Interspeech, Makuhari, Japan, 2010, pp. 1692-1695.

[61] J. B. Allen, "Nonlinear cochlear signal processing and masking in speech perception," in Springer Handbook of Speech Processing, J. Benesty, Sondhi, and Y.
Huang, Eds. Berlin, Germany: Springer-Verlag, 2008, pp. 27-60.

[86] H. Dudley, "Remaking speech," J. Acoust. Soc. Amer. vol. 11, no. 2, pp. 169-
177, 1939.

[62] A. van den Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves,
N. Kalchbrenner, A. Senior, and K. Kavukcuoglu, (2016). Wavenet: A generative
model for raw audio. CoRR. [Online]. Available: https://arxiv.org/abs/1609.03499

[87] E. J. Candès and M. B. Wakin, "An introduction to compressive sampling,"
IEEE Signal Process. Mag., vol. 25, no. 2, pp. 21-30, 2008.
[88] M. Grant, S. Boyd, and Y. Ye. (2008). Cvx: MATLAB software for disciplined
convex programming. [Online]. Available: http://cvxr.com/cvx/

[63] M. R. Schroeder, B. S. Atal, and J. Hall, "Optimizing digital speech coders by
exploiting masking properties of the human ear," J. Acoust. Soc. Amer., vol. 66, no.
6, pp. 1647-1652, 1979.

[89] A. Asaei, H. Bourlard, and V. Cevher, "Model-based compressive sensing for
multi-party distant speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech,
and Signal Processing (ICASSP), Prague, Czech Republic, 2011.

[64] M. Vetterli and J. Kovacˇ evic, Wavelets and Subband Coding. Englewood
Cliffs, NJ: Prentice Hall, 1995.

[90] M. Cernak, A. Lazaridis, A. Asaei, and P. N. Garner, "Composition of deep
and spiking neural networks for very low bit rate speech coding," IEEE/ACM Trans.
Audio, Speech, Lang. Process., vol. 24, no. 12, pp. 2301-2312, Dec. 2016.

[65] J. G. Beerends, C. Schmidmer, J. Berger, M. Obermann, R. Ullmann, J. Pomy,
and M. Keyhl, "Perceptual objective listening quality assessment (POLQA), the
third generation ITU-T standard for end-to-end speech quality measurement part I-
itemporal alignment," J. Audio Eng. Soc., vol. 61, no. 6, pp. 366-384, 2013.
[66] D. Wong, B.H. Juang, and A. Gray, "An 800 bit/s vector quantization LPC
vocoder," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 30, no. 5, pp.
770-780, Oct. 1982.
[67] S. Roucos, R. Schwartz, and J. Makhoul, "Segment quantization for very-low-rate
speech coding," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
(ICASSP), Paris, France, May 1982, pp. 1565-1568.
[68] S. Roucos, R. Schwartz, and J. Makhoul, "A segment vocoder at 150 b/s," in
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Boston,
MA, vol. 8., Apr. 1983, pp. 61-64.
[69] D. Wong, B. Juang, and D. Cheng, "Very low data rate speech compression with
LPC vector and matrix quantization," in Proc. IEEE Int. Conf. Acoustics, Speech,
and Signal Processing (ICASSP), Boston, MA, vol. 8., Apr. 1983, pp. 65-68.
[70] C. Tsao and R. Gray, "Matrix quantizer design for LPC speech using the generalized Lloyd algorithm," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 33,
no. 3, pp. 537-545, June 1985.
[71] Y. Shiraki and M. Honda, "LPC speech coding based on variable-length segment quantization," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 36, no.
9, pp. 1437-1444, Sept. 1988.
[72] G. Davidson and A. Gersho, "Complexity reduction methods for vector excitation coding," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
(ICASSP), vol. 11., Tokyo, Japan, 1986, pp. 3055-3058.
[73] C. Laflamme, J.P. Adoul, H. Su, and S. Morissette, "On reducing computational complexity of codebook search in celp coder through the use of algebraic codes,"
in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP),
Albuquerque, NM, 1990, pp. 177-180.

[91] R. Rasipuram, M. Cernak, A. Nachen, and M. Magimai-Doss, "Automatic
accentedness evaluation of non-native speech using phonetic and sub-phonetic
posterior probabilities," in Proc. Interspeech, Dresden, Germany, 2015, pp.
648-652.
[92] R. Rasipuram, M. Cernak, and M. Magimai-Doss, "HMM-based non-native
accent assessment using posterior features," in Proc. Interspeech, San Francisco,
CA, 2016, pp. 3137-3141.
[93] N. Chomsky and M. Halle, The Sound Pattern of English. New York: Harper
& Row, 1968.
[94] C. H. Lee, M. A. Clements, S. Dusan, E. Fosler-Lussier, K. Johnson, B.-H.
Juang, and L. R. Rabiner, "An overview on Automatic Speech Attribute
Transcription (ASAT)," in Proc. Interspeech, Antwerp, Belgium, 2007, pp. 1825-
1828.
[95] T. Nagamine, M. L. Seltzer, and N. Mesgarani, "Exploring how deep neural
networks form phonemic categories," in Proc. Interspeech, Dresden, Germany,
2015, pp. 1912-1916.
[96] C. M. Ribeiro and I. Trancoso, "Phonetic vocoding with speaker adaptation."
in Proc. Eurospeech, 1997, pp. 1291-1294.
[97] S. O. Ark, G. Diamos, A. Gibiansky, J. Miller, K. Peng, W. Ping, J. Raiman,
and Y. Zhou, "Deep Voice 2: Multi-speaker neural text-to-speech," in Proc. 31st
Conf. Neural Information Processing Systems (NIPS), Long Beach, CA, 2017.
[Online]. Available: http://research.baidu.com/wp-content/uploads/2017/05/DeepVoice-2-Complete-Arxiv.pdf
[98] D. Gelbart. (2018, Feb.). Gabor feature extraction for automatic speech recognition. International Computer Science Institute. [Online]. Available: http://www1.icsi
.berkeley.edu/Speech/papers/gabor/

IEEE Signal Processing Magazine

SP

|

May 2018

|

109


http://tools.ietf.org/html/rfc6716 http://tools.ietf.org/html/rfc6716 https://pdfs.semanticscholar.org/6881/d93c8c74e64cb823d9ce1c6587df91ff05c9.pdf https://pdfs.semanticscholar.org/6881/d93c8c74e64cb823d9ce1c6587df91ff05c9.pdf https://www.arxiv.org/abs/1609.03499 http://www.cvxr.com/cvx/ http://research.baidu.com/wp-content/uploads/2017/05/Deep-Voice-2-Complete-Arxiv.pdf http://research.baidu.com/wp-content/uploads/2017/05/Deep-Voice-2-Complete-Arxiv.pdf http://www1.icsi.berkeley.edu/Speech/papers/gabor/ http://www1.icsi.berkeley.edu/Speech/papers/gabor/

Table of Contents for the Digital Edition of IEEE Signal Processing - May 2018

Contents
IEEE Signal Processing - May 2018 - Cover1
IEEE Signal Processing - May 2018 - Cover2
IEEE Signal Processing - May 2018 - Contents
IEEE Signal Processing - May 2018 - 2
IEEE Signal Processing - May 2018 - 3
IEEE Signal Processing - May 2018 - 4
IEEE Signal Processing - May 2018 - 5
IEEE Signal Processing - May 2018 - 6
IEEE Signal Processing - May 2018 - 7
IEEE Signal Processing - May 2018 - 8
IEEE Signal Processing - May 2018 - 9
IEEE Signal Processing - May 2018 - 10
IEEE Signal Processing - May 2018 - 11
IEEE Signal Processing - May 2018 - 12
IEEE Signal Processing - May 2018 - 13
IEEE Signal Processing - May 2018 - 14
IEEE Signal Processing - May 2018 - 15
IEEE Signal Processing - May 2018 - 16
IEEE Signal Processing - May 2018 - 17
IEEE Signal Processing - May 2018 - 18
IEEE Signal Processing - May 2018 - 19
IEEE Signal Processing - May 2018 - 20
IEEE Signal Processing - May 2018 - 21
IEEE Signal Processing - May 2018 - 22
IEEE Signal Processing - May 2018 - 23
IEEE Signal Processing - May 2018 - 24
IEEE Signal Processing - May 2018 - 25
IEEE Signal Processing - May 2018 - 26
IEEE Signal Processing - May 2018 - 27
IEEE Signal Processing - May 2018 - 28
IEEE Signal Processing - May 2018 - 29
IEEE Signal Processing - May 2018 - 30
IEEE Signal Processing - May 2018 - 31
IEEE Signal Processing - May 2018 - 32
IEEE Signal Processing - May 2018 - 33
IEEE Signal Processing - May 2018 - 34
IEEE Signal Processing - May 2018 - 35
IEEE Signal Processing - May 2018 - 36
IEEE Signal Processing - May 2018 - 37
IEEE Signal Processing - May 2018 - 38
IEEE Signal Processing - May 2018 - 39
IEEE Signal Processing - May 2018 - 40
IEEE Signal Processing - May 2018 - 41
IEEE Signal Processing - May 2018 - 42
IEEE Signal Processing - May 2018 - 43
IEEE Signal Processing - May 2018 - 44
IEEE Signal Processing - May 2018 - 45
IEEE Signal Processing - May 2018 - 46
IEEE Signal Processing - May 2018 - 47
IEEE Signal Processing - May 2018 - 48
IEEE Signal Processing - May 2018 - 49
IEEE Signal Processing - May 2018 - 50
IEEE Signal Processing - May 2018 - 51
IEEE Signal Processing - May 2018 - 52
IEEE Signal Processing - May 2018 - 53
IEEE Signal Processing - May 2018 - 54
IEEE Signal Processing - May 2018 - 55
IEEE Signal Processing - May 2018 - 56
IEEE Signal Processing - May 2018 - 57
IEEE Signal Processing - May 2018 - 58
IEEE Signal Processing - May 2018 - 59
IEEE Signal Processing - May 2018 - 60
IEEE Signal Processing - May 2018 - 61
IEEE Signal Processing - May 2018 - 62
IEEE Signal Processing - May 2018 - 63
IEEE Signal Processing - May 2018 - 64
IEEE Signal Processing - May 2018 - 65
IEEE Signal Processing - May 2018 - 66
IEEE Signal Processing - May 2018 - 67
IEEE Signal Processing - May 2018 - 68
IEEE Signal Processing - May 2018 - 69
IEEE Signal Processing - May 2018 - 70
IEEE Signal Processing - May 2018 - 71
IEEE Signal Processing - May 2018 - 72
IEEE Signal Processing - May 2018 - 73
IEEE Signal Processing - May 2018 - 74
IEEE Signal Processing - May 2018 - 75
IEEE Signal Processing - May 2018 - 76
IEEE Signal Processing - May 2018 - 77
IEEE Signal Processing - May 2018 - 78
IEEE Signal Processing - May 2018 - 79
IEEE Signal Processing - May 2018 - 80
IEEE Signal Processing - May 2018 - 81
IEEE Signal Processing - May 2018 - 82
IEEE Signal Processing - May 2018 - 83
IEEE Signal Processing - May 2018 - 84
IEEE Signal Processing - May 2018 - 85
IEEE Signal Processing - May 2018 - 86
IEEE Signal Processing - May 2018 - 87
IEEE Signal Processing - May 2018 - 88
IEEE Signal Processing - May 2018 - 89
IEEE Signal Processing - May 2018 - 90
IEEE Signal Processing - May 2018 - 91
IEEE Signal Processing - May 2018 - 92
IEEE Signal Processing - May 2018 - 93
IEEE Signal Processing - May 2018 - 94
IEEE Signal Processing - May 2018 - 95
IEEE Signal Processing - May 2018 - 96
IEEE Signal Processing - May 2018 - 97
IEEE Signal Processing - May 2018 - 98
IEEE Signal Processing - May 2018 - 99
IEEE Signal Processing - May 2018 - 100
IEEE Signal Processing - May 2018 - 101
IEEE Signal Processing - May 2018 - 102
IEEE Signal Processing - May 2018 - 103
IEEE Signal Processing - May 2018 - 104
IEEE Signal Processing - May 2018 - 105
IEEE Signal Processing - May 2018 - 106
IEEE Signal Processing - May 2018 - 107
IEEE Signal Processing - May 2018 - 108
IEEE Signal Processing - May 2018 - 109
IEEE Signal Processing - May 2018 - 110
IEEE Signal Processing - May 2018 - 111
IEEE Signal Processing - May 2018 - 112
IEEE Signal Processing - May 2018 - 113
IEEE Signal Processing - May 2018 - 114
IEEE Signal Processing - May 2018 - 115
IEEE Signal Processing - May 2018 - 116
IEEE Signal Processing - May 2018 - 117
IEEE Signal Processing - May 2018 - 118
IEEE Signal Processing - May 2018 - 119
IEEE Signal Processing - May 2018 - 120
IEEE Signal Processing - May 2018 - 121
IEEE Signal Processing - May 2018 - 122
IEEE Signal Processing - May 2018 - 123
IEEE Signal Processing - May 2018 - 124
IEEE Signal Processing - May 2018 - 125
IEEE Signal Processing - May 2018 - 126
IEEE Signal Processing - May 2018 - 127
IEEE Signal Processing - May 2018 - 128
IEEE Signal Processing - May 2018 - 129
IEEE Signal Processing - May 2018 - 130
IEEE Signal Processing - May 2018 - 131
IEEE Signal Processing - May 2018 - 132
IEEE Signal Processing - May 2018 - Cover3
IEEE Signal Processing - May 2018 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201809
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201807
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201805
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201803
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_201801
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1117
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0917
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0717
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0517
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0317
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0117
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1116
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0916
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0716
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0516
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0316
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0116
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1115
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0915
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0715
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0515
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0315
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0115
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1114
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0914
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0714
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0514
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0314
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0114
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1113
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0913
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0713
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0513
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0313
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0113
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1112
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0912
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0712
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0512
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0312
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0112
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1111
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0911
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0711
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0511
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0311
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0111
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1110
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0910
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0710
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0510
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0310
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0110
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1109
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0909
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0709
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0509
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0309
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0109
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_1108
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0908
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0708
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0508
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0308
https://www.nxtbook.com/nxtbooks/ieee/signalprocessing_0108
https://www.nxtbookmedia.com