End to End Deep Neural Network Frequency Demodulation of Speech Signals

Elbaz, Dan; Zibulevsky, Michael

doi:10.1007/978-3-030-03402-3_1

Dan Elbaz¹⁷ &
Michael Zibulevsky¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 886))

Included in the following conference series:

Future of Information and Communication Conference

1109 Accesses
5 Citations

Abstract

Frequency modulation (FM) is a form of radio broadcasting which is widely used nowadays and has been for almost a century. We suggest a software-defined-radio (SDR) receiver for FM demodulation that adopts an end-to-end learning based approach and utilizes the prior information of transmitted speech message in the demodulation process. The receiver detects and enhances speech from the in-phase and quadrature components of its base band version. The new system yields high performance detection for both acoustical disturbances, and communication channel noise and is foreseen to out-perform the established methods for low signal to noise ratio (SNR) conditions.

This research was supported by the Intel Collaborative Research Institute for Computational Intelligence (ICRI-CI).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Amini, M., Balarastaghi, E.: Universal neural network demodulator for software defined radio. Int. J. Mach. Learn. Comput. 1(3), 305–310 (2011)
Article Google Scholar
Fan, M., Wu, L.: 2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE) (2017)
Google Scholar
Garofolo, J.S., Lamel, L.F., Fischer, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L.: DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM. NASA STI/Recon Technical report N, 0, pp. 1–94, January 1993
Google Scholar
Goehring, T., Bolner, F., Monaghan, J.J.M., van Dijk, B., Zarowski, A., Bleeck, S.: Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users. Hear. Res. 344, 183–194 (2016)
Article Google Scholar
Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: ICASSP, no. 3, pp. 6645–6649 (2013)
Google Scholar
Graves, A.: Generating sequences with recurrent neural networks. preprint. arXiv:1308.0850 (2013)
Hatai, I., Chakrabarti, I.: A new high-performance digital FM modulator and demodulator for software-defined radio and its FPGA implementation. Int. J. Reconfigurable Comput. 2011 (2011)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.U.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Kolbaek, M., Tan, Z.-H., Jensen, J.: Speech enhancement using long short-term memory based recurrent neural networks for noise robust speaker verification. In: IEEE Workshop on Spoken Language Technology (SLT), no. 1, pp. 305–311 (2016)
Google Scholar
Kumar, A., Florêncio, D.: Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks. CoRR, abs/1605.0 (2016)
Google Scholar
Li, X., Wu, X.: Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4520–4524 (2014)
Google Scholar
Önder, M., Akan, A., Doǧan, H.: Advanced neural network receiver design to combat multiple channel impairments. Turkish J. Electr. Eng. Comput. Sci. 24(4), 3066–3077 (2016)
Article Google Scholar
Pascanu, R., Mikolov, T., Bengio, Y.: On the difficulty of training recurrent neural networks. JMLR.org (2013)
Google Scholar
Rohani, K., Manry, M.T.: The design of multi-layer perceptrons using building blocks (1991)
Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)
Article Google Scholar
Schuster, M., Paliwal, K.K.: Bidirectional recurrent neural networks. IEEE Trans. Sig. Process. 45(11), 2673–2681 (1997)
Article Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
MathSciNet MATH Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp. 3104–3112 (2014)
Google Scholar
Tieleman, T., Hinton, G.: Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. In: COURSERA: Neural Networks for Machine Learning (2012)
Google Scholar
Turner, R.E., Sahani, M.: Demodulation as probabilistic inference. IEEE Trans. Audio Speech Lang. Process. 19(8), 2398–2411 (2011)
Article Google Scholar
Wornell, G.W.: Efficient symbol-spreading strategies for wireless communication. Research Laboratory of Electronics, Massachusetts Institute of Technology (1994)
Google Scholar
Xu, Y., Du, J., Dai, L.-R., Lee, C.-H.: A regression approach to speech enhancement based on deep neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 23(1), 7–19 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Technion Israel Institute of Technology, 32000, Haifa, Israel
Dan Elbaz & Michael Zibulevsky

Authors

Dan Elbaz
View author publications
You can also search for this author in PubMed Google Scholar
Michael Zibulevsky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dan Elbaz .

Editor information

Editors and Affiliations

Faculty of Science and Engineering, Saga University, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, London, UK
Supriya Kapoor
The Science and Information (SAI) Organization, Bradford, UK
Rahul Bhatia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elbaz, D., Zibulevsky, M. (2019). End to End Deep Neural Network Frequency Demodulation of Speech Signals. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Advances in Information and Communication Networks. FICC 2018. Advances in Intelligent Systems and Computing, vol 886. Springer, Cham. https://doi.org/10.1007/978-3-030-03402-3_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-03402-3_1
Published: 06 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03401-6
Online ISBN: 978-3-030-03402-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics