An Age Adapting Electrolarynx – A Feasibility Study

Coetzee, Pieter; Lamfel, Joice; Rubin, David M.; Aharonson, Vered

doi:10.1007/978-3-319-94947-5_86

Pieter Coetzee¹⁶,
Joice Lamfel¹⁶,
David M. Rubin¹⁶ &
…
Vered Aharonson^16,17

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 794))

Included in the following conference series:

International Conference on Applied Human Factors and Ergonomics

3319 Accesses

Abstract

We propose a mathematical model for voice aging that could be used in the design of an age-adapting Electrolarynx. Voice data from public figures, at the ages of 30, 40, 50 and 60 years old, were acquired from a YouTube corpus. The voice processing consisted of an extraction of 70 Mel-Frequency Cepstral Coefficients (MFCCs) and a computation of their statistical features. ANOVA F-tests were used to determine which of these features change with age. Significant differences between age groups were found only for the first 40 MFCCs. The aging model was then constructed using non-linear regression and an averaged quadratic polynomial fit on these coefficients. Model age-adapted voices were reconstructed from the young dataset speakers’ voices and compared to their voices at older ages. The model was validated by the correlation between speakers’ MFCCs at older ages and the model-aged MFCCs. The average correlation results were in the range of 0.62 to 0.93. The results imply that the first 40 MFCCs are more susceptible to age related changes and that the proposed model has the potential to enhance the Electrolarynx by providing age adaptation as the speaker grows older.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Goode, R.L.: Artificial laryngeal devices in post-laryngectomy rehabilitation. Laryngoscope 85(4), 677–689 (1975)
Article Google Scholar
Kaye, R., Tang, C.G., Sinclair, C.F.: The electrolarynx: voice restoration after total laryngectomy. Med. Dev. (Auck) 10, 133–140 (2017)
Google Scholar
Mittal, T., Barthwal, A., Koolagudi, S.G.: Age approximation from speech using Gaussian mixture models. In: Proceedings of the 2nd International Conference on Advanced Computing, Networking and Security, pp. 74–78, December 2013
Google Scholar
Erokyar, H.: Age and gender recognition for speech applications based on support vector machines. Graduate Theses and Dissertations (2014). http://scholarcommons.usf.edu/etd/5356. Accessed 05 Dec 2017
Bocklet, Z.V., Stemmer, T., Aeissler, V., Noeth, E.: Age and gender recognition based on multiple systems-early vs. late fusion. In: Interspeech 2010 (2010)
Google Scholar
Eadie, T.: Characteristics of the aging female voice. J. Speech Lang. Pathol. Audiol. 24(4), 162–179 (2000)
Google Scholar
Lee, M.-W., Kwak, K.-C.: Performance comparison of gender and age group recognition for human-robot interaction. Int. J. Adv. Comput. Sci. Appl. 3, 12 (2012)
Google Scholar
Kim, H.J., Bae, K., Yoon, H.S.: Age and gender classification for a home-robot service. In: 16th IEEE International Conference on Robot & Human Interactive Communication, pp. 122–126, August 2007
Google Scholar
Mendoza, L.A.F., Cataldo, E., Vellasco, M., Silva, M.A., Cañón, A.D.O., de Seixas, J.M.: Classification of voice aging using ANN and glottal signal parameters. In: ANDESCON 2010, pp. 1–5. IEEE (2010)
Google Scholar
Fairhurst, M., Erbilek, M., Da Costa-Abreu, M.: Selective review and analysis of aging effects in biometric system implementation. IEEE Trans. Hum. Mach. Syst. 45, 294–303 (2015)
Article Google Scholar
Boucheron, L.E., De Leon, P.L., Sandoval, S.: Low bit-rate speech coding through quantization of mel-frequency cepstral coefficients. IEEE Trans. Audio Speech Lang. Process. 20(2), 610–619 (2012)
Google Scholar
Gang, M., Zhang, X., Yang, J., Zou, X.: Speech reconstruction from mel-frequency cepstral coefficients via 1-norm minimization. In: IEEE 17th International Workshop on Multimedia Signal Processing (MMSP), pp. 1–5, October 2015
Google Scholar
Gelman, A., Imbens, G.: Why high-order polynomials should not be used in regression discontinuity designs. J. Bus. Econ. Stat. 2017. https://doi.org/10.1080/07350015.2017.1366909. Accessed 27 Oct 2017
Ganchev, T., Fakotakis, N., Kokkinakis, G.: Comparative evaluation of various MFCC implementations on the speaker verification task. In: Proceedings of SPECOM, pp. 191–194, October 2005
Google Scholar
Griffin, D., Lim, J.: Signal estimation from modified short-time Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 32(2), 236–243 (1984)
Article Google Scholar
Gang, M.: Reconstruct speech from MFCCs (v.3), October 2016. https://www.Mathworks.com/matlabcentral/fileexchange/53186-invmfccs?requestedDomain=www.mathworks.com. Accessed 28 Oct 2017
Gordon-Salant, G.S., Frisina, R.D., Fay, R.R., Popper, A.: The Aging Auditory System, p. 115. Springer (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Biomedical Engineering Research Group, School of Electrical and Information Engineering, University of the Witwatersrand, Johannesburg, South Africa
Pieter Coetzee, Joice Lamfel, David M. Rubin & Vered Aharonson
Department of Electrical Engineering, Afeka Tel Aviv Academic College of Engineering, Tel Aviv, Israel
Vered Aharonson

Authors

Pieter Coetzee
View author publications
You can also search for this author in PubMed Google Scholar
Joice Lamfel
View author publications
You can also search for this author in PubMed Google Scholar
David M. Rubin
View author publications
You can also search for this author in PubMed Google Scholar
Vered Aharonson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pieter Coetzee .

Editor information

Editors and Affiliations

University of Central Florida, Orlando, FL, USA
Tareq Z. Ahram
Catholic University of Pernambuco, Boa Viagem, Pernambuco, Brazil
Christianne Falcão

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Coetzee, P., Lamfel, J., Rubin, D.M., Aharonson, V. (2019). An Age Adapting Electrolarynx – A Feasibility Study. In: Ahram, T., Falcão, C. (eds) Advances in Usability, User Experience and Assistive Technology. AHFE 2018. Advances in Intelligent Systems and Computing, vol 794. Springer, Cham. https://doi.org/10.1007/978-3-319-94947-5_86

Download citation

DOI: https://doi.org/10.1007/978-3-319-94947-5_86
Published: 28 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-94946-8
Online ISBN: 978-3-319-94947-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics