Abstract
We propose a mathematical model for voice aging that could be used in the design of an age-adapting Electrolarynx. Voice data from public figures, at the ages of 30, 40, 50 and 60 years old, were acquired from a YouTube corpus. The voice processing consisted of an extraction of 70 Mel-Frequency Cepstral Coefficients (MFCCs) and a computation of their statistical features. ANOVA F-tests were used to determine which of these features change with age. Significant differences between age groups were found only for the first 40 MFCCs. The aging model was then constructed using non-linear regression and an averaged quadratic polynomial fit on these coefficients. Model age-adapted voices were reconstructed from the young dataset speakers’ voices and compared to their voices at older ages. The model was validated by the correlation between speakers’ MFCCs at older ages and the model-aged MFCCs. The average correlation results were in the range of 0.62 to 0.93. The results imply that the first 40 MFCCs are more susceptible to age related changes and that the proposed model has the potential to enhance the Electrolarynx by providing age adaptation as the speaker grows older.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Goode, R.L.: Artificial laryngeal devices in post-laryngectomy rehabilitation. Laryngoscope 85(4), 677–689 (1975)
Kaye, R., Tang, C.G., Sinclair, C.F.: The electrolarynx: voice restoration after total laryngectomy. Med. Dev. (Auck) 10, 133–140 (2017)
Mittal, T., Barthwal, A., Koolagudi, S.G.: Age approximation from speech using Gaussian mixture models. In: Proceedings of the 2nd International Conference on Advanced Computing, Networking and Security, pp. 74–78, December 2013
Erokyar, H.: Age and gender recognition for speech applications based on support vector machines. Graduate Theses and Dissertations (2014). http://scholarcommons.usf.edu/etd/5356. Accessed 05 Dec 2017
Bocklet, Z.V., Stemmer, T., Aeissler, V., Noeth, E.: Age and gender recognition based on multiple systems-early vs. late fusion. In: Interspeech 2010 (2010)
Eadie, T.: Characteristics of the aging female voice. J. Speech Lang. Pathol. Audiol. 24(4), 162–179 (2000)
Lee, M.-W., Kwak, K.-C.: Performance comparison of gender and age group recognition for human-robot interaction. Int. J. Adv. Comput. Sci. Appl. 3, 12 (2012)
Kim, H.J., Bae, K., Yoon, H.S.: Age and gender classification for a home-robot service. In: 16th IEEE International Conference on Robot & Human Interactive Communication, pp. 122–126, August 2007
Mendoza, L.A.F., Cataldo, E., Vellasco, M., Silva, M.A., Cañón, A.D.O., de Seixas, J.M.: Classification of voice aging using ANN and glottal signal parameters. In: ANDESCON 2010, pp. 1–5. IEEE (2010)
Fairhurst, M., Erbilek, M., Da Costa-Abreu, M.: Selective review and analysis of aging effects in biometric system implementation. IEEE Trans. Hum. Mach. Syst. 45, 294–303 (2015)
Boucheron, L.E., De Leon, P.L., Sandoval, S.: Low bit-rate speech coding through quantization of mel-frequency cepstral coefficients. IEEE Trans. Audio Speech Lang. Process. 20(2), 610–619 (2012)
Gang, M., Zhang, X., Yang, J., Zou, X.: Speech reconstruction from mel-frequency cepstral coefficients via 1-norm minimization. In: IEEE 17th International Workshop on Multimedia Signal Processing (MMSP), pp. 1–5, October 2015
Gelman, A., Imbens, G.: Why high-order polynomials should not be used in regression discontinuity designs. J. Bus. Econ. Stat. 2017. https://doi.org/10.1080/07350015.2017.1366909. Accessed 27 Oct 2017
Ganchev, T., Fakotakis, N., Kokkinakis, G.: Comparative evaluation of various MFCC implementations on the speaker verification task. In: Proceedings of SPECOM, pp. 191–194, October 2005
Griffin, D., Lim, J.: Signal estimation from modified short-time Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 32(2), 236–243 (1984)
Gang, M.: Reconstruct speech from MFCCs (v.3), October 2016. https://www.Mathworks.com/matlabcentral/fileexchange/53186-invmfccs?requestedDomain=www.mathworks.com. Accessed 28 Oct 2017
Gordon-Salant, G.S., Frisina, R.D., Fay, R.R., Popper, A.: The Aging Auditory System, p. 115. Springer (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Coetzee, P., Lamfel, J., Rubin, D.M., Aharonson, V. (2019). An Age Adapting Electrolarynx – A Feasibility Study. In: Ahram, T., Falcão, C. (eds) Advances in Usability, User Experience and Assistive Technology. AHFE 2018. Advances in Intelligent Systems and Computing, vol 794. Springer, Cham. https://doi.org/10.1007/978-3-319-94947-5_86
Download citation
DOI: https://doi.org/10.1007/978-3-319-94947-5_86
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-94946-8
Online ISBN: 978-3-319-94947-5
eBook Packages: EngineeringEngineering (R0)