Abstract
Voice-based gender detection is an interesting problem. This article shows our attempts to observe the impact of various short-term spectral features with varying number of dimensions on gender recognition systems. We demonstrate our experiments on SITW and ELSDSR databases to determine the best combination of features for improved performance. An attempt has been made to investigate the effect of these systems under mismatched conditions, and it is seen that the existing scenario needs better algorithms to improve cross-corpus performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alhussein, M., Ali, Z., Imran, M., Abdul, W.: Automatic gender detection based on characteristics of vocal folds for mobile healthcare system. Mob. Inf. Syst. 1–12 (2016)
Ambikairajah, E.: Emerging features for speaker recognition. In: ICICS, pp. 1081–1084 (2007)
Anjos, A., El Shafey, L., Wallace, R., Günther, M., McCool, C., Marcel, S.: Bob: a free signal processing and machine learning toolbox for researchers. In: 20th ACM Conference on Multimedia Systems (ACMMM), Nara, Japan, Oct 2012
Bocklet, T., Maier, A., Bauer, J.G., Burkhardt, F., Noth, E.: Age and gender recognition for telephone applications based on GMM supervectors and support vector machines. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2008)
Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28(4), 357–366 (1980)
DeMarco, A., Cox, S.J.: An accurate and robust gender identification algorithm (2011)
Feng, L.: Speaker recognition. Master’s thesis, Institute of Informatics and Mathematical Modelling, Technical University of Denmark (2004)
Feng, L., Hansen, L.K.: A new database for speaker recognition (2005)
Harb, H., Chen, L.: Voice-based gender identification in multimedia applications. In: Springer (2005)
Levitan, S.I., Mishra, T., Bangalore, S.: Automatic identification of gender from speech (2016)
Li, M., Han, K.J., Narayanan, S.: Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput. Speech Lang. 27, 151–167 (2013)
Lingenfelser, F., Wagner, J., Vogt, T., Kim, J., Andre, E.: Age and gender classification from speech using decision level fusion and ensemble based techniques. In: INTERSPEECH (2010)
Mendoza, E., Valencia, N., Munoz, J., Trujillo, H.: Differences in voice quality between men and women: use of the long-term average spectrum (ltas). J. Voice 10(1), 59–66 (1996)
Metze, F., et al.: Comparison of four approaches to age and gender recognition for telephone applications. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2007)
Pronobis, M., Magimai-Doss, M.: Analysis of F0 and cepstral features for robust automatic gender recognition. Technical report, IDIAP Research Institute, Nov 2009
Ranjan, S., Liu, G., Hansen, J.H.L.: An i-vector PLDA based gender identification approach for severely distorted and multiligual DARPA RATS data. In: ASRU (2015)
Sahidullah, M., Kinnunen, T., Hanilçi, C.: A comparison of features for synthetic speech detection. In: Sixteenth Annual Conference of the International Speech Communication Association (2015)
Sarria-Paja, M., Falk, T.H., OShaughnessy, D.: Whispered speaker verification and gender detection using weighted instantaneous frequencies. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2013)
Vergin, R., Farhat, A., O’Shaughnessy, D.: Robust gender-dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male-female classification. Int. Conf. Spok. Lang. Process. (ICSLP) 2, 1081–1084 (1996)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kanani, I., Shah, H., Mankad, S.H. (2019). On the Performance of Cepstral Features for Voice-Based Gender Recognition. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems . Smart Innovation, Systems and Technologies, vol 107. Springer, Singapore. https://doi.org/10.1007/978-981-13-1747-7_31
Download citation
DOI: https://doi.org/10.1007/978-981-13-1747-7_31
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1746-0
Online ISBN: 978-981-13-1747-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)