On the Performance of Cepstral Features for Voice-Based Gender Recognition

Kanani, Isha; Shah, Heenal; Mankad, Sapan H.

doi:10.1007/978-981-13-1747-7_31

Isha Kanani⁵,
Heenal Shah⁵ &
Sapan H. Mankad⁵

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 107))

1457 Accesses
1 Citations

Abstract

Voice-based gender detection is an interesting problem. This article shows our attempts to observe the impact of various short-term spectral features with varying number of dimensions on gender recognition systems. We demonstrate our experiments on SITW and ELSDSR databases to determine the best combination of features for improved performance. An attempt has been made to investigate the effect of these systems under mismatched conditions, and it is seen that the existing scenario needs better algorithms to improve cross-corpus performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Voice Gender Recognition Using Acoustic Features, MFCCs and SVM

Improving Speaker Gender Detection by Combining Pitch and SDC

Cepstral Coefficient-Based Gender Classification Using Audio Signals

References

Alhussein, M., Ali, Z., Imran, M., Abdul, W.: Automatic gender detection based on characteristics of vocal folds for mobile healthcare system. Mob. Inf. Syst. 1–12 (2016)
Article Google Scholar
Ambikairajah, E.: Emerging features for speaker recognition. In: ICICS, pp. 1081–1084 (2007)
Google Scholar
Anjos, A., El Shafey, L., Wallace, R., Günther, M., McCool, C., Marcel, S.: Bob: a free signal processing and machine learning toolbox for researchers. In: 20th ACM Conference on Multimedia Systems (ACMMM), Nara, Japan, Oct 2012
Google Scholar
Bocklet, T., Maier, A., Bauer, J.G., Burkhardt, F., Noth, E.: Age and gender recognition for telephone applications based on GMM supervectors and support vector machines. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2008)
Google Scholar
Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28(4), 357–366 (1980)
Article Google Scholar
DeMarco, A., Cox, S.J.: An accurate and robust gender identification algorithm (2011)
Google Scholar
Feng, L.: Speaker recognition. Master’s thesis, Institute of Informatics and Mathematical Modelling, Technical University of Denmark (2004)
Google Scholar
Feng, L., Hansen, L.K.: A new database for speaker recognition (2005)
Google Scholar
Harb, H., Chen, L.: Voice-based gender identification in multimedia applications. In: Springer (2005)
Google Scholar
Levitan, S.I., Mishra, T., Bangalore, S.: Automatic identification of gender from speech (2016)
Google Scholar
Li, M., Han, K.J., Narayanan, S.: Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput. Speech Lang. 27, 151–167 (2013)
Article Google Scholar
Lingenfelser, F., Wagner, J., Vogt, T., Kim, J., Andre, E.: Age and gender classification from speech using decision level fusion and ensemble based techniques. In: INTERSPEECH (2010)
Google Scholar
Mendoza, E., Valencia, N., Munoz, J., Trujillo, H.: Differences in voice quality between men and women: use of the long-term average spectrum (ltas). J. Voice 10(1), 59–66 (1996)
Article Google Scholar
Metze, F., et al.: Comparison of four approaches to age and gender recognition for telephone applications. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2007)
Google Scholar
Pronobis, M., Magimai-Doss, M.: Analysis of F0 and cepstral features for robust automatic gender recognition. Technical report, IDIAP Research Institute, Nov 2009
Google Scholar
Ranjan, S., Liu, G., Hansen, J.H.L.: An i-vector PLDA based gender identification approach for severely distorted and multiligual DARPA RATS data. In: ASRU (2015)
Google Scholar
Sahidullah, M., Kinnunen, T., Hanilçi, C.: A comparison of features for synthetic speech detection. In: Sixteenth Annual Conference of the International Speech Communication Association (2015)
Google Scholar
Sarria-Paja, M., Falk, T.H., OShaughnessy, D.: Whispered speaker verification and gender detection using weighted instantaneous frequencies. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2013)
Google Scholar
Vergin, R., Farhat, A., O’Shaughnessy, D.: Robust gender-dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male-female classification. Int. Conf. Spok. Lang. Process. (ICSLP) 2, 1081–1084 (1996)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Technology, Nirma University, Ahmedabad, 382481, India
Isha Kanani, Heenal Shah & Sapan H. Mankad

Authors

Isha Kanani
View author publications
You can also search for this author in PubMed Google Scholar
Heenal Shah
View author publications
You can also search for this author in PubMed Google Scholar
Sapan H. Mankad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sapan H. Mankad .

Editor information

Editors and Affiliations

School of Computer Engineering, KIIT Deemed to be University, Bhubaneswar, India
Suresh Chandra Satapathy
Sabar Institute of Technology, Gujarat Technological University, Ahmedabad, Gujarat, India
Amit Joshi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kanani, I., Shah, H., Mankad, S.H. (2019). On the Performance of Cepstral Features for Voice-Based Gender Recognition. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems . Smart Innovation, Systems and Technologies, vol 107. Springer, Singapore. https://doi.org/10.1007/978-981-13-1747-7_31

Download citation

DOI: https://doi.org/10.1007/978-981-13-1747-7_31
Published: 15 December 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1746-0
Online ISBN: 978-981-13-1747-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

On the Performance of Cepstral Features for Voice-Based Gender Recognition

Abstract

Access this chapter

Similar content being viewed by others

Voice Gender Recognition Using Acoustic Features, MFCCs and SVM

Improving Speaker Gender Detection by Combining Pitch and SDC

Cepstral Coefficient-Based Gender Classification Using Audio Signals

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

On the Performance of Cepstral Features for Voice-Based Gender Recognition

Abstract

Access this chapter

Similar content being viewed by others

Voice Gender Recognition Using Acoustic Features, MFCCs and SVM

Improving Speaker Gender Detection by Combining Pitch and SDC

Cepstral Coefficient-Based Gender Classification Using Audio Signals

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation