Skip to main content

On the Performance of Cepstral Features for Voice-Based Gender Recognition

  • Conference paper
  • First Online:
Information and Communication Technology for Intelligent Systems

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 107))

Abstract

Voice-based gender detection is an interesting problem. This article shows our attempts to observe the impact of various short-term spectral features with varying number of dimensions on gender recognition systems. We demonstrate our experiments on SITW and ELSDSR databases to determine the best combination of features for improved performance. An attempt has been made to investigate the effect of these systems under mismatched conditions, and it is seen that the existing scenario needs better algorithms to improve cross-corpus performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Alhussein, M., Ali, Z., Imran, M., Abdul, W.: Automatic gender detection based on characteristics of vocal folds for mobile healthcare system. Mob. Inf. Syst. 1–12 (2016)

    Article  Google Scholar 

  2. Ambikairajah, E.: Emerging features for speaker recognition. In: ICICS, pp. 1081–1084 (2007)

    Google Scholar 

  3. Anjos, A., El Shafey, L., Wallace, R., Günther, M., McCool, C., Marcel, S.: Bob: a free signal processing and machine learning toolbox for researchers. In: 20th ACM Conference on Multimedia Systems (ACMMM), Nara, Japan, Oct 2012

    Google Scholar 

  4. Bocklet, T., Maier, A., Bauer, J.G., Burkhardt, F., Noth, E.: Age and gender recognition for telephone applications based on GMM supervectors and support vector machines. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2008)

    Google Scholar 

  5. Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28(4), 357–366 (1980)

    Article  Google Scholar 

  6. DeMarco, A., Cox, S.J.: An accurate and robust gender identification algorithm (2011)

    Google Scholar 

  7. Feng, L.: Speaker recognition. Master’s thesis, Institute of Informatics and Mathematical Modelling, Technical University of Denmark (2004)

    Google Scholar 

  8. Feng, L., Hansen, L.K.: A new database for speaker recognition (2005)

    Google Scholar 

  9. Harb, H., Chen, L.: Voice-based gender identification in multimedia applications. In: Springer (2005)

    Google Scholar 

  10. Levitan, S.I., Mishra, T., Bangalore, S.: Automatic identification of gender from speech (2016)

    Google Scholar 

  11. Li, M., Han, K.J., Narayanan, S.: Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput. Speech Lang. 27, 151–167 (2013)

    Article  Google Scholar 

  12. Lingenfelser, F., Wagner, J., Vogt, T., Kim, J., Andre, E.: Age and gender classification from speech using decision level fusion and ensemble based techniques. In: INTERSPEECH (2010)

    Google Scholar 

  13. Mendoza, E., Valencia, N., Munoz, J., Trujillo, H.: Differences in voice quality between men and women: use of the long-term average spectrum (ltas). J. Voice 10(1), 59–66 (1996)

    Article  Google Scholar 

  14. Metze, F., et al.: Comparison of four approaches to age and gender recognition for telephone applications. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2007)

    Google Scholar 

  15. Pronobis, M., Magimai-Doss, M.: Analysis of F0 and cepstral features for robust automatic gender recognition. Technical report, IDIAP Research Institute, Nov 2009

    Google Scholar 

  16. Ranjan, S., Liu, G., Hansen, J.H.L.: An i-vector PLDA based gender identification approach for severely distorted and multiligual DARPA RATS data. In: ASRU (2015)

    Google Scholar 

  17. Sahidullah, M., Kinnunen, T., Hanilçi, C.: A comparison of features for synthetic speech detection. In: Sixteenth Annual Conference of the International Speech Communication Association (2015)

    Google Scholar 

  18. Sarria-Paja, M., Falk, T.H., OShaughnessy, D.: Whispered speaker verification and gender detection using weighted instantaneous frequencies. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2013)

    Google Scholar 

  19. Vergin, R., Farhat, A., O’Shaughnessy, D.: Robust gender-dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male-female classification. Int. Conf. Spok. Lang. Process. (ICSLP) 2, 1081–1084 (1996)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sapan H. Mankad .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kanani, I., Shah, H., Mankad, S.H. (2019). On the Performance of Cepstral Features for Voice-Based Gender Recognition. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems . Smart Innovation, Systems and Technologies, vol 107. Springer, Singapore. https://doi.org/10.1007/978-981-13-1747-7_31

Download citation

Publish with us

Policies and ethics