Is it possible to predict speaker’s body size and oral cavity characteristics from speech signals: a preliminary study on Mandarin Chinese

Puyang Geng; Hong Guo; Qimeng Lu; Jinhua Zeng; Yan Li

doi:10.1117/12.2661788

28 December 2022 Is it possible to predict speaker’s body size and oral cavity characteristics from speech signals: a preliminary study on Mandarin Chinese

Puyang Geng, Hong Guo, Qimeng Lu, Jinhua Zeng, Yan Li

Author Affiliations +

Proceedings Volume 12506, Third International Conference on Computer Science and Communication Technology (ICCSCT 2022); 125062P (2022) https://doi.org/10.1117/12.2661788
Event: International Conference on Computer Science and Communication Technology (ICCSCT 2022), 2022, Beijing, China

Abstract

This paper proposes a study on whether the speaker’s body size (height, weight) and oral cavity (lip protrusion LP, lip opening LO, front cavity FC) characteristics can be predicted based on the acoustic features of speech. Firstly, Pearson’s correlation analysis was first conducted to examine the relationships between acoustic features and body size and oral cavity characteristics. Further, the effects of acoustic features in predicting body size and oral cavity characteristics were examined using random forest and decision tree models. The results showed that fundamental frequency statistics (i.e., mean, max, min) exhibited significant negative correlations with height, weight, and FC. Besides, good accuracies of classification in height, LP range, LO range, and FC range could be achieved based on the acoustic features. The findings in the current paper imply that acoustic features could be the potential features for identification of the speaker’s body size and oral cavity characteristics. This paper will not only contribute to the research and practices in forensic speaker profiling and but also provides foundations for the technology of automatic speaker recognition.

Citation Download Citation

Puyang Geng, Hong Guo, Qimeng Lu, Jinhua Zeng, and Yan Li "Is it possible to predict speaker’s body size and oral cavity characteristics from speech signals: a preliminary study on Mandarin Chinese", Proc. SPIE 12506, Third International Conference on Computer Science and Communication Technology (ICCSCT 2022), 125062P (28 December 2022); https://doi.org/10.1117/12.2661788

ACCESS THE FULL ARTICLE

PROCEEDINGS
7 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Acoustics

Laser induced plasma spectroscopy

Sensors

Forensic science

Profiling

Head

Speaker recognition

Show All Keywords

RELATED CONTENT

Laser ablation molecular isotopic spectroscopy (LAMIS) towards the determination of...
Proceedings of SPIE (May 01 2017)

Fiber Optic Pressure Sensors Employing Reflective Diaphragm Techniques
Proceedings of SPIE (February 01 1989)

Magnetic scanner for forensic examination of audiotapes
Proceedings of SPIE (February 04 1999)

A novel optical accelerometer with wide operation range
Proceedings of SPIE (November 05 2005)

Modern microphone array for hearing aid and speech processing
Proceedings of SPIE (October 22 1996)

Miniature six-DOF inertial system for tracking HMDs
Proceedings of SPIE (August 11 1998)

Operational amplifier based micro eddy current sensor and its application...
Proceedings of SPIE (December 28 2010)

Subscribe to Digital Library

Receive Erratum Email Alert

Show All Keywords

Keywords/Phrases

Search In:

Publication Years