Controlling Speech Confounding in Psychophysiology Research: Speech detection via respiratory inductance plethysmography, thoracic impedance, accelerometers and gyroscopes

Melisa Saygin; Myrte Schoenmakers; Martin Gevonden; Eco de Geus

doi:10.22541/au.172114399.90044532/v1

loading page

Controlling Speech Confounding in Psychophysiology Research: Speech detection via respiratory inductance plethysmography, thoracic impedance, accelerometers and gyroscopes

Melisa Saygin,
Myrte Schoenmakers,
Martin Gevonden,
Eco de Geus

Abstract

Speech production interferes with accurate measurement of cardiac vagal activity during acute stress, attenuating the expected drop in heart rate variability in the respiratory frequency band. Speech also induces sympathetic changes similar to those induced by psychological stress. In the laboratory, confounding of physiological stress reactivity by speech may be controlled experimentally. In ambulatory assessments, however, detection of speech episodes is necessary to separate the physiological effects of psychosocial stress from those of speech. Using supervised machine learning (https://osf.io/bk9nf), we trained and tested speech classification models on data obtained from 56 participants (ages 18-39). They were equipped with privacy-secure wearables measuring thoracoabdominal respiratory inductance plethysmography (RIP from a single and a dual-band set-up), thoracic impedance pneumography, and an upper-sternum positioned unit with triaxial accelerometers and gyroscopes. Following an 80/20 train-test set split, nested cross-validations were run with the machine learning algorithms XGBoost, Gradient Boosting, Random Forest, and Logistic Regression on the training set to get unbiased generalized performance estimates. Speech classification by the best model per method was then validated in the unseen test-set. Speech versus no-speech classification performance (AUC) for both nested cross-validation and test-set predictions was excellent for thorax-abdomen RIP (nested cross-validation: 96.6%, test-set prediction: 98.5%), thorax-only RIP (97.5%, 99.1%), impedance (97.0%, 97.8%) and accelerometry (99.3%, 99.6%). The sternal accelerometer outperformed the other methods. These novel open-access models that leverage privacy-secure biosignals will enable researchers to detect speech and control for its confounding effects in ambulatory recordings, thereby enhancing the trustworthiness of psychophysiological findings.

Submitted to Psychophysiology

Show details

Hide details

Submission Checks Completed

Assigned to Editor

Reviewer(s) Assigned

26 Jun 2024Review(s) Completed, Editorial Evaluation Pending

15 Jul 2024Reviewer(s) Assigned

13 Sep 2024Editorial Decision: Revise Minor

24 Nov 20241st Revision Received

26 Nov 2024Submission Checks Completed

26 Nov 2024Assigned to Editor

26 Nov 2024Review(s) Completed, Editorial Evaluation Pending

29 Nov 2024Reviewer(s) Assigned

21 Jan 2025Editorial Decision: Revise Minor

23 Jan 20252nd Revision Received

24 Jan 2025Assigned to Editor

24 Jan 2025Submission Checks Completed

24 Jan 2025Review(s) Completed, Editorial Evaluation Pending

30 Jan 2025Editorial Decision: Accept

Abstract

Peer review status:ACCEPTED