Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model.

Journal: Journal of affective disorders

Volume: 355

Issue: 

Year of Publication: 

Affiliated Institutions:  Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK. Electronic address: yuezhou.zhang@kcl.ac.uk. Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK; University College London, London, UK; South London and Maudsley NHS Foundation Trust, London, UK; Health Data Research UK London, University College London, London, UK. Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK; University of Augsburg, Augsburg, Germany. Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK. Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK; Department of Computer Science, University of Sheffield, Sheffield, UK. Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK; School of Psychology, University of Sussex, Falmer, East Sussex, UK. Department of Psychiatry, Amsterdam Public Health Research Institute and Amsterdam Neuroscience, Amsterdam University Medical Centre, Vrije Universiteit and GGZ InGeest, Amsterdam, the Netherlands. Parc Sanitari Sant Joan de Déu, Fundació Sant Joan de Déu, CIBERSAM, Universitat de Barcelona, Barcelona, Spain. University of Augsburg, Augsburg, Germany; GLAM - Group on Language, Audio, & Music, Imperial College London, London, UK. Janssen Research and Development LLC, Titusville, NJ, USA. Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK; South London and Maudsley NHS Foundation Trust, London, UK. Davos Alzheimer's Collaborative, Geneva, Switzerland. Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK. Electronic address: nick.cummins@kcl.ac.uk.

Abstract summary 

Prior research has associated spoken language use with depression, yet studies often involve small or non-clinical samples and face challenges in the manual transcription of speech. This paper aimed to automatically identify depression-related topics in speech recordings collected from clinical samples.The data included 3919 English free-response speech recordings collected via smartphones from 265 participants with a depression history. We transcribed speech recordings via automatic speech recognition (Whisper tool, OpenAI) and identified principal topics from transcriptions using a deep learning topic model (BERTopic). To identify depression risk topics and understand the context, we compared participants' depression severity and behavioral (extracted from wearable devices) and linguistic (extracted from transcribed texts) characteristics across identified topics.From the 29 topics identified, we identified 6 risk topics for depression: 'No Expectations', 'Sleep', 'Mental Therapy', 'Haircut', 'Studying', and 'Coursework'. Participants mentioning depression risk topics exhibited higher sleep variability, later sleep onset, and fewer daily steps and used fewer words, more negative language, and fewer leisure-related words in their speech recordings.Our findings were derived from a depressed cohort with a specific speech task, potentially limiting the generalizability to non-clinical populations or other speech tasks. Additionally, some topics had small sample sizes, necessitating further validation in larger datasets.This study demonstrates that specific speech topics can indicate depression severity. The employed data-driven workflow provides a practical approach for analyzing large-scale speech data collected from real-world settings.

Authors & Co-authors:  Zhang Folarin Dineley Conde de Angel Sun Ranjan Rashid Stewart Laiou Sankesara Qian Matcham White Oetzmann Lamers Siddi Simblett Schuller Vairavan Wykes Haro Penninx Narayan Hotopf Dobson Cummins

Study Outcome 

Source Link: Visit source

Statistics
Citations : 
Authors :  28
Identifiers
Doi : 10.1016/j.jad.2024.03.106
SSN : 1573-2517
Study Population
Male,Female
Mesh Terms
Other Terms
Automatic speech recognition;Depression;Smartphone;Speech;Topic modeling
Study Design
Study Approach
Country of Study
Publication Country
Netherlands