Automatic recognition of depression based on audio and video: A review.

Journal: World journal of psychiatry

Volume: 14

Issue: 2

Year of Publication: 

Affiliated Institutions:  Shandong Mental Health Center, Shandong University, Jinan , Shandong Province, China. Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan , Shandong Province, China. Department of Ward Two, Shandong Mental Health Center, Shandong University, Jinan , Shandong Province, China. Shandong Mental Health Center, Shandong University, Jinan , Shandong Province, China. wangqx@qlu.edu.cn.

Abstract summary 

Depression is a common mental health disorder. With current depression detection methods, specialized physicians often engage in conversations and physiological examinations based on standardized scales as auxiliary measures for depression assessment. Non-biological markers-typically classified as verbal or non-verbal and deemed crucial evaluation criteria for depression-have not been effectively utilized. Specialized physicians usually require extensive training and experience to capture changes in these features. Advancements in deep learning technology have provided technical support for capturing non-biological markers. Several researchers have proposed automatic depression estimation (ADE) systems based on sounds and videos to assist physicians in capturing these features and conducting depression screening. This article summarizes commonly used public datasets and recent research on audio- and video-based ADE based on three perspectives: Datasets, deficiencies in existing research, and future development directions.

Authors & Co-authors:  Han Li Yi Zheng Xia Liu Wang

Study Outcome 

Source Link: Visit source

Statistics
Citations :  Benazzi F. Various forms of depression. Dialogues Clin Neurosci. 2006;8:151–161.
Authors :  7
Identifiers
Doi : 10.5498/wjp.v14.i2.225
SSN : 2220-3206
Study Population
Male,Female
Mesh Terms
Other Terms
Audio processing;Automatic depression estimation System;Deep learning;Depression recognition;Feature fusion;Future development;Image processing
Study Design
Study Approach
Country of Study
Publication Country
United States