Ensemble-Based Multimodal Analysis for Depression Detection

Ganesh D. Jadhav; Sachin D. Babar; Parikshit N. Mahalle

From The Journal:

Engineered Science

Volume 35, 2025

Ensemble-Based Multimodal Analysis for Depression Detection

Authors
Authors and affiliations

Ganesh D. Jadhav, Sachin D. Babar and Parikshit N. Mahalle

Ganesh D. Jadhav^1,Email

Sachin D. Babar^2,Email

Parikshit N. Mahalle^3,4,Email

¹Department of Computer Engineering, (SKNCOE- Vadgaon), SPPU, Pune, 411041, India
²Sinhgad Institute of Technology, Lonavala, (SPPU-Pune), Pune, 410041, India
³Department of Artificial Intelligence and Data Science, Vishwakarma Institute of Technology, Pune, 411037, India
⁴Dean, Research and Development, Vishwakarma Institute of Technology, Pune, 411037, India

Abstract

Depression is a widespread mental health disorder that affects individuals globally; it can be detected using traditional methods as well as advanced technological approaches. The study works on various modalities and methods, like textual, audio, and visual modalities, as well as advanced techniques. This study includes research methodology for depression detection using a multimodal dataset containing textual, audio, and visual modalities. The study also consists of clustering and classifying audio and visual modality features. The proposed system incorporates the Ensemble model, which consists of logistic regression, support vector classifier, random forest, and gradient boosting (LSRG). The same clustering and classification process applies to six audio and two visual features. The study determined the final depression level by averaging the results of all modalities using the Late Fusion method. The multimodal system predicts depression levels using heterogeneous data sources. The Mamdani fuzzy gives 93.10% accuracy on the textual modality for text data of Extended Distress Analysis Interview Corpus (E-DAIC), the ensemble model gives 98.21% accuracy on labeled E-DAIC dataset, and late fusion gives 99.54% accuracy for E-DAIC's PHQ-8 dataset. The study aims to integrate multiple models using better-performing techniques. Also, the study offers important suggestions to patients for using a multimodal depression detection system.

Download PDF Open in new windows

Article HTML Open in new windows

About
Cited by

Publication details

Received: 05 Mar 2025
Revised: 10 May 2025
Accepted: 06 Jun 2025
Published online: 28 Jun 2025

Article type:
Research Paper

DOI:
10.30919/es1604

Volume:
35

Article :
1604

Citation:
Engineered Science, 2025, 35, 1604

.RIS .ENW .BIB

Permissions:
Copyright

Number of downloads:
136

Citation Information:
0

Description:

This study includes research methodology for depression detection using a multimodal dataset contain....

Supplementary Information:

This article is cited by 0 publications.

. . , , , DOI:
. . , , , DOI:

Events

Guidelines

Policies

Videos

Submit

Newsroom

Recommend to Libarian

Engineered Science

Ensemble-Based Multimodal Analysis for Depression Detection

Abstract

Publication details

Quick Links :

General Links :

About ESP :

Follow ESP :