Open Access Paper
11 September 2024 Predicting mild depression using multidomain brain functional features and machine learning
Lei Yang, Yifu Li, Shouliang Qi
Author Affiliations +
Proceedings Volume 13270, International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024); 132700R (2024) https://doi.org/10.1117/12.3047772
Event: 2024 International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024), 2024, Shenyang, China
Abstract
Objective: Depression is a serious psychiatric illness, which seriously affects the life and property safety of patients. However, the clinical diagnostic gold standard is still lacking, and the pathological mechanism is still unclear. This study used machine learning methods to conduct classification prediction research and attempt to explore the pathogenesis of depression. Methods: This study concluded resting-state functional magnetic resonance images from 51 patients with mild depression and 21 healthy control subjects in the OpenNeuro dataset. Multi-domain brain connectomic features, including FOurdimensional Consistency of local neural Activities (FOCA), functional connectivity density (FCD), and degree centrality (DC) were calculated and selected for support vector machine (SVM) model to distinguish mild depression from healthy controls. Results: After statistical analysis and feature selection, 10 brain regions were used as features for machine learning model construction. Most of these brain regions are located in the default mode network (DMN) network. The accuracy of the SVM classification model is 81.9%. Conclusion: The number of important features that are effective in identifying depression is less than 10. The SVM model with multi-domain brain connectomic features has good classification performance for distinguishing depression.

1.

INTRODUCTION

Depression is a kind of psychiatric illness that is difficult to cure, bringing great pain and inconvenience to patients, and even leading to suicide. At present, there is still no effective clinical treatment for depression and a lack of accurate biomarkers for early diagnosis. The pathogenesis of depression is still unclear[1,2].

Resting-state functional magnetic resonance imaging (rs-fMRI), as an important medical image, has been used in depression research. It is non-invasive and does not require injection of contrast agents. With high resolution, it can reflect the neural activity of the brain[3]. Based on rs-fMRI, more and more studies and measurement methods such as regional homogeneity (ReHo)[4] and amplitude of low-frequency fluctuation (ALFF)[5], have been widely used in research of the pathophysiological mechanism of depression.

Several new research methods have emerged to describe neural activity. Connectomic biomarkers are specific patterns of brain connectivity that can be used to diagnose and understand neurological and psychiatric disorders. These biomarkers are derived from connectomics, the comprehensive study of neural connections within the brain. FOur-dimensional Consistency of local neural Activities (FOCA) describes the temporal consistency of adjacent voxel activities in local brain regions[6]. In addition, compared with functional connectivity (FC) methodology, functional connectivity density (FCD) is also a novel connectomic parameter[7]. It captures FC patterns throughout the brain without prior knowledge, focusing on specific regions or networks FCD analysis can be used to explore brain FC patterns associated with neurological diseases, helping to uncover potential biomarkers and disease mechanisms[8]. Degree centrality (DC) is an important index based on graph theory methods[9]. The main theoretical basis of the graph theory method is to define the brain regions as nodes, and the FC between brain regions as edges. A higher DC value represents the higher importance of the node. Machine learning methods are also widely used in the field of depression and neurological research, making important contributions to clinically assisted diagnosis. Support vector machine (SVM), logistic regression, and other methods have good performance [10].

In this study, we used rs-fMRI data of mild depression (MD) and healthy controls (HC) from the OpenNeuro dataset [11]. Firstly, 4 multi-domain brain connectomic parameters (FOCA, short-range FCD (SR-FCD) long-range FCD (LR-FCD), and degree centrality (DC)) of MD and HC subjects were calculated. Then, statistical analysis and feature selection methods were used to extract brain regions that show significant differences in the parameters above. Finally, brain regions with significant differences were selected as features to build a machine learning model for classification and prediction research (Figure 1).

Figure 1.

The main steps of the study. (Rs-fMRI, resting-state functional MRI; T1-WI, T1 weighted; AAL, AAL: Anatomical Automatic Labeling; FOCA, FOur-dimensional Consistency of local neural Activities; FCD, functional connectivity density; DC, degree centrality; SVM, support vector machine)

00187_PSISDG13270_132700R_page_2_1.jpg

2.

METHODS

2.1

Participants and MRI acquisition

Rs-fMRI data and T1-weighted images of 51 patients with MD and 21 HCs from the OpenNeuro dataset (ds002748) (https://openneuro.org/datasets/ds002748/versions/1.0.0) were used in this study. All participants were rigorously screened: participants had no mental disorders on a neurological or psychiatric level and were not taking psychotropic drugs or drugs that severely affected blood flow. This study has been approved by the review committees of all participating centers. All participants are informed and consenting.

Imaging information was acquired from the OpenNeuro dataset. The fMRI study was carried out in the International Tomography Center, Novosibirsk, using a 3T Ingenia scanner (Philips). Functional images were acquired using the following parameters: T2*-weighted Ssh echo planar, voxel size 2 × 2 × 5 mm, repetition time (TR) 2500ms, echo time (TE) 35ms. The T1-weighted image was obtained by the T1-weighted 3D turbo field echo, voxel size 1 × 1 × 1 mm. The scan lasted about six minutes, and the participants were asked to remain resting, close their eyes, and not think about anything else.

2.2

Preprocessing of rs-fMRI data

Data Processing & Analysis for Brain Imaging (DPABI) version 4.6 software toolbox was carried out for image preprocessing. The specific preprocessing steps are as follows: (1) remove the first 10 time points to exclude information from the unstable state; (2) slice timing correction and realignment to reduce the impact of factors such as head movement and heartbeat; (3) data point scrubbing: remove the points of framewise displacement > 0.2 mm; excluding the subjects with head movement > 2 mm and rotation > 2°; (4) co-registration for the T1 image and echo planar imaging; (5) segmentation of gray matter (GM), cerebrospinal fluid (CSF), and white matter (WM); (6) regression of nuisance covariates (GM, CSF, and WM); (6) spatial normalization; (7) temporal bandpass filter (0.01–0.08 Hz); (8) image smoothing using a Gaussian kernel with a full-width at half maximum (FWHM) of 6 mm × 6 mm × 6 mm. Particularly, FOCA calculation does not require prior smoothing or filtering.

2.3

Multi-domain brain connectomic parameters calculation

The Neuroscience Information Toolbox (NIT version 1.3, https://www.neuro.uestc.edu.cn/NIT.html) toolbox was used to calculate the FOCA and FCD. The value of FOCA is the product of the time correlation coefficient and the space correlation coefficient calculation, it can be calculated as follows: First, calculate the average FOCA value of each voxel in the whole brain. Secondly, the FOCA value of each voxel divided by the average FOCA value can obtain the normalized FOCA value. Then the normalized FOCA can be used for further statistical analysis.

Global FCD is defined as the number of FCs between a given voxel and other voxels in a binary brain network. Global FCD consists of short-range and long-range FCD. SR-FCD represents the sum of all elements in clusters that are adjacent and directly functionally connected. The long-range FCD can be defined as the global FCD minus the SR-FCD.

The Graph Theoretical Network Analysis (GRETNA) toolbox (https://www.nitrc.org/projects/gretna) was used to calculate the DC value. DC value can be calculated by:

00187_PSISDG13270_132700R_page_3_1.jpg

where CD (Ni) is the DC value of the node Ni, g is the number of nodes in a graph, dij represents the number of all edges of the node Ni and the other nodes in the graph, and i ≠ j represents that edges between nodes and themselves are not included.

The Anatomical Automatic Labeling (AAL) template was used to segment the whole brain into 116 brain regions and parameter values for each region were extracted.

2.4

Statistical analysis

The demographic data of 72 participants were processed by Statistical Package for the Social Sciences software, version 24.0 (SPSS, https://www.ibm.com/products/spss-statistics). The chi-square test was used for analyzing the gender differences, and two sample t-tests were carried out for the statistical analysis of age and BDI scores between MD and HC groups.

The parameters of FOCA, SR-FCD, LR-FCD, and DC were analyzed by two-sample t-tests with Bonferroni correction (p < 0.05) based on the Matlab platform. In the process of statistical analysis, age, head movement, and other factors were treated as covariables to avoid affecting the statistical results.

2.5

Feature selection and SVM classification

The LIBSVM toolbox was utilized to train the SVM model. The least absolute shrinkage and selection operator (LASSO)[12] method was performed to further select the features based on the results of statistical analysis for SVM model construction. Hyperparameter tuning was conducted using a grid search approach to identify the optimal values for the penalty parameter (C) and kernel parameter (gamma). A range of values for C and gamma were systematically explored to find the combination that provided the best performance. To ensure robust performance estimation, we employed leave-one-out cross-validation (LOOCV). In LOOCV, each sample in the dataset is used once as a validation set while the remaining samples form the training set. The performance of the SVM model was evaluated using accuracy (ACC), area under the curve (AUC), specificity, and sensitivity.

3.

RESULTS

3.1

Demographic and clinical characteristics

Table 1 presents the demographic and clinical characteristics of 51 MD subjects and 21 HC subjects. The two groups were matched in terms of age and gender (p > 0.05) with no significant differences. However, p < 0.001 indicated that the MD group and HC group have significant differences in depression test scores.

Table 1.

Demographic and clinical characteristics of participants.

CharacteristicsMD (n=51)HC (n=21)p-value
Gender, female/male38/1315/61.00a
Age, years33.1±9.533.8±8.50.326b
BDI-II score20.7±10.04.6±4.5<0.001b

a

p value: Chi-square test

b

p value: two-sample t-test.

3.2

Statistical analysis and feature selection

As shown in Table 2 and Figure 2 (A), 12 Brain regions, including 6 regions with significant differences in FOCA, 1 region with significant differences in LR-FCD, 2 regions with significant differences in SR-FCD, 2 regions with significant differences in DC, and 1 region with significant differences both in FOCA and LR-FCD were identified between MD and HC group by two-sample t-test.

Figure 2.

Locations of brain regions associated with each selected feature for each binary classification. A. Brain regions after statistical analysis between MD and HC groups; B. Features selected for SVM model to classify MD and HC groups. (FOCA & LR-FCD, both have significant differences in both FOCA and LR-FCD)

00187_PSISDG13270_132700R_page_4_1.jpg

Table 2.

Features for classification of MD and HC.

CategoryFeatureAAL indexBrain RegionNetwork
Selected by statistical analysisFOCA14IFGtriang.RDMN
FOCA50SOG.RVN
FOCA62IPL.RDMN
FOCA82STG.RDMN
FOCA88TPOmid.RAN
FOCA110Vermis3Cerebellum
FOCA114Vermis8Cerebellum
SRFCD67PCUN.LDMN
SRFCD68PCUN.RDMN
LRFCD27REC.LDMN
LRFCD88TPOmid.RAN
DC77THA.LDMN
DC112Vermis6Cerebellum
Selected by LASSOFOCA14IFGtriang.RDMN
FOCA50SOG.RVN
FOCA62IPL.RDMN
FOCA82STG.RDMN
FOCA88TPOmid.RAN
FOCA110Vermis3Cerebellum
SRFCD68PCUN.RDMN
LRFCD27REC.LDMN
DC77THA.LDMN
DC112Vermis6Cerebellum

Using the LASSO method, 10 regions (Table 2 and Figure 2 (B)) with significant differences in the four parameters, including 6 regions in FOCA, 1 region in SR-FCD, 1 region in LR-FCD, and 2 regions in DC were extracted as features for the SVM model to classify MD and HC groups.

3.3

Performances of SVM classification

In the classification task of MD and HC, the SVM model exhibited a high ACC of 81.9%, and an AUC of 0.86, demonstrating the effectiveness of the SVM model in distinguishing depression. The receiver operating characteristic (ROC) curve is shown in Figure 3. While evaluating the SVM model, the specificity was 86.2% and the sensitivity was 71.4%, respectively.

Figure 3.

ROC curve of the SVM model for classifying MD and HC groups. (ROC: Receiver operating characteristic; AUC: area under the curve)

00187_PSISDG13270_132700R_page_5_1.jpg

4.

DISCUSSION

This is the first study that combined FOCA, FCD, and DC to construct SVM models for differentiating MD and HC. Fewer features were used to obtain a good classification performance. Using 10 features, the ACC of our SVM model is up to 81.9%.

4.1

LASSO for feature dimensionality reduction

Using the LASSO method for feature selection offers significant advantages in neuroimaging analysis. LASSO effectively retains important features by shrinking the coefficients of irrelevant or less significant features to zero, thus simplifying the model. This process helps in identifying key brain regions associated with specific cognitive or behavioral functions. Due to the regularization effect in feature selection, LASSO not only prevents overfitting but also enhances the interpretability of the model. This allows researchers to more clearly understand which brain regions play crucial roles in various neural functions.

4.2

The important role of DMN

Of the 10 important brain regions, 6 regions are located in DMN. This suggests that DMN is strongly associated with depression. The DMN is a group of interconnected brain regions that show high activity when a person is at rest, introspecting, or thinking state. Key areas of the DMN include the prefrontal cortex, posterior cingulate cortex, medial prefrontal cortex, and medial temporal lobes[13]. It is associated with self-awareness, memory recall, future planning, and social cognition. Abnormal activity or connectivity in the DMN was reported linked to depression, indicating its crucial role in depressive conditions[14]. Zhang et al. studied the classification of major depressive disorder, subclinical depression, and HC groups. This study reported 48 effective brain regions as features to construct SVM models, among which several functional connectivity features were associated with DMN[10]. Another study[15] based on meta-analysis demonstrated the abnormal connectivity between regions in DMN and frontoparietal.

4.3

Limitations

This study still has limitations. First, the sample size of the dataset is far from enough, which may affect the accuracy of the findings. Extending the data set can enhance the robustness of the results. Secondly, only rs-fMRI data was relatively simple. Integrating other modalities, such as task-based fMRI, and structural MRI can provide a more complete understanding of the underlying neural mechanisms.

5.

CONCLUSION

In this study, the multi-domain connectomic parameters (FOCA, long-range FCD, short-range FCD, and DC) were analyzed to construct a classification and prediction model of the SVM. These results highlight the effectiveness of SVM models in depression prediction tasks, demonstrate the potential clinical value of machine learning in mental health diagnosis and brain imaging studies, and provide relevant evidence for underlying mechanisms of depression.

REFERENCES

[1] 

Catrambone, V., Benvenuti, S. M., Gentili, C., Valenza, G., “Intensification of functional neural control on heartbeat dynamics in subclinical depression,” Transl. Psychiat, 11221 (2021). Google Scholar

[2] 

Yang, L., Jin, C., Qi, S., Teng, Y., Li, C., Yao, Y., et al., “Alterations of functional connectivity of the lateral habenula in subclinical depression and major depressive disorder,” BMC Psychiatry, 22 588 (2022). https://doi.org/10.1186/s12888-022-04221-6 Google Scholar

[3] 

Jin, C., Qi, S., Teng, Y., Li, C., Yao, Y., Ruan, X., Wei, X., “Altered Degree Centrality of Brain Networks in Parkinson’s Disease With Freezing of Gait: A Resting-State Functional MRI Study,” Front Neurol, 12 743135 (2021). https://doi.org/10.3389/fneur.2021.743135 Google Scholar

[4] 

Zang, Y., Jiang, T., Lu, Y., He, Y., Tian, L., “Regional homogeneity approach to fMRI data analysis,” Neuroimage, 22 (1), 394 –400 (2004). https://doi.org/10.1016/j.neuroimage.2003.12.030 Google Scholar

[5] 

Zang, Y., He, Y., Zhu, C., Cao, Q., Sui, M., Liang, M., “Altered baseline brain activity in children with ADHD revealed by resting-state functional MRI,” Brain and Development, 29 (2), 83 –91 (2007). https://doi.org/10.1016/j.braindev.2006.07.002 Google Scholar

[6] 

Dong, L., Luo, C., Cao, W., Zhang, R., Gong, J., Gong, D., Yao, D., “Spatiotemporal consistency of local neural activities: A new imaging measure for functional MRI data,” J Magn Reson Imaging, 42 (3), 729 –36 (2015). https://doi.org/10.1002/jmri.v42.3 Google Scholar

[7] 

Mao, Y., Liao, Z., Liu, X., Li, T., Hu, J., Le, D., et al, “Disrupted balance of long and short-range functional connectivity density in Alzheimer’s disease (AD) and mild cognitive impairment (MCI) patients: A resting-state fMRI study,” Ann Transl Med, 9 (1), 65 (2021). https://doi.org/10.21037/atm Google Scholar

[8] 

Tomasi, D., Volkow, N.D., “Functional connectivity density mapping,” Proc Natl Acad Sci U S A, 107 9885 –90 (2010). https://doi.org/10.1073/pnas.1001414107 Google Scholar

[9] 

Sporns, O., “Graph theory methods: applications in brain networks,” Dialogues in clinical neuroscience, 20 2111 –121 (2022). Google Scholar

[10] 

Zhang, B., Liu, S., Liu, X., et al, “Discriminating subclinical depression from major depression using multi-scale brain functional features: A radiomics analysis,” Journal of Affective Disorders, 297 542 –552 (2022). https://doi.org/10.1016/j.jad.2021.10.122 Google Scholar

[11] 

Bezmaternykh, D.D., Melnikov, M.Y., Savelov, A.A., Kozlova, L.I., Petrovskiy, E.D., Natarova, K.A., Shtark, M.B., “Brain Networks Connectivity in Mild to Moderate Depression: Resting State fMRI Study with Implications to Nonpharmacological Treatment,” Neural Plast, 15 2021 (2021). Google Scholar

[12] 

Zou, H., et al. Taylor & Francis Online:, “The Adaptive Lasso and Its Oracle Properties,” Journal of the American Statistical Association, 101 (476), 1418 –1429 (2006). https://doi.org/10.1198/016214506000000735 Google Scholar

[13] 

Besteher, B., Gaser, C., Langbein, K., Dietzek, M., Sauer, H., Nenadic, I., “Effects of subclinical depression, anxiety and somatization on brain structure in healthy subjects,” J. Affect. Disord, 215 111 –117 (2017). https://doi.org/10.1016/j.jad.2017.03.039 Google Scholar

[14] 

Yang, L., Qi, S., Jin, C., et al., “Aberrant degree centrality of functional brain networks in subclinical depression and major depressive disorder,” Frontiers in Psychiatry, 14 1084443 (2023). https://doi.org/10.3389/fpsyt.2023.1084443 Google Scholar

[15] 

Kaiser, R.H., Andrews-Hanna, J.R., Wager, T.D., Pizzagalli, D.A., “Large-scale network dysfunction in major depressive disorder: a meta-analysis of resting-state functional connectivity,” JAMA. Psychiat, 72 (6), 1 –10 (2015). https://doi.org/10.1001/jamapsychiatry.2015.0071 Google Scholar
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Lei Yang, Yifu Li, and Shouliang Qi "Predicting mild depression using multidomain brain functional features and machine learning", Proc. SPIE 13270, International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024), 132700R (11 September 2024); https://doi.org/10.1117/12.3047772
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Brain

Machine learning

Feature selection

Support vector machines

Back to Top