Open Access Paper
11 September 2024 Analysis of continuous glucose monitoring data in patients with type 2 diabetes mellitus based on entropy analysis
Author Affiliations +
Proceedings Volume 13270, International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024); 132701C (2024) https://doi.org/10.1117/12.3045353
Event: 2024 International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024), 2024, Shenyang, China
Abstract
Objective: The effectiveness of an analysis system of a continuous glucose monitoring (CGM) in the management of Type 2 Diabetes Mellitus (T2DM), focusing on the quality of glycemic control via sophisticated time series analysis. Methods: Approximate entropy (ApEn) and other entropies were used for analysis in the post hoc analysis of 858 subjects with T2MD. Spearman's correlation coefficient was calculated between entropy values and selected physiologic indicators to verify the possible clinical validity of these indicators. Main results: The application of entropy analysis enhanced the quantification of glycemic control complexity. Correlation for glycated hemoglobin A1c (HbA1c), glycated albumin (GA) was demonstrated with ApEn: −0.40, and −0.39, while the correlation coefficient for sample entropy (SampEn) was −0.29, and −0.26, respectively (all P < 0.001). These large negative correlations confirmed the validity of entropy measures in interpreting CGM data. Correlation analysis between entropy measures and metrics such as HbA1c highlighted the potential of this approach to provide insights into diabetes management. Conclusions: The use of entropy analysis has theoretically enriched the methodology for analyzing CGM measurements and provided a valuable tool for clinical practice. The method improves the management of glycemic control among patients with diabetes, potentially influencing personalized treatment strategies and improving overall diabetes care.

1.

INTRODUCTION

As wearable glucose monitors continue to mature and gain popularity, continuous glucose monitoring (CGM) devices have been widely integrated into patients’ daily lives [1,2]. It has become an indispensable part of diabetes management. CGM data can reflect the 24-hour glucose excursions in patients with type 2 diabetes mellitus (T2DM). However, analyzing these extensive, complex CGM data and extracting their deep clinical value is crucial for improving diabetes outcomes and patients’ quality of life, which need to be solved urgently [3].

Existing studies mainly focus on using clinical knowledge to extract CGM metrics, such as time in range (TIR), standard deviation of sensor glucose (SDSG), and mean sensor glucose (MSG), and hence analyzing the relevant properties of glucose levels. Even though these methods have succeed to some extent, they ignored the fact that glucose dynamics are non-stationary, dynamically fluctuating, non-linear, irregular, and exhibit substantial complexity [46]. Therefore, a novel method to deeply parse CGM data was needed.

Entropy analysis has shown its value as a statistical tool for assessing the complexity and unpredictability of physiological time series data, such as ECG and EEG. Ning et al. [7] adopted the approximate entropy (ApEn), initially introduced by Pincus et al. [8], and developed an enhanced ApEn algorithm, focusing on wave modes to distinguish between patients with heart disease and healthy individuals. There have been many scholars to analyze the CGM time series data by entropy value. For example, Li et al. [9] analyzed the complexity of glucose time series index using a refined composite multiscale entropy method to characterize the CGM dataset. Entropy analysis could measure the irregularity and complexity of data. Therefore, we proposed the approach using entropy analysis to analyze the complexity of CGM data.

2.

MATERIALS AND METHODS

2.1. Study population

The dataset used in the current study was derived from comprehensive survey data of CGM from multiple centers in China between 2007 and 2009 [10,11], which included records from 858 type 2 diabetes patients. The survey included CGM records of 858 patients with type 2 diabetes. Participants were outpatient volunteers from 11 different Chinese hospitals. The criteria for participant selection have been described in earlier publications [10,11]. The present study adhered to the Declaration of Helsinki and was approved by the corresponding ethics committee.

2.2.

CGM parameters

All subjects in this study received a retrospective analysis of a CGM system. The device used is the CGMS GOLD (Medtronic Inc., Northridge, CA, USA). The same group of trained nurses was responsible for inserting the CGM system at each hospital. The first calibration via fingertip blood was performed at the hospital 1 hour after initialization of the CGM system, after which the subjects wore the CGM device for 7 consecutive days. Subjects were instructed to perform at least four fingertip blood glucose measurements per day using a SureStep blood glucose meter (LifeScan, Milpitas, CA, USA) for CGM calibration. Participants were recommended to follow a prescribed diet with specific meal times, and to maintain normal daily activities without heavy physical activity to improve the authenticity and reliability of data analysis.

2.3.

Calculation of entropy

First, the ApEn of the CGM time series data was calculated. The ApEn is a tool used to quantify complexity in timeseries data by measuring the frequency of new or irregular patterns within the dataset. The calculation of ApEn follows this formula:

00207_PSISDG13270_132701C_page_2_1.jpg

where 𝑚 denotes the embedding dimension, which is the number of data points used in similarity calculations; 𝑟 is the threshold of similarity; and 𝑁 is the total number of data points in the time series.

Then, we calculated the composite multiscale entropy (CMSE) [12] for the CGM time series data. The core principle of the CMSE algorithm is to employ sample entropy (SampEn) for assessing CGM time-series data irregularities across multiple time scales, thereby unveiling the data’s inherent dynamic complexity. CMSE is calculated using the following formula:

00207_PSISDG13270_132701C_page_2_2.jpg

where 𝑠 is the number of time scales; 𝑚 is the embedding dimension, here the embedding dimension is taken to be 2; 𝑟 is the threshold of similarity, which is 20% of the standard deviation of the data series; 𝑁 is the total number of data points in the original time series; and 𝑁τ is the total number of data points in the coarsely grained time series at time scale τ. At each time scale, the original CGM time-series data is divided into several non-overlapping segments, and the data points in each segment are averaged to obtain the coarse-grained time series at the corresponding scale. Subsequently, SampEn is computed on each coarse-grained time series using the CMSE method to quantify the complexity of the CGM time series data at that scale.

In our analysis of ApEn, we primarily focus on the cases where the embedding dimension is 2. And we focus on the case in the study of CMSE where the time-scale factor is 1. We notice that with a timescale factor of 1, CMSE actually reverts to SampEn. Originally proposed by Richman et al. [13], it is an important tool in assessing the complexity of the data and is particularly suited to analyzing the case of short-time-series data. The elimination of repetitive patterns in the data is then a much clearer and stronger way to measure the irregularity and complexity of the data in time series. Further research will continue investigating the application of these two specific cases and their possible value for blood glucose monitoring data analysis.

2.4.

Correlation analysis

A Spearman correlation matrix evaluated the association between ApEn, SampEn, and physiological indexes like HbA1c. The correlation matrix is expressed as:

00207_PSISDG13270_132701C_page_2_3.jpg

wherere 𝑆𝑖𝑗 presents the Spearman correlation coefficient between the i-th class entropy (ApEn or SampEn) and the j-th class physiological indicator 𝑃𝑗.

2.5.

Statistical analyses

The ApEn and SampEn were calculated for the CGM data using the EntropyHub toolbox in MATLAB [14]. In addition, all other data analysis procedures were performed in the MATLAB 2022b and Python 3.11.9 environment. The correlation between the complexity of the CGM time series data and the levels of physiological indices, such as HbA1c and GA were evaluated using Spearman’s correlation coefficient, the p-values for significance testing were derived using a t-test. Statistical significance was determined at P<0.05 (two-tailed test).

3.

RESULTS

3.1.

Clinical characteristics of the participants with T2DM

In this study, we applied entropy analysis to explore the inherent complexity of glucose series. After screening and cleaning data, a total of 858 patients (328 women and 530 men, median age 61.0 years) were included from which detailed clinical characteristics were extracted (Table 1).

Table 1.

Clinical characteristics of the participants with T2DM

FeaturesValues
Number of participants858
Gender (male/female)530/328
Age (years)61.0 (51.0-67.0)
Diabetes duration (years)12.0 (5.0-19.0)
BMI (kg/m2)24.63 (22.49-27.08)
SBP (mmHg)132.0 (121.0-144.0)
DBP (mmHg)80.0 (75.0-88.0)
HbA1c (%)8.4 (7.1-9.7)
GA (%)20.3 (16.5-24.9)
FPG (mmol/L)6.54 (5.45-8.11)
TIR (%)74.57 (58.72-88.26)
TOR (%)25.43 (11.74-41.28)
LBGI0.15 (0.03-0.45)
HBGI5.55 (3.01-9.02)
AUCIR (%)82.74 (66.16-94.35)

Given the skewed distribution of the data, they are expressed as median (interquartile spacing range 25%-75%) or number of individuals n. BMI, body mass index; SBP, systolic blood pressure; DBP, diastolic blood pressure; HbA1c, glycated hemoglobin A1c; GA, glycated albumin; FPG, fasting plasma glucose. TIR, time in range; TOR, time out of range; LBGI, low blood glucose index; HBGI, high blood glucose index; AUCIR, range of increase in blood glucose within area curve.

3.2.

Correlation analysis between entropy and clinical characteristics

In this study, we further elaborated the correlation of entropy values with clinical characteristics of diabetic patients; entropy value was used as a quantitative index of dynamic changes in blood glucose to understand its correlation with clinical parameters of patients. The present study disclosed results related to specific correlation analysis in Table 2.

Table 2.

Correlation analysis between entropy values and clinical characteristics

 HbA1c GA FPG 
rP valuerP valuerP value
ApEn-0.40<0.001-0.39<0.001-0.07<0.001
SampEn-0.29<0.001-0.26<0.001-0.060.07101

The correlation coefficients between ApEn and HbA1c, GA and fasting glucose (FPG) were -0.40, -0.39 and -0.07, respectively, while the correlation coefficients of SampEn were -0.29, -0.26 and -0.06, respectively, (with the correlation coefficient of -0.06 for FPG not passing the significance test, all other P values < 0.001), These results confirmed the effectiveness of entropy analysis in the assessment of glycemic control quality, suggesting that ApEn and SampEn can be used as sensitive indicators of glycemic control quality, which can effectively reflect the changes of glycemic stability in diabetic patients, and have the potential to be used in clinical practice to assess glycemic control.

3.3.

Estimate steady-state beta cell function (HOMA-B) and insulin sensitivity (HOMA-S)

In this study, we used the Homeostasis Model Assessment (HOMA) calculator (http://www.dtu.ox.ac.uk) [15], a common evaluation tool for β-cell function (HOMA-B) and insulin sensitivity (HOMA-S), for comprehensive understanding of the metabolic status of diabetic patients. Comparisons were made between β-cell function (HOMA-B) and insulin sensitivity (HOMA-S) assessed by the HOMA calculator with non-stationary models like hyperinsulinemic, hyperglycemic clamps and glucose tolerance tests. The results showed high agreement, evidencing that the HOMA model assessment of β-cell function and insulin sensitivity is reliable.

Additionally, we paid attention to the correlation between the three metrics HOMA2 %B, HOMA2 %S and HOMA2 insulin resistance (HOMA2 IR), and the entropy values obtained through CGM data (Table 3). This approach aimed to uncover and clarify how HOMA modeling metrics correlate with glycemic variability. Our goal was to deepen the understanding and improve the management of metabolic status in patients with diabetes.

Table 3.

Correlation analysis of entropy value and HOMA2

 HOMA2-B HOMA2-S HOMA2-IR 
rP valuerP valuerP value
ApEn0.29<0.001-0.14<0.0010.14<0.001
SampEn0.17<0.001-0.12<0.0010.12<0.001

Beta cell function HOMA2 %B showed a significant positive correlation with both ApEn and SampEn (correlation coefficients of 0.29 and 0.17, respectively, P<0.001), whereas HOMA2 %S showed a significant negative correlation with ApEn and SampEn (correlation coefficients of - 0.14 and -0.12, both P<0.001). This preliminary result revealed a positive association between beta cell function and glucose dynamics, and a potential negative association between insulin sensitivity and glucose dynamics. The findings of this study provided new perspectives for understanding changes in insulin secretion and sensitivity in patients with T2DM.

4.

DISCUSSION

This study focused on CGM-derived time series data analyzed by an entropy analysis approach. Initial results suggested that the complexity of the CGM-derived glucose time series had significant correlation with physiological and CGM metrics The Correlations for β-cell function (HOMA-B) and insulin sensitivity (HOMA-S)-related metrics were also high. These results propose that the entropy analysis method may provide a novel methodology in further clinical investigations.

A new experiment was designed further to illustrate the effectiveness of the entropy analysis method. The HbA1c levels were categorized into two classes based on the threshold of 6.5%, which was set as the threshold in clinical. Logistic regression modeling was then applied. The whole dataset contained 858 patients was divided into a training set 644 and a test set 214, with a ratio of 75% and 25%, respectively. This division of data was very helpful for proper training and validation of the model and at the same time provided a reasonable assessment of the generalization ability of the model. The predictive features of ApEn and SampEn showed the experimental results in models performed at 0.9023 with an accuracy, 0.9476 with F1 score, and an AUC of 0.81. The ROC curve performance of the model is evaluated and presented in Figure 1. The curve depicted the relation of the true positive rate (TPR) and false positive rate (FPR) of the model at various thresholds. As the FPR increases, the TPR also increases, indicating that the model has a strong discriminatory ability at each decision threshold. Furthermore, the stepped curve pattern indicates changes in model sensitivity at specific threshold points. The area under the receiver operating characteristic curve (AUC) was 0.81, significantly exceeding the criterion for random guessing (AUC=0.5), demonstrating the model’s high diagnostic accuracy. This result indicated the effectiveness of the entropy analysis in enhancing the model’s ability to predict glycemic control.

Figure 1.

Receiver Operating Characteristic, ROC curve

00207_PSISDG13270_132701C_page_5_1.jpg

The experiment also divided the GA level based on the 17% threshold and applied the logistic regression model (figure 1). The results showed that using entropy values as predictive features resulted in an AUC score of 0.73, Overall, the results of this experiment further validated the value of using ApEn and SampEn metrics in the prediction of glycemic control status.

Although this study provided useful insights into the state of glycemic control, there were some limitations. First, the length of CGM data used in the study was only 7 days, which may not have been sufficient to assess individual glucose fluctuations comprehensively. Second, although participants received dietary and activity recommendations, actual adherence was not rigorously monitored, which may have affected the study results. Therefore, our findings need to be scrutinized under a more rigorous analytical framework.

In summary, this study not only revealed the significance of entropy values in identifying and assessing the quality of glycemic control, but also emphasized the significant link between entropy values and clinical features. We have established a time-series analysis method using entropy analysis that enhances understanding of CGM complexity. In clinical practice, with the deep integration of CGM technology and AI, complex analysis of glucose time series is expected to provide more information about glucose characteristics and further enhance the efficiency of diabetes management.

ACKNOWLEDGEMENT

This work was supported by National Natural Science Foundation of China (61973067) and the Northeastern University (China) Student Innovation Training Program Grant (230268).

REFERENCES

[1] 

T. Battelino, T. Danne, R.M. Bergenstal, S.A. Amiel, R. Beck, T. Biester, E. Bosi, B.A. Buckingham, W.T. Cefalu, K.L. Close, C. Cobelli, E. Dassau, J.H. DeVries, K.C. Donaghue, K. Dovc, F.J. Doyle, S. Garg, G. Grunberger, S. Heller, L. Heinemann, I.B. Hirsch, R. Hovorka, W. Jia, O. Kordonouri, B. Kovatchev, A. Kowalski, L. Laffel, B. Levine, A. Mayorov, C. Mathieu, H.R. Murphy, R. Nimri, K. Norgaard, C.G. Parkin, E. Renard, D. Rodbard, B. Saboo, D. Schatz, K. Stoner, T. Urakami, S.A. Weinzimer, M. Phillip, “Clinical Targets for Continuous Glucose Monitoring Data Interpretation: Recommendations From the International Consensus on Time in Range,” Diabetes Care, 42 1593 –1603 (2019). https://doi.org/10.2337/dci19-0028 Google Scholar

[2] 

P. Rossetti, J. Bondia, J. Vehí, C.G. Fanelli, “Estimating Plasma Glucose fr om Interstitial Glucose: The Issue of Calibration Algorithms in Commercial Continuous Glucose Monitoring Devices,” Sensors, 10 10936 –10952 (2010). https://doi.org/10.3390/s101210936 Google Scholar

[3] 

G. Scheiner, “CGM Retrospective Data Analysis,” Diabetes Technology & Therapeutics, 18 S2-14 –S2-22 (2016). https://doi.org/10.1089/dia.2015.0281 Google Scholar

[4] 

H. Ogata, K. Tokuyama, S. Nagasaka, T. Tsuchita, I. Kusaka, S. Ishibashi, H. Suzuki, N. Yamada, K. Hamano, K. Kiyono, Z.R. Struzik, Y. Yamamoto, “The lack of long-range negative correlations in glucose dynamics is associated with worse glucose control in patients with diabetes mellitus,” Metab.-Clin. Exp., 61 1041 –1050 (2012). https://doi.org/10.1016/j.metabol.2011.12.007 Google Scholar

[5] 

X. Chen, D. Wang, J. Lin, T. Zhang, S. Deng, L. Huang, Y. Jin, C. Chen, Z. Zhang, J. Zheng, B. Sun, P. Bogdan, X.D. Zhang, “Analyzing Complexity and Fractality of Glucose Dynamics in a Pregnant Woman with Type 2 Diabetes under Treatment,” Int. J. Biol. Sci., 15 2373 –2380 (2019). https://doi.org/10.7150/ijbs.33825 Google Scholar

[6] 

M.D. Costa, T. Henriques, M.N. Munshi, A.R. Segal, A.L. Goldberger, “Dynamical glucometry: Use of multiscale entropy analysis in diabetes,” Chaos, 24 033139 (2014). https://doi.org/10.1063/1.4894537 Google Scholar

[7] 

X. Ning, Y. Xu, J. Wang, X. Ma, “Approximate entropy analysis of short-term HFECG based on wave mode,” Physica A: Statistical Mechanics and Its Applications, 346 475 –483 (2005). https://doi.org/10.1016/j.physa.2004.07.040 Google Scholar

[8] 

S. Pincus, “Approximate Entropy (apen) as a Complexity Measure,” Chaos, 5 110 –117 (1995). https://doi.org/10.1063/1.166092 Google Scholar

[9] 

C. Li, X. Ma, J. Lu, R. Tao, X. Yu, Y. Mo, W. Lu, Y. Bao, J. Zhou, W. Jia, “Decreasing complexity of glucose time series derived from continuous glucose monitoring is correlated with deteriorating glucose regulation,” Front. Med., 17 68 –74 (2023). https://doi.org/10.1007/s11684-022-0955-9 Google Scholar

[10] 

Zhou J, Li H, Ran X, Yang W, Li Q, Peng Y, Li Y, Gao X, Luan X, Wang W, Jia W., “Reference values for continuous glucose monitoring in Chinese subjects,” Diabetes Care, 32 (7), 1188 –1193 (2009). https://doi.org/10.2337/dc09-0076 Google Scholar

[11] 

Zhou J, Mo Y, Li H, Ran X, Yang W, Li Q, Peng Y, Li Y, Gao X, Luan X, Wang W, Xie Y, Jia W., “Relationship between HbA1c and continuous glucose monitoring in Chinese population: a multicenter study,” PLoS One, 8 (12), e83827 (2013). https://doi.org/10.1371/journal.pone.0083827 Google Scholar

[12] 

S.-D. Wu, C.-W. Wu, S.-G. Lin, C.-C. Wang, K.-Y. Lee, “Time Series Analysis Using Composite Multiscale Entropy,” Entropy, 15 1069 –1084 (2013). https://doi.org/10.3390/e15031069 Google Scholar

[13] 

J.S. Richman, J.R. Moorman, “Physiological time-series analysis using approximate entropy and sample entropy,” Am. J. Physiol.-Heart Circul. Physiol., 278 H2039 –H2049 (2000). https://doi.org/10.1152/ajpheart.2000.278.6.H2039 Google Scholar

[14] 

M.W. Flood, B. Grimm, “EntropyHub: An open-source toolkit for entropic time series analysis,” PLoS One, 16 e0259448 (2021). https://doi.org/10.1371/journal.pone.0259448 Google Scholar

[15] 

S.E. Manley, S.D. Luzio, I.M. Stratton, T.M. Wallace, P.M.S. Clark, “Preanalytical, analytical, and computational factors affect homeostasis model assessment estimates,” Diabetes Care, 31 1877 –1883 (2008). https://doi.org/10.2337/dc08-0097 Google Scholar
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Yiming Sun, Rui Tao, Yaxin Wang, Jinghao Cai, Hongru Li, Jian Zhou, and Xia Yu "Analysis of continuous glucose monitoring data in patients with type 2 diabetes mellitus based on entropy analysis", Proc. SPIE 13270, International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024), 132701C (11 September 2024); https://doi.org/10.1117/12.3045353
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Glucose

Correlation coefficients

Blood

Data analysis

Education and training

Statistical analysis

Clinical practice

Back to Top