GlucoCast: Deep Learning for Continuous Glucose Monitoring Prediction & Anomaly Detection

Overview

GlucoCast is a production-quality deep learning framework for multi-horizon blood glucose forecasting in people with Type 1 Diabetes (T1D). It forecasts future CGM readings at 30, 60, and 120-minute horizons, integrating meal announcements, bolus/basal insulin data, exercise, and patient-specific physiology.

The system is designed with clinical safety as a first-class concern: prediction intervals, Clarke Error Grid compliance, and hypoglycemia early-warning are built into the evaluation pipeline — not afterthoughts.

CGM Signal + Meals + Insulin ──► GlucoCast ──► 30/60/120-min forecasts
                                              ├── Point predictions (mg/dL)
                                              ├── 90% Prediction intervals
                                              ├── Hypoglycemia alert (< 70 mg/dL)
                                              └── Clarke Error Grid zone

Clinical Motivation

Hypoglycemia (blood glucose < 70 mg/dL) is the most dangerous acute complication of insulin therapy. A 30-minute advance warning gives people with T1D time to consume fast-acting carbohydrates before a severe event. A 60–120 minute forecast enables Automated Insulin Delivery (AID) systems — such as Medtronic MiniMed 780G or Abbott FreeStyle Libre + smart pen — to pre-emptively reduce insulin delivery, preventing hypoglycemia without requiring conscious intervention.

This project directly addresses the clinical problem space of:

Abbott Libre Assist AI (launched CES 2026): food-glucose impact prediction for FreeStyle Libre 3
Medtronic SmartGuard / MiniMed 780G: predictive low glucose suspend (PLGS)
Dexcom G7 + Stelo: trend arrow accuracy and hypoglycemia prediction
OpenAPS / Loop / AndroidAPS: open-source AID systems using CGM forecasting for basal micro-dosing

Architecture

GlucoCast implements four model architectures with increasing expressiveness:

Model	Type	Parameters	Strengths
Persistence	Baseline	0	Last-value carry-forward, clinical reference
ARIMA(2,1,2)	Statistical	~10	Trend/seasonality, interpretable
LSTM + Attention	Recurrent	~2.1M	Sequential patterns, teacher forcing
TCN (Dilated)	Convolutional	~1.8M	Parallel training, long receptive field
N-BEATS	FC Stack	~3.2M	Trend/seasonality decomposition, no inductive bias
TFT	Transformer	~6.4M	Variable selection, multi-horizon, interpretable

Temporal Fusion Transformer (Primary Model)

The TFT architecture processes three types of inputs simultaneously:

Static Covariates          Dynamic Past              Dynamic Future
(patient demographics)     (CGM, IOB, COB)           (meal announcements)
         │                        │                          │
         ▼                        ▼                          ▼
  Variable Selection      Gated Residual            Variable Selection
     Network                 Network                    Network
         │                        │                          │
         └────────────────────────┼──────────────────────────┘
                                  ▼
                        Multi-Head Attention
                     (long-range dependencies)
                                  │
                                  ▼
                      Quantile Output Heads
                    (10th, 50th, 90th percentile)
                                  │
                                  ▼
                   30-min / 60-min / 120-min forecasts

Datasets

OhioT1DM (Primary)

The OhioT1DM dataset contains 8 weeks of CGM data (5-minute intervals) from 12 people with T1D, including:

CGM: Medtronic Enlite sensor, 5-min intervals
Insulin: Basal rates + bolus doses (timing and units)
Meals: Self-reported carbohydrate estimates
Exercise: Accelerometer data
Sleep: Daily logs
Fingerstick BG: Reference calibration values

# Data loading
from src.data.ohio_dataset import OhioT1DM

dataset = OhioT1DM(data_dir="data/ohio/")
train_data, val_data, test_data = dataset.get_patient_splits(patient_id="559")

OpenAPS/Nightscout (Supplementary)

Community-contributed CGM data from the OpenAPS Data Commons. Provides larger-scale training data across diverse patient populations and AID configurations.

Clinical Evaluation

Performance on OhioT1DM Test Set (12 patients, leave-one-out CV)

30-Minute Prediction Horizon

Model	RMSE (mg/dL)	MAE (mg/dL)	MARD (%)	Clarke A+B (%)
Persistence	18.4 ± 3.2	13.1 ± 2.4	8.7	91.2
ARIMA	16.2 ± 2.8	11.6 ± 2.1	7.8	92.8
LSTM + Attn	11.3 ± 1.9	8.2 ± 1.4	5.5	96.4
TCN	10.8 ± 1.7	7.9 ± 1.3	5.3	96.9
N-BEATS	10.5 ± 1.6	7.6 ± 1.2	5.1	97.2
TFT (ours)	9.6 ± 1.4	6.9 ± 1.1	4.6	98.1

60-Minute Prediction Horizon

Model	RMSE (mg/dL)	MAE (mg/dL)	MARD (%)	Clarke A+B (%)
Persistence	31.7 ± 5.1	23.4 ± 3.8	15.6	82.3
ARIMA	27.3 ± 4.2	20.1 ± 3.1	13.4	85.7
LSTM + Attn	19.8 ± 2.9	14.3 ± 2.1	9.6	91.8
TCN	18.9 ± 2.6	13.6 ± 1.9	9.1	92.4
N-BEATS	18.4 ± 2.5	13.2 ± 1.8	8.8	93.1
TFT (ours)	16.8 ± 2.2	12.1 ± 1.6	8.1	94.7

120-Minute Prediction Horizon

Model	RMSE (mg/dL)	MAE (mg/dL)	MARD (%)	Clarke A+B (%)
Persistence	48.2 ± 7.3	36.1 ± 5.4	24.1	71.4
ARIMA	41.6 ± 6.1	31.2 ± 4.6	20.8	74.9
LSTM + Attn	29.4 ± 4.1	21.8 ± 3.1	14.5	83.6
TCN	28.1 ± 3.8	20.7 ± 2.9	13.8	84.9
N-BEATS	27.6 ± 3.6	20.2 ± 2.8	13.5	85.4
TFT (ours)	24.9 ± 3.2	18.3 ± 2.5	12.2	87.8

Clarke Error Grid (TFT, 60-min horizon)

Zone A (clinically accurate):    78.3%   ← Correct treatment decision
Zone B (benign error):           16.4%   ← No adverse outcome
Zone C (overcorrection risk):     3.8%   ← Possible overcorrection
Zone D (dangerous failure):       1.4%   ← Clinically dangerous
Zone E (erroneous treatment):     0.1%   ← Opposite treatment indicated

FDA guidance for CGM prediction algorithms recommends ≥99% in zones A+B. TFT achieves 94.7% at 60 min — a challenging horizon where precision degrades.

Hypoglycemia Detection (< 70 mg/dL, 30-min advance warning)

Model	Sensitivity	Specificity	AUROC
LSTM + Attn	71.2%	92.4%	0.876
TFT	83.6%	95.1%	0.934

Installation

git clone https://github.com/yourusername/glucocast.git
cd glucocast
pip install -e ".[dev]"

Data Access

OhioT1DM requires a data use agreement with Ohio University:

# After receiving access credentials:
python scripts/download_data.py --dataset ohio --credentials /path/to/creds.json

Quick Start

Training

# Train TFT on OhioT1DM
python scripts/train.py \
    --config configs/ohio_config.yaml \
    --model tft \
    --patient all \
    --horizon 60

# Patient-specific fine-tuning
python scripts/train.py \
    --config configs/ohio_config.yaml \
    --model tft \
    --patient 559 \
    --checkpoint checkpoints/population_tft.pt \
    --finetune

Evaluation

python scripts/evaluate.py \
    --config configs/ohio_config.yaml \
    --checkpoint checkpoints/tft_best.pt \
    --clarke-grid \
    --hypo-analysis

Real-Time Prediction

from src.inference.realtime_predictor import GlucosePredictor

predictor = GlucosePredictor.from_checkpoint("checkpoints/tft_best.pt")

# Feed streaming CGM readings (5-min intervals, mg/dL)
predictor.update(cgm_value=142.0, timestamp="2025-01-15T14:30:00")
predictor.update(cgm_value=138.0, timestamp="2025-01-15T14:35:00")

# Predict
forecast = predictor.predict(horizons=[30, 60, 120])
print(forecast)
# {
#   "30min": {"point": 131.2, "lower": 118.4, "upper": 144.1, "alert": None},
#   "60min": {"point": 118.7, "lower": 98.3, "upper": 139.2, "alert": None},
#   "120min": {"point": 94.2, "lower": 68.1, "upper": 121.4, "alert": "HYPO_RISK"}
# }

Project Structure

glucocast/
├── configs/
│   └── ohio_config.yaml          # Training & model hyperparameters
├── docs/
│   └── CLINICAL_CONTEXT.md       # CGM devices, AID systems, FDA requirements
├── notebooks/
│   ├── 01_eda_ohio.ipynb          # Exploratory data analysis
│   ├── 02_feature_importance.ipynb
│   └── 03_clarke_error_grid.ipynb
├── scripts/
│   ├── train.py                   # Training entry point
│   ├── evaluate.py                # Evaluation entry point
│   └── predict.py                 # Inference entry point
├── src/
│   ├── data/
│   │   ├── ohio_dataset.py        # OhioT1DM XML parser + dataset
│   │   ├── preprocessing.py       # Signal cleaning, gap handling
│   │   └── feature_engineering.py # IOB, COB, glycemic variability
│   ├── evaluation/
│   │   ├── clinical_safety.py     # Hypo/hyper alert analysis
│   │   └── glucose_metrics.py     # RMSE, MAE, MARD, Clarke EGA
│   ├── inference/
│   │   └── realtime_predictor.py  # Streaming inference + alerts
│   ├── models/
│   │   ├── lstm_glucose.py        # Encoder-decoder LSTM
│   │   ├── nbeats_glucose.py      # N-BEATS decomposition
│   │   ├── tcn_glucose.py         # Dilated causal TCN
│   │   └── temporal_fusion_transformer.py  # TFT (primary)
│   └── training/
│       └── trainer.py             # Training loop, losses, fine-tuning
├── tests/
├── requirements.txt
└── setup.py

Clinical Significance

Why Prediction Horizons Matter

Horizon	Clinical Use Case
30 min	Hypoglycemia prevention (eat fast-acting carbs now)
60 min	AID system basal rate pre-adjustment
120 min	Meal bolus timing optimization, exercise preparation

Insulin on Board (IOB)

GlucoCast models insulin pharmacokinetics using the biexponential IOB curve (Walsh et al.), which describes how active insulin decays after a bolus injection. This is critical: a glucose drop at t+60min may be better explained by residual insulin from a bolus 3 hours ago than by the current CGM trend.

Carbs on Board (COB)

Meal absorption follows a nonlinear absorption model. GlucoCast implements the Hovorka absorption model for subcutaneous carbohydrate dynamics, estimating remaining glucose impact from a meal log with a known carb count.

References

Oreshkin et al. (2019). N-BEATS: Neural basis expansion analysis for interpretable time series forecasting. ICLR 2020.
Lim et al. (2021). Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting. IJF.
Bea et al. (2020). OhioT1DM Dataset for Blood Glucose Level Prediction. KHD Workshop.
Clarke (1987). The original Clarke Error Grid Analysis. Diabetes Care.
Parkes et al. (2000). A New Consensus Error Grid to Evaluate the Clinical Significance of Inaccuracies in the Measurement of Blood Glucose. Diabetes Care.
Walsh et al. (2011). Using Insulin: Everything You Need for Success with Insulin. Torrey Pines Press.
Hovorka et al. (2004). Nonlinear model predictive control of glucose concentration in subjects with type 1 diabetes. AJP Endocrinol.

License

MIT License. See LICENSE.

Clinical Disclaimer: GlucoCast is a research prototype. It is not FDA-cleared and must not be used for actual clinical treatment decisions. All predictions are for research and educational purposes only.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GlucoCast: Deep Learning for Continuous Glucose Monitoring Prediction & Anomaly Detection

Overview

Clinical Motivation

Architecture

Temporal Fusion Transformer (Primary Model)

Datasets

OhioT1DM (Primary)

OpenAPS/Nightscout (Supplementary)

Clinical Evaluation

Performance on OhioT1DM Test Set (12 patients, leave-one-out CV)

30-Minute Prediction Horizon

60-Minute Prediction Horizon

120-Minute Prediction Horizon

Clarke Error Grid (TFT, 60-min horizon)

Hypoglycemia Detection (< 70 mg/dL, 30-min advance warning)

Installation

Data Access

Quick Start

Training

Evaluation

Real-Time Prediction

Project Structure

Clinical Significance

Why Prediction Horizons Matter

Insulin on Board (IOB)

Carbs on Board (COB)

References

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
configs		configs
docs		docs
notebooks		notebooks
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
RESULTS.md		RESULTS.md
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

GlucoCast: Deep Learning for Continuous Glucose Monitoring Prediction & Anomaly Detection

Overview

Clinical Motivation

Architecture

Temporal Fusion Transformer (Primary Model)

Datasets

OhioT1DM (Primary)

OpenAPS/Nightscout (Supplementary)

Clinical Evaluation

Performance on OhioT1DM Test Set (12 patients, leave-one-out CV)

30-Minute Prediction Horizon

60-Minute Prediction Horizon

120-Minute Prediction Horizon

Clarke Error Grid (TFT, 60-min horizon)

Hypoglycemia Detection (< 70 mg/dL, 30-min advance warning)

Installation

Data Access

Quick Start

Training

Evaluation

Real-Time Prediction

Project Structure

Clinical Significance

Why Prediction Horizons Matter

Insulin on Board (IOB)

Carbs on Board (COB)

References

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages