Skip to content

IDEA 25.12.0 Release#193

Merged
cfsnate merged 7 commits intomainfrom
dev-25.12.0
Dec 17, 2025
Merged

IDEA 25.12.0 Release#193
cfsnate merged 7 commits intomainfrom
dev-25.12.0

Conversation

@cfsnate
Copy link
Copy Markdown
Collaborator

@cfsnate cfsnate commented Dec 17, 2025

[25.12.0] - 2025-12-17

Upgrade Instructions:

  • It's recommended to perform a full cluster upgrade as Base AMIs and other settings have been updated
./idea-admin.sh upgrade-cluster --aws-region $IDEA_AWS_REGION --cluster-name $IDEA_CLUSTER_NAME

(Upgrade Documentation)

🔧 Improvements

  • Dependencies: Updated AWS CDK (2.1031.2), Python (3.13.9), Node (22.21.1), and all Python packages
  • NVIDIA GPU Drivers: Updated driver versions
    • LTSB from 470.239.06 to 470.256.02
    • Production from 580.65.06 to 580.105.08
  • AMI Updates: Refreshed base and software stack AMI IDs across all regions
    • Amazon Linux 2023 updated to kernel 6.12
  • HPC Job Provisioning: Added configurable job reconciler interval (scheduler.job_provisioning.job_reconciler_interval_seconds, default: 60 seconds)
  • FSx Lustre Support: Added FSx Lustre support for modern kernels

🐛 Bug Fixes

  • HPC Job Monitor: Fixed job monitor thread crash from unhandled exceptions in finished job processor
  • HPC Job Provisioning: Fixed jobs getting stuck when PBS hooks don't fire immediately
    • Added job reconciler fallback to catch missed jobs
  • EC2 Instance Type Cache: Fixed cache overflow preventing job provisioning for new instance types
    • Enhanced with larger size and on-demand fetching
  • Lustre Client Repo: Fixed repo family on Ubuntu
  • CDK Template: Removed invalid TTL on Alias record
  • GovCloud VDC Deployment: Fixed EventBridge scheduler rule deployment failure in AWS GovCloud

@cfsnate cfsnate requested a review from cfssteve December 17, 2025 17:06
@cfsnate cfsnate merged commit 9bd17d5 into main Dec 17, 2025
2 checks passed
@cfsnate cfsnate deleted the dev-25.12.0 branch December 17, 2025 20:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants