fix: prevent atom exhaustion in merge_projects mix task #3973

elias-ba · 2025-11-12T16:11:04Z

Description

This PR fixes a security vulnerability in the mix lightning.merge_projects task where malicious JSON input with arbitrary keys could cause atom exhaustion and crash the VM.

The fix uses String.to_existing_atom/1 to safely convert JSON keys to atoms, only allowing keys that already exist in the system. This prevents creation of unlimited atoms from malicious input while maintaining compatibility with the merge algorithm.

Closes #3956

Validation steps

Test basic merge functionality:
```
mix test test/mix/tasks/merge_projects_test.exs
```
All tests should pass.
Test with non-UUID IDs (Joe's requirement):
- Run merge with projects using simple IDs like "1", "2", "test-source-1"
- Verify merge works without requiring valid UUIDs
Test security:
- Try merging a project with unknown JSON keys
- Should raise clear error about unknown fields
Test offline operation:
- Run mix task without database connection
- Should work without any database access

Additional notes for the reviewer

Implementation approach: Uses atomize_keys/1 function that recursively converts only map keys to atoms (not values). UUIDs and other string values remain as strings.
No Provisioner coupling: Intentionally does not use Provisioner.parse_document/1 to avoid coupling to validation rules and usage limiting checks (per Joe's feedback).
Lets merge fail naturally: If project structure is truly invalid, the merge algorithm itself will fail with appropriate errors rather than blocking upfront.

AI Usage

Please disclose how you've used AI in this work (it's cool, we just want to know!):

You can read more details in our Responsible AI Policy

Pre-submission checklist

I have performed a self-review of my code.
I have implemented and tested all related authorization policies. (e.g., :owner, :admin, :editor, :viewer)
I have updated the changelog.
I have ticked a box in "AI usage" in this PR

codecov · 2025-11-12T16:29:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 88.67%. Comparing base (cc02dd3) to head (172e011).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3973      +/-   ##
==========================================
+ Coverage   88.65%   88.67%   +0.02%     
==========================================
  Files         422      422              
  Lines       18913    18913              
==========================================
+ Hits        16767    16771       +4     
+ Misses       2146     2142       -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

elias-ba · 2025-11-12T17:34:37Z

Hey @rorymckinley and @josephjclark 👋

This is a new PR that supersedes #3956. Based on Joe's feedback about not coupling the merge task to Provisioner validation or database requirements, I've implemented a much simpler approach.

What changed from #3956

The new implementation:

No Provisioner dependency (avoids coupling to validation rules and usage limits)
No database access (works completely offline as Joe needed)
Simple String.to_existing_atom/1 for security (prevents atom exhaustion)
Supports non-UUID IDs like "1", "test-source-1" (Joe's requirement for testing)
Lets the merge algorithm handle validation naturally

How it works

Takes JSON files as input
Converts string keys to existing atoms using String.to_existing_atom/1
Passes atom-keyed maps to MergeProjects.merge_project/2
Returns the merged result

This approach keeps the Mix task decoupled from Lightning's validation constraints while still preventing the atom exhaustion security vulnerability.

@josephjclark - As discussed on Slack, this should satisfy your testing needs. When you have time, please test this branch thoroughly and let me know if you see any issues. I've added you as a reviewer.

@rorymckinley - When you get a chance, would you be able to help review this and get it merged? I believe this approach is cleaner and aligns better with the Mix task's purpose of being a simple, offline utility.

Thanks both! 🙏

josephjclark

Not going to pretend to 100% understand what's happening here - but this passes against my test suite, so I'm all for it!

rorymckinley

@elias-ba I have not had a chance to really step through the test changes, will do that tomorrow, but I am shutting down now and don't want to take the chance that github forgets my current comments - so please consider this to be part 1 :).

I have 'Requested Changes' primarily because of my confusion re: Jason.decode - is it sill in the execution path, or are my eyes just tired?

rorymckinley · 2025-11-12T18:40:08Z