Task 2: MatrixRunner backend + APIs Integration for multiple jobs #89

nataliarodriguez-uc · 2025-08-09T05:55:14Z

Summary

Implements Task 2: MatrixRunner backend + APIs to support matrix/grid-based image generation workflows with persistent state, pause/resume capability, and job tracking.

Key Features

MatrixRunner class in dream_layer_backend_utils/task_runner.py
- Generates all job permutations from parameter ranges (e.g., seeds, steps, samplers)
- Tracks job statuses (pending, running, done)
- Supports pause/resume functionality
- Persists state to JSON for recovery after restarts
New API routes in dream_layer.py
- POST /api/matrix-runner/start – Initialize job sweep
- POST /api/matrix-runner/pause – Pause execution
- POST /api/matrix-runner/resume – Resume execution
- GET /api/matrix-runner/status – Retrieve current state of all jobs
- POST /api/matrix-runner/next – Fetch the next pending job
- POST /api/matrix-runner/complete – Mark a job as completed

Testing

Tested locally using curl:

# Check status
curl -s http://localhost:5002/api/matrix-runner/status

# Start jobs
curl -s -X POST http://localhost:5002/api/matrix-runner/start \
  -H "Content-Type: application/json" \
  -d '{"seeds":[1,2,3], "steps":[10,20], "samplers":["euler"]}'

# Pause
curl -s -X POST http://localhost:5002/api/matrix-runner/pause

# Resume
curl -s -X POST http://localhost:5002/api/matrix-runner/resume

# Get next job
curl -s -X POST http://localhost:5002/api/matrix-runner/next

# Mark job complete
curl -s -X POST http://localhost:5002/api/matrix-runner/complete \
  -H "Content-Type: application/json" \
  -d '{"job_id": 0}'

## Summary by Sourcery

Implement a backend MatrixRunner to manage grid sweeps of generation parameters with job persistence and expose REST APIs for controlling and monitoring the sweep

New Features:
- Add MatrixRunner class for generating and tracking matrix-based image generation jobs with pause/resume and persistent state
- Expose REST API endpoints to start, pause, resume, query status, fetch next pending job, and mark jobs as complete

<!-- This is an auto-generated comment: release notes by coderabbit.ai -->

## Summary by CodeRabbit

* **New Features**
  * Introduced a matrix job management system with persistent state, enabling users to generate, track, pause, and resume batches of jobs through new API endpoints.
  * Added endpoints to start a job matrix, view job status, retrieve the next job, mark jobs as complete, and control runner state (pause/resume).
* **Chores**
  * Added a new state file to store job progress and runner status.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->

coderabbitai · 2025-08-09T05:55:21Z

Walkthrough

A new matrix job management feature called "MatrixRunner" was introduced to the backend, including a persistent state mechanism and multiple new API endpoints for task control and monitoring. Supporting code and a state file were added. The .gitignore file was inadvertently malformed during an edit related to the new state file.

Changes

Cohort / File(s)	Change Summary
MatrixRunner Backend Integration `dream_layer_backend/dream_layer.py`	Introduced MatrixRunner instance and six new API endpoints for job matrix management, including task lifecycle and status.
MatrixRunner Implementation `dream_layer_backend/dream_layer_backend_utils/task_runner.py`	Added MatrixRunner class for combinatorial job generation, job state management, pause/resume, and persistent state handling.
MatrixRunner State File `dream_layer_backend/matrix_runner_state.json`	Added a JSON file storing current matrix runner state, including job list, statuses, current index, and pause flag.
.gitignore Modification `.gitignore`	Removed ignore rule for `test-logs` and appended state file path, resulting in a malformed ignore pattern.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant FlaskAPI
    participant MatrixRunner
    participant StateFile

    Client->>FlaskAPI: POST /api/matrix-runner/start (params)
    FlaskAPI->>MatrixRunner: generate(param_dict)
    MatrixRunner->>StateFile: save()
    FlaskAPI-->>Client: job count

    Client->>FlaskAPI: POST /api/matrix-runner/next
    FlaskAPI->>MatrixRunner: next_job()
    MatrixRunner->>StateFile: save()
    FlaskAPI-->>Client: next job info

    Client->>FlaskAPI: POST /api/matrix-runner/complete (job_id)
    FlaskAPI->>MatrixRunner: complete_job(job_id)
    MatrixRunner->>StateFile: save()
    FlaskAPI-->>Client: status

    Client->>FlaskAPI: POST /api/matrix-runner/pause/resume
    FlaskAPI->>MatrixRunner: pause()/resume()
    MatrixRunner->>StateFile: save()
    FlaskAPI-->>Client: status

    Client->>FlaskAPI: GET /api/matrix-runner/status
    FlaskAPI->>MatrixRunner: status()
    FlaskAPI-->>Client: summary

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~35 minutes

Poem

In the burrows of backend code so deep,
A MatrixRunner wakes from its sleep.
With jobs to juggle, pause, and resume,
It keeps all tasks in perfect tune.
JSON dreams and endpoints new,
This rabbit's joy hops right through!
🐇✨

Note

🔌 MCP (Model Context Protocol) integration is now available in Early Access!

Pro users can now connect to remote MCP servers under the Integrations page to get reviews and chat conversations that understand additional development context.

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai generate unit tests to generate unit tests for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

sourcery-ai · 2025-08-09T05:55:23Z

Reviewer's Guide

This PR introduces a MatrixRunner utility to generate and track grid-based image-generation jobs with pause/resume and JSON persistence, and wires it into the existing backend via dedicated API endpoints for sweep control and status monitoring.

Sequence diagram for MatrixRunner API job lifecycle

sequenceDiagram
actor User
participant API
participant MatrixRunner
User->>API: POST /api/matrix-runner/start
API->>MatrixRunner: generate(param_dict)
MatrixRunner-->>API: jobs initialized
API-->>User: total_jobs
User->>API: POST /api/matrix-runner/next
API->>MatrixRunner: next_job()
MatrixRunner-->>API: next job (marked running)
API-->>User: job details
User->>API: POST /api/matrix-runner/complete
API->>MatrixRunner: complete_job(job_id)
MatrixRunner-->>API: job marked done
API-->>User: status ok

Class diagram for MatrixRunner job management

classDiagram
class MatrixRunner {
  - state_file: str
  - jobs: list
  - index: int
  - paused: bool
  + __init__(state_file)
  + generate(param_dict)
  + next_job()
  + complete_job(job_id)
  + pause()
  + resume()
  + save()
  + load()
}
MatrixRunner --> "*" Job
class Job {
  + id: int
  + status: str
  + [dynamic parameters: seeds, steps, samplers, ...]
}

File-Level Changes

Change	Details	Files
Implement MatrixRunner class for job grid management with persistence	Generate all parameter combinations as jobs with unique ids Track job statuses and expose next_job and complete_job methods Implement pause and resume toggling with state persistence Save and load runner state to JSON file	`dream_layer_backend_utils/task_runner.py`
Integrate MatrixRunner into backend service and expose control APIs	Import and initialize runner with a persistent state file Add routes for starting, pausing, resuming, fetching status, next job, and completing jobs Filter request payloads to extract list-based parameters for sweeps	`dream_layer_backend/dream_layer.py`
Add persistent state file scaffold	Initialize empty JSON state file to track runner across restarts	`dream_layer_backend/matrix_runner_state.json`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey @nataliarodriguez-uc - I've reviewed your changes - here's some feedback:

Wrap MatrixRunner state modifications and file writes in a thread lock to prevent race conditions when multiple API requests invoke next_job/complete_job concurrently.
Move the persisted state file out of the source directory (or at least add it to .gitignore) so you don’t accidentally commit runtime state into version control.
Remove duplicate imports and any unused dependencies in task_runner.py to keep the module clean and maintainable.

Prompt for AI Agents

Please address the comments from this code review:
## Overall Comments
- Wrap MatrixRunner state modifications and file writes in a thread lock to prevent race conditions when multiple API requests invoke next_job/complete_job concurrently.
- Move the persisted state file out of the source directory (or at least add it to .gitignore) so you don’t accidentally commit runtime state into version control.
- Remove duplicate imports and any unused dependencies in task_runner.py to keep the module clean and maintainable.

## Individual Comments

### Comment 1
<location> `dream_layer_backend/dream_layer_backend_utils/task_runner.py:59` </location>
<code_context>
+                return job
+        return None  # No more jobs
+
+    def complete_job(self, job_id):
+        """Mark job as done"""
+        for job in self.jobs:
</code_context>

<issue_to_address>
complete_job does not handle the case where job_id is not found.

Currently, if job_id is missing, no feedback is given. Adding a warning or exception would improve error visibility.
</issue_to_address>

<suggested_fix>
<<<<<<< SEARCH
    def complete_job(self, job_id):
        """Mark job as done"""
        for job in self.jobs:
            if job["id"] == job_id:
                job["status"] = "done"
                break
        self.save()
=======
    def complete_job(self, job_id):
        """Mark job as done. Raises ValueError if job_id not found."""
        found = False
        for job in self.jobs:
            if job["id"] == job_id:
                job["status"] = "done"
                found = True
                break
        if not found:
            raise ValueError(f"Job with id {job_id} not found.")
        self.save()
>>>>>>> REPLACE

</suggested_fix>

### Comment 2
<location> `dream_layer_backend/dream_layer_backend_utils/task_runner.py:77` </location>
<code_context>
+        self.save()
+
+    # --- Persistence ---
+    def save(self):
+        with open(self.state_file, "w") as f:
+            json.dump(
</code_context>

<issue_to_address>
save does not handle file write errors.

Wrap the file write in a try/except block and log errors to prevent process crashes.

Suggested implementation:

```python
    def save(self):
        try:
            with open(self.state_file, "w") as f:
                json.dump(
                    {"jobs": self.jobs, "index": self.index, "paused": self.paused}, f, indent=2
                )
        except Exception as e:
            import logging
            logging.error(f"Failed to save state to {self.state_file}: {e}")

```

If `logging` is already imported at the top of the file, you can remove the `import logging` line inside the except block.
</issue_to_address>

### Comment 3
<location> `dream_layer_backend/dream_layer.py:447` </location>
<code_context>
+        data = request.json or {}
+        # accept lists/ranges for seeds, steps, samplers, etc.
+        # keep only list-valued keys for the sweep
+        param_dict = {k: v for k, v in data.items() if isinstance(v, list)}
+        runner.generate(param_dict)              # deterministic expansion
+        return jsonify({"status":"success","total_jobs": len(runner.jobs)})
</code_context>

<issue_to_address>
Only list-valued keys are used for job generation, which may ignore scalar parameters.

Scalar values from the API client are currently excluded from job generation. To ensure all parameters are considered, normalize scalars to single-item lists before processing.
</issue_to_address>

<suggested_fix>
<<<<<<< SEARCH
        # accept lists/ranges for seeds, steps, samplers, etc.
        # keep only list-valued keys for the sweep
        param_dict = {k: v for k, v in data.items() if isinstance(v, list)}
        runner.generate(param_dict)              # deterministic expansion
        return jsonify({"status":"success","total_jobs": len(runner.jobs)})
=======
        # accept lists/ranges for seeds, steps, samplers, etc.
        # normalize scalars to single-item lists for the sweep
        param_dict = {k: v if isinstance(v, list) else [v] for k, v in data.items()}
        runner.generate(param_dict)              # deterministic expansion
        return jsonify({"status":"success","total_jobs": len(runner.jobs)})
>>>>>>> REPLACE

</suggested_fix>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2025-08-09T05:56:10Z

dream_layer_backend/dream_layer_backend_utils/task_runner.py

+    def complete_job(self, job_id):
+        """Mark job as done"""
+        for job in self.jobs:
+            if job["id"] == job_id:
+                job["status"] = "done"
+                break
+        self.save()


suggestion: complete_job does not handle the case where job_id is not found.

Currently, if job_id is missing, no feedback is given. Adding a warning or exception would improve error visibility.

Suggested change

def complete_job(self, job_id):

"""Mark job as done"""

for job in self.jobs:

if job["id"] == job_id:

job["status"] = "done"

break

self.save()

def complete_job(self, job_id):

"""Mark job as done. Raises ValueError if job_id not found."""

found = False

for job in self.jobs:

if job["id"] == job_id:

job["status"] = "done"

found = True

break

if not found:

raise ValueError(f"Job with id {job_id} not found.")

self.save()

sourcery-ai · 2025-08-09T05:56:10Z

dream_layer_backend/dream_layer_backend_utils/task_runner.py

+        self.save()
+
+    # --- Persistence ---
+    def save(self):


suggestion: save does not handle file write errors.

Wrap the file write in a try/except block and log errors to prevent process crashes.

Suggested implementation:

def save(self): try: with open(self.state_file, "w") as f: json.dump( {"jobs": self.jobs, "index": self.index, "paused": self.paused}, f, indent=2 ) except Exception as e: import logging logging.error(f"Failed to save state to {self.state_file}: {e}")

If logging is already imported at the top of the file, you can remove the import logging line inside the except block.

sourcery-ai · 2025-08-09T05:56:10Z

dream_layer_backend/dream_layer.py

+        # accept lists/ranges for seeds, steps, samplers, etc.
+        # keep only list-valued keys for the sweep
+        param_dict = {k: v for k, v in data.items() if isinstance(v, list)}
+        runner.generate(param_dict)              # deterministic expansion
+        return jsonify({"status":"success","total_jobs": len(runner.jobs)})


suggestion: Only list-valued keys are used for job generation, which may ignore scalar parameters.

Scalar values from the API client are currently excluded from job generation. To ensure all parameters are considered, normalize scalars to single-item lists before processing.

Suggested change

# accept lists/ranges for seeds, steps, samplers, etc.

# keep only list-valued keys for the sweep

param_dict = {k: v for k, v in data.items() if isinstance(v, list)}

runner.generate(param_dict) # deterministic expansion

return jsonify({"status":"success","total_jobs": len(runner.jobs)})

# accept lists/ranges for seeds, steps, samplers, etc.

# normalize scalars to single-item lists for the sweep

param_dict = {k: v if isinstance(v, list) else [v] for k, v in data.items()}

runner.generate(param_dict) # deterministic expansion

return jsonify({"status":"success","total_jobs": len(runner.jobs)})

sourcery-ai · 2025-08-09T05:56:10Z

dream_layer_backend/dream_layer.py

+    job = runner.next_job()
+    if not job:
+        return jsonify({"status":"empty"})


issue (code-quality): We've found these issues:

Use named expression to simplify assignment and conditional (use-named-expression)

Lift code into else after jump in control flow (reintroduce-else)

Swap if/else branches (swap-if-else-branches)

coderabbitai

Actionable comments posted: 7

🧹 Nitpick comments (3)

dream_layer_backend/dream_layer.py (3)
125-128: Persisting state under code dir is fine; ensure it’s ignored and not committed

Good location, but pair this with a fixed .gitignore and remove the tracked JSON as discussed.

If you’d prefer a writable data dir, consider os.path.join(parent_dir, "Dream_Layer_Resources", "state").

463-474: Consider delegating status aggregation to the runner under lock

Avoids races and keeps logic encapsulated.

Example:
# in MatrixRunner
def summary(self):
    with self._lock:
        counts = {"pending": 0, "running": 0, "done": 0}
        for j in self.jobs:
            s = j.get("status")
            if s in counts: counts[s] += 1
        return {"total_jobs": len(self.jobs), **counts, "paused": self.paused}
Then in the endpoint:
return jsonify({"status":"ok", **runner.summary()})
475-482: Endpoint OK; optionally use 204 No Content when empty

Returning 204 improves semantics; current response is acceptable.
-    if not job:
-        return jsonify({"status":"empty"})
+    if not job:
+        return ("", 204)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1911cd3 and 8b7693c.

📒 Files selected for processing (4)

.gitignore (1 hunks)
dream_layer_backend/dream_layer.py (3 hunks)
dream_layer_backend/dream_layer_backend_utils/task_runner.py (1 hunks)
dream_layer_backend/matrix_runner_state.json (1 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-07-16T18:40:41.273Z

Learnt from: divyaprakash0426
PR: DreamLayer-AI/DreamLayer#40
File: docker/Dockerfile.backend.dev:4-6
Timestamp: 2025-07-16T18:40:41.273Z
Learning: The DreamLayer project follows an iterative approach to Docker development, where basic Docker setup is established first, and missing dependencies (like PyYAML) are addressed in subsequent iterations when related services (like ComfyUI) are added to the Docker files.

Applied to files:

.gitignore

🪛 Ruff (0.12.2)

dream_layer_backend/dream_layer_backend_utils/task_runner.py

11-11: random imported but unused

Remove unused import: random

(F401)

14-14: flask.Flask imported but unused

Remove unused import

(F401)

14-14: flask.request imported but unused

Remove unused import

(F401)

14-14: flask.jsonify imported but unused

Remove unused import

(F401)

15-15: flask_cors.CORS imported but unused

Remove unused import: flask_cors.CORS

(F401)

16-16: PIL.Image imported but unused

Remove unused import: PIL.Image

(F401)

17-17: numpy imported but unused

Remove unused import: numpy

(F401)

21-21: Redefinition of unused itertools from line 13

Remove definition: itertools

(F811)

22-22: Redefinition of unused json from line 10

Remove definition: json

(F811)

23-23: Redefinition of unused os from line 9

Remove definition: os

(F811)

🔇 Additional comments (3)

dream_layer_backend/dream_layer.py (3)

13-13: Import looks good

Clean, targeted import of the new runner.

453-457: Pause endpoint LGTM

458-462: Resume endpoint LGTM

coderabbitai · 2025-08-09T05:58:51Z

.gitignore

 # DreamLayer logs and test logs
 logs
-test-logs
+test-logsdream_layer_backend/matrix_runner_state.json


💡 Verification agent

🧩 Analysis chain

Fix malformed .gitignore entry; split into two lines

The entry concatenated two paths, so neither is ignored. Restore test-logs and add the state file on its own line.

Apply:

- test-logsdream_layer_backend/matrix_runner_state.json + test-logs + dream_layer_backend/matrix_runner_state.json

Then ensure the state file is untracked:

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Context around the malformed line:" nl -ba .gitignore | sed -n '205,220p' echo -e "\nIs the state file tracked?" git ls-files -v dream_layer_backend/matrix_runner_state.json || true echo -e "\nIf listed, untrack it:" git rm --cached -f dream_layer_backend/matrix_runner_state.json || true

Length of output: 229

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Matched lines in .gitignore:" grep -n 'test-logsdream_layer_backend' .gitignore || true echo -e "\nContext around the malformed entry:" grep -C2 'test-logsdream_layer_backend' .gitignore || true echo -e "\nTracking status of the state file:" git ls-files -v dream_layer_backend/matrix_runner_state.json || echo "Not tracked"

Length of output: 618

Fix malformed .gitignore entry for test logs and state file

The entry at line 213 in .gitignore currently reads as a single concatenated path and therefore ignores neither target. Split it into two separate lines:

.gitignore @@ -213,1 +213,2 @@ -test-logsdream_layer_backend/matrix_runner_state.json +test-logs +dream_layer_backend/matrix_runner_state.json

Then untrack the state file (it is currently tracked — git ls-files -v shows H dream_layer_backend/matrix_runner_state.json):

git rm --cached -f dream_layer_backend/matrix_runner_state.json

🤖 Prompt for AI Agents

In the .gitignore file at line 213, the entry combines two paths into one, causing neither to be ignored. Split this line into two separate lines, one for test-logsdream_layer_backend and one for matrix_runner_state.json. Then, to untrack the currently tracked state file dream_layer_backend/matrix_runner_state.json, run the command git rm --cached -f dream_layer_backend/matrix_runner_state.json.

coderabbitai · 2025-08-09T05:58:52Z

dream_layer_backend/dream_layer_backend_utils/task_runner.py

+import os
+import json
+import random
+import logging
+import itertools
+from flask import Flask, request, jsonify
+from flask_cors import CORS
+from PIL import Image
+import numpy as np
+


🛠️ Refactor suggestion

⚠️ Potential issue

Remove unused and duplicate imports; keep this module framework-agnostic

Multiple imports are unused and some are duplicated, tripping Ruff (F401/F811) and coupling this utility to Flask/PIL/NumPy unnecessarily.

-import os -import json -import random -import logging -import itertools -from flask import Flask, request, jsonify -from flask_cors import CORS -from PIL import Image -import numpy as np +import os +import json +import logging +import itertools @@ -import itertools -import json -import os

Optionally, use logger or remove import logging + logger = logging.getLogger(__name__) if not needed.

Also applies to: 21-24

🧰 Tools

🪛 Ruff (0.12.2)

11-11: random imported but unused

Remove unused import: random

(F401)

14-14: flask.Flask imported but unused

Remove unused import

(F401)

14-14: flask.request imported but unused

Remove unused import

(F401)

14-14: flask.jsonify imported but unused

Remove unused import

(F401)

15-15: flask_cors.CORS imported but unused

Remove unused import: flask_cors.CORS

(F401)

16-16: PIL.Image imported but unused

Remove unused import: PIL.Image

(F401)

17-17: numpy imported but unused

Remove unused import: numpy

(F401)

🤖 Prompt for AI Agents

In dream_layer_backend/dream_layer_backend_utils/task_runner.py around lines 9 to 18 and 21 to 24, remove all unused imports including flask, flask_cors, PIL, numpy, and any duplicates to keep the module framework-agnostic and avoid Ruff warnings. Also, if logging is not used, remove the import logging statement and any logger initialization. Only keep imports that are actually used in the code.

coderabbitai · 2025-08-09T05:58:52Z

dream_layer_backend/dream_layer_backend_utils/task_runner.py

+class MatrixRunner:
+    def __init__(self, state_file="matrix_jobs.json"):
+        self.state_file = state_file
+        self.jobs = []
+        self.index = 0
+        self.paused = False
+        self.load()
+
+    # --- Core Job Management ---


🛠️ Refactor suggestion

⚠️ Potential issue

Add thread-safety and atomic persistence to prevent race conditions and file corruption

Concurrent /next//complete calls can return duplicate jobs or corrupt the JSON (partial writes). Guard state with a lock and write atomically.

@@ -class MatrixRunner: +import threading + +class MatrixRunner: def __init__(self, state_file="matrix_jobs.json"): self.state_file = state_file self.jobs = [] self.index = 0 self.paused = False - self.load() + self._lock = threading.RLock() + self.load() @@ - def generate(self, param_dict): + def generate(self, param_dict): """Create all jobs from parameter ranges and reset state""" - keys = list(param_dict.keys()) - combos = list(itertools.product(*param_dict.values())) - self.jobs = [ - {"id": i, **dict(zip(keys, combo)), "status": "pending"} - for i, combo in enumerate(combos) - ] - self.index = 0 - self.paused = False - self.save() + keys = list(param_dict.keys()) + combos = list(itertools.product(*param_dict.values())) + with self._lock: + self.jobs = [ + {"id": i, **dict(zip(keys, combo)), "status": "pending"} + for i, combo in enumerate(combos) + ] + self.index = 0 + self.paused = False + self.save() @@ - def next_job(self): + def next_job(self): """Get the next pending job and mark it as running""" - if self.paused: - return None - while self.index < len(self.jobs): - job = self.jobs[self.index] - self.index += 1 - if job["status"] == "pending": - job["status"] = "running" - self.save() - return job + with self._lock: + if self.paused: + return None + while self.index < len(self.jobs): + job = self.jobs[self.index] + self.index += 1 + if job.get("status") == "pending": + job["status"] = "running" + self.save() + return job return None # No more jobs @@ - def complete_job(self, job_id): + def complete_job(self, job_id) -> bool: """Mark job as done""" - for job in self.jobs: - if job["id"] == job_id: - job["status"] = "done" - break - self.save() + found = False + with self._lock: + for job in self.jobs: + if job.get("id") == job_id: + job["status"] = "done" + found = True + break + self.save() + return found @@ - def pause(self): - self.paused = True - self.save() + def pause(self): + with self._lock: + self.paused = True + self.save() @@ - def resume(self): - self.paused = False - self.save() + def resume(self): + with self._lock: + self.paused = False + self.save() @@ - def save(self): - with open(self.state_file, "w") as f: - json.dump( - {"jobs": self.jobs, "index": self.index, "paused": self.paused}, f, indent=2 - ) + def save(self): + """Atomic write to avoid partial/corrupt files.""" + state = {"jobs": self.jobs, "index": self.index, "paused": self.paused} + tmp = f"{self.state_file}.tmp" + with open(tmp, "w") as f: + json.dump(state, f, indent=2) + os.replace(tmp, self.state_file) @@ - def load(self): - if os.path.exists(self.state_file): - with open(self.state_file, "r") as f: - state = json.load(f) - self.jobs = state.get("jobs", []) - self.index = state.get("index", 0) - self.paused = state.get("paused", False) + def load(self): + if os.path.exists(self.state_file): + try: + with open(self.state_file, "r") as f: + state = json.load(f) + self.jobs = state.get("jobs", []) + self.index = state.get("index", 0) + self.paused = state.get("paused", False) + except Exception as e: + logging.getLogger(__name__).warning("Failed to load state (%s); starting fresh", e) + self.jobs, self.index, self.paused = [], 0, False

Optional: convert any lingering "running" jobs back to "pending" on load to recover from worker crashes.

Also applies to: 46-57, 59-66, 68-75, 77-82, 83-89

🤖 Prompt for AI Agents

In dream_layer_backend/dream_layer_backend_utils/task_runner.py around lines 25 to 33 and also lines 46-57, 59-66, 68-75, 77-82, and 83-89, the current implementation lacks thread-safety and atomic file writes, which can cause race conditions and JSON corruption during concurrent job state updates. To fix this, add a threading lock to guard all accesses and modifications to the shared job state and ensure that writes to the state_file are done atomically, for example by writing to a temporary file and then renaming it. Additionally, modify the load method to reset any jobs marked as "running" back to "pending" to recover from worker crashes.

coderabbitai · 2025-08-09T05:58:52Z

dream_layer_backend/dream_layer_backend_utils/task_runner.py

+    def generate(self, param_dict):
+        """Create all jobs from parameter ranges and reset state"""
+        keys = list(param_dict.keys())
+        combos = list(itertools.product(*param_dict.values()))
+        self.jobs = [
+            {"id": i, **dict(zip(keys, combo)), "status": "pending"}
+            for i, combo in enumerate(combos)
+        ]
+        self.index = 0
+        self.paused = False
+        self.save()
+


🛠️ Refactor suggestion

Guard against explosive cartesian products

A broad grid can easily generate millions of jobs, exhausting memory.

Add a cap (env-configurable) and reject requests exceeding it:

Pre-compute size via product of lengths, bail out if > MAX_JOBS.

Or stream generation instead of materializing the full list.

🤖 Prompt for AI Agents

In dream_layer_backend/dream_layer_backend_utils/task_runner.py around lines 34 to 45, the generate method creates all job combinations at once, which can cause memory exhaustion for large parameter grids. To fix this, add an environment-configurable MAX_JOBS limit, pre-compute the total number of combinations by multiplying the lengths of each parameter list, and if this exceeds MAX_JOBS, raise an exception or reject the request before generating jobs. This prevents creating an excessively large list and guards against memory issues.

coderabbitai · 2025-08-09T05:58:52Z

dream_layer_backend/dream_layer.py

+@app.route('/api/matrix-runner/start', methods=['POST'])
+def matrix_runner_start():
+    try:
+        data = request.json or {}
+        # accept lists/ranges for seeds, steps, samplers, etc.
+        # keep only list-valued keys for the sweep
+        param_dict = {k: v for k, v in data.items() if isinstance(v, list)}
+        runner.generate(param_dict)              # deterministic expansion
+        return jsonify({"status":"success","total_jobs": len(runner.jobs)})
+    except Exception as e:
+        return jsonify({"status":"error","message": str(e)}), 500
+


🛠️ Refactor suggestion

Validate inputs and cap the sweep size to avoid OOM

Sanity-check list inputs and limit total jobs to a safe maximum; return 400 if exceeded.

@app.route('/api/matrix-runner/start', methods=['POST']) def matrix_runner_start(): try: - data = request.json or {} + data = request.json or {} # accept lists/ranges for seeds, steps, samplers, etc. # keep only list-valued keys for the sweep - param_dict = {k: v for k, v in data.items() if isinstance(v, list)} - runner.generate(param_dict) # deterministic expansion - return jsonify({"status":"success","total_jobs": len(runner.jobs)}) + param_dict = {k: v for k, v in data.items() if isinstance(v, list) and len(v) > 0} + if not param_dict: + return jsonify({"status":"error","message":"Provide at least one non-empty list parameter"}), 400 + from math import prod + max_jobs = int(os.environ.get("MATRIX_MAX_JOBS", "10000")) + total = prod(len(v) for v in param_dict.values()) + if total > max_jobs: + return jsonify({"status":"error","message": f"Grid too large ({total} > {max_jobs})"}), 400 + runner.generate(param_dict) # deterministic expansion + return jsonify({"status":"success","total_jobs": len(runner.jobs)}) except Exception as e: return jsonify({"status":"error","message": str(e)}), 500

Committable suggestion skipped: line range outside the PR's diff.

🤖 Prompt for AI Agents

In dream_layer_backend/dream_layer.py around lines 441 to 452, the matrix_runner_start endpoint currently accepts list inputs without validation or limits, which risks excessive job creation and potential OOM errors. Add input validation to ensure list parameters are sane, calculate the total number of jobs generated, and if it exceeds a predefined safe maximum, return a 400 response with an appropriate error message. This prevents resource exhaustion by capping the sweep size before calling runner.generate.

coderabbitai · 2025-08-09T05:58:52Z

dream_layer_backend/dream_layer.py

+@app.route('/api/matrix-runner/complete', methods=['POST'])
+def matrix_runner_complete():
+    data = request.json or {}
+    job_id = data.get("job_id")
+    runner.complete_job(job_id)
+    return jsonify({"status":"ok"})
+


🛠️ Refactor suggestion

Validate job_id and return 404 when not found

Currently None (or an invalid ID) silently succeeds. Also align with the updated complete_job bool return.

@app.route('/api/matrix-runner/complete', methods=['POST']) def matrix_runner_complete(): - data = request.json or {} - job_id = data.get("job_id") - runner.complete_job(job_id) - return jsonify({"status":"ok"}) + data = request.json or {} + job_id = data.get("job_id") + if job_id is None: + return jsonify({"status":"error","message":"job_id is required"}), 400 + ok = runner.complete_job(job_id) + if not ok: + return jsonify({"status":"error","message":f"job_id {job_id} not found"}), 404 + return jsonify({"status":"ok"})

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

@app.route('/api/matrix-runner/complete', methods=['POST'])

def matrix_runner_complete():

data = request.json or {}

job_id = data.get("job_id")

runner.complete_job(job_id)

return jsonify({"status":"ok"})

@app.route('/api/matrix-runner/complete', methods=['POST'])

def matrix_runner_complete():

data = request.json or {}

job_id = data.get("job_id")

if job_id is None:

return jsonify({"status":"error","message":"job_id is required"}), 400

ok = runner.complete_job(job_id)

if not ok:

return jsonify({"status":"error","message":f"job_id {job_id} not found"}), 404

return jsonify({"status":"ok"})

🤖 Prompt for AI Agents

In dream_layer_backend/dream_layer.py around lines 483 to 489, the matrix_runner_complete function does not validate the job_id from the request JSON and always returns a success status even if job_id is None or invalid. Update the function to check if job_id is provided and call runner.complete_job(job_id), which now returns a boolean indicating success. If the job_id is missing or complete_job returns False, return a 404 response; otherwise, return the success JSON response.

coderabbitai · 2025-08-09T05:58:52Z

dream_layer_backend/matrix_runner_state.json

+{
+  "jobs": [
+    {
+      "id": 0,
+      "seeds": 1,
+      "steps": 10,
+      "Samplers": "euler",
+      "status": "done"
+    },
+    {
+      "id": 1,
+      "seeds": 1,
+      "steps": 20,
+      "Samplers": "euler",
+      "status": "running"
+    },
+    {
+      "id": 2,
+      "seeds": 2,
+      "steps": 10,
+      "Samplers": "euler",
+      "status": "pending"
+    },
+    {
+      "id": 3,
+      "seeds": 2,
+      "steps": 20,
+      "Samplers": "euler",
+      "status": "pending"
+    },
+    {
+      "id": 4,
+      "seeds": 3,
+      "steps": 10,
+      "Samplers": "euler",
+      "status": "pending"
+    },
+    {
+      "id": 5,
+      "seeds": 3,
+      "steps": 20,
+      "Samplers": "euler",
+      "status": "pending"
+    }
+  ],
+  "index": 2,
+  "paused": false
+}


⚠️ Potential issue

Do not commit runtime state; remove file and rely on persistence via .gitignore

This JSON is mutable, environment-specific state. Keep it out of VCS to avoid drift, merge conflicts, and accidental resets. Also, key naming is inconsistent (Samplers vs likely samplers).

Recommended actions:

Delete from repo and ignore it (see .gitignore fix above).

Regenerate at runtime; initialize empty state when absent.

Normalize keys to snake_case if you need a seed file in examples.

🤖 Prompt for AI Agents

In dream_layer_backend/matrix_runner_state.json lines 1 to 48, this file contains mutable runtime state that should not be committed to version control to prevent conflicts and accidental resets. Remove this file from the repository and add it to .gitignore to avoid tracking it. Ensure the application regenerates this state at runtime and initializes it as empty if the file is missing. Also, if a seed file is needed for examples, normalize all keys to snake_case, changing "Samplers" to "samplers" for consistency.

darcy3000 · 2025-08-18T01:06:45Z

Hey? Thanks for raising the PR :) Can you please include a few DreamLayer UI screenshots? That would make it easier for us to review.

nataliarodriguez-uc added 2 commits August 8, 2025 22:36

Task 2: MatrixRunner integration in dream_layer_backend

df3ce99

Task 2: MatrixRunner backend + APIs; persist state; pause/resume; status

8b7693c

sourcery-ai bot reviewed Aug 9, 2025

View reviewed changes

coderabbitai bot reviewed Aug 9, 2025

View reviewed changes

Task 2: MatrixRunner backend + APIs Integration for multiple jobs #89

Are you sure you want to change the base?

Task 2: MatrixRunner backend + APIs Integration for multiple jobs #89

Uh oh!

Conversation

nataliarodriguez-uc commented Aug 9, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Features

Testing

Uh oh!

coderabbitai bot commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

sourcery-ai bot commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Sequence diagram for MatrixRunner API job lifecycle

Class diagram for MatrixRunner job management

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

darcy3000 commented Aug 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

nataliarodriguez-uc commented Aug 9, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Aug 9, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

sourcery-ai bot commented Aug 9, 2025 •

edited

Loading