Feat/seed for structure generation by Gitdowski · Pull Request #336 · glasagent/amorphouspy

Gitdowski · 2026-05-06T13:19:52Z

Following PR #332, where a random seed to the get_structure_dict() has been added, this PR adds the new input argument structure_seed to the API workflow. If now a replica simulation is started with the same composition, system size and settings, but with a different structure_seed it will trigger a new simulation.
Also, the structure_seed is also part of the hashing. Therefore, an existing database needs to be migrated.

added structure_seed to the MeltQuenchParams class and the generate_structure()
added tests

codecov · 2026-05-06T13:21:39Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Gitdowski · 2026-05-06T13:21:43Z

For reference, at the current stage, a database can be migrated (requires write access) using:

# run once with app env
from amorphouspy_api.database import get_job_store, Job
from amorphouspy_api.models import JobSubmission
from amorphouspy_api.routers.jobs_helpers import _job_hash

store = get_job_store()
with store.session() as s:
    rows = s.query(Job).filter(Job.status == "completed").all()
    for row in rows:
        req = row.request_data or {}
        sim = req.get("simulation", {})
        if "structure_seed" not in sim:
            sim["structure_seed"] = 42
            req["simulation"] = sim
            sub = JobSubmission(**req)
            new_hash = _job_hash(sub, row.composition)
            row.request_hash = new_hash
            row.request_data = req
    s.commit()

Copilot

Pull request overview

This PR extends the API workflow to accept a structure_seed parameter for structure generation, ensuring that identical compositions/settings can produce different initial structures (and thus distinct jobs) when the seed changes, and that the seed is included in job hashing for caching/deduplication.

Changes:

Add structure_seed to MeltQuenchParams and include it in the job hash via submission.simulation.model_dump().
Forward structure_seed to core structure generation as random_seed in generate_structure().
Add API and workflow tests to verify hashing, forwarding, and request acceptance.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
amorphouspy_api/src/amorphouspy_api/models.py	Adds `structure_seed` to the simulation parameter model exposed by the API.
amorphouspy_api/src/amorphouspy_api/workflows/meltquench.py	Passes the new seed through to `get_structure_dict(..., random_seed=...)` during structure generation.
amorphouspy_api/src/tests/test_jobs.py	Adds tests covering hash changes, seed forwarding, and POST acceptance.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Atilaac · 2026-05-06T13:45:52Z

Thanks @Gitdowski, can you add this to the documentation as well!

Gitdowski · 2026-05-07T08:34:59Z

@Atilaac: Good catch. Is this another place in the documentation where you would add a word on this, or is this fine?

ltalirz · 2026-05-07T08:42:48Z


+If specified manually, using (combinations of) an unrealistically high `density`, too high `min_distance`, too many `target_atoms` or `n_molecules`, or too few `max_attempts_per_atom` can lead to placement failures. 
+
+Internally, the random placement of atoms is controlled by the `structure_seed` parameter. This ensures reproducibility on the one hand. On the other hand, if statistics are checked and the same system is simulated several times it is recommended to use different seeds for each run to get a better sampling of the configuration space.


Can you check: at the moment, if I specify the same structure_seed, do I get the same structure?
My suspicion would be: yes, on the same computer; maybe not on different computers.

What you may want to communicate here is: if you use a fixed structure_seed , you may get the same structure (not guaranteed). If you use a different structure_seed, you are guaranteed to get a different structure

I changed the RNG to explicitely call the PCG64, PreComputed Generator, instead of relying on the default_rng() (which currently is the PCG64). Therefore, we are safe in case numpy decides to switch to a different RNG for a future release.
Furthermore, PCG64 is frozen and designed to be reproducible across different CPU achitectures and operating systems.

I just checked this for a small system on two different machines. With the default seed the same structures are generated on both machines. Also using a non-default seed produces the same structures on both machines (and of course different from the structures generated by the default seed).

Gitdowski added 2 commits May 6, 2026 14:59

add seed to the pipeline

cb88f62

add tests for structure_seed

e8840a6

Gitdowski requested a review from Copilot May 6, 2026 13:20

github-actions Bot added the type: feature Changelog: new feature or performance improvement → bumps minor version label May 6, 2026

Copilot started reviewing on behalf of Gitdowski May 6, 2026 13:20 View session

Gitdowski requested a review from ltalirz May 6, 2026 13:22

Copilot AI reviewed May 6, 2026

View reviewed changes

Comment thread amorphouspy_api/src/amorphouspy_api/models.py

Gitdowski mentioned this pull request May 6, 2026

feature request: average across multiple samples #191

Open

Potential fix for pull request finding

e4d19f0

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Gitdowski and others added 2 commits May 7, 2026 10:29

Merge branch 'main' into feat/seed-for_structure-generation

fccd672

update structure.md (on structure_seed)

544765d

ltalirz reviewed May 7, 2026

View reviewed changes

Gitdowski added 2 commits May 7, 2026 11:02

rng reproducibility

b1096f0

clarify reproducibility

9db9d75

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/seed for structure generation#336

Feat/seed for structure generation#336
Gitdowski wants to merge 7 commits intomainfrom
feat/seed-for_structure-generation

Gitdowski commented May 6, 2026

Uh oh!

codecov Bot commented May 6, 2026

Uh oh!

Gitdowski commented May 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Atilaac commented May 6, 2026

Uh oh!

Gitdowski commented May 7, 2026

Uh oh!

ltalirz May 7, 2026

Uh oh!

Gitdowski May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		If specified manually, using (combinations of) an unrealistically high `density`, too high `min_distance`, too many `target_atoms` or `n_molecules`, or too few `max_attempts_per_atom` can lead to placement failures.

		Internally, the random placement of atoms is controlled by the `structure_seed` parameter. This ensures reproducibility on the one hand. On the other hand, if statistics are checked and the same system is simulated several times it is recommended to use different seeds for each run to get a better sampling of the configuration space.

Conversation

Gitdowski commented May 6, 2026

Uh oh!

codecov Bot commented May 6, 2026

Codecov Report

Uh oh!

Gitdowski commented May 6, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Atilaac commented May 6, 2026

Uh oh!

Gitdowski commented May 7, 2026

Uh oh!

ltalirz May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Gitdowski May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants