Support --py-files to distribute files to executors #436

myandpr · 2025-10-11T09:20:29Z

Problem

We encountered an issue when submitting a job using the following command:
raydp-submit --ray-conf /root/ray.conf --py-files file.zip main.py

The parameters for distributing files (such as --py-files) do not properly distribute the files to the executors. As a result, the executors cannot access or import the code from the files specified in --py-files.

Below is the error stack trace:

File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/worker.py", line 601, in main
    func, profiler, deserializer, serializer = read_command(pickleSer, infile)
  File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/worker.py", line 71, in read_command
    command = serializer._read_with_length(file)
  File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/serializers.py", line 160, in _read_with_length
    return self.loads(obj)
  File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/serializers.py", line 430, in loads
    return pickle.loads(obj, encoding=encoding)
ModuleNotFoundError: No module named 'XXX'
) [duplicate 16]
  File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/worker.py", line 601, in main
    func, profiler, deserializer, serializer = read_command(pickleSer, infile)
  File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/worker.py", line 71, in read_command
    command = serializer._read_with_length(file)
  File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/serializers.py", line 160, in _read_with_length
    return self.loads(obj)
  File "/usr/local/spark-current/python/lib/pyspark.zip/pyspark/serializers.py", line 430, in loads
    return pickle.loads(obj, encoding=encoding)
ModuleNotFoundError: No module named 'XXX'

How to solve it

Previously, the Ray cluster master was grouped under OTHERS, so the option assigner skipped writing --files/--archives into spark.files and spark.archives, which left executors without the distributed files.

Explicitly recognizing ray:// as RAY and including RAY in the related distribution logic is what makes these settings take effect now.

myandpr · 2025-10-11T17:34:12Z

@carsonwang @pang-wu Could you please help review this PR? Thank you very much!

python/setup.py

pang-wu

@myandpr Thanks for the contribution, can you add a unit test?

pang-wu · 2025-10-12T22:54:30Z

python/raydp/tests/test_spark_cluster.py

+        module_path.write_text("VALUE = 'pyfiles works'\n")
+
+        py_files_path = tmp_path / "extra_module.zip"
+        with zipfile.ZipFile(py_files_path, "w") as zip_file:


you don't have to zip it, spark should support submitting .py files

carsonwang · 2025-10-13T02:39:59Z

@myandpr Thank you for the PR. The cluster manager name "OTHERS" is what we added in this customized SparkSubmit.scala for Ray. We used a general name "OTHERS" instead of "RAY" because we tried to upstream the changes to Spark in early days. I feel there is no need to add both "OTHERS" and "RAY" in the file. A few options such as args.jars have been added for "OTHERS", but a few others are missing as you have seen the problem. I think you can just continue to use "OTHERS" but add the missed options.

pang-wu · 2025-11-04T09:15:42Z

I created another PR: #441

Support distributing files to executors

b6ac995

myandpr force-pushed the support-distribute-files branch from 41b08b0 to b6ac995 Compare October 11, 2025 10:04

pang-wu reviewed Oct 12, 2025

View reviewed changes

python/setup.py Outdated Show resolved Hide resolved

pang-wu reviewed Oct 12, 2025

View reviewed changes

pin click version

72cd4e6

myandpr force-pushed the support-distribute-files branch from acd2d01 to 72cd4e6 Compare October 12, 2025 16:14

add ut

05ca7a4

pang-wu reviewed Oct 12, 2025

View reviewed changes

fix ut

277ac96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support --py-files to distribute files to executors #436

Support --py-files to distribute files to executors #436

Uh oh!

myandpr commented Oct 11, 2025 •

edited

Loading

Uh oh!

myandpr commented Oct 11, 2025

Uh oh!

Uh oh!

pang-wu left a comment •

edited

Loading

Uh oh!

pang-wu Oct 12, 2025

Uh oh!

carsonwang commented Oct 13, 2025

Uh oh!

pang-wu commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support --py-files to distribute files to executors #436

Are you sure you want to change the base?

Support --py-files to distribute files to executors #436

Uh oh!

Conversation

myandpr commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

How to solve it

Uh oh!

myandpr commented Oct 11, 2025

Uh oh!

Uh oh!

pang-wu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pang-wu Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

carsonwang commented Oct 13, 2025

Uh oh!

pang-wu commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

myandpr commented Oct 11, 2025 •

edited

Loading

pang-wu left a comment •

edited

Loading