1616</div >
1717
1818
19+ <div align =" center " >
20+
1921[ ![ License] ( https://img.shields.io/github/license/CrayLabs/SmartSim )] ( https://github.com/CrayLabs/SmartSim/blob/master/LICENSE.md )
2022![ GitHub last commit] ( https://img.shields.io/github/last-commit/CrayLabs/SmartSim )
2123![ GitHub deployments] ( https://img.shields.io/github/deployments/CrayLabs/SmartSim/github-pages?label=doc%20build )
2729[ ![ codecov] ( https://codecov.io/gh/CrayLabs/SmartSim/branch/develop/graph/badge.svg?token=96HFI2F45E )] ( https://codecov.io/gh/CrayLabs/SmartSim )
2830[ ![ Downloads] ( https://static.pepy.tech/personalized-badge/smartsim?period=total&units=international_system&left_color=grey&right_color=orange&left_text=Downloads )] ( https://pepy.tech/project/smartsim )
2931
32+ </div >
33+
3034------------
3135
3236# SmartSim
@@ -69,8 +73,6 @@ exchanged between applications at runtime without the utilization of MPI.
6973 - [ Local Launch] ( #local-launch )
7074 - [ Interactive Launch] ( #interactive-launch )
7175 - [ Batch Launch] ( #batch-launch )
72- - [ Ray] ( #ray )
73- - [ Ray on HPC] ( #ray-on-hpc )
7476- [ SmartRedis] ( #smartredis )
7577 - [ Tensors] ( #tensors )
7678 - [ Datasets] ( #datasets )
@@ -97,8 +99,8 @@ before using it on your system. Each tutorial is a Jupyter notebook that can be
9799which will run a jupyter lab with the tutorials, SmartSim, and SmartRedis installed.
98100
99101``` bash
100- docker pull ghcr.io/craylabs/smartsim-tutorials:v0.4.1
101- docker run -p 8888:8888 ghcr.io/craylabs/smartsim-tutorials:v0.4.1
102+ docker pull ghcr.io/craylabs/smartsim-tutorials:v0.4.2
103+ docker run -p 8888:8888 ghcr.io/craylabs/smartsim-tutorials:v0.4.2
102104# click on link to open jupyter lab
103105```
104106
@@ -284,7 +286,6 @@ initialization. Local launching does not support batch workloads.
284286
285287# Infrastructure Library Applications
286288 - Orchestrator - In-memory data store and Machine Learning Inference (Redis + RedisAI)
287- - Ray - Distributed Reinforcement Learning (RL), Hyperparameter Optimization (HPO)
288289
289290## Redis + RedisAI
290291
@@ -398,53 +399,6 @@ exp.stop(db_cluster)
398399python run_db_batch.py
399400```
400401
401- -----
402- ## Ray
403-
404- Ray is a distributed computation framework that supports a number of applications
405- - RLlib - Distributed Reinforcement Learning (RL)
406- - RaySGD - Distributed Training
407- - Ray Tune - Hyperparameter Optimization (HPO)
408- - Ray Serve - ML/DL inference
409- As well as other integrations with frameworks like Modin, Mars, Dask, and Spark.
410-
411- Historically, Ray has not been well supported on HPC systems. A few examples exist,
412- but none are well maintained. Because SmartSim already has launchers for HPC systems,
413- launching Ray through SmartSim is a relatively simple task.
414-
415- ### Ray on HPC
416-
417- Below is an example of how to launch a Ray cluster on an HPC system and connect to it.
418- In this example, we set ` batch=True ` , which means that the cluster will be started
419- requesting an allocation through the scheduler (Slurm, PBS, etc). If this code
420- is run within a sufficiently large interactive allocation, setting ` batch=False `
421- will spin the Ray cluster on the allocated nodes.
422-
423- ``` Python
424- import ray
425-
426- from smartsim import Experiment
427- from smartsim.exp.ray import RayCluster
428-
429- exp = Experiment(" ray-cluster" , launcher = ' auto' )
430- # 3 workers + 1 head node = 4 node-cluster
431- cluster = RayCluster(name = " ray-cluster" , run_args = {},
432- ray_args = {" num-cpus" : 24 },
433- launcher = ' auto' , num_nodes = 4 , batch = True )
434-
435- exp.generate(cluster, overwrite = True )
436- exp.start(cluster, block = False , summary = True )
437-
438- # Connect to the Ray cluster
439- ctx = ray.init(f " ray:// { cluster.get_head_address()} :10001 " )
440-
441- # <run Ray tune, RLlib, HPO...>
442- ```
443-
444- * New in 0.4.0* the auto argument enables the Ray Cluster to be launched
445- across scheduler types. Both batch launch and interactive launch commands
446- will be automatically detected and used by SmartSim.
447-
448402------
449403# SmartRedis
450404
@@ -498,7 +452,7 @@ which will run a jupyter lab with the tutorials, SmartSim, and SmartRedis instal
498452
499453``` bash
500454docker pull ghcr.io/craylabs/smartsim-tutorials:v1
501- docker run -p 8888:8888 ghcr.io/craylabs/smartsim-tutorials:v0.4.1
455+ docker run -p 8888:8888 ghcr.io/craylabs/smartsim-tutorials:v0.4.2
502456```
503457Each of the following examples can be found in the
504458[ SmartSim documentation] ( https://www.craylabs.org/docs/tutorials/getting_started/getting_started.html ) .
@@ -683,17 +637,17 @@ from C, C++, Fortran and Python with the SmartRedis Clients:
683637 </thead >
684638 <tbody style =" text-align :center " >
685639 <tr>
686- <td rowspan="3">1.2.3-1.2.4 </td>
640+ <td rowspan="3">1.2.7 </td>
687641 <td>PyTorch</td>
688- <td>1.7 .x</td>
642+ <td>1.11 .x</td>
689643 </tr>
690644 <tr>
691645 <td>TensorFlow\Keras</td>
692- <td>2.4.x-2.5 .x</td>
646+ <td>2.8 .x</td>
693647 </tr>
694648 <tr>
695649 <td>ONNX</td>
696- <td>1.9 .x</td>
650+ <td>1.11 .x</td>
697651 </tr>
698652 <td rowspan="3">1.2.5</td>
699653 <td>PyTorch</td>
0 commit comments