Real-World Deployment

Deep Reinforcement Learning for Coordinated Payload Transport in Biped-Wheeled Robots

Deep Reinforcement Learning (DRL) for Coordinated Payload Transport in Biped-Wheeled Robots A unified kinematic model integrated PyTorch-based framework that trains a DRL agent to coordinate two biped-wheeled robots for cooperative payload transport.

A demonstration repository showing:

Deep Reinforcement Learning-based payload transport in simulation
Sim-to-Real deployment on the Diablo biped-wheeled robots

🚀 Prerequisites

Isaac Lab & Isaac Sim
- NVIDIA Omniverse Isaac Sim (4.5.0) & Isaac Lab (2.1) installed
- Installation guide
Workstation Requirements
- GPU: NVIDIA RTX 30xx series or higher (≥16 GB VRAM)
- CPU: Intel Core i7 (9th Generation) AMD Ryzen 7
- RAM: ≥32 GB
- Ubuntu 22.04 LTS
Diablo Robot Hardware
- DirecDrive Tech's Diablo biped-wheeled robots (x2)
- ROS Noetic (Linux)
- Diablo URDF + control stack
OptiTrack Motion Capture
- Motive v3.0+ installed & calibrated
- OptiTrack (NatNet) streaming engine with mocap_optitrack ROS package
- Guide

Important Files and Directories

dual_diablo - Contains the environment file and RL agent files
dual_diablo.py - Contains the actuator and additional configurations of the biped-wheeled robot in simulation
USD_DualDiablo - Contains the USD files of the payload and biped-wheeled robot

📁 Directory Structure

Copy the folder dual_diablo (contains the environment and RL agent files) & dual_diablo.py (robot config file) in the IsaacLab directory as shown:

── IsaacLab
    └── source
        ├── isaaclab_assets
        │   └── isaaclab_assets
        │       └── robots
        │           └── dual_diablo.py
        └── isaaclab_tasks
            └── isaaclab_tasks
                └── direct
                    └── dual_diablo

🕹️ Running in Simulation

Ensure the paths of waypoints/payload path is modified in the dual_diablo_env.py file and the USD path in dual_diablo.py file

Running the training

./isaaclab.sh -p scripts/reinforcement_learning/rsl_rl/train.py --task DualDiablo_Task_Simple --num_envs 4096 --headless

Running the Evaluation (You can modify the paths and payload for testing)

./isaaclab.sh -p scripts/reinforcement_learning/rsl_rl/play.py --task DualDiablo_Task_Simple --num_envs 4 --checkpoint /home/Your_Directory/IsaacLab/logs/rsl_rl/dualdiablo_rsl_rl/2025-05-13_20-18-28/model_500.pt

🎥 Video Demonstrations - Simulation

Training

DualDiabloTraining.2.mp4

Evaluation

SimGradualSineS1.2.mp4

Real-World Deployment

The deployment is organized into three phases:

OptiTrack Setup
Robot & Payload Setup
DRL Interface Initialization

1. OptiTrack Setup

Hardware

Cameras: 12 OptiTrack units arranged around the workspace
Reflective markers: ≥ 3 per rigid body (we use 4 for extra accuracy)

Figure 1: Real-World Workflow

Figure 2: Markers on robot & payload (geometric center tracking)

Software

Install ROS‑OptiTrack packages
Follow the OptiTrack + ROS tutorial.
Launch motion capture
```
roslaunch mocap_optitrack mocap.launch
```
Verify topics
```
rostopic list
```

2. Robot & Payload Setup

Note: Repeat for each robot, swapping in its specific IP, hostname, and topic names.

Prerequisites

ROS Noetic (ROS 1)
Diablo ROS1 SDK

Install Diablo SDK

mkdir -p ~/catkin_ws/src
cd ~/catkin_ws/src
git clone https://github.com/DDTRobot/diablo-sdk-v1.git
cd ~/catkin_ws
catkin_make

Configure Environment

Add to your ~/.bashrc (replace <robot_ip> and <network_ip>):

source /opt/ros/noetic/setup.bash
source ~/catkin_ws/devel/setup.bash
export ROS_HOSTNAME="<robot_ip>"
export ROS_MASTER_URI="http://<network_ip>:11311"

Update & Run Robot Code

C++ Controller
- File: diablo-sdk-v1/example/movement_ctrl/main.cpp
- Swap in your cmd_vel_ego / cmd_vel_follower topics.
  (See Diablo_Robot_Code/main.cpp for reference.)
Python Teleop
- Script: script/teleop.py
- Publishers: DJ_teleop (ego) and DJ_teleop2 (follower)
Launch

rosrun diablo_sdk movement_ctrl_example
python3 teleop.py

Press k for mid-height, j for full-height.

3. DRL Interface Initialization

Run the ONNX-based ROS interface in Real_World_Code folder on the robot or the workstation (requires ONNX Runtime):

python3 Diablo_ROS_interface_ONNX_RSLRL.py

This script subscribes to OptiTrack topics, feeds observations into your DRL model, and publishes the resulting body twists back to each robot.

Real World Video of Biped-Wheeled Robot: Diablo

If you find this research useful, please consider citing the paper

@unknown{unknown,
author = {Mehta, Dhruv and Joglekar, Ajinkya and Krovi, Venkat},
year = {2024},
month = {09},
pages = {},
title = {Deep Reinforcement Learning for Coordinated Payload Transport in Biped-Wheeled Robots},
doi = {10.13140/RG.2.2.10251.71207/1}
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Images		Images
IsaacLab @ fe6ee74		IsaacLab @ fe6ee74
Media		Media
Real-World Code		Real-World Code
USD_DualDiablo		USD_DualDiablo
dual_diablo		dual_diablo
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
dual_diablo.py		dual_diablo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep Reinforcement Learning for Coordinated Payload Transport in Biped-Wheeled Robots

🚀 Prerequisites

Important Files and Directories

📁 Directory Structure

🕹️ Running in Simulation

Ensure the paths of waypoints/payload path is modified in the dual_diablo_env.py file and the USD path in dual_diablo.py file

Running the training

Running the Evaluation (You can modify the paths and payload for testing)

🎥 Video Demonstrations - Simulation

Training

Evaluation

Real-World Deployment

1. OptiTrack Setup

Hardware

Software

2. Robot & Payload Setup

Prerequisites

Install Diablo SDK

Configure Environment

Update & Run Robot Code

3. DRL Interface Initialization

About

Uh oh!

Releases

Packages

Languages

License

ARMLabCUICAR/Deep_Reinforcement_Learning_Multi_Robot_Payload_Transport

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning for Coordinated Payload Transport in Biped-Wheeled Robots

🚀 Prerequisites

Important Files and Directories

📁 Directory Structure

🕹️ Running in Simulation

Ensure the paths of waypoints/payload path is modified in the dual_diablo_env.py file and the USD path in dual_diablo.py file

Running the training

Running the Evaluation (You can modify the paths and payload for testing)

🎥 Video Demonstrations - Simulation

Training

Evaluation

Real-World Deployment

1. OptiTrack Setup

Hardware

Software

2. Robot & Payload Setup

Prerequisites

Install Diablo SDK

Configure Environment

Update & Run Robot Code

3. DRL Interface Initialization

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages