Welcome to the Moodist API documentation. Moodist is a PyTorch extension library that implements a high-performance process group for distributed computing, built on RDMA.
compile_op- Compile custom collective operations for arbitrary data movement patterns between ranksQueue- Distributed queue for inter-rank communication of tensors and objects
TcpStore- Distributed, decentralized key-value store for process group coordination
Documentation for additional Moodist features:
MoodistProcessGroup- Process group managementserialize/deserialize- Object serialization for distributed communicationenable_profiling,enable_cuda_allocator,enable_cpu_allocator- Performance tuningcuda_copy- CUDA tensor operations
For installation instructions and basic usage, see the main README.
All API documentation pages include practical examples. For a comprehensive guide to compile_op usage patterns, see the compile_op examples section.