Similar Items: Communication Offloading on SmartNIC DPUs: A Quantitative Approach
- LLM-Enhanced Deep Reinforcement Learning for Task Offloading in Collaborative Edge Computing
- ipc_shared_ptr: A Publish/Subscribe-Aware Smart Pointer for Cross-Process Object Lifetime Management
- Eliminating Hidden Serialization in Multi-Node Megakernel Communication
- SparseRL-Sync: Lossless Weight Synchronization with ~100x Less Communication
- ZipCCL: Efficient Lossless Data Compression of Communication Collectives for Accelerating LLM Training
- Relay Buffer Independent Communication over Pooled HBM for Efficient MoE Inference on Ascend