Similar Items: Cross-Layer Energy Analysis of Multimodal Training on Grace Hopper Superchips
- FedPLT: Scalable, Resource-Efficient, and Heterogeneity-Aware Federated Learning via Partial Layer Training
- FedQueue: Queue-Aware Federated Learning for Cross-Facility HPC Training
- Efficient Training on Multiple Consumer GPUs with RoundPipe
- ResiHP: Taming LLM Training Failures with Dynamic Hybrid
- A Study on the Performance of Distributed Training of Data-driven CFD Simulations
- AGoQ: Activation and Gradient Quantization for Memory-Efficient Distributed Training of LLMs