Similar Items: Replication in Graph Partitioning and Scheduling Problems
- On the Distortion of Partitioning Performance by Random Quantum Circuits
- Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale
- FATE: Future-State-Aware Scheduling for Heterogeneous LLM Workflows
- Taming Request Imbalance: SLO-Aware Scheduling for Disaggregated LLM Inference
- SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters
- GPU-Accelerated Simulations of Problems with Moving Boundaries and Fluid-Structure Interaction at Extreme Scales