Similar Items: FaaSMoE: A Serverless Framework for Multi-Tenant Mixture-of-Experts Serving
- nvPAX: Constrained Optimization for Dynamic Power Allocation in Hierarchical and Multi-Tenant Systems
- EdgeServing: Deadline-Aware Multi-DNN Serving at the Edge
- ClusterLess: Deadline-Aware Serverless Workflow Orchestration on Federated Edge Clusters
- Orchestrating Serverless Applications in the Edge Cloud Space Continuum: What Breaks and What is Next?
- Space Network of Experts: Architecture and Expert Placement
- Coral: Cost-Efficient Multi-LLM Serving over Heterogeneous Cloud GPUs