Similar Items: A Semantic Quantum Circuit Cache for Scalable and Distributed Quantum-Classical Workflows
- Distributed Quantum Circuit Optimisation: Evaluating Global and Local encodings
- On the Distortion of Partitioning Performance by Random Quantum Circuits
- Irminsul: MLA-Native Position-Independent Caching for Agentic LLM Serving
- RcLLM: Accelerating Generative Recommendation via Beyond-Prefix KV Caching
- FATE: Future-State-Aware Scheduling for Heterogeneous LLM Workflows
- SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters