Similar Items: Token Arena: A Continuous Benchmark Unifying Energy and Cognition in AI Inference
- AI Inference as Relocatable Electricity Demand: A Latency-Constrained Energy-Geography Framework
- SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters
- Exploring the Efficiency of 3D-Stacked AI Chip Architecture for LLM Inference with Voxel
- CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure
- Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents
- Stochastic Sparse Attention for Memory-Bound Inference