Similar Items: VitaLLM: A Versatile, Ultra-Compact Ternary LLM Accelerator with Dependency-Aware Scheduling
- VitaLLM: A Versatile and Tiny Accelerator for Mixed-Precision LLM Inference on Edge Devices
- LLM-Driven Design Space Exploration of FPGA-based Accelerators
- RCW-CIM: A Digital CIM-based LLM Accelerator with Read-Compute/Write
- Not All Thoughts Need HBM: Semantics-Aware Memory Hierarchy for LLM Reasoning
- HyDRA: Deadline and Reuse-Aware Cacheability for Hardware Accelerators
- Silicon Showdown: Performance, Efficiency, and Ecosystem Barriers in Consumer-Grade LLM Inference