Similar Items: CuLifter: Lifting GPU Binaries to Typed IR
- PipeRTL: Timing-Aware Pipeline Optimization at IR-Level for RTL Generation
- The Anatomy of Silent Data Corruption: GPU Error Pattern Study and Modeling Guidance
- DPU or GPU for Accelerating Neural Networks Inference -- Why not both? Split CNN Inference
- TLX: Hardware-Native, Evolvable MIMW GPU Compiler for Large-scale Production Environments
- AHASD: Asynchronous Heterogeneous Architecture for LLM Adaptive Drafting Speculative Decoding on Mobile Devices
- RecFlash: Fast Recommendation System on In-Storage Computing with Frequency-Based Data Mapping