Channels - Lifting to tensors when compiling scientific computing workloads for AI Engines :: FRELIP Discovery

Similar Items: Lifting to tensors when compiling scientific computing workloads for AI Engines

Quick Look
Towards Compute-Aware In-Switch Computing for LLMs Tensor-Parallelism on Multi-GPU Systems
Quick Look
Nitsum: Serving Tiered LLM Requests with Adaptive Tensor Parallelism
Quick Look
AutoSP: Unlocking Long-Context LLM Training Via Compiler-Based Sequence Parallelism
Quick Look
Akita: A High Usability Simulation Framework for Computer Architecture
Quick Look
Accelerating MoE with Dynamic In-Switch Computing on Multi-GPUs
Quick Look
Adaptation of AI-accelerated CFD Simulations to the IPU platform
Quick Look
Resilient AI Supercomputer Networking using MRC and SRv6
Quick Look
LLM-Enhanced Deep Reinforcement Learning for Task Offloading in Collaborative Edge Computing
Quick Look
SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters
Quick Look
VUDA: Breaking CUDA-Vulkan Isolation for Spatial Sharing of Compute and Graphics on the Same GPU
Quick Look
Token Arena: A Continuous Benchmark Unifying Energy and Cognition in AI Inference
Quick Look
VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?
Quick Look
Exploring the Efficiency of 3D-Stacked AI Chip Architecture for LLM Inference with Voxel
Quick Look
AI Inference as Relocatable Electricity Demand: A Latency-Constrained Energy-Geography Framework
Quick Look
From Barrier to Bridge: The Case for AI Data Center/Power Grid Co-Design
Quick Look
Tempus: A Temporally Scalable Resource-Invariant GEMM Streaming Framework for Versal AI Edge
Quick Look
OpenG2G: A Simulation Platform for AI Datacenter-Grid Runtime Coordination
Quick Look
Focus Session: Autonomous Systems Dependability in the era of AI: Design Challenges in Safety, Security, Reliability and Certification
Quick Look
A Test Taxonomy and Continuous Integration Ecosystem for Dynamic Resource Management in HPC
Quick Look
A Semantic Quantum Circuit Cache for Scalable and Distributed Quantum-Classical Workflows
Quick Look
FaaSMoE: A Serverless Framework for Multi-Tenant Mixture-of-Experts Serving
Quick Look
Efficient Training on Multiple Consumer GPUs with RoundPipe
Quick Look
Adaptive Self-Organization in Anonymous Dynamic Networks
Quick Look
Real-Time GPU-Accelerated Monte Carlo Evaluation of Safety-Critical AEB Systems Under Uncertainty