Full Text Available
Access Full Text at Repository
Search Results - "Technology"
-
FalconGEMM: Surpassing Hardware Peaks with Lower-Complexity Matrix Multiplication
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
Online Article RSS Article -
Relay Buffer Independent Communication over Pooled HBM for Efficient MoE Inference on Ascend
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
From Coordinate Matching to Structural Alignment: Rethinking Prototype Alignment in Heterogeneous Federated Learning
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
MoE-Hub: Taming Software Complexity for Seamless MoE Overlap with Hardware-Accelerated Communication on Multi-GPU Systems
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
SuperPaymaster: Eliminating Centralized Signer Authority via Asset-Oriented Abstraction to Reconcile Usability and Decentralization in Account Abstraction
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
A Privacy-Preserving Machine Learning Framework for Edge Intelligence: An Empirical Analysis
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
LLM-Enhanced Deep Reinforcement Learning for Task Offloading in Collaborative Edge Computing
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
Irminsul: MLA-Native Position-Independent Caching for Agentic LLM Serving
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
Towards Compute-Aware In-Switch Computing for LLMs Tensor-Parallelism on Multi-GPU Systems
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
Accelerating MoE with Dynamic In-Switch Computing on Multi-GPUs
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
A Scalable Digital Twin Framework for Energy Optimization in Data Centers
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
EdgeServing: Deadline-Aware Multi-DNN Serving at the Edge
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
OpenG2G: A Simulation Platform for AI Datacenter-Grid Runtime Coordination
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
Nitsum: Serving Tiered LLM Requests with Adaptive Tensor Parallelism
Published in ArXiv cs.DC Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
Low-Latency Out-of-Core ANN Search in High-Dimensional Space
Published in ArXiv cs.DB Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
An Extensible and Verifiable Language for Query Rewrite Rules
Published in ArXiv cs.DB Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
Anatomy of a Query: W5H Dimensions and FAR Patterns for Text-to-SQL Evaluation
Published in ArXiv cs.DB Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
Patch2Vuln: Agentic Reconstruction of Vulnerabilities from Linux Distribution Binary Patches
Published in ArXiv cs.CR Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
FedAttr: Towards Privacy-preserving Client-Level Attribution in Federated LLM Fine-tuning
Published in ArXiv cs.CR Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text
-
On the Security of Research Artifacts
Published in ArXiv cs.CR Recent Papers (2026)Subjects: “…Engineering & Technology…”
Get full text