Channels - Make Your LVLM KV Cache More Lightweight :: FRELIP Discovery

Similar Items: Make Your LVLM KV Cache More Lightweight

Quick Look
SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection
Quick Look
Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions
Quick Look
Beyond Pairs: Your Language Model is Secretly Optimizing a Preference Graph
Quick Look
LiVeAction: a Lightweight, Versatile, and Asymmetric Neural Codec Design for Real-time Operation
Quick Look
Don't Get Your Kroneckers in a Twist: Gaussian Processes on High-Dimensional Incomplete Grids
Quick Look
QKVShare: Quantized KV-Cache Handoff for Multi-Agent On-Device LLMs
Quick Look
Visual Latents Know More Than They Say: Unsilencing Latent Reasoning in MLLMs
Quick Look
Estimating the expected output of wide random MLPs more efficiently than sampling
Quick Look
Early Detection of Water Stress by Plant Electrophysiology: Machine Learning for Irrigation Management
Quick Look
Exponential families from a single KL identity
Quick Look
TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering
Quick Look
A Unified Framework of Hyperbolic Graph Representation Learning Methods
Quick Look
Assessing the Role of Intersection Proximity in Pedestrian Crashes: Insights from Data Mining Approach
Quick Look
PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking
Quick Look
Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression
Quick Look
Neural Aided Kalman Filtering for UAV State Estimation in Degraded Sensing Environments
Quick Look
FiLMMeD: Feature-wise Linear Modulation for Cross-Problem Multi-Depot Vehicle Routing
Quick Look
Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing
Quick Look
Beyond Gaussian Bottlenecks: Topologically Aligned Encoding of Vision-Transformer Feature Spaces
Quick Look
Do Sparse Autoencoders Capture Concept Manifolds?
Quick Look
DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
Quick Look
Strait: Perceiving Priority and Interference in ML Inference Serving
Quick Look
PhyCo: Learning Controllable Physical Priors for Generative Motion
Quick Look
Mapping the Phase Diagram of the Vicsek Model with Machine Learning