Channels - Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models :: FRELIP Discovery

Similar Items: Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

Quick Look
Crafting Reversible SFT Behaviors in Large Language Models
Quick Look
Do Sparse Autoencoders Capture Concept Manifolds?
Quick Look
Beyond Gaussian Bottlenecks: Topologically Aligned Encoding of Vision-Transformer Feature Spaces
Quick Look
Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior
Quick Look
Physiologically Grounded Driver Behavior Classification: SHAP-Driven Elite Feature Selection and Hybrid Gradient Boosting for Multimodal Physiological Signals
Quick Look
Spectral Model eXplainer: a chemically-grounded explainability framework for spectral-based machine learning models
Quick Look
Normalizing Trajectory Models
Quick Look
Memory-Efficient Continual Learning with CLIP Models
Quick Look
Proximal Projection for Doubly Sparse Regularized Models
Quick Look
On the Wasserstein Gradient Flow Interpretation of Drifting Models
Quick Look
Mapping the Phase Diagram of the Vicsek Model with Machine Learning
Quick Look
Bolek: A Multimodal Language Model for Molecular Reasoning
Quick Look
Tool Calling is Linearly Readable and Steerable in Language Models
Quick Look
Pretrained Model Representations as Acquisition Signals for Active Learning of MLIPs
Quick Look
Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models
Quick Look
Dimensionality-Aware Anomaly Detection in Learned Representations of Self-Supervised Speech Models
Quick Look
Safety and accuracy follow different scaling laws in clinical large language models
Quick Look
Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less
Quick Look
Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions
Quick Look
Beyond Pairs: Your Language Model is Secretly Optimizing a Preference Graph
Quick Look
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation
Quick Look
Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring
Quick Look
A Domain Incremental Continual Learning Benchmark for ICU Time Series Model Transportability
Quick Look
UniSD: Towards a Unified Self-Distillation Framework for Large Language Models