Similar Items: Discrete Flow Matching for Offline-to-Online Reinforcement Learning
- Reward Hacking in Rubric-Based Reinforcement Learning
- Towards Metric-Faithful Neural Graph Matching
- Possibilistic Predictive Uncertainty for Deep Learning
- Learning CLI Agents with Structured Action Credit under Selective Observation
- LLM as Clinical Graph Structure Refiner: Enhancing Representation Learning in EEG Seizure Diagnosis
- Learning Multimodal Energy-Based Model with Multimodal Variational Auto-Encoder via MCMC Revision