Similar Items: Learning, Fast and Slow: Towards LLMs That Adapt Continually
- Exploration Hacking: Can LLMs Learn to Resist RL Training?
- On the Hardness of Junking LLMs
- Memory-Efficient Continual Learning with CLIP Models
- Fast Byte Latent Transformer
- AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents
- SAVGO: Learning State-Action Value Geometry with Cosine Similarity for Continuous Control