Similar Items: Step Rejection Fine-Tuning: A Practical Distillation Recipe
- Trajectory as the Teacher: Few-Step Discrete Flow Matching via Energy-Navigated Distillation
- MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning
- Rethinking Local Learning: A Cheaper and Faster Recipe for LLM Post-Training
- Benchmarking Parameter-Efficient Fine-Tuning of Large Language Models for Low-Resource Tajik Text Generation with the Tajik Web Corpus
- KL for a KL: On-Policy Distillation with Control Variate Baseline
- Beyond "I cannot fulfill this request": Alleviating Rigid Rejection in LLMs via Label Enhancement