Similar Items: IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance
- VoiceBench: Benchmarking LLM-Based Voice Assistants
- Accelerating Language Model Workflows with Prompt Choreography
- On the Limitations of Language-targeted Pruning: Investigating the Calibration Language Impact in Multilingual LLM Pruning
- Truth or Mirage? π Towards End-To-End Factuality Evaluation with LLM-O asis
- π§βπ³ Cooking Up Creativity : Enhancing LLM Creativity through Structured Recombination
- Beyond One-Size-Fits-All : Inversion Learning for Highly Effective NLG Evaluation Prompts