Similar Items: BatchBench: Toward a Workload-Aware Benchmark for Autoscaling Policies in Big Data Batch Processing -- A Proposed Framework
- RecRM-Bench: Benchmarking Multidimensional Reward Modeling for Agentic Recommender Systems
- OBLIQ-Bench: Exposing Overlooked Bottlenecks in Modern Retrievers with Latent and Implicit Queries
- FollowTable: A Benchmark for Instruction-Following Table Retrieval
- TabEmbed: Benchmarking and Learning Generalist Embeddings for Tabular Understanding
- InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search
- ASTRA-QA: A Benchmark for Abstract Question Answering over Documents