Similar Items: OxyEcomBench: Benchmarking Multimodal Foundation Models across E-Commerce Ecosystems
- EpiCastBench: Datasets and Benchmarks for Multivariate Epidemic Forecasting
- EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce
- FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting
- Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies
- HOME-KGQA: A Benchmark Dataset for Multimodal Knowledge Graph Question Answering on Household Daily Activities
- Prior-Aligned Data Cleaning for Tabular Foundation Models