Similar Items: PrepBench: How Far Are We from Natural-Language-Driven Data Preparation?
- Anatomy of a Query: W5H Dimensions and FAR Patterns for Text-to-SQL Evaluation
- FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting
- Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies
- An Extensible and Verifiable Language for Query Rewrite Rules
- DataClaw: An Autonomous Data Agent with Instant Messaging Integration
- SEMA-SQL: Beyond Traditional Relational Querying with Large Language Models