Similar Items: CADBench: A Multimodal Benchmark for AI-Assisted CAD Program Generation
- BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD
- AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images
- Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study
- PhyGround: Benchmarking Physical Reasoning in Generative World Models
- A Benchmark for Interactive World Models with a Unified Action Generation Framework
- GlazyBench: A Benchmark for Ceramic Glaze Property Prediction and Image Generation