Similar Items: V4FinBench: Benchmarking Tabular Foundation Models, LLMs, and Standard Methods on Corporate Bankruptcy Prediction
- TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering
- AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents
- Optimal Posterior Sampling for Policy Identification in Tabular Markov Decision Processes
- TabSurv: Adapting Modern Tabular Neural Networks to Survival Analysis
- On the Hardness of Junking LLMs
- The Generalized Turing Test: A Foundation for Comparing Intelligence