Similar Items: Weight-Decay Turns Transformer Loss Landscapes Villani: Functional-Analytic Foundations for Optimization and Generalization
- The Generalized Turing Test: A Foundation for Comparing Intelligence
- How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation
- Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models
- Neural Weight Norm = Kolmogorov Complexity
- V4FinBench: Benchmarking Tabular Foundation Models, LLMs, and Standard Methods on Corporate Bankruptcy Prediction
- Spiking Sequence Machines and Transformers