Similar Items: Reparameterized Complex-valued Neurons Can Efficiently Learn More than Real-valued Neurons via Gradient Descent
- CHANI: Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration
- Optimization and Generalization of Gradient Descent for Shallow ReLU Networks with Minimal Width
- A Symplectic Analysis of Alternating Mirror Descent
- Optimizing Attention with Mirror Descent: Generalized Max-Margin Token Selection
- Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood
- Refined Risk Bounds for Unbounded Losses via Transductive Priors