Similar Items: Computation by infinite descent made explicit
- A Symplectic Analysis of Alternating Mirror Descent
- Optimizing Attention with Mirror Descent: Generalized Max-Margin Token Selection
- Optimization and Generalization of Gradient Descent for Shallow ReLU Networks with Minimal Width
- Reparameterized Complex-valued Neurons Can Efficiently Learn More than Real-valued Neurons via Gradient Descent
- How Many Public Computers in the Library?
- Distributed Computing Algorithm of Nuclear Norm Minimization for Low-Rank Matrix Completion