Similar Items: Trust, but Verify: Peeling Low-Bit Transformer Networks for Training Monitoring
- Verifier-Backed Hard Problem Generation for Mathematical Reasoning
- Quantum Interval Bound Propagation for Certified Training of Quantum Neural Networks
- Globally Optimal Training of Spiking Neural Networks via Parameter Reconstruction
- Spiking Sequence Machines and Transformers
- Fast Byte Latent Transformer
- Transformers with Selective Access to Early Representations