arxiv:2509.25049
Bingrui Li
Bingrui
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
On the Optimization and Generalization of Two-layer Transformers with
Sign Gradient Descent
upvoted
a
paper
about 2 months ago
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative
Decoders
authored
a paper
2 months ago
Memory Efficient Optimizers with 4-bit States
Organizations
None yet