zafir stojanovski
- about
- senior ml engineer @ loka
- co-founded uxo.ai
- master in cs @ uni tuebingen
- papers
- momentum-based weight interpolation for continual learning
interpolate @ neurips 2022 (best paper award) - reasoning gym: environments for rl with verifiable rewards
arxiv 2025 - open-source
- writing
- posts
- 2024-08-04 key-value (kv) cache
- 2024-08-03 grouped-query attention (gqa)
- 2024-08-03 rmsnorm
- 2024-08-02 gated linear units (glu)
- 2024-07-15 rotary position embeddings (rope)
- 2024-07-13 adamw
- 2024-07-12 elastic weight consolidation (ewc)