zafir stojanovski

about

senior ml engineer @ loka
co-founded uxo.ai
master in cs @ uni tuebingen

papers

momentum-based weight interpolation for continual learning
interpolate @ neurips 2022 (best paper award)
reasoning gym: environments for rl with verifiable rewards
arxiv 2025

open-source

writing

posts

2024-08-04 key-value (kv) cache
2024-08-03 grouped-query attention (gqa)
2024-08-03 rmsnorm
2024-08-02 gated linear units (glu)
2024-07-15 rotary position embeddings (rope)
2024-07-13 adamw
2024-07-12 elastic weight consolidation (ewc)

x linkedin github