Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
Violet Xiang PRO
violetxi
AI & ML interests
None yet
Recent Activity
updated a model 1 day ago
violetxi/opsd-physics-qwen3-4b-forward-kl-psonly published a model 1 day ago
violetxi/opsd-physics-qwen3-4b-forward-kl-psonly updated a model 4 days ago
violetxi/qwen35-4b-terminal-wm-summary-mixed