arxiv:2606.19162
🤝 Open to Collab
Felix Friedrich
felfri
AI & ML interests
Multimodal AI; Post-training/Inference; AI Safety; AI Alignment
Recent Activity
authored a paper about 9 hours ago
No Safe Dose: How Training Data Drives Unsafe Image Generation upvoted a paper 1 day ago
LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking upvoted a paper 1 day ago
No Safe Dose: How Training Data Drives Unsafe Image Generation