🤝 Open to Collab

Felix Friedrich

felfri

·

https://felifri.github.io/

AI & ML interests

Multimodal AI; Post-training/Inference; AI Safety; AI Alignment

Recent Activity

authored a paper about 9 hours ago

No Safe Dose: How Training Data Drives Unsafe Image Generation

upvoted a paper 1 day ago

LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking

upvoted a paper 1 day ago

No Safe Dose: How Training Data Drives Unsafe Image Generation

View all activity

Organizations