Claas A. Voelcker
PhD Candidate at UofT, RL researcher focused on too many things, he/him, 🏳️🌈 🤖 🧙

W1140-108 College Street
SR Innovation Campus
Toronto, ON
M5G 0C6
I am a PhD student in Reinforcement and Machine Learning at the University of Toronto and the Vector Institute, supervised by Profs. Amir-massoud Farahmand and Igor Gilitschenski. In November, I will be starting a postdoc at UT Austin with Profs. Peter Stone and Amy Zhang.
My research focuses on model based reinforcement learning and closing the gap between learning acurate models for future predictions and learning high performing models for planning. I am interested in using techniques for representation and world model learning to stablize notoriously brittle Deep Reinforcement Learning approaches. Finally, I like thinking about how we can do better science in RL by thinking about what problems we should be benchmarking our exciting advances on.
Originally from Germany, I received a Bachelor and Master degree from the University of Darmstadt with Honors. There, I had the great pleasure to be supervised and mentored by Profs. Kristian Kersting and Jan Peters.
I am proud to serve as a core organizer for Queer in AI, where I help promote the interests of queer researchers and practitioners at AI /ML conferences and in the wider community.
news
Jul 01, 2025 | Our paper Calibrated Value-Aware Model Learning with Probabilistic Environment Models will be presented at ICML 2025 in Vancouver next week! Let me know if you want to meet up for a coffee. |
---|---|
Jun 13, 2025 | I accepted a postdoc position with Peter Stone and Amy Zhang at UT Austin. Looking forward to work with so many amazing students and faculty in the Texas Robotics Ecosystem on RL that matters for real-world robotics. |
Mar 13, 2025 | Our paper MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL was awarded a spotlight award at ICLR 2025! See you in Singapore. |
Oct 13, 2024 | I made a new website! |
latest posts
Mar 15, 2025 | loss functions and calibration |
---|---|
Mar 15, 2025 | reward design and termination |