Uljad Berdica
AI Researcher and LLM Prompter
I am a 2022 Rhodes Scholar and PhD student in the CDT-AIMS program at the University of Oxford focusing on Reinforcement Learning, World Models, Exploration and Agentic LLMs. Working on meaningful AI that learns from experience, under the supervision of Prof. Jakob Foerster and Prof. Perla Maiolino from the Oxford Robotics Institute.
SR at Google DeepMind working on Game Theory, Reinforcement Learning LLMs.
My most recent papers include methods to make LLMs more effective and diverse, unification of Offline RL Algorithms to automatically discover new ones and bioimpedance measurement in Electronics. Currently interested in Backprop-Free methods and automating research with AI. Previously interned at J.P. Morgan’s AI Research group.
Served as a reviewer for IEEE Robotics Journals, conferences like ICML, ICLR, AAAI, IEEE RoboSoft and numerous workshops. Since moving out of my home country at 17 on a scholarship, I have lived in the USA, UAE, China, and France as part of my studies before moving to the UK for my PhD. I love doing stand-up comedy.
Education
Research Interests
News
| May 06, 2026 | Two papers accepted to RLC 2026 in Montreal |
|---|---|
| May 01, 2026 | Two papers accepted to ICML 2026 in Korea |
| Apr 20, 2026 | Released the PBT-NCA framework for Evolving Many Worlds |
| Dec 01, 2025 | Attended NeurIPS 2025 — Oral & Poster live |
| Oct 21, 2025 | Review paper accepted at the Journal of Clinical Medicine |
| Sep 18, 2025 | Reinforcement Learning paper accepted at NeurIPS as an Oral |
Selected Publications
-
When Do We Need LLMs? A Diagnostic for Language-Driven BanditsThe Reinforcement Learning Conference (RLC) - Main Track, 2026