Qizhen Zhang (Irene)
I am a first year machine learning PhD student at the University of Oxford, where I work on Large Language Models and reinforcement learning. My advisor is Jakob Foerster.
I'm also spending half of my time doing research at Cohere, hosted by Phil Blunsom.
Prior to my PhD, I was a member of technical staff at Cohere building frameworks and training LLMs. I wrote my Master's thesis on cooperative multi-agent reinforcement learning at the University of Toronto and the Vector Institute.