Hi, I'm Yiwei Yang.
PhD Student, University of Washington
LLM Agents • Reinforcement Learning • Trustworthy AI
I am a PhD student at the University of Washington, advised by Bill Howe. I received my B.S. in Computer Science from the University of Michigan.
My research focuses on the reliability of LLM agents and multimodal models. I study how models trained with reinforcement learning can develop shortcut behaviors—such as spurious tool-use, spurious correlations, and reward hacking—that lead to brittle performance under distribution shift. To address this, I design benchmarks, evaluation methods, and training interventions that encourage more robust reasoning and decision-making.
I am especially interested in roles related to LLM agents, RL for reasoning, multimodal learning, evaluation, and trustworthy AI.