Hi. I'm Yiwei Yang.

I am a PhD student at the University of Washington, advised by Bill Howe. I received my undergraduate degree in Computer Science in 2019 from the University of Michigan.

I am interested in making our models more reliable and trustworthy. Currently, I am working on building robust reward models to tackle reward hacking of large language models. Recently, I worked on benchmarking and mitigating spurious correlations of Large Multi-modal Models (LMMs).

Selected Publications

Contact

Email: yanyiwei@uw.edu