Sophia Xiao Pu

2113 Henley Hall
UC Santa Barbara
Santa Barbara, CA 93106
Welcome! I am a first-year CS PhD student at University of California, Santa Barbara advised by Prof. William Wang.
I obtained my bachelor’s degree from Peking University, where I was advised by Prof. Xiaojun Wan. I also worked with Prof. Tianxing He at Tsinghua University and Prof. Yulia Tsvetkov at University of Washington.
My research interests lie broadly in Language and Vision, particularly in:
- Trustworthy AI: detecting machine-generated text (EMNLP 2023), attacking LLM watermarks (NAACL 2025), and evaluating oversensitivity in LLMs (EMNLP 2025 coming soon)
- Efficiency: prompt compression (EMNLP 2024), overthinking in reasoning models (COLM 2025).
- Other: extrinsic evaluation for text summaries (COLING 2024).
(Only [co-]lead-author papers are listed.)
I am actively looking for motivated undergraduate or master’s students to collaborate on exciting topics such as multimodal evaluation, reasoning, and more.
news
Aug 20, 2025 | Our work on LLM oversensitivity will appear in EMNLP Findings! |
---|---|
Aug 08, 2025 | New on arXiv: LLM roleplay (yes, anime characters 😉) 🎭✨ |
Jul 07, 2025 | Our paper on overthinking in reasoning models got accepted to COLM! |
Jun 16, 2025 | I start my internship at AWS, Santa Clara. |
Jun 10, 2025 | We will be giving a tutorial on multimodal generation evaluation at CVPR 2025 in Nashville 🎶🍗 |
selected preprints & publications
* denotes equal contribution.Outside of research, I’m also an amateur pipa player, a Dream of the Red Chamber (红楼梦) enthusiast, and a curious language learner.