Sophia Xiao Pu
2113 Henley Hall
UC Santa Barbara
Santa Barbara, CA 93106
Welcome! I am a second-year CS PhD student at University of California, Santa Barbara advised by Prof. William Wang.
I obtained my bachelor’s degree from Peking University, where I was advised by Prof. Xiaojun Wan. I also worked with Prof. Tianxing He at Tsinghua University and Prof. Yulia Tsvetkov at University of Washington.
My research interests lie broadly in Language and Vision, particularly in:
- Trustworthy AI: detecting machine-generated text (EMNLP 2023), attacking LLM watermarks (NAACL 2025), and evaluating oversensitivity in LLMs (EMNLP 2025 coming soon)
- Efficiency: prompt compression (EMNLP 2024), overthinking in reasoning models (COLM 2025).
- Other: extrinsic evaluation for text summaries (COLING 2024).
(Only [co-]lead-author papers are listed.)
I am actively looking for motivated undergraduate or master’s students to collaborate on exciting topics such as multimodal evaluation, reasoning, and more.
news
| Oct 08, 2025 | Presenting ThoughtTerminator at COLM, Montreal. |
|---|---|
| Aug 20, 2025 | Our work on LLM oversensitivity will appear in EMNLP Findings! |
| Aug 08, 2025 | New on arXiv: LLM roleplay (yes, anime characters 😉) 🎭✨ |
| Jul 07, 2025 | Our paper on overthinking in reasoning models got accepted to COLM! |
| Jun 16, 2025 | I start my internship at AWS, Santa Clara. |
selected preprints & publications
* denotes equal contribution.Outside of research, I’m also an amateur pipa player, a Dream of the Red Chamber (红楼梦) enthusiast, and a curious language learner.