Sophia Xiao Pu

my_pic.jpg

2113 Henley Hall

UC Santa Barbara

Santa Barbara, CA 93106

Welcome! I am a first-year CS PhD student at University of California, Santa Barbara advised by Prof. William Wang.

I obtained my bachelor’s degree from Peking University, where I was advised by Prof. Xiaojun Wan. I also worked with Prof. Tianxing He at Tsinghua University and Prof. Yulia Tsvetkov at University of Washington.

My research interests lie broadly in Language and Vision, particularly in:

  • Trustworthy AI: detecting machine-generated text (EMNLP 2023), attacking LLM watermarks (NAACL 2025), and evaluating oversensitivity in LLMs (EMNLP 2025 coming soon)
  • Efficiency: prompt compression (EMNLP 2024), overthinking in reasoning models (COLM 2025).
  • Other: extrinsic evaluation for text summaries (COLING 2024).


(Only [co-]lead-author papers are listed.)

I am actively looking for motivated undergraduate or master’s students to collaborate on exciting topics such as multimodal evaluation, reasoning, and more.

news

Aug 20, 2025 Our work on LLM oversensitivity will appear in EMNLP Findings!
Aug 08, 2025 New on arXiv: LLM roleplay (yes, anime characters 😉) 🎭✨
Jul 07, 2025 Our paper on overthinking in reasoning models got accepted to COLM!
Jun 16, 2025 I start my internship at AWS, Santa Clara.
Jun 10, 2025 We will be giving a tutorial on multimodal generation evaluation at CVPR 2025 in Nashville 🎶🍗

selected preprints & publications

* denotes equal contribution.
  1. EMNLP
    oversensitivity.png
    Dynamic Evaluation for Oversensitivity in LLMs
    Sophia Xiao Pu, Sitao Cheng, Xin Eric Wang, and William Yang Wang
    Findings of EMNLP, 2025
  2. COLM
    thought-terminator.png
    THOUGHT TERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
    Sophia Xiao Pu*, Michael Saxon*, Wenyue Hua, and William Yang Wang
    COLM, 2025
  3. NAACL
    B4.png
    B^4: A Black-Box Scrubbing Attack on LLM Watermarks
    Baizhou Huang*Sophia Xiao Pu*, and Xiaojun Wan
    NAACL (Oral), 2025
  4. EMNLP
    style-compress.png
    Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
    Sophia Xiao Pu, Tianxing He, and Xiaojun Wan
    Findings of EMNLP, 2024
  5. COLING
    extrinsic-eval.png
    Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
    Sophia Xiao Pu, Mingqi Gao, and Xiaojun Wan
    LREC-COLING (Oral), 2024
  6. EMNLP
    zero-shot-detect.png
    On the Zero-Shot Generalization of Machine-Generated Text Detectors
    Sophia Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, and Tianxing He
    Findings of EMNLP, NeurIPS-ENLSP, 2023
  7. summ_is.png
    Summarization is (almost) dead
    Sophia Xiao Pu*, Mingqi Gao*, and Xiaojun Wan
    arXiv preprint arXiv:2309.09558, 2023

Outside of research, I’m also an amateur pipa player, a Dream of the Red Chamber (红楼梦) enthusiast, and a curious language learner.