Sophia Xiao Pu

my_pic.jpg

2113 Henley Hall

UC Santa Barbara

Santa Barbara, CA 93106

Welcome! I am a first-year CS PhD student at University of California, Santa Barbara advised by Prof. William Wang.

I obtained my bachelor’s degree from Peking University, where I was advised by Prof. Xiaojun Wan. I also worked with Prof. Tianxing He at Tsinghua University and Prof. Yulia Tsvetkov at University of Washington.

My research interests lie broadly in Language and Vision, particularly in:

  • Trustworthy AI: detecting machine-generated text (EMNLP 2023), and removing LLM watermarks (NAACL 2025).
  • Evaluation: extrinsic evaluation for text summaries (COLING 2024).
  • Efficiency: compressing prompts (EMNLP 2024), evaluating and mitigating overthinking in reasoning models (COLM 2025).


(Only [co-]lead-author papers are listed.)

I am actively looking for motivated undergraduate or master’s students to collaborate on exciting topics such as multimodal evaluation, reasoning, and more.

news

Jun 16, 2025 Our paper on overthinking in reasoning models got accepted to COLM!
Jun 16, 2025 I start my internship at AWS, Santa Clara.
Jun 10, 2025 We will be giving a tutorial on multimodal generation evaluation at CVPR 2025 in Nashville 🎶🍗
Apr 29, 2025 Attending NAACL25 in Albuquerque!
Apr 20, 2025 New preprint on evaluating and mitigating overthinking is out!

selected preprints & publications

* denotes equal contribution.
  1. COLM
    thought-terminator.png
    THOUGHT TERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
    Xiao Pu*, Michael Saxon*, Wenyue Hua, and William Yang Wang
    COLM, 2025
  2. NAACL
    B4.png
    B^4: A Black-Box Scrubbing Attack on LLM Watermarks
    Baizhou Huang*Xiao Pu*, and Xiaojun Wan
    NAACL (Oral), 2025
  3. EMNLP
    style-compress.png
    Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
    Xiao Pu, Tianxing He, and Xiaojun Wan
    Findings of EMNLP, 2024
  4. COLING
    extrinsic-eval.png
    Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
    Xiao Pu, Mingqi Gao, and Xiaojun Wan
    LREC-COLING (Oral), 2024
  5. EMNLP
    zero-shot-detect.png
    On the Zero-Shot Generalization of Machine-Generated Text Detectors
    Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, and Tianxing He
    Findings of EMNLP, NeurIPS-ENLSP, 2023
  6. summ_is.png
    Summarization is (almost) dead
    Xiao Pu*, Mingqi Gao*, and Xiaojun Wan
    arXiv preprint arXiv:2309.09558, 2023

Outside of research, I’m also an amateur pipa player, a Dream of the Red Chamber (红楼梦) enthusiast, and a curious language learner.