Sophia Xiao Pu

my_pic.jpg

2113 Henley Hall

UC Santa Barbara

Santa Barbara, CA 93106

Welcome! I am a first-year CS PhD student at University of California, Santa Barbara advised by Prof. William Wang.

I obtained my bachelor’s degree from Peking University, where I was advised by Prof. Xiaojun Wan. I also worked with Prof. Tianxing He at Tsinghua University and Prof. Yulia Tsvetkov at University of Washington.

My research interests lie broadly in Language and Vision, particularly in:

  • Trustworthy AI: detecting machine-generated text (EMNLP 2023), and removing LLM watermarks (NAACL 2025).
  • Evaluation: building an extrinsic evaluation framework for text summarization (COLING 2024).
  • Efficiency: compressing prompts (EMNLP 2024), evaluating and mitigating overthinking in reasoning models (Arxiv 2025).


(Only [co-]lead-author papers are listed.)

I am actively looking for motivated undergraduate or master’s students to collaborate on exciting topics such as multimodal evaluation, reasoning, and more.

news

Apr 29, 2025 Attending NAACL25 in Albuquerque!
Apr 20, 2025 New preprint on evaluating and mitigating overthinking is out!
Jan 22, 2025 Our paper on removing LLM watermarks has been accepted to NAACL main conference (•̀ᴗ• )
Nov 22, 2024 Presenting my poster at the SoCal NLP Symposium 2024 hosted at UCSD.
Nov 10, 2024 ✈️ to EMNLP Miami, welcome to chat!

selected preprints & publications

* denotes equal contribution.
  1. thought-terminator.png
    THOUGHT TERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
    Xiao Pu*, Michael Saxon*, Wenyue Hua, and William Yang Wang
    2025
  2. NAACL
    B4.png
    B^4: A Black-Box Scrubbing Attack on LLM Watermarks
    Baizhou Huang*Xiao Pu*, and Xiaojun Wan
    NAACL (Oral), 2025
  3. EMNLP
    style-compress.png
    Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
    Xiao Pu, Tianxing He, and Xiaojun Wan
    Findings of EMNLP, 2024
  4. AAAI
    bt-eval.png
    Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling
    Jie Ruan, Xiao Pu, Mingqi Gao, Xiaojun Wan, and Yuesheng Zhu
    AAAI, 2024
  5. COLING
    extrinsic-eval.png
    Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
    Xiao Pu, Mingqi Gao, and Xiaojun Wan
    LREC-COLING (Oral), 2024
  6. EMNLP
    zero-shot-detect.png
    On the Zero-Shot Generalization of Machine-Generated Text Detectors
    Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, and Tianxing He
    Findings of EMNLP, NeurIPS-ENLSP, 2023

Outside of research, I’m also an amateur pipa player, a Dream of the Red Chamber (红楼梦) enthusiast, and a curious language learner.