Xiao Pu

Xiao(晓) is pronounced like "sh-yow"

my_pic.jpg

2113 Henley Hall

UC Santa Barbara

Santa Barbara, CA 93106

Welcome! I am a first-year CS PhD student at UC Santa Barbara advised by Prof. William Wang.

I obtained my bachelor’s degree from Peking University, where I was advised by Prof. Xiaojun Wan. I also worked with Prof. Tianxing He at Tsinghua University and Prof. Yulia Tsvetkov at University of Washington.

My research interests lie broadly in Natural Language Processing, particularly in:

  • AI Safety: machine-generated text detection (EMNLP 2023), LLM watermarks (NAACL 2025) and VLM safety (ongoing work).
  • NLG Evaluation (COLING 2024)
  • Prompt Compression (EMNLP 2024)
  • Data Selection for Reasoning (ongoing work)


(Only [co-]lead-author papers are listed.)

news

Jan 22, 2025 Our paper on removing LLM watermarks has been accepted to NAACL main conference (•̀ᴗ• )
Nov 22, 2024 Presenting my poster at the SoCal NLP Symposium 2024 hosted at UCSD.
Nov 10, 2024 ✈️ to EMNLP Miami, welcome to chat!
Nov 06, 2024 New preprint is out! In this work we propose a new Black-Box Scrubbing Attack on LLM Watermarks.
Oct 18, 2024 I’m attending the Responsible Machine Learning Summit 2024

selected publications

* denotes equal contribution.
  1. NAACL
    B4.png
    B^4: A Black-Box Scrubbing Attack on LLM Watermarks
    Baizhou Huang*Xiao Pu*, and Xiaojun Wan
    NAACL, Best Paper Nomination, 2025
  2. EMNLP
    style-compress.png
    Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
    Xiao Pu, Tianxing He, and Xiaojun Wan
    Findings of EMNLP, 2024
  3. AAAI
    bt-eval.png
    Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling
    Jie Ruan, Xiao Pu, Mingqi Gao, Xiaojun Wan, and Yuesheng Zhu
    AAAI, 2024
  4. COLING
    extrinsic-eval.png
    Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
    Xiao Pu, Mingqi Gao, and Xiaojun Wan
    LREC-COLING (oral), 2024
  5. EMNLP
    zero-shot-detect.png
    On the Zero-Shot Generalization of Machine-Generated Text Detectors
    Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, and Tianxing He
    Findings of EMNLP, NeurIPS-ENLSP, 2023