publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. EMNLP
    oversensitivity.png
    Dynamic Evaluation for Oversensitivity in LLMs
    Sophia Xiao Pu, Sitao Cheng, Xin Eric Wang, and William Yang Wang
    Findings of EMNLP, 2025
  2. COLM
    thought-terminator.png
    THOUGHT TERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
    Sophia Xiao Pu*, Michael Saxon*, Wenyue Hua, and William Yang Wang
    COLM, 2025
  3. anime.png
    LLMs vs. Chinese Anime Enthusiasts: A Comparative Study on Emotionally Supportive Role-Playing
    Lanlan Qiu, Sophia Xiao Pu, Yeqi Feng, and Tianxing He
    arXiv preprint arXiv:2508.06388, 2025
  4. NAACL
    B4.png
    B^4: A Black-Box Scrubbing Attack on LLM Watermarks
    Baizhou Huang*Sophia Xiao Pu*, and Xiaojun Wan
    NAACL (Oral), 2025
  5. CL
    llm-eval-survey.png
    LLM-based NLG evaluation: Current Status and Challenges
    Mingqi Gao, Xinyu Hu, Jie Ruan, Sophia Xiao Pu, and Xiaojun Wan
    Computational Linguistics, 2025

2024

  1. EMNLP
    style-compress.png
    Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
    Sophia Xiao Pu, Tianxing He, and Xiaojun Wan
    Findings of EMNLP, 2024
  2. ACL
    stumbling-blocks.png
    Stumbling blocks: Stress testing the robustness of machine-generated text detectors under attacks
    Yichen Wang, Shangbin Feng, Abe Bohan Hou, Sophia Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, and Tianxing He
    ACL, 2024
  3. AAAI
    bt-eval.png
    Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling
    Jie Ruan, Sophia Xiao Pu, Mingqi Gao, Xiaojun Wan, and Yuesheng Zhu
    AAAI, 2024
  4. COLING
    extrinsic-eval.png
    Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
    Sophia Xiao Pu, Mingqi Gao, and Xiaojun Wan
    LREC-COLING (Oral), 2024

2023

  1. EMNLP
    zero-shot-detect.png
    On the Zero-Shot Generalization of Machine-Generated Text Detectors
    Sophia Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, and Tianxing He
    Findings of EMNLP, NeurIPS-ENLSP, 2023
  2. summ_is.png
    Summarization is (almost) dead
    Sophia Xiao Pu*, Mingqi Gao*, and Xiaojun Wan
    arXiv preprint arXiv:2309.09558, 2023