Rongwu Xu (许融武)

me

0xrwxu@gmail.com or
xrw22@mails.tsinghua.edu.cn

Github / Twitter / Curriculum Vitae / 简历

About [中文]

Rongwu Xu, male, born in 2000, Beijing. He holds a Bachelor degree from Tsinghua University, and is currently pursuing his Master's degree, also at Tsinghua University. His research interests include artificial intelligence and natural language processing. He has published multiple first-authored papers in top-tier conferences including ACL/EMNLP. He has received an ACL Outstanding Paper Award, National Scholarship by Ministry of Education of China and multiple accolades from Tsinghua University.

Education

Research

My research interests lie in Natural Language Processing (NLP) and Large Language Models (LLMs). In general, I am always fascinated by the notion of thinking of AI models as a form of approximation to human cognition and social interaction beyond merely functional analogs. My previous work has focused on comprehending the interaction of language models with the context where human users engage. I have also worked on controlled NLG and evaluation frameworks that emulate human assessment techniques. My ultimate goal is to design language intelligence capable of thinking and acting like humans and augmenting human capabilities.

The following research topics, therefore, pique my interest:

Bridging LLM and Human Intelligence:
  1. Human-AI Alignment: What are the key ingredients to make AI systems be effectively aligned with human values, behavioral patterns and expectations beyond machine learning algorithms?
  2. Machine Psychology: What are the key similarities and differences between AI models and human mind? How to use psychology-inspired behavioral experiments to test and understand LLMs?
Other General Interests:
  1. Science and Practice of Evaluation: What are the considerations for establishing a rigorous science and practice of evaluation that goes beyond task-driven approaches?
  2. LLM as Tools and Domain Applications: In what ways can LLMs be leveraged as tools across various domains, especially specialized ones? E.g., computational social science.
  3. AI Ethics and Safety: What are the primary ethical and safety concerns associated with the deployment and application of LLMs, and how can these be addressed to ensure responsible and trustworthy use?

News

Research in the rapidly evolving field of AI/NLP can be challenging for newcomers. If you are interested in my research or have preliminary ideas you'd like to explore, I'm more than willing to offering guidance. We can work towards submitting papers to top venues such as ACL/EMNLP/NAACL. Feel free to drop me an Email if interested.

  • Oct 2024 Six papers accepted to EMNLP 2024! Thanks to my collaborators!
  • Sep 2024 I received the National Scholarship by the Ministry of Education of China!
  • Aug 2024 My paper "The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation" recieved an Outstanding Paper Award at ACL 2024!
  • Jul 2024 Check out our talk (in Chinese) on knowledge conflicts for (RAG) LLMs! [Paper][Resource][机器之心][Slides]
  • May 2024 Two papers accepted to ACL 2024! Thanks to my collaborators!
  • May 2024 Check out LLMs' safety vulnerabilities discovered by tricking them to believe in misinformation! [Paper][Resource][机器之心][Video]
  • Apr 2024 I passed the PhD qualification exam (preliminary+oral) at IIIS, Tsinghua!
  • Dec 2023 I recieved the overall execellence scholarship at Tsinghua!
  • Apr 2023 One paper accepted to EuroS&P 2023! Thanks to my collaborators!
  • Dec 2022 Debut of my academic homepage.
  • Aug 2022 Enrolled as a graduate student at IIIS, Tsinghua University.

Publications

See Google Scholar for a complete list and bibliometric.

Preprints

  1. DebateQA: Evaluating Question Answering on Debatable Knowledge
    Rongwu Xu*, Xuan Qi*, Zehan Qi, Wei Xu, Zhijiang Guo
    arXiv Preprint (major revision at TACL)
    [Code]
  2. On the Role of Attention Heads in Large Language Model Safety
    Zhenhong Zhou, Haiyang Yu, Xinghua Zhang, Rongwu Xu, Fei Huang, Kun Wang, Yang Liu, Junfeng Fang, Yongbin Li
  3. arXiv Preprint

Conference Proceedings

  1. Course-Correction: Safety Alignment Using Synthetic Preferences
    Rongwu Xu*, Yishuo Cai*, Zhenhong Zhou, Renjie Gu, Haiqin Wang, Yan Liu, Tianwei Zhang, Wei Xu, Han Qiu
    EMNLP 2024 (Industry)
    [Code][Poster]
  2. MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
    Zhongshen Zeng, Yinhong Liu, Yingjia Wan, Jingyao Li, Pengguang Chen, Jianbo Dai, Yuxuan Yao, Rongwu Xu, Zehan Qi, Wanru Zhao, Linling Shen, Jianqiao Lu, Haochen Tan, Yukang Chen, Hao Zhang, Zhan Shi, Bailin Wang, Zhijiang Guo, Jiaya Jia
    NeurIPS 2024
    [Code][Project Page]
  3. Knowledge Conflicts for LLMs: A Survey
    Rongwu Xu*, Zehan Qi*, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, Wei Xu
    EMNLP 2024 (Main)
    [Code][机器之心][Talk (Chinese)][Slides][Poster]
  4. $LONG^{2}RAG$: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall
    Zehan Qi*, Rongwu Xu*, Zhijiang Guo, Cunxiang Wang, Hao Zhang, Wei Xu
    EMNLP 2024 (Findings)
  5. Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias
    Rongwu Xu, Zi'an Zhou, Tianwei Zhang, Zehan Qi, Su Yao, Ke Xu, Wei Xu, Han Qiu
    EMNLP 2024 (Main)
    [Poster]
  6. Sing it, Narrate it: Quality Musical Lyrics Translation
    Zhuorui Ye, Jinhan Li, Rongwu Xu
    EMNLP 2024 (Findings)
  7. How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States
    Zhenhong Zhou, Haiyang Yu, Xinghua Zhang, Rongwu Xu, Fei Huang, Yongbin Li
    EMNLP 2024 (Findings)
    [Code][Poster]
  8. Preemptive Answer ``Attacks'' on Chain-of-Thought Reasoning
    Rongwu Xu*, Zehan Qi*, Wei Xu
    ACL 2024 (Findings)
    [Code][Poster]
  9. The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation
    Rongwu Xu, Brian S. Lin, Shujian Yang, Tianqi Zhang, Weiyan Shi, Tianwei Zhang, Zhixuan Fang, Wei Xu, Han Qiu
    ACL 2024 (Oral, Main)
    🏆 Outstanding Paper Award [Certificate]
    [Code][机器之心][Project Page][Video][Poster]
  10. Exploring Chinese Humor Generation: A Study on Two-Part Allegorical Sayings
    Rongwu Xu
    IJCNN 2024
  11. Tempo: Confidentiality Preservation in Cloud-Based Neural Network Training
    Rongwu Xu and Zhixuan Fang
    IJCNN 2024
  12. MISO: Legacy-compatible Privacy-preserving Single Sign-on using Trusted Execution Environments
    Rongwu Xu, Sen Yang, Fan Zhang, Zhixuan Fang
    EuroS&P 2023
    [Project Page]
  13. LSync: A Universal Event-synchronizing Solution for Live Streaming
    Yifan Xu, Fan Dang, Rongwu Xu, Xinlei Chen, Yunhao Liu
    INFOCOM 2022
  14. LifeRec: A Mobile App for Lifelog Recording and Ubiquitous Recommendation
    Jiayu Li, Hantian Zhang*, Zhiyu He*, Rongwu Xu*, Pingfei Wu*, Min Zhang, Yiqun Liu, Shaoping Ma
    CHIIR 2022
    [Code]

Journal Articles

  1. LSync: A Universal Timeline-synchronizing Solution for Live Streaming
    Fan Dang*, Yifan Xu*, Rongwu Xu, Xinlei Chen, Yunhao Liu
    IEEE/ACM Trans. on Networking, 2024

* Equal Contribution, ^ Corresponding Author

Awards*

* Only honors recognized at or above the university level are shown.

Experience

Activity

Professional

Talks

Teaching

I was invited to share my teaching experience (see slides in Chinese) within my department (2024 & 2023) and awarded as excellent teaching assistant at Tsinghua University (2024).

Miscellaneous

Social Work and Volunteering

I have held positions in various student organizations at Tsinghua University and IIIS, and have accumulated rich student work experience.

  • President, IIIS Graduate Student Union, Tsinghua University, Jun 2024-Jun 2025 (清华大学交叉信息院研究生会 主席)
    IIIS 茶话会、良师益友、“12·9”特色系列活动、周末放映室、研代会
    [Photo][Photo]
  • Freshman Counselor, IIIS, Tsinghua University, May 2024-Jun 2025 (清华大学交叉信息院 新生助理)
    新生趣味运动会、交友周
    [Photo]
  • Conselor, The 39th Summer School for Graduate, Tsinghua University, Aug 2024 (清华大学第十八届研究生新生骨干培训班暨第三十九期暑期团校 (研究生班) 辅导员)
    [Photo][Photo]
  • Social Practice Captain, IIIS, Tsinghua University, Apr 2024-Jul 2024 (清华大学交叉信息院暑期社会实践 支队长)
    [Photo][Photo]
    “雁行”特色活动
    Award: Outstanding individual (社会实践优秀个人)
  • Member, IIIS Graduate Student Union, Tsinghua University, Jun 2023-Jun 2024 (清华大学交叉信息院研究生会 干事)
    茶园学生节、“冬日恋歌”联谊
    [Photo]
    Award: Outstanding individual 2023-2024 (2023年度院研会优秀个人)

Did You Know...

  • My favorite subjects include Psychology, Sociology and Philosophy. I have a personality type of ENTJ (so/sp 3w2 317 LIE).
  • I read books almost everyday. My two favorite books are Night Flight and Nostromo: A Tale of the Seaboard.
  • I used to play the electric guitar and bass, see this. My favorite band is Pink Floyd. I also enjoy classical music, with Bruckner and Sibelius being my favorite composers.
  • Jobs involving management, scheduling, consulting and lecturing excite me. Other professions of interest are diplomats and doctors.