Rongwu Xu

me

0xrwxu@gmail.com or
rongwuxu@cs.washington.edu

GitHub | X (Twitter) | LinkedIn
Google Scholar | OpenReview

Research

I research how AI’s design and human interactions shape its behavior and societal impact. My approach combines behavioral experiments, machine learning, interpretability, and psychology. I publish in NLP, AI, and ML communities, and also explore model evaluation and real-world applications.

News

  • Jun 2025 Graduated from Tsinghua University with the highest honor of Outstanding Graduate and Outstanding Master Thesis (both top 1 of my class). [Photo][Photo][Photo]
  • May 2025 Two papers accepted to ACL 2025. Thanks to my collaborators!
  • May 2025 Checkout our new review paper on AI awareness. [Paper][Project Page]
  • Apr 2025 Attending two AI safety & alignment conferences co-located with ICLR 2025 (Singapore): The Misalignment and Control Workshop (Apr 24th, our new paper on catastrophic risks and deception of LLM agents will be presented [Paper][Project Page]) and The Singapore Conference on AI (SCAI) (Apr 26th).
  • Mar 2025 Got accepted to UIUC CS, UW CSE and JHU CS. Grateful to the opportunities!
  • Jan 2025 Looking for PhD opportunities starting 2025. Don't hesitate to reach out if you think I can be a good candidate.
  • Oct 2024 Six papers accepted to EMNLP 2024. Thanks to my collaborators!
  • Sep 2024 Received the National Scholarship by the Ministry of Education of China.
  • Aug 2024 My paper "The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation" recieved an Outstanding Paper Award at ACL 2024!
  • Jul 2024 Check out our talk (Chinese) on knowledge conflicts for (RAG) LLMs. [Paper][Resource][机器之心][Slides]
  • May 2024 Two papers accepted to ACL 2024. Thanks to my collaborators!
  • May 2024 Check out LLMs' safety vulnerabilities discovered by tricking them to believe in misinformation. [Paper][Resource][机器之心][Video]
  • Apr 2024 Passed the PhD qualification exam (preliminary+oral) at IIIS, Tsinghua.
  • Dec 2023 Recieved the overall execellence scholarship at Tsinghua.
  • Apr 2023 One paper accepted to EuroS&P 2023. Thanks to my collaborators!
  • Dec 2022 Debut of my academic homepage.
  • Aug 2022 Enrolled as a graduate student at IIIS, Tsinghua University.

Personal

My name in Chinese is 许 融武 (Xu, Rongwu). Pronunciation: Rongwu → RONG-woo (stress on "RONG"), Xu → Shoo. I use he/him pronoun.