Research
I research how AI’s design and human interactions shape its behavior and societal impact. My approach combines
behavioral experiments, machine learning, interpretability, and psychology. I publish in NLP, AI, and ML
communities, and also explore model evaluation and real-world applications.
News
- Jun 2025 Graduated from Tsinghua University with the highest honor of Outstanding Graduate
and
Outstanding
Master Thesis (both top 1 of my class). [Photo][Photo][Photo]
- May 2025 Two papers accepted to ACL 2025. Thanks to my collaborators!
- May 2025 Checkout our new review paper on AI awareness. [Paper][Project
Page]
-
Apr 2025 Attending two AI safety & alignment conferences co-located with ICLR
2025 (Singapore): The Misalignment and Control Workshop (Apr 24th, our new paper on catastrophic risks
and deception of LLM agents will be presented [Paper][Project Page]) and The Singapore Conference on AI
(SCAI) (Apr 26th).
-
Mar 2025 Got accepted to UIUC CS, UW CSE and JHU CS. Grateful to the opportunities!
-
Jan 2025 Looking for PhD opportunities starting 2025. Don't hesitate to reach out if you think I
can be a good candidate.
-
Oct 2024 Six papers accepted to EMNLP 2024. Thanks to my collaborators!
-
Sep 2024 Received the National Scholarship by the Ministry of Education of China.
-
Aug 2024 My paper "The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation
via
Persuasive Conversation" recieved an Outstanding Paper Award at ACL 2024!
-
Jul 2024 Check out our talk
(Chinese) on knowledge conflicts for (RAG) LLMs. [Paper][Resource][机器之心][Slides]
-
May 2024 Two papers accepted to ACL 2024. Thanks to my collaborators!
-
May 2024 Check out LLMs' safety vulnerabilities discovered by tricking them to believe in
misinformation. [Paper][Resource][机器之心][Video]
-
Apr 2024 Passed the PhD qualification exam (preliminary+oral) at IIIS, Tsinghua.
-
Dec 2023 Recieved the overall execellence scholarship at Tsinghua.
-
Apr 2023 One paper accepted to EuroS&P 2023. Thanks to my collaborators!
-
Dec 2022 Debut of my academic homepage.
-
Aug 2022 Enrolled as a graduate student at IIIS, Tsinghua University.
Personal
My name in Chinese is 许 融武 (Xu, Rongwu). Pronunciation: Rongwu → RONG-woo (stress on "RONG"), Xu → Shoo. I use
he/him pronoun.