Chejian Xu

Chejian Xu

I am a fourth year CS Ph.D. candidate at University of Illinois, Urbana-Champaign (UIUC), advised by Prof. Bo Li. I received my Bachelor's degree from CS, Zhejiang University at CKC Honors College, advised by Prof. Shouling Ji and Prof. Siliang Tang.

My research focuses on making LLMs, VLMs, and AI agents safe, scalable, and interpretable at real-world scale. I work on scalable red-teaming to uncover realistic agent failures, interpretability-guided analysis of reasoning and hallucinations, and efficient post-training for robust deployment. My long-term goal is to build AI systems that are not only powerful but also reliable and practically deployable across real-world applications.

Email / Google Scholar / Github / LinkedIn

News

2025/09 - One paper got accepted to NeurIPS 2025.
2025/05 - I started my internship at Meta, working on VLM reasoning and interpretability.
2025/05 - One paper got accepted to ICML 2025.
2025/04 - We released UltraLong-8B, models with up to 4M context length and competitive performance on standard tasks.
2025/01 - Four papers got accepted to ICLR 2025.
2024/12 - Two papers got accepted to AAAI 2025.
2024/09 - We released ChatQA 2, a Llama 3.0-based model with enhanced long-context understanding and RAG capabilities.
2024/09 - Our paper, DecodingTrust, got the Cybersecurity award 2024 on Best Machine Learning and Security Paper.
2024/05 - I started my internship at NVIDIA, working on long context LLMs.
2024/04 - We are hosting the The Competition for LLM and Agent Safety 2024!
2024/02 - One paper got accepted to CVPR 2024.
2023/12 - Our paper, DecodingTrust, received the Outstanding Paper award at NeurIPS 2023.

Selected Publications

Full publication list available on my Google Scholar.

	GuardSet-X: Massive Multi-Domain Safety Policy-Grounded Guardrail Dataset Mintong Kang, Zhaorun Chen, Chejian Xu, Jiawei Zhang, Chengquan Guo, Minzhou Pan, Ivan Revilla, Yu Sun, Bo Li Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025 [PDF] [Code] [BibTeX]
	AdvAgent: Controllable Blackbox Red-teaming on Web Agents Chejian Xu, Mintong Kang, Jiawei Zhang, Zeyi Liao, Lingbo Mo, Mengqi Yuan, Huan Sun, Bo Li Forty-Second International Conference on Machine Learning (ICML), 2025 [PDF] [Code] [Website] [BibTeX]
	From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models Chejian Xu, Wei Ping, Peng Xu, Zihan Liu, Boxin Wang, Mohammad Shoeybi, Bo Li, Bryan Catanzaro Preprint, 2025 [PDF] [Website] [Model Weights 🤗] [BibTeX]
	MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models Chejian Xu, Jiawei Zhang, Zhaorun Chen, Chulin Xie, Mintong Kang, Zhuowen Yuan, Zidi Xiong, Chenhui Zhang, Lingzhi Yuan, Yi Zeng, Peiyang Xu, Chengquan Guo, Andy Zhou, Jeffrey Ziwei Tan, Zhun Wang, Alexander Xiong, Xuandong Zhao, Yu Gai, Francesco Pinto, Yujin Potter, Zhen Xiang, Zinan Lin, Dan Hendrycks, Dawn Song, Bo Li The Thirteenth International Conference on Learning Representations (ICLR), 2025 [PDF] [Code] [Website] [T2I Dataset 🤗] [I2T Dataset 🤗] [BibTeX]
	EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage Zeyi Liao, Lingbo Mo, Chejian Xu, Mintong Kang, Jiawei Zhang, Chaowei Xiao, Yuan Tian, Bo Li, Huan Sun The Thirteenth International Conference on Learning Representations (ICLR), 2025 [PDF] [Code] [BibTeX]
	ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Peng Xu, Wei Ping, Xianchao Wu, Chejian Xu, Zihan Liu, Mohammad Shoeybi, Bryan Catanzaro The Thirteenth International Conference on Learning Representations (ICLR), 2025 [PDF] [Website] [Model Weights 🤗] [Training Data 🤗] [BibTeX]
	AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models Mintong Kang, Chejian Xu, Bo Li The Thirteenth International Conference on Learning Representations (ICLR), 2025 [PDF] [BibTeX]
	DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, Ritik Dutta, Rylan Schaeffer, Sang T. Truong, Simran Arora, Mantas Mazeika, Dan Hendrycks, Zinan Lin, Yu Cheng, Sanmi Koyejo, Dawn Song, Bo Li Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023 (Outstanding Paper) [PDF] [Code] [Website] [BibTeX]
	SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles Chejian Xu, Wenhao Ding, Weijie Lyu, Zuxin Liu, Shuai Wang, Yihan He, Hanjiang Hu, Ding Zhao, Bo Li Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), 2022 [PDF] [Code] [Leaderboard] [BibTeX]
	SemAttack: Natural Textual Attacks via Different Semantic Spaces Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li North American Chapter of the Association for Computational Linguistics (NAACL), 2022 (Findings) [PDF] [Code] [BibTeX]
	Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models Boxin Wang, Chejian Xu, Shuohang Wang, Zhe Gan, Yu Cheng, Jianfeng Gao, Ahmed Hassan Awadallah, Bo Li Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021 (Oral) [PDF] [Leaderboard] [Dataset] [BibTeX]

Service

Conference Reviewer: NeurIPS 2022-2025, ICML 2025, ICLR 2025-2026, CVPR 2026, ICCV 2025, ACL 2025, EMNLP 2025, AISTATS 2025-2026, AAAI 2023-2025
Organizer: The Competition for LLM and Agent Safety 2024, CVPR 2023 SSAD Workshop, NeurIPS 2022 DMLW Workshop
Program Committee: ICLR 2025 SynthData Workshop, ICLR 2023 RTML Workshop