Kexun Zhang

I am an AI researcher currently interested in LLM for code generation and coding agents.

I used to be a competitive programmer and an amateur Xiangsheng performer for fun.

Email  /  GitHub  /  Scholar  /  Twitter

profile photo

Selected Research

* denotes equal contribution. † denotes project lead.

project image

Is Vibe Coding Safe? Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks


微博编码安全不?测量脆弱性属于经理人生成代码在真实任务中


Songwen Zhao, Danqing Wang†, Kexun Zhang†, Jiaxuan Luo, Zhuo Li, Lei Li
preprint, 2025
paper / code GitHub Repo stars /

project image

HardTests: Synthesizing High-Quality Test Cases for LLM Coding


硬测:合成高质量测试盒子为了大语言模型写代码


Zhongmou He*, Yee Man Choi*, Kexun Zhang*†, et al.
preprint, 2025
paper / code GitHub Repo stars / website /

project image

Integrating Expertise of Software Engineering Agents


整合专长来自软件工程特工


Kexun Zhang, Weiran Yao, Zuxin Liu, et al.
ICLR, 2025
paper / code GitHub Repo stars /

project image

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement


穗搜:加强软件特工使用莫奈特卡洛树搜索和逐步改进


Antonis Antoniades*, Albert Örwall*, Kexun Zhang, Yuxi Xie, Anirudh Goyal, William Wang
ICLR, 2025
paper / code GitHub Repo stars /

project image

ALGO: Synthesizing Algorithmic Programs with LLM-Generated Oracle Verifiers


阿尔戈:合成算法程序使用大语言模型生成的神谕验证器


Kexun Zhang, Danqing Wang, Jingtao Xia, William Yang Wang, Lei Li
NeurIPS, 2023
paper / code GitHub Repo stars /





Design and source code from Jon Barron's website