About Experiences Publications

Chenlu Ye

Hi! My name is Chenlu Ye (叶晨璐).

I am a Ph.D. student at UIUC, advised by Prof. Tong Zhang. Previously, I received my MPhil from HKUST and B.S. from USTC.

My research interests lie at the intersection of reinforcement learning and large language model post-training. Recently, I focus on RL for reasoning and post-training of LLMs.
profile photo
chenluy3[AT]illinois.edu

Selected Publications (Full)

PreprintAdaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RLChenlu Ye*, Xuanchang Zhang*, Yifan Hao*, Zhou Yu, Ziji Zhang, Abhinav Gullapalli, Hao Chen, Tong ZhangPreprintHTML

PreprintReinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM TrainingWei Xiong*, Chenlu Ye*, Baohao Liao*, Hanze Dong*, Xinxing Xu, Christof Monz, Jiang Bian, Nan Jiang, Tong ZhangPreprintPDFCode

PreprintBeyond Correctness: Harmonizing Process and Outcome Rewards through RL TrainingChenlu Ye, Zhou Yu, Ziji Zhang, Hao Chen, Narayanan Sadagopan, Jing Huang, Tong Zhang, Anurag BeniwalPreprintPDF

PreprintSelf-Rewarding Correction for Mathematical ReasoningWei Xiong*, Hanning Zhang*, Chenlu Ye*, Lichang Chen, Nan Jiang, Tong ZhangPreprintPDF

NeurIPS 2024Online Iterative Reinforcement Learning from Human Feedback with General Preference ModelChenlu Ye*, Wei Xiong*, Yuheng Zhang*, Hanze Dong*, Nan Jiang, Tong ZhangNeurIPS 2024PDF

ICML 2024Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-ConstraintWei Xiong*, Hanze Dong*, Chenlu Ye*, Han Zhong, Nan Jiang, Tong ZhangICML 2024PDF

Experiences