Welcome!
Hi, I’m Qianli — a Research Scientist at Alibaba Group’s Tongyi Lab. I split my time between research on large language models (LLMs) and building open-source AI infrastructure.
As a core contributor and maintainer, I am taking care of
- Data-Juicer - Data processing for and with foundation models!
- Data-Juicer Agents - A Suite of Agents for Agentic Data Processing.
Outside of work, I enjoy 🐱🏀🤿🎿🧑🍳🎮🃏 …
I received my Ph.D. from the School of Computing at the National University of Singapore, advised by Prof. Kenji Kawaguchi. Before that, I earned a B.S. in Computer Science from Peking University, where I worked with Prof. Zhanxing Zhu. I’ve also spent time at Baichuan Inc., Sea AI Lab, and as a visiting researcher at Georgia Tech.
Email: shenqianlilili[at]gmail.com
[CV] [Google Scholar] [GitHub] [WeChat]
News
-
[2026-01-26] One paper BOTS accepted by ICLR26🇧🇷! See you in Rio! Looking for diving buddies🤿 (scuba and/or freediving) in South America before or after the conference.
-
[2025-09-08] I joined Tongyi Lab at Alibaba Group as a Research Scientist.
-
[2025-08-01] I received my Ph.D. degree 🎓.
Selected Publications
BOTS: A Unified Framework for Bayesian Online Task Selection in LLM Reinforcement Finetuning [arxiv][code]
Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization [arxiv][code]
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline [arxiv][code]
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models [arxiv][code]
PICProp: Physics-Informed Confidence Propagation for Uncertainty Quantification [arxiv][code]
Deep Reinforcement Learning with Robust and Smooth Policy [arxiv]
Softwares