Shaobo Wang (王少博)
Ph.D Candidate, SAI, SJTU

Mail: gszfwsb@gmail.com
Tel: (+86) 15000937315
City: Shanghai, 200240
I am now a first-year Ph.D Candidate in the School of Artificial Intelligence, Shanghai Jiao Tong University (SJTU), fortunate to be advised by Prof. Linfeng Zhang. Currently, I am also a Research Intern at
Alibaba Qwen Team, supervised by Dr. Dayiheng Liu, Xingzhang Ren, and Kexin Yang.
Previously, I was a master’s student of ReThinkLab at SJTU, where I was grateful to be mentored by Prof. Junchi Yan. Additionally, I collaborate closely with Prof. Zhuoran Yang at Yale University, Prof. Xuming Hu at
Hong Kong University of Science and Technology (Guangzhou), and Dr. Conghui He at
Shanghai AI Laboratory.
Research. I approach data from both empirical and theoretical perspectives. My current research is centered on data selection and synthesis, especially on LLM pre-training, post-training, and inference-time scaling. I used to worked on Explainable AI, especially on the Shapley value.
Short bio. I was born in Hefei, China. Outside of academia, I have been playing the piano for over a decade. I once had the honor of performing alongside the world-renowned pianist Lang Lang. My favorite composers include Frédéric Chopin and Franz Liszt. I also like R&B and Neo-Soul. During my teenage years, I won several chess championships in Anhui Province, China, under the guidance of Chess Grandmaster Chongsheng Zeng and Chess Master Yongjin Zhou.
We are currently seeking self-motivated and talented students (Undergraduate, Graduate, or PhD) to join our Data-Centric AI group at the EPIC Lab. Should you have any inquiries or are interested in collaborating, please do not hesitate to contact me!
News
- [July 2025] I am honored to be selected for the Tencent PhD Research Incentive Program (one of 23 recipients worldwide).
- [March 2025] Our paper, “Dataset Distillation with Neural Characteristic Function: A Minmax Perspective,” received full scores (5/5/5) from all three reviewers at CVPR 2025.
Selected Publications
* denotes the equal contribution.- Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning2025
- Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution2025
- Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
-
-
-
-