Shaobo Wang (王少博)

Mail: gszfwsb@gmail.com

Tel: (+86) 15000937315

City: Shanghai, 200240

I am now a first-year Ph.D Candidate in the School of Artificial Intelligence, Shanghai Jiao Tong University (SJTU), fortunate to be advised by Prof. Linfeng Zhang. Currently, I am also a Research Intern at Alibaba Qwen Team, supervised by Dr. Dayiheng Liu, Xingzhang Ren, and Kexin Yang.

Previously, I was a master’s student of ReThinkLab at SJTU, where I was grateful to be mentored by Prof. Junchi Yan. Additionally, I collaborate closely with Prof. Zhuoran Yang at Yale University, Prof. Xuming Hu at Hong Kong University of Science and Technology (Guangzhou), and Dr. Conghui He at Shanghai AI Laboratory.

Research. I approach data from both empirical and theoretical perspectives. My current research is centered on data selection, synthesis, and attribution, especially on foundation models. I used to worked on Explainable AI.

We are currently seeking self-motivated and talented students (Undergraduate, Graduate, or PhD) to join our Data-Centric AI group at the [EPIC Lab](http://www.zhanglinfeng.tech/). Should you have any inquiries or are interested in collaborating, please do not hesitate to contact me!

Curriculum Vitae | 中文简历

Short bio. I was born in Hefei, China. Outside of academia, I have been playing the piano for over a decade. I once had the honor of performing alongside the world-renowned pianist Lang Lang. My favorite composers include Frédéric Chopin and Franz Liszt. I also like R&B and Neo-Soul. During my teenage years, I won several chess championships in Anhui Province, China, under the guidance of Chess Grandmaster Chongsheng Zeng and Chess Master Yongjin Zhou.

Selected Publications

* denotes the equal contribution.

CVPR highlight

Dataset Distillation with Neural Characteristic Function: A Minmax Perspective

Shaobo Wang , Yicun Yang , Zhiyuan Liu , Chenghao Sun , Xuming Hu , Conghui He , and Linfeng Zhang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Bib PDF Code News (Chinese)

@article{wang2025dataset,
  title = {Dataset Distillation with Neural Characteristic Function: A Minmax Perspective},
  author = {Wang, Shaobo and Yang, Yicun and Liu, Zhiyuan and Sun, Chenghao and Hu, Xuming and He, Conghui and Zhang, Linfeng},
  year = {2025},
  journal = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  news_zh = {https://mp.weixin.qq.com/s/VtIqPF_a098qAEvrTKbi6A}
}

ACL main

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Shaobo Wang , Xiangqi Jin , Ziming Wang , Jize Wang , Jiajun Zhang , Kaixin Li , Zichen Wen , Zhong Li , and 3 more authors

Annual Meeting of the Association for Computational Linguistics, 2025

Bib PDF Code Website

@article{wang2025datawhisperer,
  title = {Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning},
  author = {Wang, Shaobo and Jin, Xiangqi and Wang, Ziming and Wang, Jize and Zhang, Jiajun and Li, Kaixin and Wen, Zichen and Li, Zhong and He, Conghui and Hu, Xuming and Zhang, Linfeng},
  year = {2025},
  journal = {Annual Meeting of the Association for Computational Linguistics},
}

ICLR

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers

Shaobo Wang , Hongxuan Tang , Mingyang Wang , Hongrui Zhang , Xuyang Liu , Weiya Li , Xuming Hu , and Linfeng Zhang

International Conference on Learning Representations, 2025

Bib PDF Code

@article{wang2024gnothi,
  title = {Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers},
  author = {Wang, Shaobo and Tang, Hongxuan and Wang, Mingyang and Zhang, Hongrui and Liu, Xuyang and Li, Weiya and Hu, Xuming and Zhang, Linfeng},
  year = {2025},
  primaryclass = {cs.LG},
  journal = {International Conference on Learning Representations},
}

arXiv

Dd-ranking: Rethinking the evaluation of dataset distillation

Zekai Li , Xinhao Zhong , Samir Khaki , Zhiyuan Liang , Yuhao Zhou , Mingjia Shi , Ziqiao Wang , Xuanlei Zhao , and 3 more authors

2025

Bib PDF Code

@article{li2025dd,
  title = {Dd-ranking: Rethinking the evaluation of dataset distillation},
  author = {Li, Zekai and Zhong, Xinhao and Khaki, Samir and Liang, Zhiyuan and Zhou, Yuhao and Shi, Mingjia and Wang, Ziqiao and Zhao, Xuanlei and Zhao, Wangbo and Qin, Ziheng and others},
  year = {2025},
}

arXiv

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Xuyang Liu* , Zichen Wen* , Shaobo Wang* , Junjie Chen , Zhishan Tao , Yubo Wang , Xiangqi Jin , Chang Zou , and 8 more authors

2025

Bib PDF

@misc{liu2025shiftingaiefficiencymodelcentric,
  title = {Shifting AI Efficiency From Model-Centric to Data-Centric Compression},
  author = {Liu*, Xuyang and Wen*, Zichen and Wang*, Shaobo and Chen, Junjie and Tao, Zhishan and Wang, Yubo and Jin, Xiangqi and Zou, Chang and Wang, Yiyu and Liao, Chenfei and Zheng, Xu and Chen, Honggang and Li, Weijia and Hu, Xuming and He, Conghui and Zhang, Linfeng},
  year = {2025},
  eprint = {2505.19147},
  archiveprefix = {arXiv},
  primaryclass = {cs.CL},
}

arXiv

dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching

Zhiyuan Liu , Yicun Yang , Yaojie Zhang , Junjie Chen , Chang Zou , Qingyan Wei , Shaobo Wang , and Linfeng Zhang

2025

Bib PDF Code

@misc{liu2025dllm,
  title = {dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching},
  author = {Liu, Zhiyuan and Yang, Yicun and Zhang, Yaojie and Chen, Junjie and Zou, Chang and Wei, Qingyan and Wang, Shaobo and Zhang, Linfeng},
  year = {2025},
}

arXiv

Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs

Yufa Zhou* , Shaobo Wang* , Xingyu Dong* , Xiangqi Jin , Yifang Chen , Yue Min , Xingzhang Ren , Kexin Yang , and 2 more authors

2025

Bib PDF Code

@misc{recon,
  title = {Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs},
  author = {Zhou*, Yufa and Wang*, Shaobo and Dong*, Xingyu and Jin, Xiangqi and Chen, Yifang and Min, Yue and Ren, Xingzhang and Yang, Kexin and Liu, Dayiheng and Zhang, Linfeng},
  year = {2025},
}

ECCV

Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-V2)

Qifeng Li , Xiaosong Jia , Shaobo Wang , and Junchi Yan

European Conference on Computer Vision, 2024

Bib PDF

@article{li2024think2drive,
  title = {Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-V2)},
  author = {Li, Qifeng and Jia, Xiaosong and Wang, Shaobo and Yan, Junchi},
  journal = {European Conference on Computer Vision},
  year = {2024},
}

ACMMM

Compute only 16 tokens in one timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching

Zhixin Zheng , Xinyu Wang , Chang Zou , Shaobo Wang , and Linfeng Zhang

ACM Multimedia, 2025

Bib PDF

@article{zheng2025compute,
  title = {Compute only 16 tokens in one timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching},
  author = {Zheng, Zhixin and Wang, Xinyu and Zou, Chang and Wang, Shaobo and Zhang, Linfeng},
  journal = {ACM Multimedia},
  year = {2025},
}

ACMMM

SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching

Jiacheng Liu , Chang Zou , Yuanhuiyi Lyu , Fei Ren , Shaobo Wang , Kaixin Li , and Linfeng Zhang

ACM Multimedia, 2025

Bib PDF

@article{zheng2025computf,
  title = {SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching},
  author = {Liu, Jiacheng and Zou, Chang and Lyu, Yuanhuiyi and Ren, Fei and Wang, Shaobo and Li, Kaixin and Zhang, Linfeng},
  journal = {ACM Multimedia},
  year = {2025},
}

NeurIPS

Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers

Siyu Chen , Heejune Sheen , Tianhao Wang , and Zhuoran Yang

Advances in Neural Information Processing Systems, 2024

Bib PDF Code

@article{chen2024unveiling,
  title = {Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers},
  author = {Chen, Siyu and Sheen, Heejune and Wang, Tianhao and Yang, Zhuoran},
  journal = {Advances in Neural Information Processing Systems},
  year = {2024},
}

NeurIPS

Visualizing the emergence of intermediate visual patterns in dnns

Mingjie Li , Shaobo Wang , and Quanshi Zhang

Advances in Neural Information Processing Systems, 2021

Bib PDF Code

@article{li2021visualizing,
  title = {Visualizing the emergence of intermediate visual patterns in dnns},
  author = {Li, Mingjie and Wang, Shaobo and Zhang, Quanshi},
  journal = {Advances in Neural Information Processing Systems},
  volume = {34},
  pages = {6594--6607},
  year = {2021},
}