Shaobo Wang (王少博)

Mail: gszfwsb@gmail.com

Tel: (+86) 15000937315

City: Shanghai, 200240

I am now a second-year Ph.D Candidate in the School of Artificial Intelligence, Shanghai Jiao Tong University (SJTU), fortunate to be advised by Prof. Linfeng Zhang. Currently, I am also a research intern at the Alibaba Qwen Team, where I am supervised by Dr. Dayiheng Liu, Xingzhang Ren, and Kexin Yang. Here, I also closely collaborate with Dr. Fei Huang and Huiqiang Jiang.

Previously, I was a master’s student of ReThinkLab at SJTU, where I was grateful to be mentored by Prof. Junchi Yan. Additionally, I collaborated closely with Prof. Xuming Hu at Hong Kong University of Science and Technology (Guangzhou), and Dr. Conghui He at Shanghai AI Laboratory. I used to work with Prof. Zhuoran Yang at Yale University.

Research. I approach data from both empirical and theoretical perspectives. My current research is centered on data selection, synthesis, and sampling, especially on LLM pre-training, post-training, and inference-time scaling. I used to worked on Explainable AI, especially on the Shapley value.

Curriculum Vitae | 中文简历

Short bio. I was born in Hefei, China. Outside of academia, I have been playing the piano for over a decade. I once had the honor of performing alongside the world-renowned pianist Lang Lang. My favorite composers include Frédéric Chopin and Franz Liszt. I also like R&B, Jazz, and Neo-Soul. During my teenage years, I won several chess championships in Anhui Province, China, under the guidance of Chess Grandmaster Chongsheng Zeng and Chess Master Yongjin Zhou.

News

[July 2025] I am honored to be selected for the Tencent PhD Research Incentive Program (one of 23 recipients nationwide).
[March 2025] Our paper, “Dataset Distillation with Neural Characteristic Function: A Minmax Perspective,” received full scores (5/5/5) from all three reviewers at CVPR 2025.

Selected Publications

* denotes the equal contribution.

CVPR highlight

Dataset Distillation with Neural Characteristic Function: A Minmax Perspective

Shaobo Wang , Yicun Yang , Zhiyuan Liu , Chenghao Sun , Xuming Hu , Conghui He , and Linfeng Zhang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Bib PDF Code News (Chinese)

@article{wang2025dataset,
  title = {Dataset Distillation with Neural Characteristic Function: A Minmax Perspective},
  author = {Wang, Shaobo and Yang, Yicun and Liu, Zhiyuan and Sun, Chenghao and Hu, Xuming and He, Conghui and Zhang, Linfeng},
  year = {2025},
  journal = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  news_zh = {https://mp.weixin.qq.com/s/VtIqPF_a098qAEvrTKbi6A}
}

ACL main

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Shaobo Wang , Xiangqi Jin , Ziming Wang , Jize Wang , Jiajun Zhang , Kaixin Li , Zichen Wen , Zhong Li , and 3 more authors

Annual Meeting of the Association for Computational Linguistics, 2025

Bib PDF Code Website

@article{wang2025datawhisperer,
  title = {Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning},
  author = {Wang, Shaobo and Jin, Xiangqi and Wang, Ziming and Wang, Jize and Zhang, Jiajun and Li, Kaixin and Wen, Zichen and Li, Zhong and He, Conghui and Hu, Xuming and Zhang, Linfeng},
  year = {2025},
  journal = {Annual Meeting of the Association for Computational Linguistics},
}

ICLR

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers

Shaobo Wang , Hongxuan Tang , Mingyang Wang , Hongrui Zhang , Xuyang Liu , Weiya Li , Xuming Hu , and Linfeng Zhang

International Conference on Learning Representations, 2025

Bib PDF Code

@article{wang2024gnothi,
  title = {Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers},
  author = {Wang, Shaobo and Tang, Hongxuan and Wang, Mingyang and Zhang, Hongrui and Liu, Xuyang and Li, Weiya and Hu, Xuming and Zhang, Linfeng},
  year = {2025},
  primaryclass = {cs.LG},
  journal = {International Conference on Learning Representations},
}

arXiv

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Shaobo Wang , Jiaming Wang , Jiajun Zhang , Cong Wang , Yue Min , Zichen Wen , Fei Huang , Huiqiang Jiang , and 3 more authors

2025

PDF
arXiv

Socratic-Zero: Bootstrapping Reasoning via Data-Free Agent Co-evolution

Shaobo Wang , Zhengbo Jiao , Zifan Zhang , Yilang Peng , Xu Ze , Boyu Yang , Wei Wang , Hu Wei , and 1 more author

2025

PDF
arXiv

Rethinking LLM Evaluation: Can We Evaluate LLMs with 200x Less Data?

Shaobo Wang , Cong Wang , Wenjie Fu , Yue Min , Mingquan Feng , Isabel Guan , Xuming Hu , Conghui He , and 6 more authors

2025

PDF

arXiv

VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction

Shaobo Wang , Tianle Niu , Runkang Yang , Deshan Liu , Xu He , Zichen Wen , Conghui He , Xuming Hu , and 1 more author

2025

Bib

@misc{wang2025videocompressa,
  title = {VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction},
  author = {Wang, Shaobo and Niu, Tianle and Yang, Runkang and Liu, Deshan and He, Xu and Wen, Zichen and He, Conghui and Hu, Xuming and Zhang, Linfeng},
  year = {2025},
  eprint = {2511.18831},
  archiveprefix = {arXiv},
  primaryclass = {cs.CV},
  url = {https://arxiv.org/abs/2511.18831}
}

arXiv

CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs

Shaobo Wang , Yongliang Miao , Yuancheng Liu , Qianli Ma , Ning Liao , and Linfeng Zhang

2025

Bib PDF

@misc{wang2025circuitseermininghighqualitydata,
  title = {CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs},
  author = {Wang, Shaobo and Miao, Yongliang and Liu, Yuancheng and and Qianli Ma and Liao, Ning and Zhang, Linfeng},
  year = {2025},
  eprint = {2510.18470},
  archiveprefix = {arXiv},
  primaryclass = {cs.AI},
  url = {https://arxiv.org/abs/2510.18470},
}

NeurIPS

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Zichen Wen , Shaobo Wang , Yufa Zhou , Junyuan Zhang , Qintong Zhang , Yifeng Gao , Zhaorun Chen , Bin Wang , and 3 more authors

2025

Bib

@misc{zhang2025efficient,
  title = {Efficient Multi-modal Large Language Models via Progressive Consistency Distillation},
  author = {Wen, Zichen and Wang, Shaobo and Zhou, Yufa and Zhang, Junyuan and Zhang, Qintong and Gao, Yifeng and Chen, Zhaorun and Wang, Bin and Li, Weijia and He, Conghui and Zhang, Linfeng},
  year = {2025}
}

ECCV

Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-V2)

Qifeng Li , Xiaosong Jia , Shaobo Wang , and Junchi Yan

European Conference on Computer Vision, 2024

Bib PDF

@article{li2024think2drive,
  title = {Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-V2)},
  author = {Li, Qifeng and Jia, Xiaosong and Wang, Shaobo and Yan, Junchi},
  journal = {European Conference on Computer Vision},
  year = {2024},
}

AAAI

UNSEEN: Enhancing Dataset Pruning from a Generalization Perspective

Furui Xu* , Shaobo Wang* , Jiajun Zhang , Chenghao Sun , Haixiang Tang , and Linfeng Zhang

Annual AAAI Conference on Artificial Intelligence, 2026

Bib PDF

@article{xu2026unseen,
  title = {UNSEEN: Enhancing Dataset Pruning from a Generalization Perspective},
  author = {Xu*, Furui and Wang*, Shaobo and Zhang, Jiajun and Sun, Chenghao and Tang, Haixiang and Zhang, Linfeng},
  journal = {Annual AAAI Conference on Artificial Intelligence},
  year = {2026},
}

AAAI

ImagebindDC: Compressing Multimodal Data with Imagebind-based Condensation

Yue Min* , Shaobo Wang* , Jiaze Li , Tianle Niu , Junxin Fan , Yongliang Miao , Lijin Yang , and Linfeng Zhang

Annual AAAI Conference on Artificial Intelligence, 2026

Bib PDF

@article{min2026imagebinddc,
  title = {ImagebindDC: Compressing Multimodal Data with Imagebind-based Condensation},
  author = {Min*, Yue and Wang*, Shaobo and Li, Jiaze and Niu, Tianle and Fan, Junxin and Miao, Yongliang and Yang, Lijin and Zhang, Linfeng},
  journal = {Annual AAAI Conference on Artificial Intelligence},
  year = {2026},
}

EMNLP main

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

Zichen Wen , Yifeng Gao , Shaobo Wang , Junyuan Zhang , Qintong Zhang , Weijia Li , Conghui He , and Linfeng Zhang

2025

Bib PDF

@article{wen2025stoplookingimportanttokens,
  title = {Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More},
  author = {Wen, Zichen and Gao, Yifeng and Wang, Shaobo and Zhang, Junyuan and Zhang, Qintong and Li, Weijia and He, Conghui and Zhang, Linfeng},
  year = {2025},
  eprint = {2502.11494},
  archiveprefix = {arXiv},
  primaryclass = {cs.CL},
}

ACMMM

Compute only 16 tokens in one timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching

Zhixin Zheng , Xinyu Wang , Chang Zou , Shaobo Wang , and Linfeng Zhang

ACM Multimedia, 2025

Bib PDF

@article{zheng2025compute,
  title = {Compute only 16 tokens in one timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching},
  author = {Zheng, Zhixin and Wang, Xinyu and Zou, Chang and Wang, Shaobo and Zhang, Linfeng},
  journal = {ACM Multimedia},
  year = {2025},
}

ACMMM

SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching

Jiacheng Liu , Chang Zou , Yuanhuiyi Lyu , Fei Ren , Shaobo Wang , Kaixin Li , and Linfeng Zhang

ACM Multimedia, 2025

Bib PDF

@article{zheng2025computf,
  title = {SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching},
  author = {Liu, Jiacheng and Zou, Chang and Lyu, Yuanhuiyi and Ren, Fei and Wang, Shaobo and Li, Kaixin and Zhang, Linfeng},
  journal = {ACM Multimedia},
  year = {2025},
}

NeurIPS

Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers

Siyu Chen , Heejune Sheen , Tianhao Wang , and Zhuoran Yang

Advances in Neural Information Processing Systems, 2024

Bib PDF Code

@article{chen2024unveiling,
  title = {Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers},
  author = {Chen, Siyu and Sheen, Heejune and Wang, Tianhao and Yang, Zhuoran},
  journal = {Advances in Neural Information Processing Systems},
  year = {2024},
}

ICLR workshop

DRUPI: Dataset Reduction Using Privileged Information

Shaobo Wang , Yantai Yang , Shuaiyu Zhang , Chenghao Sun , Weiya Li , Xuming Hu , and Linfeng Zhang

The Future of Machine Learning Data Practices and Repositories at ICLR, 2024

Bib PDF

@article{wang2024drupi,
  title = {DRUPI: Dataset Reduction Using Privileged Information},
  author = {Wang, Shaobo and Yang, Yantai and Zhang, Shuaiyu and Sun, Chenghao and Li, Weiya and Hu, Xuming and Zhang, Linfeng},
  year = {2024},
  eprint = {2410.01611},
  archiveprefix = {arXiv},
  primaryclass = {cs.CV},
  journal = {The Future of Machine Learning Data Practices and Repositories at ICLR},
}

CVPR workshop

Not All Samples Should Be Utilized Equally: Towards Understanding and Improving Dataset Distillation

Shaobo Wang , Yantai Yang , Qilong Wang , Kaixin Li , Linfeng Zhang , and Junchi Yan

Synthetic Data for Computer Vision Workshop at CVPR, 2025

Bib PDF

@article{wang2024samples,
  title = {Not All Samples Should Be Utilized Equally: Towards Understanding and Improving Dataset Distillation},
  author = {Wang, Shaobo and Yang, Yantai and Wang, Qilong and Li, Kaixin and Zhang, Linfeng and Yan, Junchi},
  year = {2025},
  eprint = {2408.12483},
  archiveprefix = {arXiv},
  primaryclass = {cs.CV},
  journal = {Synthetic Data for Computer Vision Workshop at CVPR},
}

NeurIPS

Visualizing the emergence of intermediate visual patterns in dnns

Mingjie Li , Shaobo Wang , and Quanshi Zhang

Advances in Neural Information Processing Systems, 2021

Bib PDF Code

@article{li2021visualizing,
  title = {Visualizing the emergence of intermediate visual patterns in dnns},
  author = {Li, Mingjie and Wang, Shaobo and Zhang, Quanshi},
  journal = {Advances in Neural Information Processing Systems},
  volume = {34},
  pages = {6594--6607},
  year = {2021},
}