Zhiwei Bai (白志威)

alt text 

Ph.D. Student,
School of Mathematical Sciences,
Institute of Natural Sciences,
Shanghai Jiao Tong University,
Shanghai, China
E-mail: bai299@sjtu.edu.cn

About Me

I am currently a doctoral candidate at the School of Mathematical Sciences and the Institute of Natural Sciences, Shanghai Jiao Tong University, where I am pursuing a Ph.D. in Applied Mathematics. My academic journey began at Shanghai Jiao Tong University, where I graduated with a B.S. in Mathematics and Applied Mathematics in 2022, as a distinguished member of the Wu Wenjun Honors Class.

Under the mentorship of Professors Yaoyu Zhang and Zhi-Qin John Xu, my research primarily delves into the theoretical underpinnings of machine learning and deep learning. My work is driven by a deep curiosity about the mathematical and foundational principles underlying modern artificial intelligence, with a particular focus on understanding the complexities of deep learning models through a phenomenon-driven approach. I am especially interested in developing a bottom-up theoretical understanding of deep neural networks, ranging from the analysis of loss landscapes and training dynamics to implicit regularization mechanisms and the theoretical understanding of large language models (LLMs).

Research Interests

My research areas include:

  • Machine Learning and Deep Learning Theory

  • Large Language Models

  • Brain-Inspired Computing

  • Algebra

Current Works

  • Critical Points Analysis of Loss Landscapes in Deep Neural Networks

  • Training Dynamics and Induced Implicit Regularization in Matrix Factorization Models

  • Theory of Optimistic Sample Estimation for Nonlinear Models

  • Loss Spike Analysis in Adam Training

  • Complexity Control for Improving Reasoning Ability in LLMs

  • Self-Adjoint Functors in the Category of Nilpotent Morphisms

Recent Publications

  1. Zhiwei Bai, Tao Luo, Zhi-Qin John Xu*, Yaoyu Zhang*, "Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks", CSIAM Transactions on Applied Mathematics, 5(2):350-389, 2024. ISSN 2708-0579. doi: https://doi.org/10.4208/csiam-am.SO-2023-0020. [pdf] and on [arXiv]

  2. Zhiwei Bai, Jiajie Zhao, Yaoyu Zhang*, "Connectivity shapes implicit regularization in matrix factorization models for matrix completion", In The Thirty-eighth Annual Conference on Neural Information Processing Systems, volume 37, pages 45914–45955, 2024. [pdf] and on [arXiv]

  3. Zhiwei Bai#, Zhangchen Zhou#, Jiajie Zhao, Xiaolong Li, Zhiyu Li, Feiyu Xiong, Hongkang Yang, Yaoyu Zhang*, Zhi-Qin John Xu*, "Adaptive Preconditioners Trigger Loss Spikes in Adam", arXiv:2506.04805, 2025 [pdf] and on [arXiv]

  4. Zhiwei Bai, Xiang Cao, Songtao Mao, Han Zhang, Yuehui Zhang*, "Nilpotent Category of Abelian Categories and Self-Adjoint Functors", Frontiers of Mathematics 18 (6), 1363-1377, 2023. [pdf] and on [arXiv]

  5. Yaoyu Zhang*, Leyang Zhang, Zhongwang Zhang, Zhiwei Bai, "Local Linear Recovery Guarantee of Deep Neural Networks at Overparameterization", Journal of Machine Learning Research 26 (69), 1-30, 2025. [pdf] and on [arXiv]

  6. Yaoyu Zhang*, Zhongwang Zhang, Leyang Zhang, Zhiwei Bai, Tao Luo, Zhi-Qin John Xu, "Optimistic Estimate Uncovers the Potential of Nonlinear Models", Journal of Machine Learning, 2025. [pdf] and on [arXiv]

  7. Yaoyu Zhang*, Zhongwang Zhang, Leyang Zhang, Zhiwei Bai, Tao Luo, Zhi-Qin John Xu, "Linear Stability Hypothesis and Rank Stratification for Nonlinear Models", arXiv:2211.11623, 2022. [pdf] and on [arXiv]

  8. Jiajie Zhao, Zhiwei Bai, Yaoyu Zhang*, "Disentangling Sample Size and Initialization Effects on Perfect Generalization for Single-Neuron Targets", arXiv:2405.13787, 2024. [pdf] and on [arXiv]

  9. Liangkai Hang#, Junjie Yao#, Zhiwei Bai, Tianyi Chen, Yang Chen, Rongjie Diao, Hezhou Li, Pengxiao Lin, Zhiwei Wang, Cheng Xu, Zhongwang Zhang*, Zhangchen Zhou, Zhiyu Li, Zehao Lin, Kai Chen, Feiyu Xiong*, Yaoyu Zhang*, Weinan E*, Hongkang Yang*, Zhi-Qin John Xu*, "Scalable Complexity Control Facilitates Reasoning Ability of LLMs", arXiv:2505.23013, 2025 [pdf] and on [arXiv]

Note: * indicates the corresponding author; # indicates equal contribution.

Full list of publications on Google Scholar.

Projects

  • Book (Chinese) in progress

    • Collaborating with supervisors Zhi-Qin John Xu and Yaoyu Zhang.

    • This book introduces fundamental concepts of deep learning with a phenomenon-driven approach. Please see the project on GitHub.

    • Your feedback is welcome!

Academic Service

Academic Conferences

  • Presented a oral talk titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the Shanghai Jiao Tong University AI for math Workshop, Shanghai, China, May 2025.

  • Presented a poster titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the Thirty-eighth Annual Conference on Neural Information Processing Systems, Vancouver, Canada, December 2024.

  • Presented a talk titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the NUS-SJTU PhD Forum, Singapore, Singapore, November 2024.

  • Presented a talk titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the Peking University PhD Forum, Beijing, China, November 2024.

  • Presented a poster titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the The 22-nd Annual Meeting of the Chinese Society for Industrial and Applied Mathematics, Student Forum, Nanjing, China, October 2024.

  • Presented a talk titled "Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks" at the 2024 Scientific Machine Learning Conference (CSML2024), Shanghai, China, August 2024.

  • Volunteer Manager for the 2024 Scientific Machine Learning Conference (CSML2024) [CSML2024].

  • Presented a talk titled "Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks" at the 2023 Machine Learning and Materials Science Workshop, Shanghai, China, July 2023.

Teaching

  • Spring 2025: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.

  • Fall 2024: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.

  • Spring 2024: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.

  • Fall 2023: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.

  • Spring 2023: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.

  • Fall 2022: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.

Education

2022–present, Ph.D. in Mathematics, School of Mathematical Sciences, Shanghai Jiao Tong University, China.

2018–2022, B.S. in Mathematics and Applied Mathematics (Wu Wenjun Honors Class), School of Mathematical Sciences, Shanghai Jiao Tong University, China.

2021–2022, Minor in AI+X, a collaborative program offered by: Shanghai Jiao Tong University, Zhejiang University, Fudan University, University of Science and Technology of China, Nanjing University, and Tongji University, China.

Activities

  • Peking University Graduate Applied Mathematics Workshop and Summer School, July–August 2023, Peking University, Beijing, China.
  • Deep Learning Theory and Application Summer School, July 2023, Shanghai Jiao Tong University, Shanghai, China.

Competitions and Awards

  • Special PhD Fellowship of the Young Talents Support Program, Chinese Society for Science and Technology(CAST), 2024

  • Best Paper Award, Shanghai Jiao Tong University AI for Math Workshop, 2025

  • Outstanding Poster Award, The 22-nd Annual Meeting of the Chinese Society for Industrial and Applied Mathematics, 2024

  • National Scholarship, Ministry of Education of the People's Republic of China, 2021

  • Shanghai Outstanding Graduate, Shanghai Municipal Education Commission, 2022

  • Outstanding Bachelor's Degree Thesis(top1%), Shanghai Jiao Tong University, 2022

  • First Prize, National Undergraduate Mathematical Contest in Modeling, Chinese Society of Industrial and Applied Mathematics, 2020

  • Second Prize, National Postgraduate Mathematical Contest in Modeling, Chinese Society for Academic Degrees and Postgraduate Education, 2023

  • Huatai Securities Technology Scholarship, Shanghai Jiao Tong University, 2023

  • Samsung Scholarship, Shanghai Jiao Tong University, 2020

  • Third Prize, National College Mathematics Competition, Chinese Mathematical Society, 2019

  • President, Mathematical Modeling Association, Shanghai Jiao Tong University, 2020–2023

Contact

I am always open to connecting with fellow researchers and enthusiasts in related fields. Feel free to reach out—let’s explore new ideas together!