Zhiwei Bai (白志威)
|
Ph.D. Student,
School of Mathematical Sciences,
Institute of Natural Sciences,
Shanghai Jiao Tong University,
Shanghai, China
E-mail: bai299@sjtu.edu.cn
|
About Me
I am currently a doctoral candidate at the School of Mathematical Sciences and the Institute of Natural Sciences, Shanghai Jiao Tong University, where I am pursuing a Ph.D. in Applied Mathematics. My academic journey began at Shanghai Jiao Tong University, where I graduated with a B.S. in Mathematics and Applied Mathematics in 2022, as a distinguished member of the Wu Wenjun Honors Class.
Under the mentorship of Professors Yaoyu Zhang and Zhi-Qin John Xu, my research primarily delves into the theoretical underpinnings of machine learning and deep learning. My work is driven by a deep curiosity about the mathematical and foundational principles underlying modern artificial intelligence, with a particular focus on understanding the complexities of deep learning models through a phenomenon-driven approach. I am especially interested in developing a bottom-up theoretical understanding of deep neural networks, ranging from the analysis of loss landscapes and training dynamics to implicit regularization mechanisms and the theoretical understanding of large language models (LLMs).
Research Interests
My research areas include:
Current Works
Critical Points Analysis of Loss Landscapes in Deep Neural Networks
Training Dynamics and Induced Implicit Regularization in Matrix Factorization Models
Theory of Optimistic Sample Estimation for Nonlinear Models
Loss Spike Analysis in Adam Training
Complexity Control for Improving Reasoning Ability in LLMs
Self-Adjoint Functors in the Category of Nilpotent Morphisms
Recent Publications
Zhiwei Bai, Tao Luo, Zhi-Qin John Xu*, Yaoyu Zhang*, "Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks", CSIAM Transactions on Applied Mathematics, 5(2):350-389, 2024. ISSN 2708-0579. doi: https://doi.org/10.4208/csiam-am.SO-2023-0020. [pdf] and on [arXiv]
Zhiwei Bai, Jiajie Zhao, Yaoyu Zhang*, "Connectivity shapes implicit regularization in matrix factorization models for matrix completion", In The Thirty-eighth Annual Conference on Neural Information Processing Systems, volume 37, pages 45914–45955, 2024. [pdf] and on [arXiv]
Zhiwei Bai#, Zhangchen Zhou#, Jiajie Zhao, Xiaolong Li, Zhiyu Li, Feiyu Xiong, Hongkang Yang, Yaoyu Zhang*, Zhi-Qin John Xu*, "Adaptive Preconditioners Trigger Loss Spikes in Adam", arXiv:2506.04805, 2025 [pdf] and on [arXiv]
Zhiwei Bai, Xiang Cao, Songtao Mao, Han Zhang, Yuehui Zhang*, "Nilpotent Category of Abelian Categories and Self-Adjoint Functors", Frontiers of Mathematics 18 (6), 1363-1377, 2023. [pdf] and on [arXiv]
Yaoyu Zhang*, Leyang Zhang, Zhongwang Zhang, Zhiwei Bai, "Local Linear Recovery Guarantee of Deep Neural Networks at Overparameterization", Journal of Machine Learning Research 26 (69), 1-30, 2025. [pdf] and on [arXiv]
Yaoyu Zhang*, Zhongwang Zhang, Leyang Zhang, Zhiwei Bai, Tao Luo, Zhi-Qin John Xu, "Optimistic Estimate Uncovers the Potential of Nonlinear Models", Journal of Machine Learning, 2025. [pdf] and on [arXiv]
Yaoyu Zhang*, Zhongwang Zhang, Leyang Zhang, Zhiwei Bai, Tao Luo, Zhi-Qin John Xu, "Linear Stability Hypothesis and Rank Stratification for Nonlinear Models", arXiv:2211.11623, 2022. [pdf] and on [arXiv]
Jiajie Zhao, Zhiwei Bai, Yaoyu Zhang*, "Disentangling Sample Size and Initialization Effects on Perfect Generalization for Single-Neuron Targets", arXiv:2405.13787, 2024. [pdf] and on [arXiv]
Liangkai Hang#, Junjie Yao#, Zhiwei Bai, Tianyi Chen, Yang Chen, Rongjie Diao, Hezhou Li, Pengxiao Lin, Zhiwei Wang, Cheng Xu, Zhongwang Zhang*, Zhangchen Zhou, Zhiyu Li, Zehao Lin, Kai Chen, Feiyu Xiong*, Yaoyu Zhang*, Weinan E*, Hongkang Yang*, Zhi-Qin John Xu*, "Scalable Complexity Control Facilitates Reasoning Ability of LLMs", arXiv:2505.23013, 2025 [pdf] and on [arXiv]
Note: * indicates the corresponding author; # indicates equal contribution.
Full list of publications on Google Scholar.
Projects
Academic Service
Academic Conferences
Presented a oral talk titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the Shanghai Jiao Tong University AI for math Workshop, Shanghai, China, May 2025.
Presented a poster titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the Thirty-eighth Annual Conference on Neural Information Processing Systems, Vancouver, Canada, December 2024.
Presented a talk titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the NUS-SJTU PhD Forum, Singapore, Singapore, November 2024.
Presented a talk titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the Peking University PhD Forum, Beijing, China, November 2024.
Presented a poster titled "Connectivity shapes implicit regularization in matrix factorization models for matrix completion" at the The 22-nd Annual Meeting of the Chinese Society for Industrial and Applied Mathematics, Student Forum, Nanjing, China, October 2024.
Presented a talk titled "Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks" at the 2024 Scientific Machine Learning Conference (CSML2024), Shanghai, China, August 2024.
Volunteer Manager for the 2024 Scientific Machine Learning Conference (CSML2024) [CSML2024].
Presented a talk titled "Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks" at the 2023 Machine Learning and Materials Science Workshop, Shanghai, China, July 2023.
Teaching
Spring 2025: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.
Fall 2024: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.
Spring 2024: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.
Fall 2023: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.
Spring 2023: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.
Fall 2022: Teaching Assistant, Mathematical Analysis, Shanghai Jiao Tong University Zhiyuan Honors Program.
Education
2022–present, Ph.D. in Mathematics, School of Mathematical Sciences, Shanghai Jiao Tong University, China.
2018–2022, B.S. in Mathematics and Applied Mathematics (Wu Wenjun Honors Class), School of Mathematical Sciences, Shanghai Jiao Tong University, China.
2021–2022, Minor in AI+X, a collaborative program offered by: Shanghai Jiao Tong University, Zhejiang University, Fudan University, University of Science and Technology of China, Nanjing University, and Tongji University, China.
Activities
- Peking University Graduate Applied Mathematics Workshop and Summer School, July–August 2023, Peking University, Beijing, China.
- Deep Learning Theory and Application Summer School, July 2023, Shanghai Jiao Tong University, Shanghai, China.
Competitions and Awards
Special PhD Fellowship of the Young Talents Support Program, Chinese Society for Science and Technology(CAST), 2024
Best Paper Award, Shanghai Jiao Tong University AI for Math Workshop, 2025
Outstanding Poster Award, The 22-nd Annual Meeting of the Chinese Society for Industrial and Applied Mathematics, 2024
National Scholarship, Ministry of Education of the People's Republic of China, 2021
Shanghai Outstanding Graduate, Shanghai Municipal Education Commission, 2022
Outstanding Bachelor's Degree Thesis(top1%), Shanghai Jiao Tong University, 2022
First Prize, National Undergraduate Mathematical Contest in Modeling, Chinese Society of Industrial and Applied Mathematics, 2020
Second Prize, National Postgraduate Mathematical Contest in Modeling, Chinese Society for Academic Degrees and Postgraduate Education, 2023
Huatai Securities Technology Scholarship, Shanghai Jiao Tong University, 2023
Samsung Scholarship, Shanghai Jiao Tong University, 2020
Third Prize, National College Mathematics Competition, Chinese Mathematical Society, 2019
President, Mathematical Modeling Association, Shanghai Jiao Tong University, 2020–2023
Contact
I am always open to connecting with fellow researchers and enthusiasts in related fields. Feel free to reach out—let’s explore new ideas together!
|