Publications | Yan Li

## Journal - **[Distributionally Robust Stochastic Optimal Control](https://arxiv.org/pdf/2406.05648)** Alexander Shapiro, **Yan Li** Operations Research Letters, accepted, 2024 - **[Rectangularity and Duality of Distributionally Robust Markov Decision Processes](https://arxiv.org/pdf/2308.11139.pdf)** **Yan Li**, Alexander Shapiro Mathematical Programming, under review - **[A Novel Catalyst Scheme for Stochastic Minimax Optimization](https://arxiv.org/abs/2311.02814)** Guanghui Lan, **Yan Li** Mathematical Programming, major revision - **[First-order Policy Optimization for Robust Policy Evaluation](https://arxiv.org/pdf/2307.15890.pdf)** **Yan Li**, Guanghui Lan Mathematical Programming, under review - **[First-order Policy Optimization for Robust Markov Decision Process](https://arxiv.org/pdf/2209.10579.pdf)** **Yan Li**, Guanghui Lan, Tuo Zhao Operations Research, major revision - **[Implicit Regularization of Bregman Proximal Point Algorithm and Mirror Descent on Separable Data](https://arxiv.org/abs/2108.06808)** **Yan Li**, Caleb Ju, Ethan X. Fang, Tuo Zhao Transactions on Machine Learning Research, under review - **[Policy Mirror Descent Inherently Explores Action Space](https://arxiv.org/abs/2303.04386)** **Yan Li**, Guanghui Lan SIAM Journal on Optimization, accepted, 2024 - **[Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity](https://arxiv.org/abs/2201.09457)** **Yan Li**, Guanghui Lan, Tuo Zhao Mathematical Programming, 2023 Alice and John Jarvis Ph.D. Student Research Award - **[Block Policy Mirror Descent](https://arxiv.org/abs/2201.05756)** Guanghui Lan, **Yan Li**, Tuo Zhao SIAM Journal on Optimization, 2023 ## Conference - **[Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms](https://arxiv.org/pdf/2310.10810)** Alexander Bukharin, **Yan Li**, Yue Yu, Qingru Zhang, Zhehui Chen, Simiao Zuo, Chao Zhang, Songan Zhang, Tuo Zhao *Advances in Neural Information Processing Systems (NeurIPS), 2023* - **[Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits](https://arxiv.org/abs/2110.04844)** **Yan Li**, Dhruv Choudhary, Xiaohan Wei, Baichuan Yuan, Bhargav Bhushanam, Tuo Zhao, Guanghui Lan *International Conference on Learning Representations (ICLR), 2022* - **[Noise Regularizes Over-parameterized Rank One Matrix Recovery, Provably](https://arxiv.org/abs/2202.03535)** Tianyi Liu, **Yan Li**, Enlu Zhou, Tuo Zhao *International Conference on Artificial Intelligence and Statistics (AISTAT), 2022* - **[Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL](https://proceedings.neurips.cc/paper/2021/hash/9559fc73b13fa721a816958488a5b449-Abstract.html)** Minshuo Chen, **Yan Li**, Ethan Wang, Zhuoran Yang, Zhaoran Wang, Tuo Zhao *Advances in Neural Information Processing Systems (NeurIPS), 2021* - **[Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization](https://proceedings.mlr.press/v130/liu21e.html)** Tianyi Liu, **Yan Li**, Song Wei, Enlu Zhou, Tuo Zhao *International Conference on Artificial Intelligence and Statistics (AISTAT), 2021* - **[Deep Reinforcement Learning with Robust and Smooth Policy](http://proceedings.mlr.press/v119/shen20b.html)** **Yan Li**^*, Qianli Shen^*, Haoming Jiang, Zhaoran Wang, Tuo Zhao *International Conference on Machine Learning (ICML), 2020* - **[Implicit Bias of Gradient Descent based Adversarial Training on Separable Data](https://openreview.net/pdf?id=HkgTTh4FDH)** **Yan Li**, Huan Xu, Ethan X. Fang, Tuo Zhao *International Conference on Learning Representations (ICLR), 2020* - **[Toward Understanding the Importance of Noise in Training Neural Networks](https://proceedings.mlr.press/v97/zhou19d.html)** Mo Zhou, Tianyi Liu, **Yan Li**, Dachao Lin, Enlu Zhou, Tuo Zhao *International Conference on Machine Learning (ICML), 2019* - **[Non-convex Conditional Gradient Sliding](https://proceedings.mlr.press/v80/qu18a.html)** Chao Qu, **Yan Li**, Huan Xu *International Conference on Machine Learning (ICML), 2018* ## Preprints/Working Papers - **[Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning](https://arxiv.org/abs/2105.08268)** **Yan Li**, Lingxiao Wang, Jiachen Yang, Ethan Wang, Zhaoran Wang, Tuo Zhao, Hongyuan Zha