5 Renyuan Xu
I am currently a WiSE Gabilan Assistant Professor in the Daniel J. Epstein Department of Industrial and Systems Engineering at the University of Southern California. From 2019-2021 I was a Hooke Research Fellow in the Mathematical Institute at the University of Oxford, and before that I completed my Ph.D. in 2019 at the University of California, Berkeley in the Department of Industrial Engineering and Operations Research.

My research interests include stochastic analysis, stochastic controls and games, machine learning theory, and mathematical finance. I am also interested in interdisciplinary topics that integrate methodologies in multiple fields such as applied probability, statistics, and optimization, along with their applications in addressing high-stake decision-making problems in modern large-scale systems. Some of the topics that I have been working on recently:
  • Stochastic games and mean-field games with applications in finance
  • Deep learning theory through the lens of stochastic differential equations
  • Reinforcement learning theory and data-driven decision making
  • Stochastic controls under imperfect observations and dynamic information acquisition
  • Principal-agent problems (mechanism design) under heterogeneous information structures

I am serving on the local organizer committee of ICCOPT 2025. I am co-organizing the World Online Seminar on Machine Learning in Finance (2021-). I was a program co-chair of the 3rd ACM International Conference on AI in Finance (ICAIF) in 2022 and the finance area chair of the Oxford Machine Learning Summer School in 2022 and 2023.

Please find my cv here.
Office: Olin Hall of Engineering 310K
Email: renyuanx (at) usc (dot) edu

Working Papers and Preprints

Decision Making Under Costly Sequential Information Acquisition: The Paradigm of Reversible and Irreversible Decisions
with Thaleia Zariphopoulou and Luhao Zhang (2023)
Submitted | SSRN|

Fast Policy Learning for Linear Quadratic Control with Entropy Regularization
with Xin Guo and Xinyu Li (2023)
Submitted | SSRN|

Risk-sensitive Markov Decision Process and Learning under General Utilities
with Zhengqi Wu (2023)
Submitted | SSRN|

Implicit Regularization and Convergence of Gradient Descent for Deep Residual Networks
with Rama Cont and Alain Rossier (2022)
Submitted | arXiv|

Asymptotic Analysis of Deep Residual Networks
with Rama Cont and Alain Rossier (2022)
Submitted | arXiv|

Linear-quadratic Gaussian Games with Asymmetric Information: Belief Corrections Using the Opponents Actions
with Huining Yang and Ben Hambly (2023)
Revision, SIAM Journal on Control and Optimization | arXiv|

Model-free Analysis of Dynamic Trading Strategies
with Rama Cont and Anna Ananova (2023)
Revision, SIAM Journal on Financial Mathematics | arXiv|

Policy Gradient Finds Global Optimum of Nearly Linear-quadratic Control Systems
with Yinbin Han amd Meisam Razaviyayn (2022)
Revision, SIAM Journal on Control and Optimization| arXiv|

Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing
with Jingwei Ji and Ruihao Zhu (2023)
Revision, Operations Research| arXiv|

TailGAN: Nonparametric Scenario Generation for Tail Risk Estimation
with Rama Cont, Mihai Cucuringu and Chao Zhang (2022)
Major revision, Management Science | arXiv|

Jounal Publications

Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network Approach
with Xin Guo, Haotian Gu and Xiaoli Wei (2021)
Accepted, Mathematics of Operations Research (2024) | arXiv|

Recent Advances in Reinforcement Learning in Finance
with Ben Hambly and Huining Yang (2021)
Mathematical Finance (2023)
arXiv| DOI

Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games
with Ben Hambly and Huining Yang (2021)
Journal of Machine Learning Research (2022)
arXiv| DOI

Modelling COVID-19 Contagion: Risk Assessment and Targeted Mitigation Policies
with Rama Cont and Artur Kotlicki (2020)
Royal Society Open Science (2021)
medRxiv| DOI

Interbank Lending with Benchmark Rates: Pareto Optima for a Class of Singular Control Games
with Xin Guo and Rama Cont (2020)
Mathematical Finance (2021)
arXiv| DOI

Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon
with Ben Hambly and Huining Yang (2020)
SIAM Journal on Control and Optimization (2021)
arXiv| DOI

Entropy Regularization for Mean Field Games with Learning
with Xin Guo and Thaleia Zariphopoulou (2020)
Mathematics of Operations Research (2022)
arXiv| DOI

Mean-Field Controls with Q-learning for Cooperative MARL: Convergence and Complexity Analysis
with Xin Guo, Haotian Gu and Xiaoli Wei (2020)
SIAM Journal on Mathematics of Data Science (2021)
arXiv| DOI

A General Framework for Learning Mean-Field Games
with Xin Guo, Anran Hu and Junzi Zhang (2020)
Mathematics of Operations Research (2022)
arXiv| DOI

Delay-Adaptive Learning in Generalized Linear Contextual Bandits
with Jose Blanchet and Zhengyuan Zhou (2020)
Mathematics of Operations Research (2022)
arXiv| DOI

Dynamic Programming Principles for Mean-Field Controls with Learning
with Xin Guo, Haotian Gu and Xiaoli Wei (2019)
Operations Research (2022)
arXiv| DOI

Transaction Cost Data Analytics for Corporate Bonds
with Xin Guo and Charles-Albert Lehalle (2019)
Quantitative Finance (2022)
arXiv| DOI

A Class of Stochastic Games and Moving Free Boundary Problems
with Xin Guo and Wenpin Tang (2018)
SIAM Journal on Control and Optimization (2022)
arXiv| DOI

Stochastic Games for Fuel Followers Problem: N versus MFG
with Xin Guo (2018)
SIAM Journal on Control and Optimization (2019)
arXiv| DOI

Conference Preceedings

Neural Network-based Score Estimation in Diffusion Models: Optimization and Generalization
with Yinbin Han and Meisam Razaviyayn (2023)
International Conference on Learning Representation (ICLR) '24 | arXiv|

Risk-Aware Linear Bandits with Application in Smart Order Routing
with Jingwei Ji and Ruihao Zhu (2022)
proceeding

Scaling Properties of Deep Residual Networks
with Alain–Sam Cohen, Rama Cont, and Alain Rossier (2021)
ICML'21 | International Conference on Machine Learning
arXiv| proceeding

Learning in Generalized Linear Contextual Bandits with Stochastic Delays
with Jose Blanchet and Zhengyuan Zhou (2019)
NeurIPS'19 (Spotlight) | Conference on Neural Information Processing Systems
proceeding

Learning Mean-Field Games
with Xin Guo, Anran Hu and Junzi Zhang (2019)
NeurIPS'19 | Conference on Neural Information Processing Systems
arXiv| proceeding

Ph.D. Students

I am very fortunate to advise and work with the following Ph.D. students:

Recognitions and Awards

Teaching

I am teaching the following courses at USC:
  • ISE537 (Master level): Financial Analytics (Machine Learning in Finance), Fall 2021/2022/2023
  • ISE599 (Ph.D. level): Special Topics in Control Theory and Reinforcement Learning, Fall 2022
I was the tutor for the following courses at the University of Oxford:
  • Stochastic Control, Hilary Term 2020
  • Machine Learning, Hilary Term 2020
  • Market Microstructure and Algorithmic Trading, Hilary Term 2020
  • Statistics and Financial Data Analysis, Michaelmas Term 2019
Berkeley
2014 - 2019
Oxford
2019-2021
USC
2021 -