Research

(NEW) Interpretability of LLMs

While LLMs have show impressive capabilities, the internal mechanisms that give rise to such capabilities are still largely a mystery. Fascinated by such mystery, our group has been developing new interpretability tools and algorithms towards better scientific understandings of internal mechanisms of LLMs.

(NEW) Sampling-based Control

Sampling-based control like MPPI has been widely used in robotics, especially in contact-rich non-smooth optimal control problems (e.g. locomotion for legged robots). However, from an optimization standpoint, why they can converge in highly NON-CONVEX problems and how to design the best sampling strategy are poorly understood. In this line of work, we develop fundamental theoretical understandings of sampling-based control algorithms, based on which we propose novel new design principles, unlocking new capabilities in robotics.

The theory underpinning comes from this paper which reveals a coverage and optimality tradeoff in selecting the sampling variance, which explains the convergence property of sampling-based control:

The tradeoff has practical implications: leveraging the tradeoff, we develop new sampling-based methods with improved convergence properties, and demonstrate their superior performance in real-world robotic control problems. This includes an ICRA 2025 best paper finalist.

Reinforcement Learning for Networked Systems

Reinforcement Learning (RL) has achieved many sucess in single-agent systems, but its application to large scale networked systems face a major obstacle: scalability. Put more concretely, the scalibity issue lies in that the state or action space of such networked systems can be exponentially large in the number of nodes; further, each agent only has local observation of the state of the network. In this project, we investigate how we can use the network structure to make RL scalable for networked systems.

As an application, we have also applied RL to power systems.

Even without learning, the control of a networked systems is already a challenging problem. To that end, I have developed fundamental theories regarding how to design distributed algorithms for control and optimization of networked systems using only local information and local communication.

Model Predictive Control

Model Prediction Control (MPC) is one of the most popular and flexible controller design approaches, yet its performance guarantee has long been not well understood, particularly when it comes to time-varying systems and systems with constraints. In this project, we propose a general perturbation analsyis framework that bounds the regret of MPC.

Learn to Stabilize

Machine learning has been applied to control systems to learn to control an unknown system with provable performance guarantee (e.g. regret, competitive ratio). However, in addition to performance, an equally important property of control systems is stability, without which there is no performance to even talk about. In this project, we investigate the ``learn to stabilize’’ problem for an unknown system, and study fundamental questions like sample complexity.

Bridging Model-based and Model-Free Methods

Traditional controller synthesis typicalls starts with a first-princeples model and designs a controller with provable stability and robustness guarantee. In contrast, recent RL approaches do not assume a model and learns a controller (often times neural network based) in a data driven manner, which experimentally can perform well even for complex dynamical systems. However, the RL approach is often data and computation heavy, requires extensive tuning, and lacks provable guarantess. In this project, we seek to combine both approaches and achieve the best of both worlds.

Application: power systems

Many of our research is inspired by applications in power systems, particularly distributed control and coordination of distributed energy resources. Here is a list of relevant power system publications.

Optimization Theory