Undergraduate Student at
University of California, Berkeley
Tongji University, Shanghai
View my survey paper on robot skills with an interactive PDF viewer. Navigate pages, zoom in/out, and download directly from the browser without leaving the site.
Notes on Richard Sutton’s “Bitter Lesson”: human heuristics vs. large-scale learning and search, and what prediction learning means for how machines (and humans) learn.
An intuitive explanation of Flow Matching for generative modeling. Learn how to directly match probability flows as an efficient alternative to diffusion models.
Explanation of Policy Gradient and Actor-Critic methods in Reinforcement Learning, taught by Prof. Sergey Levine in CS185 2026
The contemporary RL algo used widely in robotics sim2real.
Complete notes on nonlinear systems: stability theory, Lyapunov methods, contraction mapping, and existence/uniqueness. Taught by Prof. Koushil Sreenath. in ME 237
Small facts and intuitions collected while studying nonlinear systems — phase planes, $\dot{x}=f(x)$ behavior, and the geometry behind them.
Why don’t we use triple-dot or higher-order derivatives to model dynamic systems?
Notes on the algebraic and differential Riccati equation in optimal control (LQR).
Why Euler angle parameterizations of rotation suffer from gimbal lock, and what alternatives (quaternions, rotation matrices) buy you.
Notes on the reaching law and reaching speed in sliding mode control, and how they trade off chattering against convergence time.