Over the Rainbow

Generating Lyapunov Functions for Gradient Descent by SDP

This blog is the reading note of the following paper: [1] Taylor, Adrien, Bryan Van Scoy, and Laurent Lessard. ‘‘Lyapunov functions for first-order methods: Tight automated convergence guarante...

Travel Album | Hangzhou, Yichang, and Xi'an

Hangzhou On May 10, I took a one-day trip with my family to Hangzhou, where we celebrated a special Mother’s Day by visiting West Lake and Lingyin Temple. West Lake at sun...

Collection | words that stay

This blog is a collection of excerpts that have stayed with me—lines from poems, lyrics, and books. I’m not an expert in literature, but I write these notes to capture and share the personal connec...

Reflections from My First Academic Talk

I just gave my first academic talk at a conference at SJTU IIC yesterday. I presented an ongoing project, and I had hesitated for a long time about whether to give a talk on an “incomplete” work. I...

Helpful Resources for Grad School and Beyond

I’ve benefited greatly from reading advice posts—especially during my graduate school application process. In this post, I’ve collected some of the most helpful resources I’ve come across, covering...

Routines for Setting Up a New Server

Lately, I’ve been running deep learning experiments across different computing clusters. Every time I switch to a new server, I have to go through a series of setup steps to get my environment read...

Optimizing EPLB by Integer (Conic) Linear Programming

In the last post, I reviewed the code of EPLB (Expert Parallelism Load Balancer). As a quick recap, EPLB is a toolbox for expert load balancing in the MoE architecture, it outputs the expert replic...

Code Review | Expert Parallelism Load Balancer

DeepSeek recently released a simple yet effective toolbox for load balancing in Mixture of Experts (MoE) architectures. The EPLB toolbox consists of only one Python file and has already received 1....

Writing LaTeX Locally on macOS

Previously, I used Overleaf to write .tex files. It’s convenient, beginner-friendly, and great for collaboration. However, it only works online, which means you can’t draft your paper on a flight (...

A Random Trip in Beijing

Visa Interview Tips & Travel Snapshots

About visa interview I’m traveling to Beijing for my U.S. F1 visa interview. Good news first: my visa was approved within 60 seconds of the interview starting—super smooth! I wanted to share a q...