Wanyu Zhang

Polynomial Identity Testing and Schwartz-Zippel Lemma

Given two polynomials $f$ and $g$, like $x_1^2 - x_2^2$ and $(x_1 + x_2)(x_1 - x_2)$, how do we determine whether they are equal? This problem is known as Polynomial Identity Testing (PIT), which i...

Posted by Wanyu Zhang on February 7, 2026

Asymptotic Memoryless of Markov Chains

This is the first post in a series of notes taken from the Stochastic Systems class (MS&E 321) taught by Professor Peter Glynn. This series will be updated as the course progresses. A fundamen...

Posted by Wanyu Zhang on February 6, 2026

A Step-by-step Derivation of ADMM from DRS

In this note, we present a step-by-step derivation of the Alternating Direction Method of Multipliers (ADMM) from Douglas-Rachford Splitting (DRS). This derivation is adapted from the book below an...

Posted by Wanyu Zhang on February 5, 2026

Generating Lyapunov Functions for Gradient Descent by SDP

This blog is the reading note of the following paper: [1] Taylor, Adrien, Bryan Van Scoy, and Laurent Lessard. ‘‘Lyapunov functions for first-order methods: Tight automated convergence guarante...

Posted by Wanyu Zhang on September 6, 2025

Reflections from My First Academic Talk

I just gave my first academic talk at a conference at SJTU IIC yesterday. I presented an ongoing project, and I had hesitated for a long time about whether to give a talk on an “incomplete” work. I...

Posted by Wanyu Zhang on July 7, 2025

Helpful Resources in Grad School

I’ve benefited greatly from reading advice posts—especially during my graduate school application. In this post, I’ve collected some of the most helpful resources I’ve come across, covering both gr...

Posted by Wanyu Zhang on June 22, 2025

Routines for Setting Up a New Server

Lately, I’ve been running deep learning experiments across different computing clusters. Every time I switch to a new server, I have to go through a series of setup steps to get my environment read...

Posted by Wanyu Zhang on May 23, 2025

Optimizing EPLB by Integer (Conic) Linear Programming

In the last post, I reviewed the code of EPLB (Expert Parallelism Load Balancer). As a quick recap, EPLB is a toolbox for expert load balancing in the MoE architecture, it outputs the expert replic...

Posted by Wanyu Zhang on May 19, 2025

Code Review | Expert Parallelism Load Balancer

DeepSeek recently released a simple yet effective toolbox for load balancing in Mixture of Experts (MoE) architectures. The EPLB toolbox consists of only one Python file and has already received 1....

Posted by Wanyu Zhang on May 17, 2025

Writing LaTeX Locally on macOS

Previously, I used Overleaf to write .tex files. It’s convenient, beginner-friendly, and great for collaboration. However, it only works online, which means you can’t draft your paper on a flight (...

Posted by Wanyu Zhang on May 1, 2025

High Probability Analysis for SGD

Beyond Bounded Domain and Bounded Gradients

Long time no see! This one is the longest post I have written so far—so grab a drink, it will take a little time to read! For better readability, you can refer to the pdf version. I am learning ho...

Posted by Wanyu Zhang on April 10, 2025

Proof of the Contraction Properties of PDHG

In this blog, we introduce how to simply derive the nonexpansiveness and contraction properties of primal-dual hybrid gradient method (PDHG) iteration through the language of operator theory. In [...

Posted by Wanyu Zhang on March 4, 2025

What is Good Research? A Catalog of Professional Views

I’ve been quite busy with PhD interviews recently, and I’ve found the experience to be very rewarding. I see interviews as a great opportunity to engage in meaningful conversations with experts. Du...

Posted by Wanyu Zhang on February 25, 2025

TeXmacs Tips

Efficient Math Typing, Crash Fixes, and More

About TeXmacs TeXmacs is my favorite text editor, especially useful for those who frequently need to type mathematical formulas. I highly recommend giving it a try! In case you need help getting ...

Posted by Wanyu Zhang on November 6, 2024

Performance Estimation Problems II

Convergence Proofs and Stepsize Optimization

This is the second post in a series on Performance Estimation Problems (PEP). In this post, I’ll introduce applications of the PEP framework, particularly in convergence proofs and stepsize optimiz...

Posted by Wanyu Zhang on November 5, 2024

Performance Estimation Problems I

Methodology Review

This is the first post in a new series on Performance Estimation Problems (PEP). I’ve divided the series into two parts: the first introduces the PEP framework, and the second covers applications o...

Posted by Wanyu Zhang on November 1, 2024

Polynomial Optimization II

Multivariant problems

This note is taken from the summer course, in which Prof.Cédric Josz makes everything clear and intuitive! This blog is about multivariate polynomial optimization, including both unconstrained and ...

Posted by Wanyu Zhang on September 4, 2024

Polynomial Optimization I

Univariant unconstrained problems

This note is taken from the summer course, in which Prof.Cédric Josz makes everything clear and intuitive! This blog is about univariate unconstrained polynomial optimization, and the multivariate ...

Posted by Wanyu Zhang on August 14, 2024

Equivalence of PDHG and DRS

It is common to hear that Primal-Dual Hybrid Gradient (PDHG) and Douglas-Rachford Splitting (DRS) are equivalent, and this blog is about why. I read Daniel and Vandenberghe’s paper [1] and organize...

Posted by Wanyu Zhang on August 2, 2024

Blogs