Operator splitting methods, like forward backward splitting (FBS) and Douglas-Rachford splitting (DRS), can be viewed as gradient descent on the envelope function. This point of view is useful because it turns a nonsmooth splitting iteration into a smooth optimization method on a surrogate objective. Related envelope constructions also appear for other splitting schemes, such as the Davis-Yin envelope.
In this post I first review the classical envelope functions for FBS and DRS, and then derive the corresponding envelope for extended DRS. Throughout, I write the composite objective as
\[F(u):=f(u)+g(u),\]where $f$ is the smooth block and $g$ is the prox-friendly, possibly nonsmooth block. For DRS, I also assume that the proximal map of $f$ is tractable.
1. Envelope functions for FBS and DRS
FBS. Suppose $f$ is convex, twice differentiable, and $L_f$-smooth, and $g$ is proper, closed, convex, and proximable. The FBS step with stepsize $\gamma>0$ is
\[x^+ = \operatorname{prox}_{\gamma g}\bigl(x-\gamma \nabla f(x)\bigr).\]The forward-backward envelope (FBE), introduced by Patrinos, Stella, and Bemporad, is
\[F_\gamma^{\mathrm{FB}}(x) := f(x)-\tfrac{\gamma}{2}\|\nabla f(x)\|^2 +g_\gamma\bigl(x-\gamma\nabla f(x)\bigr),\]where $g_\gamma$ is the Moreau envelope of $g$. Then FBS can be written as a variable-metric gradient step on the FBE:
\[x^+ = x-\gamma\bigl(I-\gamma\nabla^2 f(x)\bigr)^{-1} \nabla F_\gamma^{\mathrm{FB}}(x).\]When $0<\gamma<1/L_f$, the envelope is exact:
\[\min_x F(x)=\min_x F_\gamma^{\mathrm{FB}}(x), \qquad \operatorname{argmin} F = \operatorname{argmin} F_\gamma^{\mathrm{FB}}.\]So minimizing the FBE is equivalent to solving the original composite problem, but the FBE is differentiable even when $g$ is nonsmooth.
DRS. To obtain a smooth envelope, assume that $f$ is twice differentiable and $L_f$-smooth. Define
\[P_\gamma(x):=\operatorname{prox}_{\gamma f}(x), \qquad G_\gamma(x):=\operatorname{prox}_{\gamma g}\bigl(2P_\gamma(x)-x\bigr).\]The classical DRS step is
\[x^+=x+G_\gamma(x)-P_\gamma(x).\]The Douglas-Rachford envelope (DRE), introduced by Patrinos, Stella, and Bemporad, is
\[F_\gamma^{\mathrm{DR}}(x) := f_\gamma(x)-\gamma\|\nabla f_\gamma(x)\|^2 +g_\gamma\bigl(x-2\gamma\nabla f_\gamma(x)\bigr).\]DRS is again a variable-metric gradient method:
\[x^+ = x-\gamma \bigl(I-2\gamma\nabla^2 f_\gamma(x)\bigr)^{-1} \nabla F_\gamma^{\mathrm{DR}}(x).\]When $0<\gamma<1/L_f$, the DRE is exact up to the proximal map $P_\gamma$:
\[\min_u F(u)=\min_x F_\gamma^{\mathrm{DR}}(x), \qquad P_\gamma\bigl(\operatorname{argmin}F_\gamma^{\mathrm{DR}}\bigr) = \operatorname{argmin}F.\]2. Extended DRS
A recent paper by Nilsson, Åkerman, and Giselsson studies an extended DRS family with two proximal stepsizes. For the optimization problem
\[\min_u F(u),\]extended DRS takes the form
\[P_\alpha(x):=\operatorname{prox}_{\alpha f}(x),\] \[G_{\alpha,\beta}(x) := \operatorname{prox}_{\beta g} \left( \left(1+\tfrac{\beta}{\alpha}\right)P_\alpha(x) - \tfrac{\beta}{\alpha}x \right),\] \[x^+ = x+\theta\bigl(G_{\alpha,\beta}(x)-P_\alpha(x)\bigr).\]The standard DRS iteration is recovered by setting $\alpha=\beta=\gamma$ and $\theta=1$. For the convex optimization setting, the extended DRS paper gives the convergence range
\[0<\theta<\min\left\{2,\tfrac{2\alpha}{\beta}\right\}.\]3. Extended Douglas-Rachford envelope
The extended DRS envelope is
\[F_{\alpha,\beta}^{\mathrm{EDR}}(x) := f_\alpha(x) - \tfrac{\alpha+\beta}{2}\|\nabla f_\alpha(x)\|^2 + g_\beta\bigl(x-(\alpha+\beta)\nabla f_\alpha(x)\bigr).\]The key identity is
\[x-(\alpha+\beta)\nabla f_\alpha(x) = \left(1+\tfrac{\beta}{\alpha}\right)P_\alpha(x) - \tfrac{\beta}{\alpha}x.\]Therefore the last term in $F_{\alpha,\beta}^{\mathrm{EDR}}$ is exactly the Moreau envelope associated with the second proximal step in extended DRS.
Hence the extended DRS update can be written as
\[x^+ = x-\theta\beta \bigl(I-(\alpha+\beta)\nabla^2 f_\alpha(x)\bigr)^{-1} \nabla F_{\alpha,\beta}^{\mathrm{EDR}}(x),\]whenever $0<\beta<1/L_f$. Thus extended DRS is gradient descent on the extended DRE, under a variable metric.
Exactness of the extended envelope. The same envelope sandwich argument extends to $F_{\alpha,\beta}^{\mathrm{EDR}}$. For $P=P_\alpha(x)$, $G=G_{\alpha,\beta}(x)$, and $R=P-G$, we have
\[F(G) + \tfrac{1-\beta L_f}{2\beta}\|R\|^2 \le F_{\alpha,\beta}^{\mathrm{EDR}}(x) \le F(P) - \tfrac{1}{2\beta}\|R\|^2.\]Therefore, if $0<\beta<1/L_f$ and the original problem has a minimizer, then
\[\min_x F_{\alpha,\beta}^{\mathrm{EDR}}(x) = \min_u F(u), \quad P_\alpha\bigl(\operatorname{argmin}F_{\alpha,\beta}^{\mathrm{EDR}}\bigr) = \operatorname{argmin}F.\]Convex and smooth envelope in the quadratic case. Suppose
\[f(u)=\tfrac{1}{2}u^\top Q u+q^\top u,\; \mu I\preceq Q\preceq L I\]and $g$ is proper, closed, convex, and proximable. In this case, if $\alpha>0$ and $0<\beta\le 1/L$, then the extended DRE is smooth and convex with
\[L_{\alpha,\beta}^{\mathrm{EDR}} = \tfrac{1-\beta\mu}{\beta(1+\alpha\mu)^2}, \qquad \mu_{\alpha,\beta}^{\mathrm{EDR}} = \min\left\{ \tfrac{\mu(1-\beta\mu)}{(1+\alpha\mu)^2}, \tfrac{L(1-\beta L)}{(1+\alpha L)^2} \right\}.\]This gives a clean smooth convex objective on which the extended DRS iteration is simply a gradient method with a variable metric.
References
- Panagiotis Patrinos, Lorenzo Stella, and Alberto Bemporad. Forward-Backward Truncated Newton Methods for Convex Composite Optimization, 2014.
- Panagiotis Patrinos, Lorenzo Stella, and Alberto Bemporad. Douglas-Rachford Splitting: Complexity Estimates and Accelerated Variants, 2014.
- Max Nilsson, Anton Åkerman, and Pontus Giselsson. Extending Douglas-Rachford Splitting for Convex Optimization, 2025.
- Yanli Liu and Wotao Yin. An Envelope for Davis-Yin Splitting and Strict Saddle Point Avoidance, 2019.