---
date: '2024-12-10'
description: and what is she descending from, really?
id: gradient descent
modified: 2026-06-05 15:08:24 GMT-04:00
tags:
  - ml
title: gradient descent
created: '2024-12-10'
published: '2024-12-10'
pageLayout: default
slug: thoughts/gradient-descent
permalink: https://aarnphm.xyz/thoughts/gradient-descent.md
generator:
  quartz: v4.6.0
  hostedProvider: Cloudflare
  baseUrl: aarnphm.xyz
full: https://aarnphm.xyz/llms-full.txt
---
Let us define the standard [[thoughts/Vector calculus#gradient|gradient]] descent approach for minimizing a differentiable [[thoughts/Convex function|convex function]]

In a sense, gradient of a differential function $f : \mathbb{R}^d \to \mathbb{R}$ at $w$ is the vector of partial derivatives:

$$
\nabla f(w) = (\frac{\partial f(w)}{\partial w[1]},\ldots,\frac{\partial f(w)}{\partial w[d]})
$$

> \[!math\] 1. intuition
>
> $$
> x_{t+1} = x_t - \alpha \nabla f(x_t)
> $$
>
> <figure class="tikz" data-remark-tikz style=""><span class="tikz-mathml"><math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><annotation encoding="application/x-tex">"\\usepackage{pgfplots}\n\\pgfplotsset{compat=1.16}\n\n\\begin{document}\n\\begin{tikzpicture}\n  \\begin{scope}\n    \\clip(-4,-1) rectangle (4,4);\n    \\draw plot[domain=0:360] ({cos(\\x)*sqrt(20/(sin(2*\\x)+2))},{sin(\\x)*sqrt(20/(sin(2*\\x)+2))});\n    \\draw plot[domain=0:360] ({cos(\\x)*sqrt(16/(sin(2*\\x)+2))},{sin(\\x)*sqrt(16/(sin(2*\\x)+2))});\n    \\draw plot[domain=0:360] ({cos(\\x)*sqrt(12/(sin(2*\\x)+2))},{sin(\\x)*sqrt(12/(sin(2*\\x)+2))});\n    \\draw plot[domain=0:360] ({cos(\\x)*sqrt(8/(sin(2*\\x)+2))},{sin(\\x)*sqrt(8/(sin(2*\\x)+2))});\n    \\draw plot[domain=0:360] ({cos(\\x)*sqrt(4/(sin(2*\\x)+2))},{sin(\\x)*sqrt(4/(sin(2*\\x)+2))});\n    \\draw plot[domain=0:360] ({cos(\\x)*sqrt(1/(sin(2*\\x)+2))},{sin(\\x)*sqrt(1/(sin(2*\\x)+2))});\n    \\draw plot[domain=0:360] ({cos(\\x)*sqrt(0.0625/(sin(2*\\x)+2))},{sin(\\x)*sqrt(0.0625/(sin(2*\\x)+2))});\n\n    \\draw[->,blue,ultra thick] (-2,3.65) to (-1.93,3);\n    \\draw[->,blue,ultra thick] (-1.93,3) to (-1.75,2.4);\n    \\draw[->,blue,ultra thick] (-1.75,2.4) to (-1.5,1.8);\n    \\draw[->,blue,ultra thick] (-1.5,1.8) to (-1.15,1.3);\n\n    \\node at (-1.4,3.8){\\scriptsize $w[0]$};\n    \\node at (-1.2,3.2){\\scriptsize $w[1]$};\n    \\node at (-1.05,2.6){\\scriptsize $w[2]$};\n    \\node at (-0.8,2){\\scriptsize $w[3]$};\n    \\node at (-0.6,1.4){\\scriptsize $w[4]$};\n  \\end{scope}\n\\end{tikzpicture}\n\\end{document}"</annotation></semantics></math></span><img src="data:image/svg+xml;base64,PHN2ZyB2ZXJzaW9uPSIxLjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgeG1sbnM6eGxpbms9Imh0dHA6Ly93d3cudzMub3JnLzE5OTkveGxpbmsiIHdpZHRoPSIyMjcuNjIxOTVwdCIgaGVpZ2h0PSIxNDIuMjYzNzJwdCIgdmlld0JveD0iLTcyIC03MiAyMjcuNjIxOTUgMTQyLjI2MzcyIj48ZyBzdHJva2UtbWl0ZXJsaW1pdD0iMTAiIHRyYW5zZm9ybT0idHJhbnNsYXRlKDQxLjU0MDk2OTg0ODYzMjgwNSw0MS41NDA5Njk4NDg2MzI4MDUpIHNjYWxlKDEsLTEpIj48ZyBzdHJva2U9IiMwMDAiIGZpbGw9IiMwMDAiPiA8ZyBzdHJva2Utd2lkdGg9IjAuNCI+IDxjbGlwUGF0aCBpZD0icGdmOGIwM2UzZWVjZWE5ODAzMGI4ODk4ZjAwYjM3Yzk3ZjhjcDEiPjxwYXRoIGQ9IiBNIC0xMTMuODEwOTcgLTI4LjQ1Mjc0IE0gLTExMy44MTA5NyAtMjguNDUyNzQgTCAtMTEzLjgxMDk3IDExMy44MTA5NyBMIDExMy44MTA5NyAxMTMuODEwOTcgTCAxMTMuODEwOTcgLTI4LjQ1Mjc0IFogTSAxMTMuODEwOTcgMTEzLjgxMDk3ICAiPjwvcGF0aD4gPC9jbGlwUGF0aD4gPGcgY2xpcC1wYXRoPSJ1cmwoI3BnZjhiMDNlM2VlY2VhOTgwMzBiODg5OGYwMGIzN2M5N2Y4Y3AxKSI+IDxwYXRoIGQ9IiBNIDg5Ljk3NTQ1IDAuMCBMIDc3LjczNDA0IDIwLjgyODU1IEwgNjUuMDkyMzMgMzcuNTgwODMgTCA1MS45NDcgNTEuOTQ3IEwgMzcuNTgwODMgNjUuMDkyMzMgTCAyMC44Mjg1NSA3Ny43MzQwNCBMIDAuMCA4OS45NzU0NSBMIC0yNi44ODk3OSAxMDAuMzU0MzUgTCAtNTkuNzQ1NzEgMTAzLjQ4Mjg2IEwgLTg5Ljk3NTQ1IDg5Ljk3NTQ1IEwgLTEwMy40ODI4NiA1OS43NDU3MSBMIC0xMDAuMzU0MzUgMjYuODg5NzkgTCAtODkuOTc1NDUgMC4wIEwgLTc3LjczNDA0IC0yMC44Mjg1NSBMIC02NS4wOTIzMyAtMzcuNTgwODMgTCAtNTEuOTQ3IC01MS45NDcgTCAtMzcuNTgwODMgLTY1LjA5MjMzIEwgLTIwLjgyODU1IC03Ny43MzQwNCBMIDAuMCAtODkuOTc1NDUgTCAyNi44ODk3OSAtMTAwLjM1NDM1IEwgNTkuNzQ1NzEgLTEwMy40ODI4NiBMIDg5Ljk3NTQ1IC04OS45NzU0NSBMIDEwMy40ODI4NiAtNTkuNzQ1NzEgTCAxMDAuMzU0MzUgLTI2Ljg4OTc5IEwgODkuOTc1NDUgMC4wICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPHBhdGggZD0iIE0gODAuNDc2MTUgMC4wIEwgNjkuNTI3MjIgMTguNjI5NTYgTCA1OC4yMjAxIDMzLjYxMzEgTCA0Ni40NjMyIDQ2LjQ2MzIgTCAzMy42MTMxIDU4LjIyMDEgTCAxOC42Mjk1NiA2OS41MjcyMiBMIDAuMCA4MC40NzYxNSBMIC0yNC4wNTA4NCA4OS43NjAxMiBMIC01My40MzgzMiA5Mi41NTc4IEwgLTgwLjQ3NjYgODAuNDc2NiBMIC05Mi41NTc4IDUzLjQzODMyIEwgLTg5Ljc2MDEyIDI0LjA1MDg0IEwgLTgwLjQ3NjE1IDAuMCBMIC02OS41MjcyMiAtMTguNjI5NTYgTCAtNTguMjIwMSAtMzMuNjEzMSBMIC00Ni40NjMyIC00Ni40NjMyIEwgLTMzLjYxMzEgLTU4LjIyMDEgTCAtMTguNjI5NTYgLTY5LjUyNzIyIEwgMC4wIC04MC40NzYxNSBMIDI0LjA1MDg0IC04OS43NjAxMiBMIDUzLjQzODMyIC05Mi41NTc4IEwgODAuNDc2NiAtODAuNDc2NiBMIDkyLjU1NzggLTUzLjQzODMyIEwgODkuNzYwMTIgLTI0LjA1MDg0IEwgODAuNDc2MTUgMC4wICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPHBhdGggZD0iIE0gNjkuNjk0MzcgMC4wIEwgNjAuMjEyNDMgMTYuMTMzNiBMIDUwLjQyMDA3IDI5LjExMDA1IEwgNDAuMjM4MyA0MC4yMzgzIEwgMjkuMTEwMDUgNTAuNDIwMDcgTCAxNi4xMzM2IDYwLjIxMjQzIEwgMC4wIDY5LjY5NDM3IEwgLTIwLjgyODU1IDc3LjczNDA0IEwgLTQ2LjI3ODY5IDgwLjE1NzA2IEwgLTY5LjY5NDM3IDY5LjY5NDM3IEwgLTgwLjE1NzA2IDQ2LjI3ODY5IEwgLTc3LjczNDA0IDIwLjgyODU1IEwgLTY5LjY5NDM3IDAuMCBMIC02MC4yMTI0MyAtMTYuMTMzNiBMIC01MC40MjAwNyAtMjkuMTEwMDUgTCAtNDAuMjM4MyAtNDAuMjM4MyBMIC0yOS4xMTAwNSAtNTAuNDIwMDcgTCAtMTYuMTMzNiAtNjAuMjEyNDMgTCAwLjAgLTY5LjY5NDM3IEwgMjAuODI4NTUgLTc3LjczNDA0IEwgNDYuMjc4NjkgLTgwLjE1NzA2IEwgNjkuNjk0MzcgLTY5LjY5NDM3IEwgODAuMTU3MDYgLTQ2LjI3ODY5IEwgNzcuNzM0MDQgLTIwLjgyODU1IEwgNjkuNjk0MzcgMC4wICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPHBhdGggZD0iIE0gNTYuOTA1NDkgMC4wIEwgNDkuMTYzMjEgMTMuMTczMTEgTCA0MS4xNjc4MiAyMy43NjgyIEwgMzIuODU0MTkgMzIuODU0MTkgTCAyMy43NjgyIDQxLjE2NzgyIEwgMTMuMTczMTEgNDkuMTYzMjEgTCAwLjAgNTYuOTA1NDkgTCAtMTcuMDA2MjYgNjMuNDY5NDcgTCAtMzcuNzg2NjIgNjUuNDQ4MzMgTCAtNTYuOTA1MDQgNTYuOTA1MDQgTCAtNjUuNDQ4MzMgMzcuNzg2NjIgTCAtNjMuNDY5NDcgMTcuMDA2MjYgTCAtNTYuOTA1NDkgMC4wIEwgLTQ5LjE2MzIxIC0xMy4xNzMxMSBMIC00MS4xNjc4MiAtMjMuNzY4MiBMIC0zMi44NTQxOSAtMzIuODU0MTkgTCAtMjMuNzY4MiAtNDEuMTY3ODIgTCAtMTMuMTczMTEgLTQ5LjE2MzIxIEwgMC4wIC01Ni45MDU0OSBMIDE3LjAwNjI2IC02My40Njk0NyBMIDM3Ljc4NjYyIC02NS40NDgzMyBMIDU2LjkwNTA0IC01Ni45MDUwNCBMIDY1LjQ0ODMzIC0zNy43ODY2MiBMIDYzLjQ2OTQ3IC0xNy4wMDYyNiBMIDU2LjkwNTQ5IDAuMCAgIiBmaWxsPSJub25lIj48L3BhdGg+IDxwYXRoIGQ9IiBNIDQwLjIzODMgMC4wIEwgMzQuNzYzNjEgOS4zMTQ3NyBMIDI5LjEwOTYyIDE2LjgwNjU1IEwgMjMuMjMxMTYgMjMuMjMxMTYgTCAxNi44MDY1NSAyOS4xMDk2MiBMIDkuMzE0NzcgMzQuNzYzNjEgTCAwLjAgNDAuMjM4MyBMIC0xMi4wMjUyIDQ0Ljg3OTg0IEwgLTI2LjcxODcyIDQ2LjI3ODY5IEwgLTQwLjIzODMgNDAuMjM4MyBMIC00Ni4yNzg2OSAyNi43MTg3MiBMIC00NC44Nzk4NCAxMi4wMjUyIEwgLTQwLjIzODMgMC4wIEwgLTM0Ljc2MzYxIC05LjMxNDc3IEwgLTI5LjEwOTYyIC0xNi44MDY1NSBMIC0yMy4yMzExNiAtMjMuMjMxMTYgTCAtMTYuODA2NTUgLTI5LjEwOTYyIEwgLTkuMzE0NzcgLTM0Ljc2MzYxIEwgMC4wIC00MC4yMzgzIEwgMTIuMDI1MiAtNDQuODc5ODQgTCAyNi43MTg3MiAtNDYuMjc4NjkgTCA0MC4yMzgzIC00MC4yMzgzIEwgNDYuMjc4NjkgLTI2LjcxODcyIEwgNDQuODc5ODQgLTEyLjAyNTIgTCA0MC4yMzgzIDAuMCAgIiBmaWxsPSJub25lIj48L3BhdGg+IDxwYXRoIGQ9IiBNIDIwLjExOTE0IDAuMCBMIDE3LjM4MTM2IDQuNjU3MTcgTCAxNC41NTQxNSA4LjQwMzA1IEwgMTEuNjE1MzYgMTEuNjE1MzYgTCA4LjQwMzA1IDE0LjU1NDE1IEwgNC42NTcxNyAxNy4zODEzNiBMIDAuMCAyMC4xMTkxNCBMIC02LjAxMjYgMjIuNDQwMTIgTCAtMTMuMzU4OTIgMjMuMTM4NjkgTCAtMjAuMTE5MTQgMjAuMTE5MTQgTCAtMjMuMTM4NjkgMTMuMzU4OTIgTCAtMjIuNDQwMTIgNi4wMTI2IEwgLTIwLjExOTE0IDAuMCBMIC0xNy4zODEzNiAtNC42NTcxNyBMIC0xNC41NTQxNSAtOC40MDMwNSBMIC0xMS42MTUzNiAtMTEuNjE1MzYgTCAtOC40MDMwNSAtMTQuNTU0MTUgTCAtNC42NTcxNyAtMTcuMzgxMzYgTCAwLjAgLTIwLjExOTE0IEwgNi4wMTI2IC0yMi40NDAxMiBMIDEzLjM1ODkyIC0yMy4xMzg2OSBMIDIwLjExOTE0IC0yMC4xMTkxNCBMIDIzLjEzODY5IC0xMy4zNTg5MiBMIDIyLjQ0MDEyIC02LjAxMjYgTCAyMC4xMTkxNCAwLjAgICIgZmlsbD0ibm9uZSI+PC9wYXRoPiA8cGF0aCBkPSIgTSA1LjAyOTY4IDAuMCBMIDQuMzQ1MDIgMS4xNjM5NiBMIDMuNjM3NzcgMi4xMDA0MyBMIDIuOTAzMTggMi45MDMxOCBMIDIuMTAwNDMgMy42Mzc3NyBMIDEuMTYzOTYgNC4zNDUwMiBMIDAuMCA1LjAyOTY4IEwgLTEuNTAxNzQgNS42MDUzNiBMIC0zLjMzOTA4IDUuNzgzOCBMIC01LjAyOTY4IDUuMDI5NjggTCAtNS43ODM4IDMuMzM5MDggTCAtNS42MDUzNiAxLjUwMTc0IEwgLTUuMDI5NjggMC4wIEwgLTQuMzQ1MDIgLTEuMTYzOTYgTCAtMy42Mzc3NyAtMi4xMDA0MyBMIC0yLjkwMzE4IC0yLjkwMzE4IEwgLTIuMTAwNDMgLTMuNjM3NzcgTCAtMS4xNjM5NiAtNC4zNDUwMiBMIDAuMCAtNS4wMjk2OCBMIDEuNTAxNzQgLTUuNjA1MzYgTCAzLjMzOTA4IC01Ljc4MzggTCA1LjAyOTY4IC01LjAyOTY4IEwgNS43ODM4IC0zLjMzOTA4IEwgNS42MDUzNiAtMS41MDE3NCBMIDUuMDI5NjggMC4wICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPGcgc3Ryb2tlPSIjMDBmIiBmaWxsPSIjMDBmIj4gPGcgc3Ryb2tlLXdpZHRoPSIxLjYiPiA8cGF0aCBkPSIgTSAtNTYuOTA1NDkgMTAzLjg1MjMzIEwgLTU1LjA4NDg1IDg2Ljk0ODk3ICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPGcgdHJhbnNmb3JtPSJtYXRyaXgoMC4xMDcwNiwtMC45OTQyMiwwLjk5NDIyLDAuMTA3MDYsLTU0Ljk5OTIsODYuMTUzNjEpIj4gPGcgc3Ryb2tlLWRhc2hhcnJheT0ibm9uZSIgc3Ryb2tlLWRhc2hvZmZzZXQ9IjAuMCI+IDxnIHN0cm9rZS1saW5lY2FwPSJyb3VuZCI+IDxnIHN0cm9rZS1saW5lam9pbj0icm91bmQiPiA8cGF0aCBkPSIgTSAtMy41MiA0LjU2NzczIEMgLTIuODc2OSAxLjgyNzA2IC0xLjQ0Mzg1IDAuNTMyOSAwLjAgMC4wIEMgLTEuNDQzODUgLTAuNTMyOSAtMi44NzY5IC0xLjgyNzA2IC0zLjUyIC00LjU2NzczICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPC9nPiA8L2c+IDwvZz4gIDwvZz4gPC9nPiA8L2c+IDxnIHN0cm9rZT0iIzAwZiIgZmlsbD0iIzAwZiI+IDxnIHN0cm9rZS13aWR0aD0iMS42Ij4gPHBhdGggZD0iIE0gLTU0LjkxMzU3IDg1LjM1ODIzIEwgLTUwLjI1MTk1IDY5LjgxODg1ICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPGcgdHJhbnNmb3JtPSJtYXRyaXgoMC4yODczLC0wLjk1Nzc4LDAuOTU3NzgsMC4yODczLC01MC4wMjIxMyw2OS4wNTI2MykiPiA8ZyBzdHJva2UtZGFzaGFycmF5PSJub25lIiBzdHJva2UtZGFzaG9mZnNldD0iMC4wIj4gPGcgc3Ryb2tlLWxpbmVjYXA9InJvdW5kIj4gPGcgc3Ryb2tlLWxpbmVqb2luPSJyb3VuZCI+IDxwYXRoIGQ9IiBNIC0zLjUyIDQuNTY3NzMgQyAtMi44NzY5IDEuODI3MDYgLTEuNDQzODUgMC41MzI5IDAuMCAwLjAgQyAtMS40NDM4NSAtMC41MzI5IC0yLjg3NjkgLTEuODI3MDYgLTMuNTIgLTQuNTY3NzMgICIgZmlsbD0ibm9uZSI+PC9wYXRoPiA8L2c+IDwvZz4gPC9nPiAgPC9nPiA8L2c+IDwvZz4gPGcgc3Ryb2tlPSIjMDBmIiBmaWxsPSIjMDBmIj4gPGcgc3Ryb2tlLXdpZHRoPSIxLjYiPiA8cGF0aCBkPSIgTSAtNDkuNzkyMyA2OC4yODY0IEwgLTQzLjI5NDQzIDUyLjY5MTgzICAiIGZpbGw9Im5vbmUiPjwvcGF0aD4gPGcgdHJhbnNmb3JtPSJtYXRyaXgoMC4zODQ1OCwtMC45MjMwMiwwLjkyMzAyLDAuMzg0NTgsLTQyLjk4Njc3LDUxLjk1MzQzKSI+IDxnIHN0cm9rZS1kYXNoYXJyYXk9Im5vbmUiIHN0cm9rZS1kYXNob2Zmc2V0PSIwLjAiPiA8ZyBzdHJva2UtbGluZWNhcD0icm91bmQiPiA8ZyBzdHJva2UtbGluZWpvaW49InJvdW5kIj4gPHBhdGggZD0iIE0gLTMuNTIgNC41Njc3MyBDIC0yLjg3NjkgMS44MjcwNiAtMS40NDM4NSAwLjUzMjkgMC4wIDAuMCBDIC0xLjQ0Mzg1IC0wLjUzMjkgLTIuODc2OSAtMS44MjcwNiAtMy41MiAtNC41Njc3MyAgIiBmaWxsPSJub25lIj48L3BhdGg+IDwvZz4gPC9nPiA8L2c+ICA8L2c+IDwvZz4gPC9nPiA8ZyBzdHJva2U9IiMwMGYiIGZpbGw9IiMwMGYiPiA8ZyBzdHJva2Utd2lkdGg9IjEuNiI+IDxwYXRoIGQ9IiBNIC00Mi42NzkxMSA1MS4yMTUwMSBMIC0zMy42MzggMzguMjk5NCAgIiBmaWxsPSJub25lIj48L3BhdGg+IDxnIHRyYW5zZm9ybT0ibWF0cml4KDAuNTczNDYsLTAuODE5MjEsMC44MTkyMSwwLjU3MzQ2LC0zMy4xNzkyNCwzNy42NDQwMykiPiA8ZyBzdHJva2UtZGFzaGFycmF5PSJub25lIiBzdHJva2UtZGFzaG9mZnNldD0iMC4wIj4gPGcgc3Ryb2tlLWxpbmVjYXA9InJvdW5kIj4gPGcgc3Ryb2tlLWxpbmVqb2luPSJyb3VuZCI+IDxwYXRoIGQ9IiBNIC0zLjUyIDQuNTY3NzMgQyAtMi44NzY5IDEuODI3MDYgLTEuNDQzODUgMC41MzI5IDAuMCAwLjAgQyAtMS40NDM4NSAtMC41MzI5IC0yLjg3NjkgLTEuODI3MDYgLTMuNTIgLTQuNTY3NzMgICIgZmlsbD0ibm9uZSI+PC9wYXRoPiA8L2c+IDwvZz4gPC9nPiAgPC9nPiA8L2c+IDwvZz4gPGcgdHJhbnNmb3JtPSJ0cmFuc2xhdGUoLTQ3LjA3NzMzLDEwNi4zNzA1KSI+IDxnIHN0cm9rZT0iIzAwMCIgZmlsbD0iIzAwMCI+IDxnIHN0cm9rZT0ibm9uZSIgdHJhbnNmb3JtPSJzY2FsZSgtMSwxKSB0cmFuc2xhdGUoNDEuNTQwOTY5ODQ4NjMyODA1LDQxLjU0MDk2OTg0ODYzMjgwNSkgc2NhbGUoLTEsLTEpIj48ZyBmaWxsPSIjMDAwIj4gPGcgc3Ryb2tlPSJub25lIj4gPHRleHQgYWxpZ25tZW50LWJhc2VsaW5lPSJiYXNlbGluZSIgeT0iNDEuNTQwOTY5ODQ4NjMyODA1IiB4PSI0MS41NDA5Njk4NDg2MzI4MDUiIGZvbnQtZmFtaWx5PSJzZXJpZiIgZm9udC1zaXplPSI3IiBmaWxsPSJibGFjayIgZm9udC1zdHlsZT0iaXRhbGljIj53PC90ZXh0Pjx0ZXh0IGFsaWdubWVudC1iYXNlbGluZT0iYmFzZWxpbmUiIHk9IjQxLjU0MDk2OTg0ODYzMjgwNSIgeD0iNDcuNTE0Mzc2NjQwMzE5ODIiIGZvbnQtZmFtaWx5PSJzZXJpZiIgZm9udC1zaXplPSI3IiBmaWxsPSJibGFjayI+WzBdPC90ZXh0PjwvZz4gPC9nPiA8L2c+PC9nPiA8L2c+IDxnIHRyYW5zZm9ybT0idHJhbnNsYXRlKC00MS4zODY4Nyw4OS4yOTg2OSkiPiA8ZyBzdHJva2U9IiMwMDAiIGZpbGw9IiMwMDAiPiA8ZyBzdHJva2U9Im5vbmUiIHRyYW5zZm9ybT0ic2NhbGUoLTEsMSkgdHJhbnNsYXRlKDQxLjU0MDk2OTg0ODYzMjgwNSw0MS41NDA5Njk4NDg2MzI4MDUpIHNjYWxlKC0xLC0xKSI+PGcgZmlsbD0iIzAwMCI+IDxnIHN0cm9rZT0ibm9uZSI+IDx0ZXh0IGFsaWdubWVudC1iYXNlbGluZT0iYmFzZWxpbmUiIHk9IjQxLjU0MDk2OTg0ODYzMjgwNSIgeD0iNDEuNTQwOTY5ODQ4NjMyODA1IiBmb250LWZhbWlseT0ic2VyaWYiIGZvbnQtc2l6ZT0iNyIgZmlsbD0iYmxhY2siIGZvbnQtc3R5bGU9Iml0YWxpYyI+dzwvdGV4dD48dGV4dCBhbGlnbm1lbnQtYmFzZWxpbmU9ImJhc2VsaW5lIiB5PSI0MS41NDA5Njk4NDg2MzI4MDUiIHg9IjQ3LjUxNDM3NjY0MDMxOTgyIiBmb250LWZhbWlseT0ic2VyaWYiIGZvbnQtc2l6ZT0iNyIgZmlsbD0iYmxhY2siPlsxXTwvdGV4dD48L2c+IDwvZz4gPC9nPjwvZz4gPC9nPiA8ZyB0cmFuc2Zvcm09InRyYW5zbGF0ZSgtMzcuMTE5MTMsNzIuMjI3MykiPiA8ZyBzdHJva2U9IiMwMDAiIGZpbGw9IiMwMDAiPiA8ZyBzdHJva2U9Im5vbmUiIHRyYW5zZm9ybT0ic2NhbGUoLTEsMSkgdHJhbnNsYXRlKDQxLjU0MDk2OTg0ODYzMjgwNSw0MS41NDA5Njk4NDg2MzI4MDUpIHNjYWxlKC0xLC0xKSI+PGcgZmlsbD0iIzAwMCI+IDxnIHN0cm9rZT0ibm9uZSI+IDx0ZXh0IGFsaWdubWVudC1iYXNlbGluZT0iYmFzZWxpbmUiIHk9IjQxLjU0MDk2OTg0ODYzMjgwNSIgeD0iNDEuNTQwOTY5ODQ4NjMyODA1IiBmb250LWZhbWlseT0ic2VyaWYiIGZvbnQtc2l6ZT0iNyIgZmlsbD0iYmxhY2siIGZvbnQtc3R5bGU9Iml0YWxpYyI+dzwvdGV4dD48dGV4dCBhbGlnbm1lbnQtYmFzZWxpbmU9ImJhc2VsaW5lIiB5PSI0MS41NDA5Njk4NDg2MzI4MDUiIHg9IjQ3LjUxNDM3NjY0MDMxOTgyIiBmb250LWZhbWlseT0ic2VyaWYiIGZvbnQtc2l6ZT0iNyIgZmlsbD0iYmxhY2siPlsyXTwvdGV4dD48L2c+IDwvZz4gPC9nPjwvZz4gPC9nPiA8ZyB0cmFuc2Zvcm09InRyYW5zbGF0ZSgtMzAuMDA1OTQsNTUuMTU1NDkpIj4gPGcgc3Ryb2tlPSIjMDAwIiBmaWxsPSIjMDAwIj4gPGcgc3Ryb2tlPSJub25lIiB0cmFuc2Zvcm09InNjYWxlKC0xLDEpIHRyYW5zbGF0ZSg0MS41NDA5Njk4NDg2MzI4MDUsNDEuNTQwOTY5ODQ4NjMyODA1KSBzY2FsZSgtMSwtMSkiPjxnIGZpbGw9IiMwMDAiPiA8ZyBzdHJva2U9Im5vbmUiPiA8dGV4dCBhbGlnbm1lbnQtYmFzZWxpbmU9ImJhc2VsaW5lIiB5PSI0MS41NDA5Njk4NDg2MzI4MDUiIHg9IjQxLjU0MDk2OTg0ODYzMjgwNSIgZm9udC1mYW1pbHk9InNlcmlmIiBmb250LXNpemU9IjciIGZpbGw9ImJsYWNrIiBmb250LXN0eWxlPSJpdGFsaWMiPnc8L3RleHQ+PHRleHQgYWxpZ25tZW50LWJhc2VsaW5lPSJiYXNlbGluZSIgeT0iNDEuNTQwOTY5ODQ4NjMyODA1IiB4PSI0Ny41MTQzNzY2NDAzMTk4MiIgZm9udC1mYW1pbHk9InNlcmlmIiBmb250LXNpemU9IjciIGZpbGw9ImJsYWNrIj5bM108L3RleHQ+PC9nPiA8L2c+IDwvZz48L2c+IDwvZz4gPGcgdHJhbnNmb3JtPSJ0cmFuc2xhdGUoLTI0LjMxNTQ4LDM4LjA4MzY2KSI+IDxnIHN0cm9rZT0iIzAwMCIgZmlsbD0iIzAwMCI+IDxnIHN0cm9rZT0ibm9uZSIgdHJhbnNmb3JtPSJzY2FsZSgtMSwxKSB0cmFuc2xhdGUoNDEuNTQwOTY5ODQ4NjMyODA1LDQxLjU0MDk2OTg0ODYzMjgwNSkgc2NhbGUoLTEsLTEpIj48ZyBmaWxsPSIjMDAwIj4gPGcgc3Ryb2tlPSJub25lIj4gPHRleHQgYWxpZ25tZW50LWJhc2VsaW5lPSJiYXNlbGluZSIgeT0iNDEuNTQwOTY5ODQ4NjMyODA1IiB4PSI0MS41NDA5Njk4NDg2MzI4MDUiIGZvbnQtZmFtaWx5PSJzZXJpZiIgZm9udC1zaXplPSI3IiBmaWxsPSJibGFjayIgZm9udC1zdHlsZT0iaXRhbGljIj53PC90ZXh0Pjx0ZXh0IGFsaWdubWVudC1iYXNlbGluZT0iYmFzZWxpbmUiIHk9IjQxLjU0MDk2OTg0ODYzMjgwNSIgeD0iNDcuNTE0Mzc2NjQwMzE5ODIiIGZvbnQtZmFtaWx5PSJzZXJpZiIgZm9udC1zaXplPSI3IiBmaWxsPSJibGFjayI+WzRdPC90ZXh0PjwvZz4gPC9nPiA8L2c+PC9nPiA8L2c+IDwvZz4gPC9nPiA8L2c+IDwvZz48L3N2Zz4=" alt="tikz diagram" loading="lazy" decoding="async"><figcaption><em>source code</em><button class="source-code-button" aria-label="copy source code for this tikz graph" title="copy source code for this tikz graph"><svg class="source-icon" xmlns="http://www.w3.org/2000/svg" width="12" height="16" viewBox="0 -4 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><use href="#code-icon"></use></svg><svg class="check-icon" xmlns="http://www.w3.org/2000/svg" width="12" height="16" viewBox="0 -4 16 16" fill="currentColor" stroke="none" stroke-width="0" stroke-linecap="round" stroke-linejoin="round"><use href="#github-check"></use></svg></button></figcaption></figure>

## idea

- initialize $w^0$
- iteratively for each t=1:
  - $w^{t+1} = w^t - \alpha \nabla f(w^{(t)})$

intuition: It should convert to a local minimum depending on learning rate $\alpha$

> not necessarily global minimum

But guaranteed global minimum for [[thoughts/Convex function|convex functions]]

## calculate the gradient

$$
\begin{aligned}
E(w) &= L(w) + \lambda \text{Reg}(w) \\[8pt]
L(w) &= \sum_{i} l(f_w(x^i), y^i) \\[8pt]
\nabla_w (L(w)) &= \sum_{i} \nabla_w (l(f_w(x^i), y^i)) --ready-check-timeout-sec
\end{aligned}
$$

trick: split into mini-batch of gradient

$$
\begin{aligned}
\nabla_w^j &= \sum_{(x,y) \in S_j} \nabla_W (l(f_W(x), y))\\[8pt]
&= \sum_{j} \nabla_W^j
\end{aligned}
$$

![[thoughts/university/twenty-four-twenty-five/sfwr-4ml3/Stochastic gradient descent|SGD]]

## analysis of GD for Convex-Lipschitz Functions

