What is the purpose of momentum in gradient descent optimization?

Machine Learning Medium

Machine Learning — Medium

What is the purpose of momentum in gradient descent optimization?

Key points

  • Momentum accumulates velocity in the direction of persistent gradients
  • Dampens oscillations and accelerates convergence through ravines
  • Helps optimizer navigate complex loss surfaces efficiently

Ready to go further?

Related questions