Machine Learning — Hard
Key points
- The target network prevents chasing moving targets by holding stable Q-value targets.
- It is updated periodically to maintain consistency in training.
- Updating the target network prevents oscillations and ensures more stable learning.
Ready to go further?
Related questions
