Machine Learning — Hard
Key points
- Double DQN separates action selection and evaluation
- Standard DQN uses the same network for both tasks
- Overestimation bias is a common challenge in Q-learning algorithms
Ready to go further?
Related questions
Machine Learning — Hard
Key points
Ready to go further?
Related questions