Machine Learning — Hard
Key points
- Experience replay stores past transitions for training
- Sampling random mini-batches breaks temporal correlations
- Improves sample efficiency and training stability
- Key concept in Deep Q-Networks
Ready to go further?
Related questions
