Machine Learning — Hard
Key points
- Online RL learns from real-time interactions
- Offline RL uses fixed pre-collected datasets
- Off-policy methods learn from different behavior policies
Ready to go further?
Related questions
Machine Learning — Hard
Key points
Ready to go further?
Related questions