PRIORITIZED
MODEL-AUGMENTED PRIORITIZED EXPERIENCE REPLAY
 **发表时间:**2022(ICLR 2022) **文章要点:**这篇文章想说Q网络通常会存在under- or ......
Prioritized Sequence Experience Replay
 **发表时间:**2020 **文章要点:**这篇文章提出了Prioritized Sequence Exper ......
Revisiting Prioritized Experience Replay: A Value Perspective
 **发表时间:**2021 **文章要点:**这篇文章想说Prioritized experience repla ......
Sep 2022-Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt
提出了Reducible Holdout Loss Selection (RHOLOSS),一种简单但有原则的技术,近似地选择那些最能减少模型泛化损失的点进行训练 ......