【论文随笔】SAC Soft Actor-Critic:Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | ICML 2018 2023-04-10 学术 #RL #Algorithm #Online
【论文随笔】MBPO When to Trust Your Model:Model-Based Policy Optimization | NeurIPS 2019 2023-04-04 学术 #RL #Offline #Algorithm
【论文随笔】Combustion Optimization for Thermal Power Generating Units DeepThermal:Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning | AAAI 2022 2023-04-04 学术 #RL #Offline #Application
【论文随笔】COMBO Conservative Offline Model-Based Policy Optimization | NeurIPS 2021 2023-03-31 学术 #RL #Offline #Algorithm
【论文随笔】MOReL Model-Based Offline Reinforcement Learning | NeurIPS 2020 2023-03-31 学术 #RL #Offline #Algorithm
【论文随笔】MOPO Model-based Offline Policy Optimization | NeurIPS 2020 2023-03-31 学术 #RL #Offline #Algorithm